hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-05-08 03:01:47 +00:00

Author	SHA1	Message	Date
Brooklyn Nicholson	d056b610b7	fix: avoid prompt_toolkit complex tempfile bug and prefer nvim first Setting buffer.tempfile = 'prompt.md' pushed prompt_toolkit into its complex-tempfile path, which creates a temp dir and then calls os.makedirs() on that same path when no subdirectory is present. That raises EEXIST before the editor can launch. Keep prompt_toolkit on the simple tempfile path with .md suffix, and make the editor fallback chain explicit on both surfaces: $VISUAL -> $EDITOR -> nvim -> vim -> vi -> nano.	2026-04-25 20:16:50 -05:00
Teknium	2536a36f6f	fix(tui): route /save through session.save JSON-RPC The cherry-picked approach serialized the UI-shaped transcript on the Node side, producing a third JSON format alongside cli.py save_conversation and tui_gateway session.save. Simpler to call the existing session.save method, which already writes the canonical agent history (raw OpenAI messages + model) to an absolute-path file. - /save still short-circuits before the slash worker - Empty transcript -> 'no conversation yet' - No active session -> 'no active session - nothing to save' - Otherwise: rpc('session.save', {session_id}) and echo back the file path - Tests updated to assert RPC contract; new test covers the no-sid case	2026-04-25 18:11:37 -07:00
helix4u	1b8ca9254f	fix(tui): save live transcript from slash command	2026-04-25 18:11:37 -07:00
Brooklyn Nicholson	db7c5735f0	fix: prefer vim over nano for $EDITOR fallback (CLI + TUI) prompt_toolkit's default editor list is: $VISUAL, $EDITOR, /usr/bin/editor, /usr/bin/nano, /usr/bin/pico, /usr/bin/vi, /usr/bin/emacs — so when neither env var is set, the base CLI launched nano. The TUI fell back to a literal 'vi'. Same Ctrl+G keystroke, two different editors. Pick the same chain on both surfaces: $VISUAL → $EDITOR → vim → vi → nano CLI: override input_area.buffer._open_file_in_editor on the TextArea once at app build time. Local to that buffer; doesn't touch os.environ or affect other subprocesses. TUI: extract resolveEditor() into ui-tui/src/lib/editor.ts. PATH walk with accessSync(X_OK), no shelling out. Six-line unit test verifies the priority order and the multi-entry PATH walk.	2026-04-25 20:11:25 -05:00
Teknium	8bbeaea6c7	fix(config): broaden api-key ref lookup to templated base_url The raw-template lookup added in PR #15817 went through `get_compatible_custom_providers(read_raw_config())`, which calls `_normalize_custom_provider_entry` → `urlparse(base_url)`. Any entry whose `base_url` is itself an env-ref (`${NEURALWATT_API_BASE}`) was dropped as 'not a valid URL', so `api_key_ref` stayed empty and the resolved secret was still written to `model.api_key` — the exact case the original Discord report described. Replace the normalizer-gated lookup with a direct read of `raw['custom_providers']` and `raw['providers']`, indexed by name (case-insensitive, optionally qualified by model) so the loaded (expanded) entry can be matched regardless of how `base_url` is written. Add an integration regression test driving the real `select_provider_and_model` entry point with the Discord-reported NeuralWatt config (`${VAR}` in both `base_url` and `api_key`). This test fails on the PR-only fix and passes with the broadened lookup.	2026-04-25 18:10:52 -07:00
helix4u	1fdc31b214	fix(config): preserve custom provider api key refs	2026-04-25 18:10:52 -07:00
Brooklyn Nicholson	5fac6c3440	fix(cli): write editor draft to prompt.md so syntax highlighting works Base CLI was handing prompt_toolkit's Buffer.open_in_editor() a default config — Buffer.tempfile_suffix and .tempfile both empty — so it created /tmp/tmpXXXXXX with no extension. nano/vim/helix all key syntax highlighting off the file extension, so the buffer rendered plain. The TUI already writes to <mkdtemp>/prompt.md and gets full markdown highlighting + a sensible title bar. Set buffer.tempfile = 'prompt.md' on the TextArea so prompt_toolkit's complex-tempfile path produces <mkdtemp>/prompt.md to match. shutil.rmtree cleanup is built-in.	2026-04-25 20:04:04 -05:00
kshitijk4poor	2c56dce0ed	fix(model): preserve custom endpoint credentials and accept cloud models not in /v1/models When switching models on a custom endpoint (ollama-launch): - Same-provider switches no longer re-resolve credentials (fixes base_url being lost for 'custom' provider on subsequent switches) - Named providers (ollama-launch) are resolved via user_providers so switch_model can find their base_url from config - Models not in the /v1/models probe but present in the user's saved provider config are accepted with a warning instead of rejected - CLI /model and TUI /model both pass user_providers/custom_providers to switch_model so the config model list is available for validation Closes #15088	2026-04-25 18:03:47 -07:00
Teknium	01cf2c65cc	chore(release): map iris@growthpillars.co to irispillars (#15825 ) Follow-up to #15533 (merged). Prevents release notes CI from attributing the contributor to the placeholder.	2026-04-25 18:02:13 -07:00
helix4u	b2d3308f98	fix(doctor): accept bare custom provider	2026-04-25 18:01:36 -07:00
Iris Jin	25ba6a4a74	fix(gateway): make reasoning session-scoped by default	2026-04-25 18:01:31 -07:00
Brooklyn Nicholson	4c797bfae9	fix(cli): accept Alt+G as Ctrl+G fallback in VSCode/Cursor terminals Same problem as the TUI: Cursor and VSCode bind Ctrl+G to "Find Next" at the editor level, so the keystroke never reaches the terminal and the prompt_toolkit-driven Hermes CLI sees nothing. Register ('escape', 'g') alongside the existing 'c-g' on the same handler so the editor handoff works inside Cursor/VSCode too. The filter (no clarify/approval/sudo/secret prompt active) is unchanged.	2026-04-25 20:01:03 -05:00
Brooklyn Nicholson	c58956a9a2	fix(tui): accept Alt+G as Ctrl+G fallback in VSCode/Cursor terminals VSCode and Cursor bind Ctrl+G to "Find Next" at the editor level, so the keystroke never reaches the embedded terminal — Ctrl+G to open \$EDITOR was effectively dead inside those IDEs. Alt+G is unbound in both editors and reaches the TUI cleanly as `\x1bg` → `key.meta && ch === 'g'` after parse-keypress. Accept it alongside the existing isAction(key, ch, 'g') check, and document the fallback in README + the hotkeys panel.	2026-04-25 19:57:17 -05:00
Brooklyn Nicholson	3944b22506	fix(tui): suspend Ink properly when opening $EDITOR via Ctrl+G The Ctrl+G handler was toggling the alt-screen by hand (`\x1b[?1049l` ... `\x1b[?1049h`) without releasing stdin or kitty keyboard mode, so the launched editor would lose keystrokes (Ink kept swallowing them) and editors that don't speak CSI-u (e.g. nano) would print "Unknown sequence" for every Ctrl-key. Switch to `withInkSuspended` from @hermes/ink, the same helper `/setup` already uses. It pauses Ink, removes stdin listeners, drops raw mode, disables kitty/modifyOtherKeys + mouse + focus reporting, runs the editor, then restores everything with a full repaint.	2026-04-25 19:54:06 -05:00
brooklyn!	489bed6f96	Merge pull request #15478 from yes999zc/fix-deepseek-reasoning-all-assistant-messages Some checks are pending Deploy Site / deploy-vercel (push) Waiting to run Details Deploy Site / deploy-docs (push) Waiting to run Details Docker Build and Publish / build-and-push (push) Waiting to run Details Nix / nix (macos-latest) (push) Waiting to run Details Nix / nix (ubuntu-latest) (push) Waiting to run Details Tests / test (push) Waiting to run Details Tests / e2e (push) Waiting to run Details fix: DeepSeek/Kimi thinking mode requires reasoning_content on ALL assistant messages	2026-04-25 19:19:33 -05:00
FocusFlow Dev	ad0ac89478	fix: DeepSeek/Kimi thinking mode requires reasoning_content on ALL assistant messages Previously _copy_reasoning_content_for_api only padded reasoning_content when the assistant message had tool_calls. DeepSeek V4 thinking mode requires the field on every assistant turn, including plain text replies without tool_calls. - Remove the 'source_msg.get("tool_calls") and' guard - Update test: plain assistant turns now get padded for DeepSeek/Kimi Fixes #15213	2026-04-26 07:47:13 +08:00
Teknium	dc4d92f131	docs: embed tutorial videos on webhooks + auxiliary models pages (#15809 ) - webhooks.md: adds a Video Tutorial section under the intro with a responsive YouTube iframe (WNYe5mD4fY8). - configuration.md: adds a Video Tutorial subsection under Auxiliary Models with a responsive YouTube iframe (NoF-YajElIM). Both use a 16:9 aspect-ratio wrapper so the embeds scale cleanly on mobile. Verified with `npm run build` — MDX parses clean, no new warnings or broken links introduced.	2026-04-25 16:44:53 -07:00
Teknium	47420a84b9	docs(obliteratus): link YouTube video guide in SKILL.md (#15808 ) Adds a 'Video Guide' section pointing at the walkthrough of a Hermes agent abliterating Gemma with OBLITERATUS, so the agent can surface it when the user wants a visual overview before running the workflow.	2026-04-25 16:30:38 -07:00
brooklyn!	f93d4624bf	Merge pull request #15749 from Zjianru/fix/copy-reasoning-content-ordering-and-cross-provider-isolation fix(agent): ordering fix in _copy_reasoning_content_for_api — cross-provider reasoning isolation	2026-04-25 17:21:49 -05:00
codez	5ae608152e	fix: remove has_reasoning guard — inject empty reasoning_content for DeepSeek/Kimi tool_calls unconditionally	2026-04-26 06:08:54 +08:00
brooklyn!	88b65cc82a	Update run_agent.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-04-26 05:49:38 +08:00
brooklyn!	edc78e258c	Merge pull request #15766 from NousResearch/bb/tui-ssh-copy fix(tui): honor client copy shortcut over ssh	2026-04-25 15:33:17 -05:00
Brooklyn Nicholson	31d7f1951a	fix(tui): clamp copied selection bounds Clamp copied selection columns to the screen width before scanning rendered cells.	2026-04-25 15:32:45 -05:00
Brooklyn Nicholson	b1c18e5a41	refactor(tui): format screen imports Keep screen.ts import ordering aligned with the ui-tui formatter.	2026-04-25 15:26:51 -05:00
Brooklyn Nicholson	bd66e55a02	fix(tui): track rendered spaces for selection copy - add a written-cell bitmap so selection can distinguish rendered spaces from blank padding - preserve code indentation without markdown-specific rendering hacks	2026-04-25 15:21:26 -05:00
Brooklyn Nicholson	1735ced93b	fix(tui): preserve code block indentation in selection Render code indentation spaces as selectable cells so copied fenced code keeps its leading whitespace.	2026-04-25 15:17:36 -05:00
Brooklyn Nicholson	bba16943f6	fix(tui): preserve rendered indentation in selections - trim only empty edge rows instead of full selected text - bound selection paint using unwritten cells so rendered indentation remains copyable	2026-04-25 15:14:26 -05:00
Brooklyn Nicholson	132620ba3d	refactor(tui): simplify remote copy hotkey hints Use an explicit conditional table instead of spread casting for SSH copy hint rows.	2026-04-25 15:09:12 -05:00
Brooklyn Nicholson	876bb60044	fix(tui): trim whitespace-only selection chrome - clamp selection highlight to real row content so blank drag margins do not render or copy - keep successful copy actions quiet while preserving usage and failure feedback	2026-04-25 15:07:29 -05:00
Brooklyn Nicholson	a68793b6c4	refactor(tui): share remote shell detection Reuse the platform helper for SSH-aware copy hints so hotkey display and input handling cannot drift.	2026-04-25 14:55:28 -05:00
Brooklyn Nicholson	bcc5362432	fix(tui): honor client copy shortcut over ssh - accept forwarded Cmd+C for selection copy in SSH sessions even when Hermes runs on Linux - keep local Linux Alt+C from acting as copy and update TUI hotkey hints for remote shells	2026-04-25 14:44:39 -05:00
brooklyn!	283c8fd6e2	Merge pull request #15755 from NousResearch/bb/tui-model-flag fix(tui): honor launch model overrides	2026-04-25 14:30:26 -05:00
Brooklyn Nicholson	919274b60e	fix(tui): align overlay q shortcut casing Keep shared overlay close behavior consistent with pager and agents overlays by binding lowercase q only.	2026-04-25 14:26:35 -05:00
Brooklyn Nicholson	6e83d90eb4	refactor(tui): tighten overlay helpers - rename overlay help text component to match its role - share picker window math across model, session, and skills overlays	2026-04-25 14:23:45 -05:00
Brooklyn Nicholson	c6fdf48b79	fix(tui): sync inference model after switches - keep HERMES_INFERENCE_MODEL aligned with HERMES_MODEL after in-TUI model switches - clarify static provider detection remapping docs	2026-04-25 14:17:57 -05:00
Brooklyn Nicholson	a046483e86	fix(tui): share overlay close controls - add reusable overlay key and help-text helpers for picker-style overlays - make model, session, skills, and pager hints consistently support Esc/q close behavior	2026-04-25 14:17:04 -05:00
Brooklyn Nicholson	fdcbd2257b	fix(tui): resolve startup model aliases statically - expand short model aliases like sonnet/opus via static catalogs during startup runtime resolution - keep startup alias resolution network-free and add regression tests in models and tui gateway suites	2026-04-25 14:13:02 -05:00
Brooklyn Nicholson	48bdd2445e	fix(tui): apply ui-tui fix pass and restore type-check - run the requested ui-tui lint+format pass and include resulting formatting updates - guard text-measure cache eviction key in hermes-ink so ui-tui type-check stays green	2026-04-25 14:08:54 -05:00
Brooklyn Nicholson	5e52011de3	fix(tui): bind provider as model alias	2026-04-25 13:58:59 -05:00
Brooklyn Nicholson	e48a497d16	fix(tui): share static model detection	2026-04-25 13:56:16 -05:00
Brooklyn Nicholson	2dfcc8087a	fix(tui): avoid network lookup during startup	2026-04-25 13:47:18 -05:00
Brooklyn Nicholson	4db58d45d4	fix(tui): address startup provider review	2026-04-25 13:29:15 -05:00
Brooklyn Nicholson	57b43fdd4b	fix(tui): preserve provider precedence on startup	2026-04-25 13:25:43 -05:00
Brooklyn Nicholson	e9c47c7042	fix(tui): honor launch model overrides	2026-04-25 13:21:59 -05:00
brooklyn!	ee0728c6c4	Merge pull request #15351 from helix4u/fix/tui-rebuild-missing-ink-bundle fix(tui): rebuild when ink bundle is missing	2026-04-25 13:14:23 -05:00
codez	9daa0620a6	fix(agent): ordering fix in _copy_reasoning_content_for_api — cross-provider reasoning isolation Fix logic-ordering bug where normalized_reasoning promotion returns before the DeepSeek/Kimi needs_empty_reasoning guard, causing cross-provider reasoning content (MiniMax → DeepSeek) to leak into reasoning_content and trigger HTTP 400. Changes: - Reorder branching: existing reasoning_content check first - Add 'not has_reasoning' guard so poisoned histories (no reasoning) still get '' injected for DeepSeek/Kimi - Healthy same-provider reasoning promotion path unchanged Refs: #15250, #15213	2026-04-26 02:04:52 +08:00
kshitij	648b89911f	fix: use output_text for assistant message content in Codex Responses API (#15690 ) The Codex Responses API rejects input_text inside assistant messages — only output_text and refusal are valid content types for assistant role. _chat_content_to_responses_parts() previously hardcoded all text content to input_text regardless of the message role. When an assistant message had list-format content (multimodal or structured), this produced invalid input_text parts that the API rejected with: Invalid value: 'input_text'. Supported values are: 'output_text' and 'refusal'. Fix: add a role parameter to _chat_content_to_responses_parts() that selects output_text for assistant messages and input_text for user messages. Thread this through _chat_messages_to_responses_input() and _preflight_codex_input_items(). Fixes #15687	2026-04-25 10:13:29 -07:00
kshitijk4poor	7c17accb29	fix: /stop now immediately aborts streaming retry loop When a user sends /stop during a streaming API call, the outer poll loop detects _interrupt_requested and closes the HTTP connection. However, the inner _call() thread catches the connection error and enters its retry loop — opening a FRESH connection without checking the interrupt flag. On slow providers like ollama-cloud, each retry attempt blocks for the full stream-read timeout (120s+). With 3 retry attempts this caused 510+ second delays between /stop and actual response — the agent appeared completely unresponsive despite the stop being acknowledged. Fix: add an _interrupt_requested check at the top of the streaming retry loop so the agent exits immediately instead of retrying. Also fix log truncation: all session key logging in gateway/run.py used [:20] or [:30] slices, which truncated 'agent:main:telegram:dm:5690190437' (33 chars) to 'agent:main:telegram:' — losing the identifying chat type and user ID. Replace with full keys to make logs debuggable. Reported by user Sidharth Pulipaka via Telegram on ollama-cloud provider.	2026-04-25 09:51:39 -07:00
Teknium	5006b2204b	fix(update): honor RestartSec when polling for gateway respawn (#15707 ) The post-graceful-drain is-active poll used a fixed 10s timeout, but systemd's hermes-gateway.service has RestartSec=30 — so systemd won't respawn the unit for 30s after exit-75, and our poll gives up during the cooldown. Result: every 'hermes update' printed ⚠ hermes-gateway drained but didn't relaunch — forcing restart followed by a redundant 'systemctl restart' that kicked the newly- respawning gateway again (and re-started WhatsApp / Discord a second time in the process). Fix: read RestartUSec from the unit via 'systemctl show' and set the poll budget to max(10s, RestartSec + 10s slack). Units without RestartSec set (or value=infinity) fall back to the original 10s. Observed timeline from journalctl before fix: 08:56:22.262 old PID exits 75 08:56:32.707 systemd logs Stopped -> Started (10.4s gap, > 10s budget) After fix the poll covers 40s — comfortably inside RestartSec + slack. Validation: - RestartUSec parser tested against '30s', '100ms', '1min 30s', 'infinity', '', 'garbage', '500us', '2min' — all correct. - Against the live hermes-gateway.service: parses to 30.0s. - tests/hermes_cli/test_update_gateway_restart.py: 41/41 pass.	2026-04-25 09:08:27 -07:00
Teknium	a9fa73a620	feat(oneshot): add --model / --provider / HERMES_INFERENCE_MODEL (#15704 ) Makes hermes -z usable by sweeper without mutating user config. - Top-level -m/--model and --provider flags that apply to -z/--oneshot (mirrors hermes chat's plumbing). - HERMES_INFERENCE_MODEL env var as the parallel to HERMES_INFERENCE_PROVIDER for CI / scripted invocations. - resolve_runtime_provider() gets the requested provider; when --model is given without --provider, detect_provider_for_model() auto-selects the provider that serves it (same semantic as /model in an interactive session). - --provider without --model errors out with exit 2 — carrying a config model across to a different provider is usually wrong, and silently picking the provider's catalog default hides the mismatch. Config defaults still used when both flags are omitted (existing behavior). Validation (all live against OpenRouter): -z 'x' ....................... uses config default (opus-4.7) -z 'x' --model haiku-4.5 ..... haiku-4.5 via auto-detected openrouter -z 'x' --model ... --provider pair as given HERMES_INFERENCE_MODEL=... -z haiku-4.5 via env var -z 'x' --provider anthropic .. exits 2 with error to stderr	2026-04-25 08:55:36 -07:00

... 21 22 23 24 25 ...

7028 commits