hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-31 19:16:29 +00:00

Author	SHA1	Message	Date
Teknium	d206e1f51d	fix(dashboard): keep local file browser on home	2026-06-13 06:39:38 -07:00
konsisumer	16fb573bae	fix(gateway): clear bloated compression binding on compression-exhaustion auto-reset After compression exhaustion the auto-reset created a fresh session but discarded reset_session()'s return value and left the Telegram topic binding pointing at the oversized compressed child. The next inbound message in that topic healed the binding forward and switch_session'd the freshly-reset lane back onto the bloated transcript, re-triggering compression exhaustion in a loop with a new session id each time. Capture the fresh entry and re-sync the topic binding to it so the next message starts clean. No-op on non-topic lanes. Regression of the #9893/#10063 auto-reset fix. Fixes #35809	2026-06-13 06:38:29 -07:00
Henrik Bentel	eed61a1251	fix(gemini): add role field to systemInstruction	2026-06-13 06:12:52 -07:00
Teknium	74c5158b10	fix(model): show bare custom endpoints in gateway picker (#45597 ) Surface direct model.provider=custom endpoints in /model picker output and keep explicit bare custom switches on the current endpoint instead of requiring a named providers/custom_providers row.	2026-06-13 06:05:30 -07:00
Teknium	0333a99925	fix: merge session-only model analytics rows (#45582 )	2026-06-13 05:52:42 -07:00
Tranquil-Flow	5acd185f7c	fix(moonshot): handle union type arrays in tool schemas	2026-06-13 05:51:41 -07:00
Adalsteinn Helgason	2667601c05	fix(tui): keep reasoning-only assistant turns visible on session resume A thinking-only assistant turn (reasoning present, empty visible text) is persisted with its reasoning fields and stays recallable from the transcript, but `_history_to_messages` dropped it as "empty" before its reasoning was attached. On desktop/TUI resume or reload the turn therefore vanished from the session view while the agent could still recall it from a fresh session -- exactly the "messages disappear when the LLM uses its thinking block, but a new session can recall them" symptom reported on #44022. Keep an assistant turn when it carries reasoning, even with empty text, so the desktop "Thinking…" disclosure has something to render. Genuinely empty turns (no text, no reasoning, no tool calls) are still filtered out. Refs #44022 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-13 05:51:05 -07:00
Adalsteinn Helgason	643dc82793	Fix custom provider identity loss in session persistence _runtime_model_config persisted the live agent's RESOLVED provider into the session row's model_config JSON. For any named providers:/ custom_providers: entry, agent.provider is the literal string "custom", so the entry name was lost (and the api_key is deliberately never persisted). On session.resume or _reset_session_agent the stored provider="custom" fed resolve_runtime_provider(requested="custom"), which cannot match a named entry — the rebuild either raised "No LLM provider configured" or silently resolved placeholder credentials against the patched-back base_url. Persist the REQUESTED/entry identity instead: a new reverse lookup find_custom_provider_identity(base_url) maps the endpoint URL back to the canonical custom:<name> menu key. _runtime_model_config stores that key; _make_agent performs the same recovery for rows persisted before the fix, falling back to passing the stored base_url as explicit_base_url so the direct-alias branch still targets the session's endpoint when no entry matches. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>	2026-06-13 05:51:05 -07:00
Haozhe Zhang	e256f4aae4	fix(gateway): don't restore a bare billing provider as the resumed session's provider `_stored_session_runtime_overrides` restored the session provider from `billing_provider` when `model_config` had no explicit provider. For a `custom:<name>` endpoint that only ran normal turns (no `/model` switch), the persisted `billing_provider` is the bare billing bucket `"custom"`, which `agent_init` treats as non-routable, so `session.resume` failed with "No LLM provider configured" even though new chats and CLI `--resume` work. Only restore an explicit `model_config.provider`; skip a bare billing bucket (`auto`/`openrouter`/`custom`) so resume falls back to the configured default, matching the CLI path. Fixes #44022	2026-06-13 05:51:05 -07:00
Teknium	cb125c2b3f	fix(kanban): pin assigned profile toolsets for workers (#45590 )	2026-06-13 05:50:09 -07:00
Teknium	a59d5e37e8	feat(telegram): make rich messages always on (#45584 ) Remove the rich_messages config toggle entirely so Telegram replies always try the Bot API 10.1 rich-message path first, with the existing MarkdownV2 fallback/latch behavior for unsupported endpoints and per-message failures. Restore the Telegram platform hint to encourage rich Markdown tables/task lists/math now that the rich path is the default, and remove the config/docs surface for the old toggle.	2026-06-13 05:45:11 -07:00
Teknium	4b646bc21e	fix(auxiliary): preserve main provider base url (#45587 )	2026-06-13 05:44:18 -07:00
Teknium	62b4618e9a	fix(dashboard): scope sessions and analytics to selected profile (#45598 )	2026-06-13 05:42:38 -07:00
H-Ali13381	2abcae9678	fix(cli): preserve renderer state on resize	2026-06-13 05:40:18 -07:00
xxxigm	c814d3d1dd	test(installer): regression for unmerged-index update failure Functional bash test drives install.sh's autostash block against a throwaway repo with a real conflicted index and asserts the stash now succeeds and the unmerged entries are cleared (previously `git stash` failed with "could not write index"). Source-order assertions cover both scripts to ensure the `git reset` clear runs before `git stash push` (a no-op otherwise).	2026-06-13 05:19:44 -07:00
Teknium	aa0798352a	fix(auth): self-heal missing Codex access tokens Recover Codex singleton auth entries that have a refresh token but no access token by adopting a valid Codex CLI token pair, matching the cron-time failure mode before falling back to the credential pool.	2026-06-13 05:15:26 -07:00
Kennedy Umege	311ff967de	review: validate refresh_token, path-agnostic recovery log, map author email Addresses PR review feedback: - Validate refresh_token (not only access_token) before persisting the re-imported Codex token, so a half-token payload can't silently break the next refresh cycle. - Make the recovery log path-agnostic ("Codex CLI auth.json") since _import_codex_cli_tokens can read $CODEX_HOME, not only ~/.codex. - Add regression test: relogin-required + imported token missing refresh_token -> re-raise and persist nothing. - Map kenmege@yahoo.com -> Kenmege in scripts/release.py AUTHOR_MAP (fixes the check-attribution job). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-13 05:15:26 -07:00
Kennedy Umege	bd66e7e3fb	fix(auth): self-heal Codex refresh_token rotation by reimporting from ~/.codex Hermes keeps its own copy of the Codex OAuth token per profile and at the top level, separate from the Codex CLI's ~/.codex/auth.json. OAuth refresh_tokens are single-use, so when the Codex CLI (or another Hermes process) rotates the shared token, the frozen copy's refresh_token goes stale and refresh_codex_oauth_pure fails with a relogin-required error (invalid_grant / refresh_token_reused / 401). Today that surfaces as a hard 401 on the turn — idle profiles and desktop sessions 401 "token_expired" until a manual re-auth — even though ~/.codex/auth.json holds a fresh token. _refresh_codex_auth_tokens now falls back to _import_codex_cli_tokens() (the canonical Codex CLI store) when the stored refresh_token is rejected, adopts and persists the fresh token, and lets the in-flight retry succeed. This complements PR #6525 (force relogin on 401/403): we attempt automatic recovery before surfacing a relogin prompt. Transient failures (e.g. 429 quota, relogin_required=False) are never self-healed — the stored token is still valid there — so they re-raise unchanged, and the happy path is untouched. Adds tests/hermes_cli/test_auth_codex_self_heal.py covering: self-heal on invalid_grant, no self-heal on 429 quota, re-raise when ~/.codex is absent, and happy-path-unchanged. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-13 05:15:26 -07:00
Teknium	2681c5a12d	fix(photon): correct gateway start command (#45566 )	2026-06-13 05:14:59 -07:00
Teknium	905ed413d1	fix(doctor): avoid unsafe npm audit fallback Root-level npm audit fix can crash with isDescendantOf on the same monorepo tree, so workspace audit advisories should explain the lockfile-bump path instead of recommending another manual npm fix command.	2026-06-13 05:09:56 -07:00
xxxigm	bea6c1c01f	test(doctor): assert audit-fix hint avoids crashing form and explains build-tool advisories	2026-06-13 05:09:56 -07:00
Black-Kylin	202e318cb1	fix(gateway): sync compression session splits before failures Salvages PR #25747 by preserving gateway session rotation even when a post-compression model call fails before returning final content. Co-authored-by: Hermes <127238744+teknium1@users.noreply.github.com>	2026-06-13 04:51:59 -07:00
helix4u	2d474e39c7	fix(acp): preserve memory provider tools	2026-06-13 04:51:44 -07:00
Teknium	2a5dc0ef3d	fix(slack): make video attachments available to agents (#45512 )	2026-06-13 03:33:27 -07:00
Teknium	197337cc47	fix(gateway): suppress duplicate final stream sends (#45517 )	2026-06-13 03:23:44 -07:00
Teknium	8cf9d8689d	fix(desktop): keep composer usable during reconnect (#45488 ) * feat(cli): add --safe-mode troubleshooting flag Inspired by Claude Code v2.1.169 (June 2026): run Hermes with all customizations disabled to isolate setup problems from product bugs. --safe-mode implies --ignore-user-config and --ignore-rules, and additionally skips plugin discovery (hermes_cli/plugins.py) and MCP server loading (tools/mcp_tool.py) via the internal HERMES_SAFE_MODE env bridge. * fix(desktop): keep composer usable during reconnect	2026-06-13 02:36:09 -07:00
Teknium	3803e5fc28	fix(agent): don't treat custom:<name> pools as cross-provider mismatch (#45289 ) Custom endpoints carry two naming conventions for the same provider: the agent's provider attribute is the generic 'custom' label while the pool is keyed 'custom:<normalized-name>'. The defensive guard in recover_with_credential_pool compared them literally, logged 'Credential pool provider mismatch: pool=custom:<name>, agent=custom', and skipped recovery — so 401 refresh and 429 rotation never ran for ANY custom-provider user (seen in the field on a Fireworks setup whose dead key burned full retry cycles every turn with the skip warning on each one). Accept the pair only when the agent's CURRENT base_url resolves to the same pool key via get_custom_provider_pool_key, preserving the guard's original purpose (#33088/#33163): a fallback provider or a different custom endpoint still skips pool mutation.	2026-06-13 02:01:09 -07:00
xxxigm	b6c7ebf028	fix(tui): honor provider_routing config in the desktop/TUI backend (#44953 ) * fix(tui): honor provider_routing config in the desktop/TUI backend The messaging gateway and classic CLI both read `provider_routing` from config.yaml and pass the OpenRouter routing prefs (only / ignore / order / sort / require_parameters / data_collection) into the agent. The tui_gateway backend that powers the desktop app and TUI never did, so it built agents with every routing pref left at its default — OpenRouter then selected providers freely (effectively at random), ignoring the user's config. Load `provider_routing` in `_make_agent` and forward the same six prefs the gateway does, restoring parity across CLI / gateway / desktop. Background subagent kwargs already propagate these from the parent agent, so they now inherit correctly too. * test(tui): cover provider_routing forwarding in _make_agent Asserts the six OpenRouter routing prefs flow from config.yaml into AIAgent, and that an absent provider_routing section forwards None/False (unchanged behavior for users who never configured routing).	2026-06-13 02:58:15 -05:00
Teknium	7e46533d9f	test: compressed-summary metadata flag set in-process, stripped on wire	2026-06-12 16:47:15 -07:00
Teknium	8905ee6b8a	fix(agent): rewind flush cursor exactly when repair compacts before the cursor Follow-up to the #44837 clamp: a min() clamp only fixes cursor overshoot past the new end of the list. When repair_message_sequence drops/merges messages at indexes below the cursor, the clamp leaves the cursor pointing past unflushed rows and the turn-end flush silently skips them. Extract repair_message_sequence_with_cursor(): snapshot the flushed prefix by object identity before repair, then recompute the cursor as the count of surviving flushed messages. Falls back to the clamp when no snapshot is available. Keeps the safety guard in _flush_messages_to_session_db. Adds targeted tests for overshoot, before-cursor compaction, no-repair, bare-agent, and the flush guard.	2026-06-12 16:29:01 -07:00
konsisumer	aec38855b5	fix(agent): preserve recent turns during compression	2026-06-12 16:26:58 -07:00
Teknium	135fe90166	fix(profiles): backfill .env for pre-existing profiles on hermes update (#45247 ) Profiles created before #44792 have no .env. Now that the Channels/Keys endpoints are profile-scoped (no os.environ fallback), those profiles would show everything as unconfigured. hermes update now copies the default install's .env into each named profile that lacks one (0600, never overwrites, placeholder fallback when the root has no .env), so existing users keep the credentials they were effectively running with.	2026-06-12 15:42:14 -07:00
xxxigm	68536d4375	test(compressor): regression coverage for assistant-tail anchor + compaction rollup (#29824 ) 21 cases pinning the new ``_ensure_last_assistant_message_in_tail`` anchor and its interaction with the existing tail-cut path: * ``TestFindLastAssistantMessageIdx`` — helper contract: prefers a content-bearing assistant message, skips ``tool_calls``-only stubs, multimodal text-block content counts, falls back to "any assistant" when no content-bearing reply exists, honours ``head_end``, returns -1 when there's none. * ``TestEnsureLastAssistantMessageInTail`` — direct: no-op when already in the tail, walks ``cut_idx`` back when the reply is in the compressed middle, never crosses into the head region, re-aligns through a preceding ``tool_call`` / ``tool_result`` group instead of orphaning it. * ``TestFindTailCutByTokensAnchorsAssistant`` — integration: reporter repro (long tool-output run after the visible reply) now preserves the reply; user and assistant anchors compose in a single tail-cut call; a soft-ceiling-overrunning oversized tool result no longer strands the prior reply. * ``TestCompactionRollupReproduction`` — end-to-end through ``compress()`` with a stubbed ``_generate_summary``: the visible reply text survives either as its own standalone assistant message (normal path) or concatenated onto the merged summary tail (double-collision path the WebUI then re-splits). The standalone-summary case is asserted strictly (exactly one summary row, exactly one separate assistant row carrying the reply) — that's the dominant path and any drift there reintroduces the original bug. * ``TestSourceGuardrail`` — static asserts on ``agent/context_compressor.py``: the helper exists, the anchor is wired into ``_find_tail_cut_by_tokens`` AFTER the user-message anchor (so chaining is monotonic), the content-bearing preference is preserved, and the issue number is referenced so future bisects can find this fix.	2026-06-12 15:41:57 -07:00
Teknium	7a318aae22	fix(profiles): exclude session history, backups, and snapshots from --clone-all (#45246 ) --clone-all copied the source profile's state.db, sessions/, backups/, state-snapshots/, and checkpoints/ into the new profile. These are per-profile history: a 49GB copy in practice (15GB snapshots + 11GB backup archives + 16GB state.db + 6.4GB sessions), and restoring a copied backup inside the clone would resurrect the SOURCE profile's state. A clone is a fresh workspace; history stays with the source. New _CLONE_ALL_HISTORY_EXCLUDE_ROOT set, applied at root level for ANY source profile (named profiles accumulate the same artifacts), unlike the default-gated infrastructure excludes. Nested same-name dirs still copy. Docs and the post-create CLI message updated to match; profile export / hermes backup remain the full-history paths.	2026-06-12 15:41:50 -07:00
Tranquil-Flow	749b7219c4	fix(compression): always append END OF CONTEXT SUMMARY marker to standalone summaries regardless of role When the compression summary lands as an assistant-role message (head ends with user), the end marker was not appended. Models may regurgitate the summary text as their own visible output when there's no clear boundary signal (#33256). The end marker was already appended for user-role summaries (#11475, #14521) but the assistant-role path was missed in the original fix. This ensures ALL standalone summary messages carry the boundary marker, preventing summary text from leaking into user-visible chat output.	2026-06-12 15:05:00 -07:00
Teknium	a118b94a85	fix(dashboard): skill installs from the dashboard silently auto-cancel (#45150 ) The dashboard's /api/skills/hub/install (and the new-profile hub_skills path) spawned `hermes skills install <id>` with stdin=DEVNULL but without --yes. do_install()'s 'Confirm [y/N]' prompt hit EOF, defaulted to 'n', and printed 'Installation cancelled.' into a background log the user never sees — every dashboard install no-opped. Pass --yes on both spawn sites, matching the uninstall endpoint which already passed --yes. The dashboard install button is the explicit user consent, same as the TUI/slash-command skip_confirm rationale. Repro: spawned the exact argv with stdin=DEVNULL against a temp HERMES_HOME — without --yes it cancels, with --yes the skill installs.	2026-06-12 12:58:36 -07:00
Teknium	fca84fe20b	test: regression guard for Nous 429 fallback re-entry; AUTHOR_MAP entry	2026-06-12 12:21:29 -07:00
Teknium	dc467488a7	test: assert typing-stop-before-callback as an invariant, not a call count The shared _stop_typing_refresh cleanup makes up to two bounded stop_typing attempts; the old assertion pinned exactly one typing-stopped event before callback-start.	2026-06-12 12:02:41 -07:00
Flownium	331cb38e21	fix: stop Discord typing after replies	2026-06-12 12:02:41 -07:00
Teknium	fa5e98facb	fix(send): helpful error when --file gets a binary; document MEDIA: attachments (#45116 ) A user passing an image to `hermes send --file` got a raw UnicodeDecodeError ('utf-8 codec can't decode byte 0x89...') with no hint that media delivery goes through the MEDIA:<path> directive. - send_cmd: catch UnicodeDecodeError separately and print a usage error explaining --file is for text bodies, with copy-pasteable MEDIA: and [[as_document]] examples using the user's own path - --file help text + epilog now mention MEDIA: - docs: new 'Sending images and other media' section on the hermes send reference page	2026-06-12 11:48:06 -07:00
Teknium	652dd9c9f2	fix: rich messages follow-ups — reply_parameters, send latch, opt-in default - Use reply_parameters per the sendRichMessage spec instead of the undocumented reply_to_message_id scalar (silently ignored -> reply anchor quietly dropped). - Latch rich sends off after an endpoint-capability failure (old PTB / server without sendRichMessage) so every later reply doesn't pay a doomed extra roundtrip; per-message BadRequests do NOT latch. - Default rich_messages to OFF (opt-in) while the day-old Bot API 10.1 endpoint is validated live; revert the prompt-hint table guidance until the default flips on. - Tests: reply_parameters shape, send-latch behavior, BadRequest non-latch; rich tests opt in explicitly via extra.	2026-06-12 11:47:54 -07:00
ITheEqualizer	05b9c84ca4	Add Telegram Bot API 10.1 rich message support Introduce opportunistic support for Telegram Bot API 10.1 rich messages by sending raw agent Markdown via sendRichMessage and streaming previews via sendRichMessageDraft. Implements a rich-path fast‑path in gateway/platforms/telegram.py (RICH_MESSAGE_MAX_BYTES=32768, feature gate platforms.telegram.extra.rich_messages, bot capability checks, routing/thread handling, and conservative fallback rules: permanent/capability errors fall back to the legacy MarkdownV2 path, transient/network errors are surfaced without legacy-resend). Also add a latch for draft capability failures (_rich_draft_disabled) and preserve legacy chunking and draft behavior when needed. Update agent prompt hints (telegram encourages rich Markdown/tables), add CLI config example option, update English and Chinese docs to describe rich messages and fallbacks, and add/adjust tests for rich send and draft behavior.	2026-06-12 11:47:54 -07:00
teknium1	6b4073648e	fix(tui): config.yaml wins over env model seed in per-turn sync Hosted instances set HERMES_INFERENCE_MODEL as a provision-time seed in the container env. _config_model_target() previously went through _resolve_model() (env-first), so on hosted VPS the sync target stayed pinned to the seed and dashboard model changes never reached an open chat -- the exact scenario the sync exists to fix. The sync target now reads config.yaml first and only falls back to the env vars when config has no model. Startup resolution (_resolve_model) is unchanged.	2026-06-12 11:03:44 -07:00
IAvecilla	bc3f4ed70f	Skip redundant model switch	2026-06-12 11:03:44 -07:00
IAvecilla	8c3c08c50b	Update implementation to make it cleaner	2026-06-12 11:03:44 -07:00
IAvecilla	c61815232a	Update model correctly when updating from dashboard	2026-06-12 11:03:44 -07:00
ethernet	1e25358a8f	refactor(desktop): use port 0 for ephemeral port discovery instead of PortPool reservation Replace the PortPool-based port reservation system (9120-9199 range) with OS-assigned ephemeral ports via --port 0. Before: Desktop probed a hardcoded port range, reserved ports in-process to close TOCTOU races, and passed the chosen port to the dashboard via CLI arg. After: Desktop spawns dashboard with --port 0, parses the actual port from a stdout announcement line (HERMES_DASHBOARD_READY port=<N>), and uses that for WebSocket connections. Changes: - web_server.py: add --port 0 support with SO_REUSEADDR pre-bind + announcement; add EADDRINUSE preflight for explicit ports - main.cjs: remove PortPool, PORT_FLOOR/CEILING, pickPort(), isPortAvailable(); add waitForDashboardPort() stdout parser - Delete port-pool.cjs and port-pool.test.cjs (106 lines removed) Net effect: eliminates the entire TOCTOU-mitigation reservation infrastructure and arbitrary port range constraints. OS handles port allocation natively.	2026-06-12 14:02:19 -04:00
ethernet	6ff39c31ad	fix(tests): guard against real 'hermes update' subprocess spawns in conftest Extends _live_system_guard in tests/conftest.py to block any subprocess call that would run 'hermes update' (or 'python -m hermes_cli.main update') against the real checkout. These commands run git fetch origin + git pull, overwriting repo files like pyproject.toml mid-test-run and corrupting every subsequent subprocess that reads them. The spawned process uses setsid / start_new_session=True so it's invisible to pytest's process tree (PPid=1) — the corruption was essentially undetectable without explicit inotify/SHA watchdogs. Root cause of #43703 CI failures: tests in TestUpdateCommandPlatformGate called _handle_update_command() with HERMES_MANAGED='' and no Popen mock, causing the code to fall through and spawn a real 'hermes update --gateway' that overwrote pyproject.toml with origin/main's content (which still had '--timeout=30 --timeout-method=thread' in addopts while the PR had already removed pytest-timeout). The guard covers all three invocation patterns: - 'hermes update' / 'hermes update --gateway' (direct or via setsid bash -c) - 'python -m hermes_cli.main update --gateway' - '.venv/bin/hermes update' (absolute path variant) Does not false-positive on: git update-index, apt-get update, pip install --upgrade, or any command lacking 'hermes'/'hermes_cli'.	2026-06-12 13:42:42 -04:00
ethernet	c41a6534cf	fix(tests): mock subprocess.Popen in all _handle_update_command tests	2026-06-12 13:42:42 -04:00
ethernet	2f9d18711f	fix(ci): remove pytest-timeout, use per-file timeout only fix(ci): write a new cache for test durations every time change(ci): rip out error 4 retries because we found the real bug	2026-06-12 13:42:42 -04:00

1 2 3 4 5 ...

5409 commits