hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-26 01:01:40 +00:00

Author	SHA1	Message	Date
Stefan	654d61ab6f	feat(status-bar): per-prompt elapsed stopwatch Adds a per-prompt elapsed timer to the CLI status bar (live ⏱ while the turn runs, frozen ⏲ after completion, resets on next prompt). Fills the gap left by the KawaiiSpinner — the spinner only shows elapsed time while actively animating, so it disappears between tool calls and after the turn finishes. Status bar is always pinned, so users can glance down and see how long the current/last prompt has been running. - New instance vars: _prompt_start_time, _prompt_duration - Timer starts before agent_thread.start() and freezes once the thread has exited (both interrupt and normal-completion paths) - _format_prompt_elapsed() formats s/m/h/d with seconds visible at all scales, trailing zeros hidden on exact boundaries, negative clamp - Displayed in the wide (>=76 col) status bar as position 7, after the session duration timer - Uses width-1 glyphs (⏱/⏲, no variation selector) to stay aligned in monospace terminals	2026-04-20 02:59:57 -07:00
Lumen Radley	a2b5627e6d	feat(cli): add editor workflow for drafts	2026-04-20 02:53:40 -07:00
Teknium	09ced16ecc	fix(cli): apply markdown stripping to background-task and /btw response panels Follow-up to #12262 — extend final_response_markdown behavior to the other two final-response Panel render sites (background task completion and /btw responses) so users see consistent plain-text output everywhere.	2026-04-20 02:53:40 -07:00
Lumen Radley	177e6eb3da	feat(cli): strip markdown formatting from final replies	2026-04-20 02:53:40 -07:00
Lumen Radley	22655ed1e6	feat(cli): improve multiline previews	2026-04-20 02:53:40 -07:00
Teknium	2614586306	chore(release): add lumenradley to AUTHOR_MAP	2026-04-20 02:53:40 -07:00
Teknium	93f9db59b2	fix(doctor): update config validation for current auth.py API Follow-up for #3171 cherry-pick — the contributor's validation block called get_provider_credentials() which doesn't exist on current main. Replaces it with get_auth_status() limited to API-key providers in PROVIDER_REGISTRY so providers without a registry entry (openrouter, anthropic, custom) don't trigger false 'not authenticated' failures. Also runs the provider name through resolve_provider() so aliases like 'glm'/'moonshot' validate correctly. Adds StefanIsMe to AUTHOR_MAP.	2026-04-20 02:41:25 -07:00
Stefan	954dd8a4e0	fix(doctor): catch OpenRouter 402/429 and validate model/provider config Discovered via real user session where hermes doctor missed two failures: 1. OpenRouter HTTP 402 (credits exhausted) fell through to the generic 'else' branch — printed yellow but never added to issues, so 'hermes doctor --fix' couldn't surface it. User had to manually find and run 'hermes config set model.provider minimax'. 2. A provider value 'main' (from a stale gateway state or config corruption) caused 'Unknown provider main' at runtime. Doctor checked that config.yaml existed but never validated that model.provider or model.default contained sane values. Changes: - OpenRouter health-check now catches 402 (out of credits) and 429 (rate limited) separately, prints a red X, and adds a fixable issue with the exact command to run. - New config validation after the config.yaml existence check: * Validates model.provider against PROVIDER_REGISTRY. Unknown provider names fail red with the full valid list. * Warns when model.default uses a provider-prefixed name (e.g. 'anthropic/claude-opus-4') but provider is not openrouter/custom. * Warns when model.provider is configured but no API key or base_url is set for it. Both fixes are fully general — they catch classes of errors, not hardcoded values specific to one user's setup.	2026-04-20 02:41:25 -07:00
Teknium	c470a325f7	chore(release): add Linux2010 and elmatadorgh to AUTHOR_MAP	2026-04-20 02:40:20 -07:00
elmatadorgh	1ec4a34dcd	test(error_classifier): broaden non-string message type coverage Adds regression tests for list-typed, int-typed, and None-typed message fields on top of the dict-typed coverage from #11496. Guards against other provider quirks beyond the original Pydantic validation case. Credit to @elmatadorgh (#11264) for the broader type coverage idea.	2026-04-20 02:40:20 -07:00
Linux2010	b869bf206c	fix(error_classifier): handle dict-typed message fields without crashing When API providers return Pydantic-style validation errors where body['message'] or body['error']['message'] is a dict (e.g. {"detail": [...]}), the error classifier was crashing with AttributeError: 'dict' object has no attribute 'lower'. The 'or ""' fallback only handles None/falsy values. A non-empty dict is truthy and passes through to .lower(), which fails. Fix: Wrap all 5 call sites with str() before calling .lower(). This is a no-op for strings and safely converts dicts to their repr for pattern matching (no false positives on classification patterns like 'rate limit', 'context length', etc.). Closes #11233	2026-04-20 02:40:20 -07:00
Teknium	acca428c81	chore: add haileymarshall to AUTHOR_MAP	2026-04-20 02:10:53 -07:00
haileymarshall	49282b6e04	fix(gemini): assign unique stream indices to parallel tool calls The streaming translator in agent/gemini_cloudcode_adapter.py keyed OpenAI tool-call indices by function name, so when the model emitted multiple parallel functionCall parts with the same name in a single turn (e.g. three read_file calls in one response), they all collapsed onto index 0. Downstream aggregators that key chunks by index would overwrite or drop all but the first call. Replace the name-keyed dict with a per-stream counter that persists across SSE events. Each functionCall part now gets a fresh, unique index, matching the non-streaming path which already uses enumerate(parts). Add TestTranslateStreamEvent covering parallel-same-name calls, index persistence across events, and finish-reason promotion to tool_calls.	2026-04-20 02:10:53 -07:00
Roy-oss1	d990fa52ed	docs(feishu): tighten processing reactions section Change-Id: I9547777b9a09f9cfeb333af9b016e4659a934e24	2026-04-20 02:04:57 -07:00
Roy-oss1	520edd3499	feat(feishu): show processing state via reactions on user messages Replaces the permanent "OK" receipt reaction with a 3-phase visual lifecycle: - Typing animation appears when the agent starts processing. - Cleared when processing succeeds — the reply message is the signal. - Replaced with CrossMark when processing fails. - Cleared when processing is cancelled or interrupted. When Feishu rejects the reaction-delete call, we keep the Typing in place and skip adding CrossMark. Showing both at once would leave the user seeing both "still working" and "done/failed" simultaneously, which is worse than a stuck Typing. A FEISHU_REACTIONS env var (default on) disables the whole lifecycle. User-added reactions with the same emoji still route through to the agent; only bot-origin reactions are filtered to break the feedback loop. Change-Id: I527081da31f0f9d59b451f45de59df4ddab522ba	2026-04-20 02:04:57 -07:00
Ruzzgar	60236862ee	fix(agent): fall back when rg is blocked for @folder references	2026-04-20 01:56:41 -07:00
Teknium	8a6aa5882e	fix(cli): sync session_id after compression and preserve original end_reason (#12920 ) After context compression (manual /compress or auto), run_agent's _compress_context ends the current session and creates a new continuation child session, mutating agent.session_id. The classic CLI held its own self.session_id that never resynced, so /status showed the ended parent, the exit-summary --resume hint pointed at a closed row, and any later end_session() call (from /resume <other> or /branch) targeted the wrong row AND overwrote the parent's 'compression' end_reason. This only affected the classic prompt_toolkit CLI. The gateway path was already fixed in PR #1160 (March 2026); --tui and ACP use different session plumbing and were unaffected. Changes: - cli.py::_manual_compress — sync self.session_id from self.agent.session_id after _compress_context, clear _pending_title - cli.py chat loop — same sync post-run_conversation for auto-compression - cli.py hermes -q single-query mode — same sync so stderr session_id output points at the continuation - hermes_state.py::end_session — guard UPDATE with 'ended_at IS NULL' so the first end_reason wins; reopen_session() remains the explicit escape hatch for re-ending a closed row Tests: - 3 new in tests/cli/test_manual_compress.py (split sync, no-op guard, pending_title behavior) - 2 new in tests/test_hermes_state.py (preserve compression end_reason on double-end; reopen-then-re-end still works) Closes #12483. Credits @steve5636 for the same-day bug report and @dieutx for PR #3529 which proposed the CLI sync approach.	2026-04-20 01:48:20 -07:00
Ruzzgar	f23123e7b4	fix(gateway): prevent scoped lock and resource leaks on connection failure	2026-04-20 01:44:36 -07:00
Teknium	a5063ff105	docs(providers): drop stale 'TODO: Phase 4' from get_provider docstring (#12902 ) User-defined providers from config.yaml are already resolved via resolve_provider_full() (which layers resolve_user_provider and resolve_custom_provider on top of get_provider). Refresh the docstring to reflect current reality and point future readers at the right entry point. No behaviour change. Closes #12309.	2026-04-20 01:41:27 -07:00
teyrebaz33	2d59afd3da	fix(docker): pass docker_mount_cwd_to_workspace and docker_forward_env to container_config in file_tools file_tools._get_file_ops() built a container_config dict for Docker/ Singularity/Modal/Daytona backends but omitted docker_mount_cwd_to_workspace and docker_forward_env. Both are read by _create_environment() from container_config, so file tools (read_file, write_file, patch, search) silently ignored those config values when running in Docker. Add the two missing keys to match the container_config already built by terminal_tool.terminal_tool(). Fixes #2672.	2026-04-20 00:58:16 -07:00
Junass1	4c50b4689e	fix(gateway): make Telegram DM topic config writes atomic	2026-04-20 00:57:53 -07:00
Teknium	4f24db4258	fix(compression): enforce 64k floor on aux model + auto-correct threshold (#12898 ) Context compression silently failed when the auxiliary compression model's context window was smaller than the main model's compression threshold (e.g. GLM-4.5-air at 131k paired with a 150k threshold). The feasibility check warned but the session kept running and compression attempts errored out mid-conversation. Two changes in _check_compression_model_feasibility(): 1. Hard floor: if detected aux context < MINIMUM_CONTEXT_LENGTH (64k), raise ValueError so the session refuses to start. Mirrors the existing main-model rejection at AIAgent.__init__ line 1600. A compression model below 64k cannot summarise a full threshold-sized window. 2. Auto-correct: when aux context is >= 64k but below the computed threshold, lower the live compressor's threshold_tokens to aux_context (and update threshold_percent to match so later update_model() calls stay in sync). Warning reworded to say what was done and how to persist the fix in config.yaml. Only ValueError re-raises; other exceptions in the check remain swallowed as non-fatal.	2026-04-20 00:56:04 -07:00
helix4u	03e3c22e86	fix(config): add stale timeout settings	2026-04-20 00:52:50 -07:00
Teknium	440764e013	chore(release): add salt-555 to AUTHOR_MAP	2026-04-20 00:47:40 -07:00
salt-555	12c8cefbce	fix(backup): handle files with pre-1980 timestamps ZipFile.write() raises ValueError for files with mtime before 1980-01-01 (the ZIP format uses MS-DOS timestamps which can't represent earlier dates). This crashes the entire backup. Add ValueError to the existing except clause so these files are skipped and reported in the warnings summary, matching the existing behavior for PermissionError and OSError.	2026-04-20 00:47:40 -07:00
helix4u	afba54364e	docs(config): document session_search auxiliary controls	2026-04-20 00:47:39 -07:00
helix4u	6ab78401c9	fix(aux): add session_search extra_body and concurrency controls Adds auxiliary.<task>.extra_body config passthrough so reasoning-heavy OpenAI-compatible providers can receive provider-specific request fields (e.g. enable_thinking: false on GLM) on auxiliary calls, and bounds session_search summary fan-out with auxiliary.session_search.max_concurrency (default 3, clamped 1-5) to avoid 429 bursts on small providers. - agent/auxiliary_client.py: extract _get_auxiliary_task_config helper, add _get_task_extra_body, merge config+explicit extra_body with explicit winning - hermes_cli/config.py: extra_body defaults on all aux tasks + session_search.max_concurrency; _config_version 19 -> 20 - tools/session_search_tool.py: semaphore around _summarize_all gather - tests: coverage in test_auxiliary_client, test_session_search, test_aux_config - docs: user-guide/configuration.md + fallback-providers.md Co-authored-by: Teknium <teknium@nousresearch.com>	2026-04-20 00:47:39 -07:00
cresslank	904f20d622	fix(tui): stop empty idle dequeue from triggering ready-state OOM	2026-04-20 00:42:10 -07:00
Teknium	edf1aecacd	chore(release): add cresslank to AUTHOR_MAP	2026-04-20 00:42:10 -07:00
helix4u	e96758291b	fix(signal): normalize direct recipients to UUIDs	2026-04-20 00:35:55 -07:00
kshitijk4poor	fd5df5fe8e	fix(camofox): honor auxiliary vision temperature\n\n- forward auxiliary.vision.temperature in camofox screenshot analysis\n- add regression tests for configured and default behavior	2026-04-20 00:32:09 -07:00
kshitijk4poor	9d88bdaf11	fix(browser): honor auxiliary.vision.temperature for screenshot analysis\n\n- mirror the vision tool's config bridge in browser_vision - add regression tests for configured and default temperature forwarding	2026-04-20 00:32:09 -07:00
kshitijk4poor	098d554aac	test: cover vision config temperature wiring\n\n- add regression tests for auxiliary.vision.temperature and timeout\n- add bugkill3r to AUTHOR_MAP for the salvaged commit	2026-04-20 00:32:09 -07:00
Saurabh	088bf9057f	fix: vision tool respects auxiliary.vision.temperature from config (#4661 ) The vision tool hardcoded temperature=0.1, ignoring the user's config.yaml setting. This broke providers like Kimi/Moonshot that require temperature=1 for vision models. Now reads temperature from auxiliary.vision.temperature, falling back to 0.1.	2026-04-20 00:32:09 -07:00
kshitijk4poor	e485bc60cd	test(kimi): cover api.moonshot.cn direct-call regressions\n\n- add run_agent coverage for the Moonshot China endpoint\n- add sync/async trajectory compressor coverage for api.moonshot.cn	2026-04-20 00:32:06 -07:00
kagura-agent	9b60ffc47f	fix: include api.moonshot.cn in public API temperature override (#12745 ) kimi-k2.5 on api.moonshot.cn/v1 rejects temperature=0.6 with HTTP 400, same as api.moonshot.ai. The public API check now matches both domains.	2026-04-20 00:32:06 -07:00
helix4u	8155ebd7c4	fix(gemini): sanitize tool schemas for Google providers	2026-04-20 00:26:18 -07:00
Teknium	a33e890644	fix(acp): silence 'Background task failed' noise on liveness-probe requests (#12855 ) Clients like acp-bridge send periodic bare `ping` JSON-RPC requests as a liveness probe. The acp router correctly returns JSON-RPC -32601 to the caller, which those clients already handle as 'agent alive'. But the supervisor task that ran the request then surfaces the raised RequestError via `logging.exception('Background task failed', ...)`, dumping a full traceback to stderr on every probe interval. Install a logging filter on the stderr handler that suppresses 'Background task failed' records only when the exception is an acp RequestError(-32601) for one of {ping, health, healthcheck}. Real method_not_found for any other method, other exception classes, other log messages, and -32601 logged under a different message all pass through untouched. The protocol response is unchanged — the client still receives a standard -32601 'Method not found' error back. Only the server-side stderr noise is silenced. Closes #12529	2026-04-20 00:10:27 -07:00
Teknium	e330112aa8	refactor(telegram): use entity-only mention detection Replaces the word-boundary regex scan with pure MessageEntity-based detection. Telegram's server emits MENTION entities for real @username mentions and TEXT_MENTION entities for @FirstName mentions; the text- scanning fallback was both redundant (entities are always present for real mentions) and broken (matched raw substrings like email addresses, URLs, code-block contents, and forwarded literal text). Entity-only detection: - Closes bug #12545 ("foo@hermes_bot.example" false positive). - Also fixes edge cases the regex fix would still miss: @handles inside URLs and code blocks, where Telegram does not emit mention entities. Tests rewritten to exercise realistic Telegram payloads (real mentions carry entities; substring false positives don't).	2026-04-20 00:10:22 -07:00
Tranquil-Flow	1e18e0503f	fix(telegram): use word-boundary matching for bot mention detection (#12545 )	2026-04-20 00:10:22 -07:00
JackJin	5157f5427f	chore(release): add jackjin1997 qq email to AUTHOR_MAP	2026-04-19 22:46:47 -07:00
JackJin	6c0c625952	fix(gateway): accept finalize kwarg in all platform edit_message overrides stream_consumer._send_or_edit unconditionally passes finalize= to adapter.edit_message(), but only DingTalk's override accepted the kwarg. Streaming on Telegram/Discord/Slack/Matrix/Mattermost/Feishu/ WhatsApp raised TypeError the first time a segment break or final edit fired. The REQUIRES_EDIT_FINALIZE capability flag only gates the redundant final edit (and the identical-text short-circuit), not the kwarg itself — so adapters that opt out of finalize still receive the keyword argument and must accept it. Add *, finalize: bool = False to the 7 non-DingTalk signatures; the body ignores the arg since those platforms treat edits as stateless (consistent with the base class contract in base.py). Add a parametrized signature check over every concrete adapter class so a future override cannot silently drop the kwarg — existing tests use MagicMock which swallows any kwarg and cannot catch this. Fixes #12579	2026-04-19 22:46:47 -07:00
Teknium	fc5fda5e38	fix(display): render <missing old_text> in memory previews instead of empty quotes (#12852 ) When the model omits old_text on memory replace/remove, the tool preview rendered as '~memory: ""' / '-memory: ""', which obscured what went wrong. Render '<missing old_text>' in that case so the failure mode is legible in the activity feed. Narrow salvage from #12456 / #12831 — only the display-layer fix, not the schema/API changes.	2026-04-19 22:45:47 -07:00
Tranquil-Flow	6a228d52f7	fix(webhook): validate HMAC signature before rate limiting (#12544 )	2026-04-19 22:45:08 -07:00
Tranquil-Flow	35e7bf6b00	fix(models): validate MiniMax models against static catalog (#12611 , #12460 , #12399 , #12547 )	2026-04-19 22:44:47 -07:00
Teknium	a4ba0754ed	test: drop platform-dependent _resolve_verify test file The new tests/test_resolve_verify_ssl_context.py used ssl.get_default_verify_paths().cafile which is None on macOS and several Linux builds, causing 3 of its 6 tests to fail portably. The existing tests/hermes_cli/test_auth_nous_provider.py already covers every _resolve_verify return path with tmp_path + monkeypatched ssl.create_default_context, which is platform-agnostic.	2026-04-19 22:44:35 -07:00
Tranquil-Flow	b53f74a489	fix(auth): use ssl.SSLContext for CA bundle instead of deprecated string path (#12706 )	2026-04-19 22:44:35 -07:00
Teknium	65a31ee0d5	fix(anthropic): complete third-party Anthropic-compatible provider support (#12846 ) Third-party gateways that speak the native Anthropic protocol (MiniMax, Zhipu GLM, Alibaba DashScope, Kimi, LiteLLM proxies) now work end-to-end with the same feature set as direct api.anthropic.com callers. Synthesizes eight stale community PRs into one consolidated change. Five fixes: - URL detection: consolidate three inline `endswith("/anthropic")` checks in runtime_provider.py into the shared _detect_api_mode_for_url helper. Third-party /anthropic endpoints now auto-resolve to api_mode=anthropic_messages via one code path instead of three. - OAuth leak-guard: all five sites that assign `_is_anthropic_oauth` (__init__, switch_model, _try_refresh_anthropic_client_credentials, _swap_credential, _try_activate_fallback) now gate on `provider == "anthropic"` so a stale ANTHROPIC_TOKEN never trips Claude-Code identity injection on third-party endpoints. Previously only 2 of 5 sites were guarded. - Prompt caching: new method `_anthropic_prompt_cache_policy()` returns `(should_cache, use_native_layout)` per endpoint. Replaces three inline conditions and the `native_anthropic=(api_mode=='anthropic_messages')` call-site flag. Native Anthropic and third-party Anthropic gateways both get the native cache_control layout; OpenRouter gets envelope layout. Layout is persisted in `_primary_runtime` so fallback restoration preserves the per-endpoint choice. - Auxiliary client: `_try_custom_endpoint` honors `api_mode=anthropic_messages` and builds `AnthropicAuxiliaryClient` instead of silently downgrading to an OpenAI-wire client. Degrades gracefully to OpenAI-wire when the anthropic SDK isn't installed. - Config hygiene: `_update_config_for_provider` (hermes_cli/auth.py) clears stale `api_key`/`api_mode` when switching to a built-in provider, so a previous MiniMax custom endpoint's credentials can't leak into a later OpenRouter session. - Truncation continuation: length-continuation and tool-call-truncation retry now cover `anthropic_messages` in addition to `chat_completions` and `bedrock_converse`. Reuses the existing `_build_assistant_message` path via `normalize_anthropic_response()` so the interim message shape is byte-identical to the non-truncated path. Tests: 6 new files, 42 test cases. Targeted run + tests/run_agent, tests/agent, tests/hermes_cli all pass (4554 passed). Synthesized from (credits preserved via Co-authored-by trailers): #7410 @nocoo — URL detection helper #7393 @keyuyuan — OAuth 5-site guard #7367 @n-WN — OAuth guard (narrower cousin, kept comment) #8636 @sgaofen — caching helper + native-vs-proxy layout split #10954 @Only-Code-A — caching on anthropic_messages+Claude #7648 @zhongyueming1121 — aux client anthropic_messages branch #6096 @hansnow — /model switch clears stale api_mode #9691 @TroyMitchell911 — anthropic_messages truncation continuation Closes: #7366, #8294 (third-party Anthropic identity + caching). Supersedes: #7410, #7367, #7393, #8636, #10954, #7648, #6096, #9691. Rejects: #9621 (OpenAI-wire caching with incomplete blocklist — risky), #7242 (superseded by #9691, stale branch), #8321 (targets smart_model_routing which was removed in #12732). Co-authored-by: nocoo <nocoo@users.noreply.github.com> Co-authored-by: Keyu Yuan <leoyuan0099@gmail.com> Co-authored-by: Zoee <30841158+n-WN@users.noreply.github.com> Co-authored-by: sgaofen <135070653+sgaofen@users.noreply.github.com> Co-authored-by: Only-Code-A <bxzt2006@163.com> Co-authored-by: zhongyueming <mygamez@163.com> Co-authored-by: Xiaohan Li <hansnow@users.noreply.github.com> Co-authored-by: Troy Mitchell <i@troy-y.org>	2026-04-19 22:43:09 -07:00
Teknium	491cf25eef	test(voice): update existing voice_mode tests for platform-prefixed keys Follow-up to `40164ba1`. - _handle_voice_channel_join/leave now use event.source.platform instead of hardcoded Platform.DISCORD (consistent with other voice handlers). - Update tests/gateway/test_voice_command.py to use 'platform:chat_id' keys matching the new _voice_key() format. - Add platform isolation regression test for the bug in #12542. - Drop decorative test_legacy_key_collision_bug (the fix makes the collision impossible; the test mutated a single key twice, not a real scenario). - Adapter mocks in _sync_voice_mode_state_to_adapter tests now set adapter.platform = Platform.* (required by new isinstance check).	2026-04-19 22:36:00 -07:00
Tranquil-Flow	52a972e927	fix(gateway): namespace voice mode state by platform to prevent cross-platform collision (#12542 )	2026-04-19 22:36:00 -07:00

... 5 6 7 8 9 ...

5319 commits