hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-15 09:21:36 +00:00

Author	SHA1	Message	Date
Teknium	cecf84daf7	fix: extend hostname-match provider detection across remaining call sites Aslaaen's fix in the original PR covered _detect_api_mode_for_url and the two openai/xai sites in run_agent.py. This finishes the sweep: the same substring-match false-positive class (e.g. https://api.openai.com.evil/v1, https://proxy/api.openai.com/v1, https://api.anthropic.com.example/v1) existed in eight more call sites, and the hostname helper was duplicated in two modules. - utils: add shared base_url_hostname() (single source of truth). - hermes_cli/runtime_provider, run_agent: drop local duplicates, import from utils. Reuse the cached AIAgent._base_url_hostname attribute everywhere it's already populated. - agent/auxiliary_client: switch codex-wrap auto-detect, max_completion_tokens gate (auxiliary_max_tokens_param), and custom-endpoint max_tokens kwarg selection to hostname equality. - run_agent: native-anthropic check in the Claude-style model branch and in the AIAgent init provider-auto-detect branch. - agent/model_metadata: Anthropic /v1/models context-length lookup. - hermes_cli/providers.determine_api_mode: anthropic / openai URL heuristics for custom/unknown providers (the /anthropic path-suffix convention for third-party gateways is preserved). - tools/delegate_tool: anthropic detection for delegated subagent runtimes. - hermes_cli/setup, hermes_cli/tools_config: setup-wizard vision-endpoint native-OpenAI detection (paired with deduping the repeated check into a single is_native_openai boolean per branch). Tests: - tests/test_base_url_hostname.py covers the helper directly (path-containing-host, host-suffix, trailing dot, port, case). - tests/hermes_cli/test_determine_api_mode_hostname.py adds the same regression class for determine_api_mode, plus a test that the /anthropic third-party gateway convention still wins. Also: add asslaenn5@gmail.com → Aslaaen to scripts/release.py AUTHOR_MAP.	2026-04-20 22:14:29 -07:00
Teknium	5125a78283	chore(release): map yukipukikedy@gmail.com to Yukipukii1	2026-04-20 22:13:07 -07:00
jerilynzheng	e1b29c474e	chore: register contributor in AUTHOR_MAP for release-note attribution Adds zheng.jerilyn@gmail.com → jerilynzheng to scripts/release.py so the check-attribution CI workflow passes.	2026-04-20 21:02:28 -07:00
Teknium	34c5c2538e	chore: map Es1la contributor email for AUTHOR_MAP (#13294 ) Credit preserved for PR #13270 (WhatsApp Windows disconnect fix).	2026-04-20 20:53:10 -07:00
Teknium	5031aa37a2	chore(release): map mavrickdeveloper email for attribution	2026-04-20 20:52:50 -07:00
Teknium	e00d9630c5	fix: thread api_key through ollama num_ctx probe + author map Follow-up for salvaged PR #3185: - run_agent.py: pass self.api_key to query_ollama_num_ctx() so Ollama behind an auth proxy (same issue class as the LM Studio fix) can be probed successfully. - scripts/release.py AUTHOR_MAP: map @tannerfokkens-maker's local-hostname commit email.	2026-04-20 20:51:56 -07:00
Teknium	36e8435d3e	fix: follow-up for salvaged PRs #6293 , #7387 , #9091 , #13131 - Fix duplicate 'timezone' import in e2e conftest - Fix test_text_before_command_not_detected asserting send() is awaited when no agent is present in mock setup (text messages don't produce command output)	2026-04-20 14:56:04 -07:00
cdanis	4a424f1fbb	feat(send_message): add media delivery support for Signal Cherry-picked from PR #13159 by @cdanis. Adds native media attachment delivery to Signal via signal-cli JSON-RPC attachments param. Signal messages with media now follow the same early-return pattern as Telegram/Discord/Matrix — attachments are sent only with the last chunk to avoid duplicates. Follow-up fixes on top of the original PR: - Moved Signal into its own early-return block above the restriction check (matches Telegram/Discord/Matrix pattern) - Fixed media_files being sent on every chunk in the generic loop - Restored restriction/warning guards to simple form (Signal exits early) - Fixed non-hermetic test writing to /tmp instead of tmp_path	2026-04-20 13:24:15 -07:00
Teknium	5a2118a70b	test: add _resolve_path tests + AUTHOR_MAP entry for aniruddhaadak80	2026-04-20 12:29:31 -07:00
Teknium	f01e65196a	chore: add MassiveMassimo to AUTHOR_MAP	2026-04-20 11:56:19 -07:00
Mibayy	3273f301b7	fix(stt): map cloud-only model names to valid local size for faster-whisper (#2544 ) Cherry-picked from PR #2545 by @Mibayy. The setup wizard could leave stt.model: "whisper-1" in config.yaml. When using the local faster-whisper provider, this crashed with "Invalid model size 'whisper-1'". Voice messages were silently ignored. _normalize_local_model() now detects cloud-only names (whisper-1, gpt-4o-transcribe, etc.) and maps them to the default local model with a warning. Valid local sizes (tiny, base, small, medium, large-v3) pass through unchanged. - Renamed _normalize_local_command_model -> _normalize_local_model (backward-compat wrapper preserved) - 6 new tests including integration test - Added lowercase AUTHOR_MAP alias for @Mibayy Closes #2544	2026-04-20 05:18:48 -07:00
Teknium	9725b452a1	fix: extract _repair_tool_call_arguments helper, add tests, bound loop Follow-up for PR #12252 salvage: - Extract 75-line inline repair block to _repair_tool_call_arguments() module-level helper for testability and readability - Remove redundant 'import re as _re' (re already imported at line 33) - Bound the while-True excess-delimiter removal loop to 50 iterations - Add 17 tests covering all 6 repair stages - Add sirEven to AUTHOR_MAP in release.py	2026-04-20 05:12:55 -07:00
Sanjays2402	570f8bab8f	fix(compression): exclude completion tokens from compression trigger (#12026 ) Cherry-picked from PR #12481 by @Sanjays2402. Reasoning models (GLM-5.1, QwQ, DeepSeek R1) inflate completion_tokens with internal thinking tokens. The compression trigger summed prompt_tokens + completion_tokens, causing premature compression at ~42% actual context usage instead of the configured 50% threshold. Now uses only prompt_tokens — completion tokens don't consume context window space for the next API call. - 3 new regression tests - Added AUTHOR_MAP entry for @Sanjays2402 Closes #12026	2026-04-20 05:12:10 -07:00
Teknium	34ae13e6ed	chore: add jplew to AUTHOR_MAP	2026-04-20 05:10:23 -07:00
PStarH	00192d51f1	fix(install): quote PYTHON_PATH and UV_CMD for paths with spaces on macOS (#10009 ) Cherry-picked from PR #10019 by @PStarH. On macOS, uv stores Python in ~/Library/Application Support/uv/... which contains a space. Unquoted $PYTHON_PATH and $UV_CMD caused word-splitting under set -e, silently aborting install.sh. Quotes all variable expansions in check_python(): - "$PYTHON_PATH" in command invocations - "$UV_CMD" in uv calls - Outer quotes on $(...) assignments Closes #10009	2026-04-20 05:03:14 -07:00
houguokun	6cdab70320	fix(batch_runner): mark discarded no-reasoning prompts as completed (#9950 ) Cherry-picked from PR #10005 by @houziershi. Discarded prompts (has_any_reasoning=False) were skipped by `continue` before being added to completed_in_batch. On --resume they were retried forever. Now they are added to completed_in_batch before the continue. - Added AUTHOR_MAP entry for @houziershi Closes #9950	2026-04-20 04:56:06 -07:00
Teknium	7242afaa5f	chore: defer WhatsApp bridge install to first use (#12992 ) Remove eager npm install of @whiskeysockets/baileys during install.sh, install.ps1, and Docker build. The bridge deps are already installed on-demand by `hermes whatsapp` (Step 4 checks for node_modules and runs npm install if missing), so there is no need to pay the cost at initial install for users who never use WhatsApp.	2026-04-20 04:55:33 -07:00
Teknium	3218d58fc5	chore(release): add Swift42 to AUTHOR_MAP	2026-04-20 04:15:04 -07:00
Teknium	9f22977fc0	chore(release): add haileymarshall to AUTHOR_MAP	2026-04-20 03:10:19 -07:00
Teknium	0cff992f0a	chore(release): add alexzhu0 to AUTHOR_MAP	2026-04-20 03:07:32 -07:00
Teknium	649ef5c8f1	chore(release): add sjz-ks to AUTHOR_MAP	2026-04-20 03:04:06 -07:00
Teknium	8f4db7bbd5	chore(release): map withapurpose37@gmail.com -> StefanIsMe Author mapping for the salvaged PR #8191 contributor.	2026-04-20 02:59:57 -07:00
Teknium	2614586306	chore(release): add lumenradley to AUTHOR_MAP	2026-04-20 02:53:40 -07:00
Teknium	93f9db59b2	fix(doctor): update config validation for current auth.py API Follow-up for #3171 cherry-pick — the contributor's validation block called get_provider_credentials() which doesn't exist on current main. Replaces it with get_auth_status() limited to API-key providers in PROVIDER_REGISTRY so providers without a registry entry (openrouter, anthropic, custom) don't trigger false 'not authenticated' failures. Also runs the provider name through resolve_provider() so aliases like 'glm'/'moonshot' validate correctly. Adds StefanIsMe to AUTHOR_MAP.	2026-04-20 02:41:25 -07:00
Teknium	c470a325f7	chore(release): add Linux2010 and elmatadorgh to AUTHOR_MAP	2026-04-20 02:40:20 -07:00
Teknium	acca428c81	chore: add haileymarshall to AUTHOR_MAP	2026-04-20 02:10:53 -07:00
Teknium	440764e013	chore(release): add salt-555 to AUTHOR_MAP	2026-04-20 00:47:40 -07:00
Teknium	edf1aecacd	chore(release): add cresslank to AUTHOR_MAP	2026-04-20 00:42:10 -07:00
kshitijk4poor	098d554aac	test: cover vision config temperature wiring\n\n- add regression tests for auxiliary.vision.temperature and timeout\n- add bugkill3r to AUTHOR_MAP for the salvaged commit	2026-04-20 00:32:09 -07:00
JackJin	5157f5427f	chore(release): add jackjin1997 qq email to AUTHOR_MAP	2026-04-19 22:46:47 -07:00
Teknium	65a31ee0d5	fix(anthropic): complete third-party Anthropic-compatible provider support (#12846 ) Third-party gateways that speak the native Anthropic protocol (MiniMax, Zhipu GLM, Alibaba DashScope, Kimi, LiteLLM proxies) now work end-to-end with the same feature set as direct api.anthropic.com callers. Synthesizes eight stale community PRs into one consolidated change. Five fixes: - URL detection: consolidate three inline `endswith("/anthropic")` checks in runtime_provider.py into the shared _detect_api_mode_for_url helper. Third-party /anthropic endpoints now auto-resolve to api_mode=anthropic_messages via one code path instead of three. - OAuth leak-guard: all five sites that assign `_is_anthropic_oauth` (__init__, switch_model, _try_refresh_anthropic_client_credentials, _swap_credential, _try_activate_fallback) now gate on `provider == "anthropic"` so a stale ANTHROPIC_TOKEN never trips Claude-Code identity injection on third-party endpoints. Previously only 2 of 5 sites were guarded. - Prompt caching: new method `_anthropic_prompt_cache_policy()` returns `(should_cache, use_native_layout)` per endpoint. Replaces three inline conditions and the `native_anthropic=(api_mode=='anthropic_messages')` call-site flag. Native Anthropic and third-party Anthropic gateways both get the native cache_control layout; OpenRouter gets envelope layout. Layout is persisted in `_primary_runtime` so fallback restoration preserves the per-endpoint choice. - Auxiliary client: `_try_custom_endpoint` honors `api_mode=anthropic_messages` and builds `AnthropicAuxiliaryClient` instead of silently downgrading to an OpenAI-wire client. Degrades gracefully to OpenAI-wire when the anthropic SDK isn't installed. - Config hygiene: `_update_config_for_provider` (hermes_cli/auth.py) clears stale `api_key`/`api_mode` when switching to a built-in provider, so a previous MiniMax custom endpoint's credentials can't leak into a later OpenRouter session. - Truncation continuation: length-continuation and tool-call-truncation retry now cover `anthropic_messages` in addition to `chat_completions` and `bedrock_converse`. Reuses the existing `_build_assistant_message` path via `normalize_anthropic_response()` so the interim message shape is byte-identical to the non-truncated path. Tests: 6 new files, 42 test cases. Targeted run + tests/run_agent, tests/agent, tests/hermes_cli all pass (4554 passed). Synthesized from (credits preserved via Co-authored-by trailers): #7410 @nocoo — URL detection helper #7393 @keyuyuan — OAuth 5-site guard #7367 @n-WN — OAuth guard (narrower cousin, kept comment) #8636 @sgaofen — caching helper + native-vs-proxy layout split #10954 @Only-Code-A — caching on anthropic_messages+Claude #7648 @zhongyueming1121 — aux client anthropic_messages branch #6096 @hansnow — /model switch clears stale api_mode #9691 @TroyMitchell911 — anthropic_messages truncation continuation Closes: #7366, #8294 (third-party Anthropic identity + caching). Supersedes: #7410, #7367, #7393, #8636, #10954, #7648, #6096, #9691. Rejects: #9621 (OpenAI-wire caching with incomplete blocklist — risky), #7242 (superseded by #9691, stale branch), #8321 (targets smart_model_routing which was removed in #12732). Co-authored-by: nocoo <nocoo@users.noreply.github.com> Co-authored-by: Keyu Yuan <leoyuan0099@gmail.com> Co-authored-by: Zoee <30841158+n-WN@users.noreply.github.com> Co-authored-by: sgaofen <135070653+sgaofen@users.noreply.github.com> Co-authored-by: Only-Code-A <bxzt2006@163.com> Co-authored-by: zhongyueming <mygamez@163.com> Co-authored-by: Xiaohan Li <hansnow@users.noreply.github.com> Co-authored-by: Troy Mitchell <i@troy-y.org>	2026-04-19 22:43:09 -07:00
Teknium	be3bec55be	chore(release): add draix to AUTHOR_MAP	2026-04-19 22:16:37 -07:00
Teknium	09195be979	docs: repoint tui.md skin reference to features/skins.md The example-skin.yaml was removed as part of the stale docs cleanup. Docusaurus features/skins.md covers the same material. Also update AUTHOR_MAP for balyan.sid@gmail.com → alt-glitch (actual GitHub login; balyansid returns 404).	2026-04-19 20:39:49 -07:00
Teknium	5d01fc4e6f	chore(attribution): add taeng02@icloud.com → taeng0204 Salvaged commit `0c652e9b` in this branch is authored by taeng02@icloud.com. check-attribution CI blocks PRs whose new author emails aren't in AUTHOR_MAP, so add the mapping to unblock #12680's salvage PR. GitHub username confirmed via `gh api users/taeng0204` (Taein Lim).	2026-04-19 18:54:35 -07:00
Teknium	af53039dbc	chore(release): add etherman-os and mark-ramsell to AUTHOR_MAP	2026-04-19 16:47:20 -07:00
Teknium	a3a4932405	fix(mcp-oauth): bidirectional auth_flow bridge + absolute expires_at (salvage #12025 ) (#12717 ) * [verified] fix(mcp-oauth): bridge httpx auth_flow bidirectional generator HermesMCPOAuthProvider.async_auth_flow wrapped the SDK's auth_flow with 'async for item in super().async_auth_flow(request): yield item', which discards httpx's .asend(response) values and resumes the inner generator with None. This broke every OAuth MCP server on the first HTTP response with 'NoneType' object has no attribute 'status_code' crashing at mcp/client/auth/oauth2.py:505. Replace with a manual bridge that forwards .asend() values into the inner generator, preserving httpx's bidirectional auth_flow contract. Add tests/tools/test_mcp_oauth_bidirectional.py with two regression tests that drive the flow through real .asend() round-trips. These catch the bug at the unit level; prior tests only exercised _initialize() and disk-watching, never the full generator protocol. Verified against BetterStack MCP: Before: 'Connection failed (11564ms): NoneType...' after 3 retries After: 'Connected (2416ms); Tools discovered: 83' Regression from #11383. * [verified] fix(mcp-oauth): seed token_expiry_time + pre-flight AS discovery on cold-load PR #11383's consolidation fixed external-refresh reloading and 401 dedup but left two latent bugs that surfaced on BetterStack and any other OAuth MCP with a split-origin authorization server: 1. HermesTokenStorage persisted only a relative 'expires_in', which is meaningless after a process restart. The MCP SDK's OAuthContext does NOT seed token_expiry_time in _initialize, so is_token_valid() returned True for any reloaded token regardless of age. Expired tokens shipped to servers, and app-level auth failures (e.g. BetterStack's 'No teams found. Please check your authentication.') were invisible to the transport-layer 401 handler. 2. Even once preemptive refresh did fire, the SDK's _refresh_token falls back to {server_url}/token when oauth_metadata isn't cached. For providers whose AS is at a different origin (BetterStack: mcp.betterstack.com for MCP, betterstack.com/oauth/token for the token endpoint), that fallback 404s and drops into full browser re-auth on every process restart. Fix set: - HermesTokenStorage.set_tokens persists an absolute wall-clock expires_at alongside the SDK's OAuthToken JSON (time.time() + TTL at write time). - HermesTokenStorage.get_tokens reconstructs expires_in from max(expires_at - now, 0), clamping expired tokens to zero TTL. Legacy files without expires_at fall back to file-mtime as a best-effort wall-clock proxy, self-healing on the next set_tokens. - HermesMCPOAuthProvider._initialize calls super(), then update_token_expiry on the reloaded tokens so token_expiry_time reflects actual remaining TTL. If tokens are loaded but oauth_metadata is missing, pre-flight PRM + ASM discovery runs via httpx.AsyncClient using the MCP SDK's own URL builders and response handlers (build_protected_resource_metadata_discovery_urls, handle_auth_metadata_response, etc.) so the SDK sees the correct token_endpoint before the first refresh attempt. Pre-flight is skipped when there are no stored tokens to keep fresh-install paths zero-cost. Test coverage (tests/tools/test_mcp_oauth_cold_load_expiry.py): - set_tokens persists absolute expires_at - set_tokens skips expires_at when token has no expires_in - get_tokens round-trips expires_at -> remaining expires_in - expired tokens reload with expires_in=0 - legacy files without expires_at fall back to mtime proxy - _initialize seeds token_expiry_time from stored tokens - _initialize flags expired-on-disk tokens as is_token_valid=False - _initialize pre-flights PRM + ASM discovery with mock transport - _initialize skips pre-flight when no tokens are stored Verified against BetterStack MCP: hermes mcp test betterstack -> Connected (2508ms), 83 tools mcp_betterstack_telemetry_list_teams_tool -> real team data, not 'No teams found. Please check your authentication.' Reference: mcp-oauth-token-diagnosis skill, Fix A. * chore: map hermes@noushq.ai to benbarclay in AUTHOR_MAP Needed for CI attribution check on cherry-picked commits from PR #12025. --------- Co-authored-by: Hermes Agent <hermes@noushq.ai>	2026-04-19 16:31:07 -07:00
Teknium	13febe60ca	chore(release): add dodo-reach to AUTHOR_MAP	2026-04-19 11:57:51 -07:00
Teknium	d48d6fadff	test(run_agent): pin proxy-env forwarding through keepalive transport Adds a regression guard for the #11277 → proxy-bypass regression fixed in `42b394c3`. With HTTPS_PROXY / HTTP_PROXY / ALL_PROXY set, the custom httpx transport used for TCP keepalives must still route requests through an HTTPProxy pool; without proxy env, no HTTPProxy mount should exist. Also maps zrc <zhurongcheng@rcrai.com> → heykb in scripts/release.py AUTHOR_MAP so the salvage PR passes the author-attribution CI check.	2026-04-19 11:44:43 -07:00
Teknium	eb247e6c0a	chore: add bingo906 numeric qq email to AUTHOR_MAP Maps 906014227@qq.com → bingo906 for PR #12450 attribution in the weekly release notes.	2026-04-19 11:36:04 -07:00
Teknium	5a23f3291a	fix(model_switch): section 3 base_url/model/dedup follow-up On top of the salvaged PR #12505 (Jason/farion1231, which adds dict-format models: enumeration to both sections), three section-3 refinements from competing PR #11534 (YangManBOBO): - accept base_url as canonical (matches Hermes's writer and custom_providers entries); keep api/url as fallbacks for legacy/hand-edited configs - accept singular model as a default_model synonym, matching custom_providers - add seen_slugs guard so the same provider slug appearing in both providers: dict and custom_providers: list emits exactly one picker row (providers: dict wins since section 3 runs first) Two regression tests cover the new behavior. AUTHOR_MAP entry added for farion1231 so CI doesn't reject the cherry-picked commit.	2026-04-19 11:07:29 -07:00
Teknium	a3b76ae36d	chore(attribution): add AUTHOR_MAP entry for Mibayy Adds the Mibayy noreply email to the AUTHOR_MAP so CI attribution checks pass for the #3884 maps skill feat commit (`7fa01faf`).	2026-04-19 05:19:51 -07:00
kshitijk4poor	4f0e49dc7b	chore: add sgaofen to AUTHOR_MAP	2026-04-19 03:30:03 -07:00
kshitijk4poor	175cf7e6bb	fix: tighten quiet-mode salvage follow-ups Follow-up for the helix4u easy-fix salvage batch: - route remaining context-engine quiet-mode output through _should_emit_quiet_tool_messages() so non-CLI/library callers stay silent consistently - drop the extra senderAliases computation from WhatsApp allowlist-drop logging and remove the now-unused import This keeps the batch scoped to the intended fixes while avoiding leaked quiet-mode output and unnecessary duplicate work in the bridge.	2026-04-19 00:28:25 -07:00
helix4u	3ade655999	fix(whatsapp): log allowlist drops in bridge	2026-04-19 00:28:25 -07:00
kshitijk4poor	0a8d48809f	chore: add LeonSGP43 numeric noreply email to AUTHOR_MAP The cherry-picked commit from #11434 uses the 154585401+ prefixed noreply format. Add it alongside the existing bare entry so the contributor audit passes.	2026-04-18 22:50:55 -07:00
Teknium	41560192c4	chore(attribution): add AUTHOR_MAP entry for nish3451 Adds the nish3451 noreply email to the AUTHOR_MAP so CI attribution checks pass for the #6100 Telegram DM fallback fix merged in `1a9a2d7f`.	2026-04-18 18:52:41 -07:00
Teknium	11ee87e605	chore(attribution): add AUTHOR_MAP entry for kshitijk4poor@gmail.com Covers the non-noreply email used on commit `dd3e6424` (rename of the TouchDesigner skill to touchdesigner-mcp).	2026-04-18 17:43:42 -07:00
LVT382009	f7af90e2da	fix: wire _ephemeral_max_output_tokens into chat_completions and add NVIDIA NIM default Based on #12152 by @LVT382009. Two fixes to run_agent.py: 1. _ephemeral_max_output_tokens consumption in chat_completions path: The error-recovery ephemeral override was only consumed in the anthropic_messages branch of _build_api_kwargs. All chat_completions providers (OpenRouter, NVIDIA NIM, Qwen, Alibaba, custom, etc.) silently ignored it. Now consumed at highest priority, matching the anthropic pattern. 2. NVIDIA NIM max_tokens default (16384): NVIDIA NIM falls back to a very low internal default when max_tokens is omitted, causing models like GLM-4.7 to truncate immediately (thinking tokens exhaust the budget before the response starts). 3. Progressive length-continuation boost: When finish_reason='length' triggers a continuation retry, the output budget now grows progressively (2x base on retry 1, 3x on retry 2, capped at 32768) via _ephemeral_max_output_tokens. Previously the retry loop just re-sent the same token limit on all 3 attempts.	2026-04-18 12:51:30 -07:00
jarvischer	0f778f7768	fix: prevent tool name duplication in streaming accumulator (MiniMax/NVIDIA NIM) Based on #11984 by @maxchernin. Fixes #8259. Some providers (MiniMax M2.7 via NVIDIA NIM) resend the full function name in every streaming chunk instead of only the first. The old accumulator used += which concatenated them into 'read_fileread_file'. Changed to simple assignment (=), matching the OpenAI Node SDK, LiteLLM, and Vercel AI SDK patterns. Function names are atomic identifiers delivered complete — no provider splits them across chunks, so concatenation was never correct semantics.	2026-04-18 12:50:32 -07:00
Teknium	0bebf5b948	chore(attribution): add AUTHOR_MAP entry for Honghua Yang (honghua)	2026-04-18 12:40:56 -07:00

1 2 3 4 5

232 commits