hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-09 08:21:50 +00:00

History

Teknium c6fd2619f7 fix(gemini-cli): surface MODEL_CAPACITY_EXHAUSTED cleanly + drop retired gemma-4-26b (#11833 ) Google-side 429 Code Assist errors now flow through Hermes' normal rate-limit path (status_code on the exception, Retry-After preserved via error.response) instead of being opaque RuntimeErrors. User sees a one-line capacity message instead of a 500-char JSON dump. Changes - CodeAssistError grows status_code / response / retry_after / details attrs. _extract_status_code in error_classifier picks up status_code and classifies 429 as FailoverReason.rate_limit, so fallback_providers triggers the same way it does for SDK errors. run_agent.py line ~10428 already walks error.response.headers for Retry-After — preserving the response means that path just works. - _gemini_http_error parses the Google error envelope (error.status + error.details[].reason from google.rpc.ErrorInfo, retryDelay from google.rpc.RetryInfo). MODEL_CAPACITY_EXHAUSTED / RESOURCE_EXHAUSTED / 404 model-not-found each produce a human-readable message; unknown shapes fall back to the previous raw-body format. - Drop gemma-4-26b-it from hermes_cli/models.py, hermes_cli/setup.py, and agent/model_metadata.py — Google returned 404 for it today in local repro. Kept gemma-4-31b-it (capacity-constrained but not retired). Validation \| \| Before \| After \| \|---------------------------\|--------------------------------\|-------------------------------------------\| \| Error message \| 'Code Assist returned HTTP 429: {500 chars JSON}' \| 'Gemini capacity exhausted for gemini-2.5-pro (Google-side throttle...)' \| \| status_code on error \| None (opaque RuntimeError) \| 429 \| \| Classifier reason \| unknown (string-match fallback) \| FailoverReason.rate_limit \| \| Retry-After honored \| ignored \| extracted from RetryInfo or header \| \| gemma-4-26b-it picker \| advertised (404s on Google) \| removed \| Unit + E2E tests cover non-streaming 429, streaming 429, 404 model-not-found, Retry-After header fallback, malformed body, and classifier integration. Targeted suites: tests/agent/test_gemini_cloudcode.py (81 tests), full tests/hermes_cli (2203 tests) green. Co-authored-by: teknium1 <teknium@nousresearch.com>		2026-04-17 15:34:12 -07:00
..
__init__.py	chore: release v0.10.0 (2026.4.16) (#11209 )	2026-04-16 12:53:06 -07:00
auth.py	feat(providers): add native NVIDIA NIM provider	2026-04-17 13:47:46 -07:00
auth_commands.py	fix(auth): codex auth remove no longer silently undone by auto-import (#11485 )	2026-04-17 04:10:17 -07:00
backup.py	feat: fix SQLite safety in hermes backup + add --quick snapshots + /snapshot command (#8971 )	2026-04-13 04:46:13 -07:00
banner.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
callbacks.py	fix: ESC cancels secret/sudo prompts, clearer skip messaging (#9902 )	2026-04-14 16:11:37 -07:00
claw.py	fix: unify OpenClaw detection, add isatty guard, fix print_warning import	2026-04-12 16:40:37 -07:00
cli_output.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
clipboard.py	feat(gateway): WSL-aware gateway with smart systemd detection (#7510 )	2026-04-10 21:15:47 -07:00
codex_models.py	fix: add gpt-5.4-mini to Codex fallback catalog (#3855 )	2026-03-29 20:10:00 -07:00
colors.py	feat: respect NO_COLOR env var and TERM=dumb (#4079 )	2026-03-30 17:07:21 -07:00
commands.py	feat(gemini): add Google Gemini CLI OAuth provider via Cloud Code Assist (free + paid tiers) (#11270 )	2026-04-16 16:49:00 -07:00
completion.py	fix: preserve profile name completion in dynamic shell completion	2026-04-14 10:45:42 -07:00
config.py	fix(qqbot): add back-compat for env var rename; drop qrcode core dep	2026-04-17 15:31:14 -07:00
copilot_auth.py	fix(copilot): resolve GHE token poisoning when GITHUB_TOKEN is set	2026-04-13 05:12:36 -07:00
cron.py	feat(cron): track delivery failures in job status (#6042 )	2026-04-07 22:49:01 -07:00
curses_ui.py	feat: ungate Tool Gateway — subscription-based access with per-tool opt-in	2026-04-16 12:36:49 -07:00
debug.py	fix: bump debug share paste TTL from 1 hour to 6 hours (#11240 )	2026-04-16 14:34:46 -07:00
default_soul.py	fix: reset default SOUL.md to baseline identity text (#3159 )	2026-03-26 01:34:27 -07:00
dingtalk_auth.py	test(dingtalk): cover QR device-flow auth + OpenClaw branding disclosure	2026-04-17 05:08:07 -07:00
doctor.py	fix(providers): complete NVIDIA NIM parity with other providers	2026-04-17 13:47:46 -07:00
dump.py	fix(providers): complete NVIDIA NIM parity with other providers	2026-04-17 13:47:46 -07:00
env_loader.py	fix: detect and strip non-ASCII characters from API keys (#6843 )	2026-04-14 20:20:31 -07:00
gateway.py	refactor(qqbot): change qrcode style	2026-04-17 15:31:14 -07:00
logs.py	feat: component-separated logging with session context and filtering (#7991 )	2026-04-11 17:23:36 -07:00
main.py	fix(providers): complete NVIDIA NIM parity with other providers	2026-04-17 13:47:46 -07:00
mcp_config.py	fix(mcp): consolidate OAuth handling, pick up external token refreshes (#11383 )	2026-04-16 21:57:10 -07:00
memory_setup.py	fix(memory): discover user-installed memory providers from $HERMES_HOME/plugins/ (#10529 )	2026-04-15 14:25:40 -07:00
model_normalize.py	fix(copilot): normalize vendor-prefixed and dash-notation model IDs (#6879 ) (#11561 )	2026-04-17 04:19:36 -07:00
model_switch.py	fix(opencode): strip /v1 from base_url on mid-session /model switch to Anthropic-routed models (#11286 )	2026-04-16 19:41:41 -07:00
models.py	fix(gemini-cli): surface MODEL_CAPACITY_EXHAUSTED cleanly + drop retired gemma-4-26b (#11833 )	2026-04-17 15:34:12 -07:00
nous_subscription.py	feat: ungate Tool Gateway — subscription-based access with per-tool opt-in	2026-04-16 12:36:49 -07:00
pairing.py	chore: fix 154 f-strings, simplify getattr/URL patterns, remove dead code (#3119 )	2026-03-25 19:47:58 -07:00
platforms.py	feat(gateway): unify QQBot branding, add PLATFORM_HINTS, fix streaming, restore missing setup functions	2026-04-14 00:11:49 -07:00
plugins.py	feat(plugins): add dispatch_tool() to PluginContext (#10763 )	2026-04-15 22:23:01 -07:00
plugins_cmd.py	fix: no auto-activation + unified hermes plugins UI with provider categories	2026-04-10 19:15:50 -07:00
profiles.py	fix: improve profile creation UX — seed SOUL.md + credential warning (#8553 )	2026-04-12 12:22:34 -07:00
providers.py	feat(providers): add native NVIDIA NIM provider	2026-04-17 13:47:46 -07:00
runtime_provider.py	feat(gemini): add Google Gemini CLI OAuth provider via Cloud Code Assist (free + paid tiers) (#11270 )	2026-04-16 16:49:00 -07:00
setup.py	fix(gemini-cli): surface MODEL_CAPACITY_EXHAUSTED cleanly + drop retired gemma-4-26b (#11833 )	2026-04-17 15:34:12 -07:00
skills_config.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
skills_hub.py	feat(skills): add 'hermes skills reset' to un-stick bundled skills (#11468 )	2026-04-17 00:41:31 -07:00
skin_engine.py	fix(cli): handle null/non-dict display config in skin initialization	2026-04-16 06:35:31 -07:00
status.py	fix(qqbot): add back-compat for env var rename; drop qrcode core dep	2026-04-17 15:31:14 -07:00
tips.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
tools_config.py	test(dingtalk): cover get_connected_platforms + null platform_toolsets	2026-04-17 06:26:18 -07:00
uninstall.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
web_server.py	dashboard: show GATEWAY_HEALTH_URL instead of PID for remote gateways	2026-04-16 16:48:14 -07:00
webhook.py	refactor: replace inline HERMES_HOME re-implementations with get_hermes_home()	2026-04-07 10:40:34 -07:00