hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-31 19:16:29 +00:00

Author	SHA1	Message	Date
Brooklyn Nicholson	c5511bbc5a	fix: leading ./ thingy	2026-04-09 16:27:06 -05:00
Brooklyn Nicholson	b7d4ea1550	feat: better hyperlink formatting	2026-04-09 15:13:43 -05:00
Ari Lotter	74241328f0	direnv: watch lockfiles/nix files; gitignore .nix-stamps	2026-04-09 15:50:24 -04:00
Ari Lotter	df5874c119	nix: add bundled TUI build-time verification check	2026-04-09 15:50:24 -04:00
Ari Lotter	21afb3fa3c	nix: delegate devShell setup to package passthru hooks - use inputsFrom to inherit build inputs from packages - concat passthru.devShellHook from each package	2026-04-09 15:50:24 -04:00
Ari Lotter	31b2c12f0f	nix: bundle TUI in main package with passthru hooks - build tui.nix, copy to $out/ui-tui/ (same layout as dev) - set HERMES_TUI_DIR, HERMES_PYTHON in wrapper - add passthru.devShellHook with stamp-checked venv setup - expose tui as separate package output	2026-04-09 15:50:24 -04:00
Ari Lotter	405c1b4e84	nix: add TUI derivation with buildNpmPackage - fetchNpmDeps for reproducibilty - compile ts to js - passthru.devShellHook for dev shell stamp-checked auto dep install	2026-04-09 15:50:24 -04:00
Ari Lotter	5ff96551d5	cli: support bundled TUI at HERMES_TUI_DIR (for nix) - Fix cwd to use bundled TUI dir, not PROJECT_ROOT - Set HERMES_ROOT from env with cwd fallback	2026-04-09 15:50:24 -04:00
Ari Lotter	2b4272ef5b	ui-tui: update package-lock.json	2026-04-09 15:35:54 -04:00
Ari Lotter	670dcea8f4	ui-tui: add tsc build pipeline - Switch tsconfig to nodenext module resolution for Node 22 (used by installer script) - Add shebang to entry.tsx, preserved into index.js - Add HERMES_ROOT env var fallback for repo root resolution	2026-04-09 15:35:29 -04:00
Brooklyn Nicholson	17f13013eb	chore: fmt	2026-04-09 14:17:45 -05:00
Brooklyn Nicholson	00e1d42b9e	feat: image pasting	2026-04-09 13:45:23 -05:00
Brooklyn Nicholson	b2ea9b4176	Merge branch 'main' of github.com:NousResearch/hermes-agent into feat/ink-refactor	2026-04-09 12:31:20 -05:00
Brooklyn Nicholson	0d7c19a42f	fix(ui-tui): ref-based input buffer, gateway listener stability, usage display, and 6 correctness bugs	2026-04-09 12:21:24 -05:00
ethernet	637ad443bf	nix: add tirith to runtime deps (#6721 )	2026-04-09 22:28:00 +05:30
Devorun	a8b85bb887	fix(nix): make setupSecrets activation script optional (#6227 ) (#6261 )	2026-04-09 22:09:20 +05:30
Sergei Korolev	d9753720f3	fix(nix): switch nixpkgs input from nixos-24.11 to nixos-unstable (#5520 ) * fix(nix): switch nixpkgs input from nixos-24.11 to nixos-unstable nixos-24.11 reached EOL on 2025-06-30. For a dev tool, tracking a frozen release branch causes dependency versions to go stale. nixos-unstable provides rolling updates and is the conventional choice for development packages. * docs(website): update nix flake example --------- Co-authored-by: sk <sk@mercury>	2026-04-09 21:30:38 +05:30
Dilek	dbc11abcb6	fix(ci): pin floating GitHub Actions tags and ascii-guard to explicit versions (#3982 ) * fix(ci): pin floating GitHub Actions tags and ascii-guard to explicit versions Actions pinned to @main pull whatever is at that ref at execution time, so a compromised upstream org could execute arbitrary code in CI. - Pin DeterminateSystems/nix-installer-action to commit SHA (v22) - Pin DeterminateSystems/magic-nix-cache-action to commit SHA (v13) - Pin ascii-guard to 2.3.0 in docs-site-checks workflow SHA comments include the version tag for human readability; Renovate or Dependabot can keep these updated automatically. * Add skill metadata extraction step in workflow Add step to extract skill metadata for dashboard in CI workflow. --------- Co-authored-by: Siddharth Balyan <52913345+alt-glitch@users.noreply.github.com>	2026-04-09 21:27:20 +05:30
Teknium	268ee6bdce	fix: add turn-exit diagnostic logging to agent loop (#6549 ) Every turn now logs WHY the agent loop ended to agent.log with a structured INFO line capturing: exit reason, model, api_calls/max, budget usage, tool turn count, last message role, response length, and session ID. When the last message is a tool result and the turn was NOT interrupted, emits WARNING level (visible in errors.log) — this is the 'just stops' scenario users report where a tool call completes but no continuation or final response follows. 10 tracked exit reasons: text_response, interrupted_by_user, interrupted_during_api_call, budget_exhausted, max_iterations_reached, all_retries_exhausted_no_response, fallback_prior_turn_content, empty_response_exhausted, error_near_max_iterations, unknown.	2026-04-09 04:15:22 -07:00
Teknium	173289b64f	docs: add hermes dump and hermes logs to CLI commands reference (#6552 ) Documents both debugging commands with full option tables, examples, and usage guidance. Adds both to the top-level commands table and as detailed sections with subsections for log files, filtering behavior, and log rotation.	2026-04-09 04:11:03 -07:00
Teknium	1a3ae6ac6e	feat: structured API error classification for smart failover (#6514 ) Add agent/error_classifier.py with a priority-ordered classification pipeline that replaces scattered inline string-matching in the retry loop with structured error taxonomy and recovery hints. FailoverReason enum (14 categories): auth, auth_permanent, billing, rate_limit, overloaded, server_error, timeout, context_overflow, payload_too_large, model_not_found, format_error, thinking_signature, long_context_tier, unknown. ClassifiedError dataclass carries reason + recovery action hints (retryable, should_compress, should_rotate_credential, should_fallback). Key improvements over inline matching: - 402 disambiguation: 'insufficient credits' = billing (immediate rotate), 'usage limit, try again' = rate_limit (backoff first) - OpenRouter 403 'key limit exceeded' correctly classified as billing - Error cause chain walking (walks __cause__/__context__ up to 5 levels) - Body message included in pattern matching (SDK str() misses it) - Server disconnect + large session check ordered before generic transport catch so RemoteProtocolError triggers compression when appropriate - Chinese error message support for context overflow run_agent.py: replaced 6 inline detection blocks with classifier calls, net -55 lines. All recovery actions (pool rotation, fallback activation, compression, transport recovery) unchanged. 65 new unit tests + 10 E2E tests + live tests with real SDK error objects. Inspired by OpenClaw's failover error classification system.	2026-04-09 04:10:11 -07:00
Teknium	78e6b06518	feat: add 'hermes dump' command for copy-pasteable setup summary (#6550 ) Adds a new CLI command that outputs a compact, plain-text dump of the user's Hermes setup — version, OS, model/provider, API key presence, toolsets, gateway status, platforms, cron jobs, skills, and any non-default config overrides. Designed for support context: no ANSI colors, ready to paste into Discord/GitHub/Telegram. Secrets shown as 'set/not set' by default; --show-keys reveals redacted prefixes (first/last 4 chars). Files: - hermes_cli/dump.py (new) — run_dump() implementation - hermes_cli/main.py — parser + cmd_dump wiring - hermes_cli/profiles.py — shell completions + subcommand set	2026-04-09 04:00:41 -07:00
Teknium	b650957b40	docs(bluebubbles): fix pairing instructions to use existing approve flow (#6548 ) The docs incorrectly referenced 'hermes pairing generate bluebubbles' which doesn't exist. The existing reactive pairing flow already handles this — when an unknown user messages the bot, it sends them a code automatically, and the owner approves with 'hermes pairing approve'.	2026-04-09 03:57:11 -07:00
Teknium	ad06bfccf0	fix: remove dead LLM_MODEL env var — add migration to clear stale .env entries (#6543 ) The old setup wizard (pre-March 2026) wrote LLM_MODEL to ~/.hermes/.env across 12 provider flows. Commit `9302690e` removed the writes but never cleaned up existing .env files, leaving a dead variable that: - Nothing in the codebase reads (zero os.getenv calls) - The docs incorrectly claimed the gateway still used as fallback - Caused user confusion when debugging model resolution issues Changes: - config.py: Bump _config_version 12 → 13, add migration to clear LLM_MODEL and OPENAI_MODEL from .env (both dead since March 2026) - environment-variables.md: Remove LLM_MODEL row, fix HERMES_MODEL description to stop referencing it - providers.md: Update deprecation notice from 'deprecated' to 'removed'	2026-04-09 03:56:40 -07:00
Teknium	8dfc96dbbb	feat: capture provider rate limit headers and show in /usage (#6541 ) Parse x-ratelimit-* headers from inference API responses (Nous Portal, OpenRouter, OpenAI-compatible) and display them in the /usage command. - New agent/rate_limit_tracker.py: parse 12 rate limit headers (RPM/RPH/ TPM/TPH limits, remaining, reset timers), format as progress bars (CLI) or compact one-liner (gateway) - Hook into streaming path in run_agent.py: stream.response.headers is available on the OpenAI SDK Stream object before chunks are consumed - CLI /usage: appends rate limit section with progress bars + warnings when any bucket exceeds 80% - Gateway /usage: appends compact rate limit summary - 24 unit tests covering parsing, formatting, edge cases Headers captured per response: x-ratelimit-{limit,remaining,reset}-{requests,tokens}{,-1h} Example CLI display: Nous Rate Limits (captured just now): Requests/min [░░░░░░░░░░░░░░░░░░░░] 0.1% 1/800 used (799 left, resets in 59s) Tokens/hr [░░░░░░░░░░░░░░░░░░░░] 0.0% 49/336.0M (336.0M left, resets in 52m)	2026-04-09 03:43:14 -07:00
konsisumer	3c8ec7037c	fix(agent): catch PermissionError in subdirectory hint discovery Wrap is_dir() in _is_valid_subdir() and is_file() in _load_hints_for_directory() with OSError handlers so that inaccessible directories (e.g. /root from a non-root Daytona host user) are silently skipped instead of crashing the agent. The existing PermissionError PRs for prompt_builder.py (#6247, #6321, #6355) do not cover subdirectory_hints.py, which was identified as a separate crash path in the #6214 comments. Ref: #6214	2026-04-09 03:10:30 -07:00
Kira	161c2c4da4	fix(skills): archive OpenClaw cron store without config	2026-04-09 03:06:11 -07:00
Lumen Radley	e22416dd9b	fix: handle empty sudo password and false prompts	2026-04-09 02:50:07 -07:00
Teknium	a94099908a	fix(state): orphan children instead of cascade-deleting in prune/delete (#6513 ) prune_sessions and delete_session only handled direct children when satisfying the parent_session_id FK constraint. Multi-level chains (A -> B -> C) caused IntegrityError because deleting B while C still referenced it was blocked by the FK. Fix: NULL out parent_session_id for any session whose parent is about to be deleted. This orphans children instead of cascade-deleting them, which also respects the prune retention window — newer child sessions are no longer deleted just because an ancestor is old. Reported by Aaryan2304 in PR #6463.	2026-04-09 02:41:56 -07:00
cokemine	851857e413	fix(models): correct probed_url selection logic Updated the logic for determining the probed_url in the probe_api_models function to use the first tried URL instead of the last. This change ensures that the most relevant URL is returned when probing for models. Additionally, improved the output message in the _model_flow_custom function to provide clearer guidance based on the suggested_base_url.	2026-04-09 02:38:09 -07:00
Teknium	b408379e9d	fix: reduce credential exhaustion TTL from 24 hours to 1 hour (#6504 ) The 24-hour default cooldown for 402-exhausted credentials was far too aggressive — if a user tops up credits or the 402 was caused by an oversized max_tokens request rather than true billing exhaustion, they shouldn't have to wait a full day. Reduce to 1 hour (matching the existing 429 TTL). Inspired by PR #6493 (michalkomar).	2026-04-09 02:37:23 -07:00
Kira	e1b0b135cb	fix(discord): accept .log attachments and raise document size limit	2026-04-09 02:26:33 -07:00
Teknium	1eabbe905e	fix: retry 3 times when model returns truly empty response (#6488 ) When a model returns no content, no structured reasoning, and no tool calls (common with open models), the agent now silently retries up to 3 times before falling through to (empty). Silent retry (no synthetic messages) keeps the conversation history clean, preserves prompt caching, and respects the no-synthetic-user- injection invariant. Most empty responses from open models are transient (provider hiccups, rate limits, sampling flukes) so a simple retry is sufficient. This fills the last gap in the empty-response recovery chain: 1. _last_content_with_tools fallback (prior tool turn had content) 2. Thinking-only prefill continuation (#5931 — structured reasoning) 3. Empty response silent retry (NEW — truly empty, no reasoning) 4. (empty) terminal (last resort after all retries exhausted) Inline <think> blocks are excluded — the model chose to reason, it just produced no visible text. That differs from truly empty. Tests: - Updated test_truly_empty to expect 4 API calls (1 + 3 retries) - Added test_truly_empty_response_succeeds_on_nudge	2026-04-09 02:06:12 -07:00
Teknium	b962801f6a	fix(bluebubbles): add setup wizard integration and OPTIONAL_ENV_VARS (#6494 ) The BlueBubbles adapter was merged but missing setup wizard support: - Add _setup_bluebubbles() guided setup (server URL, password, allowlist, home channel, webhook port) - Add to _GATEWAY_PLATFORMS registry so it appears in 'hermes setup gateway' - Add to any_messaging check and home channel missing warning - Add to gateway status display in 'hermes setup' - Add BLUEBUBBLES_SERVER_URL, BLUEBUBBLES_PASSWORD, BLUEBUBBLES_ALLOWED_USERS to OPTIONAL_ENV_VARS with descriptions and categories Previously the only way to configure BlueBubbles was manually editing .env.	2026-04-09 02:05:41 -07:00
Cherif Yaya	5cf4fac2aa	fix: restore codex fallback auth-store lookup	2026-04-09 01:56:10 -07:00
Hunter B	894e8c8a8f	fix: resolve opencode.ai context window to 1M and clean up display formatting Two issues resolved: 1. Add opencode.ai to _URL_TO_PROVIDER mapping so base_url routes through models.dev lookup (which has mimo-v2-pro at 1M context) instead of falling back to probing /models (404) and defaulting to 128K. 2. Fix _format_context_length to round cleanly: 1048576 → '1M' instead of '1.048576M'. Applies same rounding logic to K values.	2026-04-09 01:43:22 -07:00
Teknium	18140199c3	fix(ci): build and push multi-arch Docker image (amd64 + arm64) (#6124 ) Add QEMU cross-compilation and multi-arch manifest support so Apple Silicon (M1/M2/M3) and other ARM-based systems get native images. - Add docker/setup-qemu-action for arm64 emulation on amd64 runners - Smoke test stays amd64-only (load:true can't export multi-arch) - Both push steps (main + release) now build linux/amd64,linux/arm64 - Bump timeout 30->60min for QEMU cross-compilation overhead - Add permissions: contents: read (least-privilege hardening) Salvaged from PR #3998 by Mibayy. Also addresses #5005 and #3913. Co-authored-by: Mibayy <Mibayy@users.noreply.github.com>	2026-04-09 00:29:45 -07:00
Teknium	7120d6cdd6	fix(bluebubbles): add missing integration points and documentation (#6460 ) - hermes_cli/skills_config.py: add platform label for per-platform skill config - gateway/session.py: add to PII-safe platforms (no mention system) - website/docs/user-guide/messaging/bluebubbles.md: full setup guide - website/sidebars.ts: sidebar navigation entry - 10 docs pages: add BlueBubbles to all platform enumerations (env vars, toolsets, cron delivery, gateway internals, etc.)	2026-04-09 00:19:05 -07:00
Teknium	d40264d53b	test: add coverage for token-budget tail protection Tests for the new behavior paths: - Large tool outputs no longer block compaction (motivating scenario) - Hard minimum of 3 tail messages always protected - 1.5x soft ceiling for oversized messages - Small conversations still compress (min 8 messages) - Token-budget prune path in _prune_old_tool_results - Fallback to message-count when no token budget	2026-04-08 23:54:23 -07:00
BongSuCHOI	c506126123	fix(tests): update context_compressor tests for min_tail=3 PR #6240 changed tail protection from protect_last_n to min(3, ...) which increased the minimum compressible message count and shifted tail boundaries. Three tests broke: - test_summary_role_avoids_consecutive_user_messages: 6→8 msgs - test_double_collision_user_head_assistant_tail: 7→8 msgs - test_no_collision_scenarios_still_work: 6→8 msgs All tests now exceed the new min_for_compress threshold (6) and maintain proper role alternation in both head and tail sections.	2026-04-08 23:54:23 -07:00
BongSuCHOI	d12f8db0b8	fix(compaction): token-budget primary tail protection Tail protection was effectively message-count based despite having a token budget, because protect_last_n=20 acted as a hard floor. A single 50K-token tool output would cause all 20 recent messages to be preserved regardless of budget, leaving little room for summarization. Changes: - _find_tail_cut_by_tokens: min_tail reduced from protect_last_n (20) to 3; token budget is now the primary criterion - Soft ceiling at 1.5x budget to avoid cutting mid-oversized-message - _prune_old_tool_results: accepts optional protect_tail_tokens so pruning also respects the token budget instead of a fixed count - compress() minimum message check relaxed from protect_first_n + protect_last_n + 1 to protect_first_n + 3 + 1 - Tool group alignment (no splitting tool_call/result) preserved	2026-04-08 23:54:23 -07:00
Nicolò Boschi	25757d631b	feat(hindsight): feature parity, setup wizard, and config improvements Port missing features from the hindsight-hermes external integration package into the native plugin. Only touches plugin files — no core changes. Features: - Tags on retain/recall (tags, recall_tags, recall_tags_match) - Recall config (recall_max_tokens, recall_max_input_chars, recall_types, recall_prompt_preamble) - Retain controls (retain_every_n_turns, auto_retain, auto_recall, retain_async via aretain_batch, retain_context) - Bank config via Banks API (bank_mission, bank_retain_mission) - Structured JSON retain with per-message timestamps - Full session accumulation with document_id for dedup - Custom post_setup() wizard with curses picker - Mode-aware dep install (hindsight-client for cloud, hindsight-all for local) - local_external mode and openai_compatible LLM provider - OpenRouter support with auto base URL - Auto-upgrade of hindsight-client to >=0.4.22 on session start - Comprehensive debug logging across all operations - 46 unit tests - Updated README and website docs	2026-04-08 23:54:15 -07:00
Teknium	d97f6cec7f	feat(gateway): add BlueBubbles iMessage platform adapter (#6437 ) Adds Apple iMessage as a gateway platform via BlueBubbles macOS server. Architecture: - Webhook-based inbound (event-driven, no polling/dedup needed) - Email/phone → chat GUID resolution for user-friendly addressing - Private API safety (checks helper_connected before tapback/typing) - Inbound attachment downloading (images, audio, documents cached locally) - Markdown stripping for clean iMessage delivery - Smart progress suppression for platforms without message editing Based on PR #5869 by @benjaminsehl (webhook architecture, GUID resolution, Private API safety, progress suppression) with inbound attachment downloading from PR #4588 by @1960697431 (attachment cache routing). Integration points: Platform enum, env config, adapter factory, auth maps, cron delivery, send_message routing, channel directory, platform hints, toolset definition, setup wizard, status display. 27 tests covering config, adapter, webhook parsing, GUID resolution, attachment download routing, toolset consistency, and prompt hints.	2026-04-08 23:54:03 -07:00
Teknium	241bd4fc7e	fix: add size cap to assistant thread metadata cache Prevents unbounded memory growth in _assistant_threads dict. Evicts oldest entries when exceeding _ASSISTANT_THREADS_MAX (5000), matching the pattern used by _mentioned_threads and _seen_messages.	2026-04-08 23:53:50 -07:00
helix4u	30a0fcaec8	fix(slack): handle assistant thread lifecycle events	2026-04-08 23:53:50 -07:00
Teknium	5449c01d26	fix: clean env vars in pairing regression test The test_non_internal_event_without_user_triggers_pairing test relied on no Discord auth env vars being set, but gateway/run.py loads dotenv at module level. In environments with DISCORD_ALLOW_ALL_USERS=True in .env, the auth check passed instead of triggering the pairing flow. Clear DISCORD_ALLOW_ALL_USERS, DISCORD_ALLOWED_USERS, GATEWAY_ALLOW_ALL_USERS, and GATEWAY_ALLOWED_USERS via monkeypatch to ensure test isolation.	2026-04-08 23:01:04 -07:00
xingkongliang	1d8d4f28ae	fix(gateway): prevent background process notifications from triggering false pairing requests When a background process with notify_on_complete=True finishes, the gateway injects a synthetic MessageEvent to notify the session. This event was constructed without user_id, causing _is_user_authorized() to reject it and — for DM-origin sessions — trigger the pairing flow, sending "Hi~ I don't recognize you yet!" with a pairing code to the chat owner. Add an `internal` flag to MessageEvent that bypasses authorization checks for system-generated synthetic events. Only the process watcher sets this flag; no external/adapter code path can produce it. Includes 4 regression tests covering the fix and the normal pairing path.	2026-04-08 23:01:04 -07:00
Brooklyn Nicholson	8755b9dfc0	fix: resizing etc	2026-04-09 00:46:35 -05:00
Brooklyn Nicholson	54bd25ff4a	fix(tui): -c resume, ctrl z, pasting updates, exit summary, session fix	2026-04-09 00:36:53 -05:00
Brooklyn Nicholson	b66550ed08	fix(tui): stabilize multiline input, persist tool traces, and port CLI-style context status bar	2026-04-08 23:59:56 -05:00

1 2 3 4 5 ...

3614 commits