hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-05-08 03:01:47 +00:00

Author	SHA1	Message	Date
Kshitij	d78c34928f	feat(tui): collapsible sections in startup banner (skills, system prompt, MCP) The TUI SessionPanel banner now uses collapsible \u25b8/\u25be toggle sections matching the existing Chevron convention used for runtime agent details. Skills, system prompt, and MCP server lists are collapsed by default; tools remain expanded as the most actionable info. - tui_gateway/server.py: _session_info() now passes agent._cached_system_prompt through to the TUI frontend - ui-tui/src/types.ts: added system_prompt?: string to SessionInfo - ui-tui/src/components/branding.tsx: rewrote SessionPanel with CollapseToggle helper + per-section useState toggles Default states: tools=open, skills=collapsed, system=collapsed, mcp=collapsed. Clicking any \u25b8/\u25be header toggles that section.	2026-05-06 03:34:00 -07:00
Kshitij Kapoor	3ebdd26449	fix(browser): surface Lightpanda Chrome fallback warnings	2026-05-06 03:23:19 -07:00
kshitijk4poor	395dbcc873	feat(browser): add Lightpanda engine support with automatic Chrome fallback Add Lightpanda as an optional browser engine for local mode. Lightpanda is a headless browser built from scratch in Zig -- faster navigation than Chrome with significantly less memory. One config line to enable: browser: engine: lightpanda New functions in browser_tool.py: - _get_browser_engine() -- config/env reader with validation + caching - _should_inject_engine() -- only inject in local non-cloud mode - _needs_lightpanda_fallback() -- detect empty/failed LP results - _chrome_fallback_screenshot() -- temporary Chrome session for screenshots - Engine injection in _run_browser_command (--engine flag) - browser_vision pre-routes screenshots to Chrome when engine=lightpanda Config: - browser.engine in DEFAULT_CONFIG (auto/lightpanda/chrome) - AGENT_BROWSER_ENGINE in OPTIONAL_ENV_VARS - /browser status shows engine info in local mode Rebased from PR #7144 onto current main. All existing code preserved -- pure additions only (+520/-2). 25 new tests + 81 total browser tests pass (0 failures).	2026-05-06 03:23:19 -07:00
kshitijk4poor	aa88dcc57b	fix: salvage batch — compaction guidance, memory authority, cache eviction after compression - Fix /compact → /compress in context-overflow tips (closes #20020) - Evict cached agent after session hygiene and /compress so system prompt refreshes with current SOUL.md, memory, and skills - Restore memory authority across compaction: change 'informational background data' to 'authoritative reference data' in memory block and SUMMARY_PREFIX, with backward-compatible regex Based on: - PR #20027 by @LeonSGP43 - PR #18767 by @MacroAnarchy - PR #17380 by @vominh1919 PR #17121 boundary marker fix already merged to main (`2eef395e1`). PR #9262 user-message anchoring already on main via _ensure_last_user_message_in_tail().	2026-05-05 22:33:45 -07:00
Teknium	f27fcb6a82	feat(models): add x-ai/grok-4.3 to OpenRouter + Nous Portal curated lists (#20497 ) Some checks are pending Deploy Site / deploy-vercel (push) Waiting to run Details Deploy Site / deploy-docs (push) Waiting to run Details Docker Build and Publish / build-and-push (push) Waiting to run Details Nix / nix (macos-latest) (push) Waiting to run Details Nix / nix (ubuntu-latest) (push) Waiting to run Details OSV-Scanner / Scan lockfiles (push) Waiting to run Details Tests / test (push) Waiting to run Details Tests / e2e (push) Waiting to run Details Endpoint validated over 6 conversational turns with tool calls (9 API calls, 3 tool calls, 0 failures) and an 8-request burst (8/8 ok, 0 rate limits). Latency ~5-10s/call — slower than grok-4.20 but expected for a reasoning model. - hermes_cli/models.py: add to OPENROUTER_MODELS and _PROVIDER_MODELS['nous'] - website/static/api/model-catalog.json: regenerated	2026-05-05 19:15:10 -07:00
Teknium	477e4a2fe6	feat(models): add deepseek/deepseek-v4-pro to OpenRouter + Nous Portal curated lists (#20495 ) Endpoint re-tested over 6 conversational turns (9 API calls, 3 tool calls) and an 8-request burst — no rate limits, no errors, ~2-3s latency. The historical rate-limit issues that caused its removal are gone. - hermes_cli/models.py: add to OPENROUTER_MODELS and _PROVIDER_MODELS['nous'] - website/static/api/model-catalog.json: regenerated via build_model_catalog.py	2026-05-05 19:11:58 -07:00
Teknium	e598e18529	docs: document custom model aliases for /model command (#20475 ) User-defined model aliases (config.yaml model_aliases: and model.aliases.*) have worked since early versions but were entirely undocumented. Add a dedicated 'Custom model aliases' section to slash-commands.md covering both YAML config formats and the 'hermes config set' shell form, mirror a shorter version into the configuring-models 'Alternative methods' section, and cross-link from the two /model table rows. Flagged by @weehowe on Twitter — he wasn't aware the feature existed.	2026-05-05 19:11:20 -07:00
etherman-os	39f451f5ad	fix: add Turkish locale references in config, tests, and docs - hermes_cli/config.py: add tr to supported languages comment - locales/en.yaml: add tr to locale file list comment - tests/agent/test_i18n.py: add Turkish alias tests + explicit lang test - website/docs/user-guide/configuration.md: add tr to supported values	2026-05-05 17:29:12 -07:00
etherman-os	985133852a	feat(i18n): add Turkish (tr) locale - Add locales/tr.yaml with Turkish translations for all approval.* and gateway.* keys - Register 'tr' in SUPPORTED_LANGUAGES - Add Turkish aliases: turkish, türkçe, tr-tr	2026-05-05 17:29:12 -07:00
Teknium	fab3ad9777	chore(release): AUTHOR_MAP entries for suncokret12 and mioimotoai-lgtm	2026-05-05 17:26:15 -07:00
LeonSGP43	a49670c21b	fix(kanban): wire dependency selects	2026-05-05 17:26:15 -07:00
Brecht-H	3f97297413	feat(kanban): surface task_runs.summary on dashboard cards + ``kanban show`` The kanban-worker skill (built into the gateway dispatcher's spawn prompt) instructs every worker to hand off via ``kanban_complete(summary=..., metadata=...)``. That writes the summary onto the closing ``task_runs`` row, NOT onto ``tasks.result`` — the latter is left NULL unless the caller passes ``result=`` explicitly. Result: a glance at the dashboard or ``hermes kanban show <id>`` shows a blank "Result:" section even when the worker did real work, which on 2026-05-05 caused a Mac false-alarm ("Hermes did nothing") on a task that had a 10-line completion summary on its run. This patch surfaces the latest non-null run summary as ``latest_summary`` so the worker's actual handoff lands in front of operators. * New helpers ``kanban_db.latest_summary(conn, task_id)`` and ``kanban_db.latest_summaries(conn, task_ids)``. The batch variant uses a single window-function SELECT so the dashboard board endpoint doesn't pay an N+1 cost on multi-hundred-task boards. * CLI ``hermes kanban show <id>`` prints a "Latest summary:" block when ``tasks.result`` is empty but a run has produced a summary (the existing "Result:" section still wins when populated, so the back-compat path for hand-edited results is untouched). JSON output gains a top-level ``latest_summary`` field. * Dashboard ``/board`` and ``/tasks/{id}`` now include a ``latest_summary`` field on every task. Cards on /board carry a 200-character preview (cheap to render, plenty for "what did this worker do?" at a glance); the drawer/detail endpoint returns the full summary. * Five new tests cover: empty-runs case, post-complete surface, newest-of-multiple selection, empty-string skip, batch with missing tasks + empty input. Smoke-tested locally against the live profile DB on the three acceptance-criterion targets (t_f08fef91 cron-hygiene-audit, t_007b7f1c EMA-analysis, t_05746fa4 self-assessment) — all three now return their populated summaries via both ``latest_summary`` and ``latest_summaries``. Test plan: 255/255 kanban tests pass + 91/91 dashboard plugin tests pass. No regression on tasks where ``tasks.result`` is explicitly populated (the existing "Result:" branch is preserved). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 17:26:15 -07:00
daixin1204	d2c6eceed9	fix(kanban): prevent child task dispatch when parent is not done Add parent dependency guard to _set_status_direct so dragging a task to the ready column is rejected (409) when its parents are not all done. Previously the guard only existed in recompute_ready, allowing direct status writes via the dashboard API to bypass the dependency engine. Root cause: after reclaiming stale workers, both T3 and T4 were set to ready via dashboard status writes in quick succession, causing the writer to be spawned while the analyst was blocked — upstream work wasn't done yet.	2026-05-05 17:26:15 -07:00
Teknium	8a1a42d098	test(kanban): backdate task_runs.started_at alongside tasks.started_at After #19473 landed (enforce_max_runtime reads from task_runs.started_at rather than tasks.started_at), a regression test added earlier still only backdated the tasks column. Backdate both so the test is robust regardless of which column the enforcer reads from.	2026-05-05 17:26:15 -07:00
澪 / Mio	b28ab4fc3f	fix(kanban): measure max runtime from current run	2026-05-05 17:26:15 -07:00
LeonSGP43	6d302b340e	fix(kanban): accept created_cards linked as child of completing task Widens _verify_created_cards to also accept ids that are children of the completing task in task_links. Previously we only accepted cards where created_by matched the completing task's assignee, which was too strict for legitimate orchestrator flows: a specifier creates a card (so created_by=specifier, not worker), then a worker picks it up and passes parents=[current_task] to kanban_create. The explicit link proves the relationship and should be trusted. Salvaged from #20022 @LeonSGP43 (full PR superseded by #20232 + this patch; the linked-children relaxation was the portable improvement).	2026-05-05 17:26:15 -07:00
suncokret12	eda326df16	fix(doctor): report Kanban worker tools as runtime-gated	2026-05-05 17:26:15 -07:00
Teknium	f0b95cc93d	test(arcee): cover Trinity Large Thinking temperature + compression overrides Salvage follow-up for PR #20344: - AUTHOR_MAP entry for rob-maron (required by CI) - 17 parametrized tests covering _is_arcee_trinity_thinking, _fixed_temperature_for_model Trinity override, and _compression_threshold_for_model, including sibling-model negatives (trinity-large-preview, trinity-mini) and the OpenRouter slug form.	2026-05-05 17:23:45 -07:00
rob-maron	2d4eaed111	arcee temperature + compression	2026-05-05 17:23:45 -07:00
teknium1	735349c679	chore: AUTHOR_MAP entry for olisikh	2026-05-05 17:21:59 -07:00
Oleksii Lisikh	c4b287ba53	feat(i18n): add Ukrainian locale	2026-05-05 17:21:59 -07:00
Miniding	0d41e94ca9	feat(i18n): add French (fr) locale support - Add fr.yaml with French translations for approval prompts and gateway messages - Register 'fr' in SUPPORTED_LANGUAGES - Add French aliases: french, français, fr-fr, fr-be, fr-ca, fr-ch - Update locale sync comment in en.yaml	2026-05-05 15:13:57 -07:00
Teknium	ee8edd4169	chore: AUTHOR_MAP entry for bogerman1	2026-05-05 15:13:36 -07:00
bogerman1	3188e63b05	fix(api_server): SSE token batching + error handling for Open WebUI performance Reduces SSE event rate ~500/turn → ~20/turn via 50ms text-delta batching in _dispatch(), which eliminates markdown re-render storms on Open WebUI. Also: - Trim tool_call.arguments in the response.completed event to 100KB (prevents silent hangs on 848KB+ single-line SSE events). - Catch-all exception handlers in _write_sse_responses() + _write_sse_chat_completion() emit a proper error chunk instead of TransferEncodingError from incomplete chunked encoding when the agent crashes mid-stream. - MAX_REQUEST_BYTES 1MB → 10MB; pass client_max_size to aiohttp Application to avoid silent 400s on truncated request bodies for long conversations. Salvage of #17552 (api_server portion only). The contrib/openwebui-filter/ payload from that PR — Open WebUI Filter Function + benchmark writeup — is a client-side user-installable add-on and doesn't need to live in the repo; dropped here. Closes #17537. Co-authored-by: bogerman1 <93757150+bogerman1@users.noreply.github.com>	2026-05-05 15:13:36 -07:00
Nicolò Boschi	3082fa0829	feat(hindsight): probe API for update_mode='append' support, dedupe across processes Mirrors the pattern already shipping in hindsight-integrations/openclaw: probe `<api_url>/version` once per process, gate on Hindsight ≥ 0.5.0. When supported, retains use a stable session-scoped `document_id` (`session_id`) plus `update_mode='append'` so cross-process retains for the same session merge into one document instead of producing N-different-process-stamped duplicates. When unsupported (or probe fails), fall back to the existing per-process unique `f"{session_id}-{start_ts}"` document_id with no `update_mode` — the resume-overwrite fix (#6654) keeps working unchanged on legacy servers. Closes the dedup half of #20115. The proposed `document_id_strategy` config knob isn't needed: auto-detection via the same /version probe the OpenClaw plugin already uses gives the same outcome with no extra config burden, and the choice is purely a function of what the server can do. Plumbing -------- - Module-level helpers (`_meets_minimum_version`, `_fetch_hindsight_api_version`, `_check_api_supports_update_mode_append`) cache the result per api_url so every provider in the process gets one /version round-trip. - One-time WARN logged when the API is older than 0.5.0, telling the user to upgrade for cross-session deduplication. - New instance helper `_resolve_retain_target(fallback_doc_id)` returns `(document_id, update_mode)` based on cached capability. Wired into `sync_turn` and the `on_session_switch` flush path. - For local_embedded mode, the probe URL is taken from the running client (`client.url`) so we hit the actual daemon port rather than the configured default. - `update_mode` is set on the per-item dict; `aretain_batch` already threads `item['update_mode']` into the API call. Tests ----- - `TestUpdateModeAppendCapability` (5 cases): legacy fallback, modern stable+append, per-url cache, one-time warn, flush-on-switch resolves against the OLD session. - Existing `_make_hindsight_provider` factory in the manager-side test file extended to seed `_mode`/`_api_url`/`_api_key`/`_client` and stub `_resolve_retain_target` so the bypass-init pattern keeps working. E2E verified against installed `~/.hermes/hermes-agent`: - Legacy probe (unreachable host) → `legacy-session-<ts>` doc_id, no `update_mode`. - Modern probe (live local_embedded 0.5.6 daemon) → stable `modern-session` doc_id + `update_mode='append'`. - `test_hermes_embedded_smoke.py` passes (90s).	2026-05-05 15:09:59 -07:00
Teknium	1efed67056	chore(release): AUTHOR_MAP entries for momowind and misery-hl	2026-05-05 15:09:28 -07:00
misery-hl	56b4795115	guard kanban worker lifecycle by run id	2026-05-05 15:09:28 -07:00
Moonyeah	f0d278412f	feat(gateway): respect kanban.max_spawn config to limit concurrent tasks The dispatch_once function already accepts a max_spawn parameter but the gateway was calling it without passing any value, effectively ignoring the configuration. This change reads kanban.max_spawn from config.yaml and passes it through, allowing users to limit concurrent kanban tasks. This prevents resource exhaustion scenarios where kanban dispatcher spawns too many parallel workers on constrained hardware.	2026-05-05 15:09:28 -07:00
0xVox	0b9cbc8b23	test(kanban): cover metadata handoff round-trip	2026-05-05 15:09:28 -07:00
Teknium	50ab0a85a7	chore: AUTHOR_MAP entry for formulahendry	2026-05-05 14:16:30 -07:00
Jun Han	0d945d1541	docs: update VS Code setup instructions for ACP Client integration	2026-05-05 14:16:30 -07:00
Teknium	f97d022149	chore: AUTHOR_MAP entry for zhanggttry	2026-05-05 14:15:05 -07:00
zhangguangtao	05cdcac362	docs: add Chinese (zh-CN) README translation Closes #12954 - Add README.zh-CN.md with complete Simplified Chinese translation - Add language switcher badge in README.md linking to Chinese version - Add language switcher badge in README.zh-CN.md linking to English version	2026-05-05 14:15:05 -07:00
haidao1919	74e4f5f97a	docs(i18n): add zh-Hans Tool Gateway, image gen, and Windows WSL guide Made-with: Cursor	2026-05-05 14:14:03 -07:00
Teknium	a321874ab4	chore: AUTHOR_MAP entry for liu-collab	2026-05-05 14:12:49 -07:00
liuyuqi	a11234dd68	docs(browser): document WSL-to-Windows Chrome MCP bridge	2026-05-05 14:12:49 -07:00
Teknium	a860a1098f	chore: AUTHOR_MAP entry for acesjohnny	2026-05-05 14:12:09 -07:00
Zhen Liu	1c42d8ff53	docs: add Open WebUI bootstrap script	2026-05-05 14:12:09 -07:00
Teknium	92a08c633f	chore: AUTHOR_MAP entry for binhnt92	2026-05-05 14:11:16 -07:00
binhnt92	9a0a4c5831	docs(guides): add guide for running Hermes locally with Ollama Step-by-step guide covering Ollama installation, model selection, Hermes configuration, speed optimization, and optional gateway bot setup — all running on local hardware with zero API cost. Includes hardware requirements, model comparison table with tool-call support status, context window tuning, GPU offloading tips, fallback provider setup, troubleshooting, and cost comparison.	2026-05-05 14:11:16 -07:00
Teknium	1fc8733a69	fix(kanban): unify failure counter across spawn/timeout/crash outcomes (#20410 ) The dispatcher's circuit breaker only protected against spawn-side failures (profile missing, workspace mount error, exec failure). Workers that successfully spawned but then timed out or crashed re-queued to ``ready`` with no counter increment, so the next tick re-spawned them — loops forever until someone noticed. Reported externally on Twitter (Forbidden Seeds) and confirmed by walking the kernel: ``enforce_max_runtime`` flipped the task back to ready, emitted a ``timed_out`` event, and never touched ``spawn_failures``; same for ``detect_crashed_workers``. Fix: unify the counter across all non-success outcomes. Schema ------ * ``tasks.spawn_failures`` → ``tasks.consecutive_failures`` * ``tasks.last_spawn_error`` → ``tasks.last_failure_error`` * Migration renames the columns in-place on existing DBs (``ALTER TABLE RENAME COLUMN`` — SQLite >= 3.25) so historical counter values are preserved. Row mappers fall through to the legacy names if both column renames and a migration somehow got out of sync. Counter lifecycle ----------------- New helper ``_record_task_failure(conn, task_id, error, , outcome, release_claim, end_run, event_payload_extra)`` is the single point every non-success outcome funnels through: ``spawn_failed`` → ``_record_spawn_failure`` (kept as alias) calls it with ``release_claim=True, end_run=True`` — transitions running→ready, clears claim, closes run. * ``timed_out`` → ``enforce_max_runtime`` already does the status transition + run close + event emission, then calls ``_record_task_failure`` with ``release_claim=False, end_run=False`` just to bump the counter (and trip the breaker if needed). * ``crashed`` → ``detect_crashed_workers`` same pattern, but the counter increment runs after the main write_txn closes (SQLite doesn't nest write transactions). If the counter hits the breaker threshold (``DEFAULT_FAILURE_LIMIT=5``, same as before), the task transitions to ``blocked`` with a ``gave_up`` event on top of whatever outcome-specific event was already emitted. Reset semantics changed: the counter now clears only on successful ``complete_task`` (and operator ``reclaim_task`` — an explicit "I've looked at this, try again with a fresh budget"). Previously ``_clear_spawn_failures`` ran on every successful spawn, which would have wiped the counter before a timeout could accumulate past threshold — exactly the loop this fix prevents. Diagnostics ----------- * ``_rule_repeated_spawn_failures`` → ``_rule_repeated_failures``. Now fires regardless of which outcome is at fault. Classifies the most recent failure (spawn_failed / timed_out / crashed) from the run history so the title ("Agent timeout x3", "Agent crash x4", "Agent spawn x5") and suggested action (``doctor`` for spawn, ``log`` for timeout/crash) stay outcome-specific without N duplicate rules. * ``_rule_repeated_crashes`` kept as a narrower early-warning at threshold 2 (vs 3 for the unified rule), but now suppresses itself when the unified rule would also fire — avoids double-flagging. * Diagnostic ``data`` payload now carries ``{consecutive_failures, most_recent_outcome, last_error}`` instead of spawn-specific keys. CLI --- * ``Task.consecutive_failures`` / ``Task.last_failure_error`` are the public fields now. Existing callers that referenced the old names get migrated (tests updated in this commit). * Backward-compat: ``DEFAULT_SPAWN_FAILURE_LIMIT``, ``_clear_spawn_failures``, ``_record_spawn_failure`` stay as aliases. Tests ----- * 6 new kernel tests: timeout increments counter, 3 consecutive timeouts trip the breaker (was the reported gap), crash increments counter, reclaim clears counter, completion clears counter, spawn success does NOT clear counter. * Diagnostic tests: updated ``repeated_spawn_failures`` cases to use the new kind name and add a timeout-loop test. * Dashboard API test: spawn_failures column update → consecutive_failures. 389/389 kanban-suite tests pass. Live verification ----------------- Seeded 4 tasks in an isolated HERMES_HOME: 3 timeouts, 4 crashes, 2-spawn-failed + 2-timed-out, and a task that had prior failures but completed successfully. Board correctly shows "!! 3 tasks need attention" (the successful one has no badge because the counter reset). Drawer for the timeout-loop task renders "Agent timeout x3" with most_recent_outcome=timed_out and the "Check logs" suggested action (not the spawn-flavoured "Verify profile"). The successful task has zero diagnostics. Closes the Forbidden-Seeds-reported gap.	2026-05-05 13:55:37 -07:00
Teknium	587ef55f2c	chore: AUTHOR_MAP entry for xsfX20	2026-05-05 13:55:21 -07:00
xsfx20	144ba71a33	docs(faq): use messaging extra for gateway deps	2026-05-05 13:55:21 -07:00
Teknium	391e3fff56	chore: AUTHOR_MAP entry for Hypnus-Yuan	2026-05-05 13:54:33 -07:00
Yuan Tao-Wen	39560c948d	docs(voice): add Doubao speech integration examples (TTS + STT)	2026-05-05 13:54:33 -07:00
LeonSGP43	ca8e68822d	docs(codex): clarify OAuth auth prerequisite	2026-05-05 13:53:55 -07:00
LeonSGP43	f13b349b9a	docs: clarify Telegram group chat troubleshooting	2026-05-05 13:53:19 -07:00
Teknium	bb2b129549	chore: AUTHOR_MAP entry for Fearvox	2026-05-05 13:52:46 -07:00
0xVox	5bd75c73ed	docs(kanban): document handoff evidence metadata	2026-05-05 13:52:46 -07:00
Teknium	79902a0278	chore: AUTHOR_MAP entry for counterposition	2026-05-05 13:51:56 -07:00

1 2 3 4 5 ...

7396 commits