hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-19 15:18:03 +00:00

Author	SHA1	Message	Date
Teknium	a8b689f0c2	test(kanban): regression for status=running rejection at dashboard PATCH Reporter of #19535 explicitly asked for a regression test — covers it here so a future refactor of _set_status_direct can't silently re-enable the direct ready/todo -> running bypass. Asserts both: (a) HTTP 400 with 'running' in the detail message, and (b) the task's status is unchanged after the rejected PATCH (pre-request status preserved, no partial mutation).	2026-05-04 04:46:47 -07:00
luyao618	6b3efcee49	fix(kanban): reject direct status transition to 'running' via dashboard API The PATCH /tasks/:id endpoint allows setting status='running' via _set_status_direct(), bypassing the dispatcher/claim path that creates run rows, claim locks, expiry, and worker process metadata. This can leave tasks stuck in 'running' with no active worker. Fix: reject status='running' with HTTP 400, requiring all transitions to 'running' to go through the canonical claim_task() path. Closes #19535	2026-05-04 04:46:47 -07:00
vominh1919	652f8e6f3e	fix(test): correct _coerce_number inf/nan test assertions The test 'test_inf_stays_string_for_integer_only' incorrectly asserted that _coerce_number('inf') returns float('inf'), but the function correctly returns the original string 'inf' because infinity is not JSON-serializable. Fixed the assertion to expect the string 'inf', and added two new tests for negative infinity and NaN edge cases to improve coverage of the non-JSON-serializable number guard in _coerce_number().	2026-05-04 04:45:55 -07:00
Yoimex	edf9c75621	fix(env): pass -- to cd for hyphen-prefixed workdirs	2026-05-04 04:45:03 -07:00
Teknium	ae40fca955	fix(profiles): keep validate_profile_name strict; callers normalize first Follow-up to @changchun989's cherry-pick: reverts the validate-via- normalize change so validate_profile_name remains a strict regex check on the input AS-GIVEN. Callers that accept mixed-case user input (dashboard UI, CLI args, import flows) call normalize_profile_name() first, then validate the result. This keeps validate honest about what the on-disk directory name must look like — e.g. ' jules ' (trailing whitespace) is now rejected instead of silently trimmed and accepted. - validate_profile_name: strict lowercase/regex check again, 'UPPER' back in the invalid-names parametrize - 8 call sites in profiles.py (create_profile, delete_profile, set_active_profile, export_profile, import_profile, rename_profile, resolve_profile_env, plus the clone_from branch): swap the normalize-then-validate order - scripts/release.py: add changchun989@proton.me -> changchun989 to AUTHOR_MAP so CI doesn't block on the unmapped contributor email All kanban + profile tests pass (268 across test_profiles.py + test_kanban_db.py + test_kanban_core_functionality.py, plus 73 in test_kanban_tools.py + test_kanban_dashboard_plugin.py). Closes #18498.	2026-05-04 04:44:37 -07:00
changchun989	a31477dabb	fix(profiles): normalize profile IDs for Kanban assignees and lookups - Add normalize_profile_name() for lowercase canonical IDs and Default alias - Use canonical names in create/delete/rename/export/import/set_active paths - Canonicalize Kanban assignee on create/assign, list filter, and worker spawn - Tests for mixed-case assignees and profile resolution (fixes #18498)	2026-05-04 04:44:37 -07:00
Yuyang Xu	60c4bc96fd	fix(security): restore .env/auth.json/state.db with 0600 perms `hermes import` was creating secret files with the process umask (typically 0644) instead of 0600. zipfile.open() does not honor the Unix mode bits stored in zip member external_attr; the restore loop used open(target, "wb") which always falls back to umask. Threat: silent privilege downgrade after a routine restore on multi-user systems (shared dev boxes, CI runners, jump hosts) — any local user could read API keys and OAuth tokens from ~/.hermes/. Fix mirrors the convention already used at file creation (hermes_cli/auth.py: stat.S_IRUSR \| stat.S_IWUSR for auth.json). The quick-snapshot restore path (restore_quick_snapshot) is unaffected — it uses shutil.copy2 which preserves perms via copystat(). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-04 04:43:53 -07:00
MichaelWDanko	da8654bb41	fix(dashboard): show custom theme palette swatches	2026-05-04 04:43:27 -07:00
Cameron Aragon	239ea1bdea	fix(image-gen): preserve xAI API error status	2026-05-04 04:43:07 -07:00
atongrun	75b4a34670	fix(cli): check updates against upstream/main for fork users	2026-05-04 04:42:44 -07:00
Teknium	5ec6baa400	feat(kanban): multi-project boards — one install, many kanbans (#19653 ) Adds first-class board support to kanban so users can separate unrelated streams of work (projects, repos, domains) into isolated queues. Single- project users stay on the 'default' board and see no UI change. Isolation model --------------- - Each board is a directory at `~/.hermes/kanban/boards/<slug>/` with its own `kanban.db`, `workspaces/`, and `logs/`. The 'default' board keeps its legacy path (`~/.hermes/kanban.db`) for back-compat — fresh installs and pre-boards users get zero migration. - Workers spawned by the dispatcher have `HERMES_KANBAN_BOARD` pinned in their env alongside the existing `HERMES_KANBAN_DB` / `HERMES_KANBAN_WORKSPACES_ROOT` pins, so workers physically cannot see other boards' tasks. - The gateway's single dispatcher loop now sweeps every board per tick; per-tick cost is a few extra filesystem stats. - CAS concurrency guarantees are preserved per-board (each board is its own SQLite DB, same WAL+IMMEDIATE machinery as before). CLI --- hermes kanban boards list\|create\|switch\|show\|rename\|rm hermes kanban --board <slug> <any-subcommand> Board resolution order: `--board` flag → `HERMES_KANBAN_BOARD` env → `~/.hermes/kanban/current` file → `default`. Slug validation is strict: lowercase alphanumerics + hyphens + underscores, 1-64 chars, starts with alphanumeric. Uppercase is auto-downcased; slashes / dots / `..` / control chars are rejected so boards can't name their way out of the boards/ directory. Passive discoverability: when more than one board exists, `hermes kanban list` prints a one-line header ("Board: foo (2 other boards …)") so users who stumble across multi-project never have to hunt for the feature. Invisible for single-board installs. Dashboard --------- - New `BoardSwitcher` component at the top of the Kanban tab: dropdown with all boards + task counts, `+ New board` button, `Archive` button (non-default only). Hidden entirely when only `default` exists and is empty — single-project users never see it. - New `NewBoardDialog` modal: slug / display name / description / icon + "switch to this board after creating" checkbox. - Selected board persists to `localStorage` so browser users don't shift the CLI's active board out from under a terminal they left open. - New `?board=<slug>` query param on every existing endpoint plus a new `/boards` CRUD surface (`GET /boards`, `POST /boards`, `PATCH /boards/<slug>`, `DELETE /boards/<slug>`, `POST /boards/<slug>/switch`). - Events WebSocket is pinned to a board at connection time; switching opens a fresh WS against the new board. Also fixes a pre-existing bug in the plugin's tenant / assignee filters: the SDK's `Select` uses `onValueChange(value)`, not native `onChange(event)`, so those filters silently didn't work. New `selectChangeHandler` helper wires both signatures. Tests ----- 49 new tests in `tests/hermes_cli/test_kanban_boards.py` covering: slug validation (valid / invalid / auto-downcase), path resolution (default = legacy path, named = `boards/<slug>/`, env var override), current-board resolution chain (env > file > default), board CRUD + archive / hard-delete, per-board connection isolation (tasks don't leak), worker spawn env injection (`HERMES_KANBAN_BOARD`, `HERMES_KANBAN_DB`, `HERMES_KANBAN_WORKSPACES_ROOT` all point at the right board), and end-to-end CLI surface. Regression surface: all 264 pre-existing kanban tests continue to pass. Live-tested via the dashboard: created 3 boards (default, hermes-agent, atm10-server), created tasks on each via both CLI (`--board <slug> create`) and dashboard (inline create on the Ready column), confirmed zero cross-board leakage, confirmed `BoardSwitcher` + `NewBoardDialog` work end-to-end in the browser.	2026-05-04 04:42:38 -07:00
vominh1919	135b4c8b35	fix(mcp): decouple AnyUrl import from mcp dependency AnyUrl was imported inside the same try block as mcp.client.auth, so when the mcp package was not installed, AnyUrl was undefined and _build_client_metadata raised NameError at runtime. Moved the AnyUrl import to its own try/except block so it's available whenever pydantic is installed (which is a core dependency), regardless of whether the mcp SDK is present. Also added pytest.importorskip('mcp') to the three test_build_client_metadata tests that exercise _build_client_metadata, since that function depends on OAuthClientMetadata from the mcp package.	2026-05-04 04:42:18 -07:00
vominh1919	0d563621fb	fix(test): skip bedrock adapter tests when botocore is not installed Six tests in test_bedrock_adapter.py import botocore.exceptions directly (ConnectionClosedError, EndpointConnectionError, ReadTimeoutError, ClientError) without guarding the import. When botocore is not installed (it's an optional dependency), these tests fail with ModuleNotFoundError instead of being gracefully skipped. Added pytest.importorskip('botocore') to each affected test function, following the same pattern used elsewhere in the test suite (e.g. test_voice_mode.py for numpy, test_mcp_oauth.py for mcp). Tests affected: - TestIsStaleConnectionError: 3 tests - TestCallConverseInvalidatesOnStaleError: 3 tests Before: 6 FAIL with ModuleNotFoundError After: 6 SKIP with reason message	2026-05-04 04:41:55 -07:00
vominh1919	d1d2d43387	fix(test): add skip marker for transcription tests requiring faster_whisper TestTranscribeLocalExtended patches faster_whisper.WhisperModel, which triggers an ImportError when the faster_whisper package is not installed. Added a pytest.mark.skipif marker using importlib.util.find_spec so these tests are gracefully skipped instead of failing with ModuleNotFoundError.	2026-05-04 04:41:36 -07:00
Teknium	844d4a32ce	chore(release): AUTHOR_MAP entries for Tier 1e salvage batch	2026-05-04 04:40:34 -07:00
Teknium	110387d149	docs(open-webui): fill gaps in quick setup — verify curls, ollama flag, restart note (#19654 ) Reported by @neopabo — the Open WebUI page was missing several steps users hit in practice: - Use hermes config set instead of hand-editing .env (matches current UX) - Restart-gateway note after enabling API_SERVER_ENABLED - curl /health + /v1/models verification step before jumping to Docker - ENABLE_OLLAMA_API=false in both docker run and compose snippets to suppress the empty Ollama backend that otherwise clutters the picker - 15-30s startup wait note for first-run embedding model download - Troubleshooting entry for the empty-Ollama-shadowing case - /v1/models troubleshoot command now includes the Authorization header	2026-05-04 04:36:18 -07:00
Siddharth Balyan	af6f9bc2a1	fix: refresh systemd unit on gateway boot (not just start/restart) (#19684 ) The resilient restart settings from PR #18639 only took effect when the gateway was started via `hermes gateway start` or `hermes gateway restart` — both of which call refresh_systemd_unit_if_needed() which writes the new unit and runs daemon-reload. However, when the gateway self-restarts via exit-code-75 (stale-code detection after `hermes update`, or the /restart command), systemd respawns the process directly without going through any CLI function. The unit file on disk stays stale, and systemd keeps using the old cached settings (StartLimitBurst=5, RestartSec=30) until someone manually runs `hermes gateway restart`. This meant that after PR #18639 was deployed, users who never ran `hermes gateway restart` manually were still vulnerable to the permanent-death-on-network-outage bug. Fix: call refresh_systemd_unit_if_needed() at the top of run_gateway() (the foreground entry point that systemd's ExecStart invokes). This ensures that on every boot — whether triggered by systemd restart, exit-75 respawn, or manual foreground run — the unit definition and daemon state are current. The call is best-effort (exceptions caught) and a no-op when the unit is already current (one stat + string compare).	2026-05-04 16:27:51 +05:30
Teknium	33f554d83c	feat(kanban-dashboard): workspace kind + path inputs in inline create form (#19679 ) Closes #18718. Exposes the existing `workspace_kind` + `workspace_path` fields (already accepted by POST /api/plugins/kanban/tasks) in the dashboard's per-column inline-create form so users can create tasks targeting a git worktree or an explicit directory without dropping back to the CLI. - Add a workspace-kind Select (scratch / worktree / dir) to InlineCreate in plugins/kanban/dashboard/dist/index.js. - Conditionally render a workspace_path Input next to the select when kind != scratch; placeholder tells the user whether the path is required (dir) or optional (worktree — derived from assignee when blank). - Submit wires `workspace_kind` / `workspace_path` into the POST body only when they're non-default, keeping the request shape small and interoperable with older dispatcher versions. E2E verified in a dashboard pointed at the worktree: selecting dir + typing /tmp/test-18718 produces a POST body with {workspace_kind: 'dir', workspace_path: '/tmp/test-18718'} and the task lands in sqlite with those fields set. 42/42 kanban dashboard plugin tests pass.	2026-05-04 03:40:39 -07:00
Grey0202	a219a0a4df	fix(anthropic): strip top-level oneOf/allOf/anyOf from tool input_schema Extends the existing _normalize_tool_input_schema to also drop top-level union keywords that Anthropic's tool schema validator rejects with HTTP 400. Several upstream and plugin tools ship schemas with a top-level oneOf/ allOf/anyOf (common for Pydantic discriminated unions). The existing strip_nullable_unions pass only handles anyOf-with-null patterns; a non-null top-level union keyword sails through and hits the API. Salvage of #16471 — approach folded into the existing normalize helper rather than introducing a parallel _sanitize_input_schema function, to avoid two schema-munging code paths running against the same input. Co-authored-by: Grey0202 <grey0202@users.noreply.github.com>	2026-05-04 03:17:35 -07:00
charliekerfoot	412f2389f1	fix(google_oauth): close TOCTOU window when saving credentials	2026-05-04 03:16:19 -07:00
Ioodu	e50809b771	fix(file-tools): cap read_file result size to prevent context window overflow Set max_result_size_chars=100_000 on the read_file registry entry (was float('inf')), closing the Layer 2 defense-in-depth gap in tool_result_storage.py. The existing Layer 1 guard inside _handle_read_file already returns a JSON error for oversized reads; this aligns the registry cap with every other tool. Update test_read_file_never_persisted → test_read_file_result_size_cap to assert 100_000, and add test_read_file_registry_cap_is_100k as an explicit regression guard against re-introducing float('inf'). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-04 03:14:59 -07:00
Teknium	5b6d413476	fix(cli,gateway): surface title errors from /new <name> The contributor's PR silently swallowed ValueError from SessionDB.set_session_title() with bare except Exception: pass. Users typing /new <title> with an already-in-use title got an untitled session and no feedback. Changes: - cli.py: catch ValueError from both sanitize_title() and set_session_title(); print the error and mark the session untitled in the banner (never echo the rejected title back). - gateway/run.py: append a warning note to the reset reply on title rejection; reflect the accepted title in the header. - Add regression tests for the duplicate-title path in CLI and gateway. Also map exx@example.com -> @exxmen in scripts/release.py.	2026-05-04 03:14:50 -07:00
Exx	f720751d79	feat(cli,gateway): /new accepts optional session name argument Allow users to start a fresh session and immediately set its title by passing a name to /new (or /reset): /new Refactor auth module Changes: - hermes_cli/commands.py: add args_hint='[name]' to /new command - cli.py: parse title argument in process_command(), pass to new_session() - cli.py: new_session() accepts title=None, sets title via SessionDB - gateway/run.py: _handle_reset_command() parses title, sets on new entry - gateway/session.py: reset_session() accepts optional display_name - tests: add test_new_session_with_title, test_reset_command_with_title, test_new_command_in_help_output All 36 affected tests pass.	2026-05-04 03:14:50 -07:00
ms-alan	055fde40e0	fix(doctor): check global agent-browser when local install not found When agent-browser is globally installed via 'npm install -g agent-browser' but not present in the local node_modules, doctor falsely warns that it's not installed. Add shutil.which('agent-browser') as a fallback check after the local path check. Closes #15951	2026-05-04 03:13:22 -07:00
xyiy001	e69d11d30c	fix(browser): allow CDP override to pass requirement checks Treat explicit CDP override mode as a valid browser backend even when agent-browser is absent, and add a regression test to prevent false-negative availability gating.	2026-05-04 03:12:30 -07:00
kshitijk4poor	46072425fe	fix(model-picker): exclude providers with empty credential pool entries The auth check in list_authenticated_providers used mere key presence in credential_pool to conclude a provider is authenticated. An empty entry (pool_store key with no actual credentials) caused providers like ollama-cloud to appear as authenticated in the model picker even when no OLLAMA_API_KEY was set. The user's picker then offered nemotron-3-super under Ollama Cloud; selecting it routed every subsequent turn to https://ollama.com/v1, which rejected the requests with HTTP 400. Fix: drop the pool_store key-existence check from both section 2 (HERMES_OVERLAYS) and section 2b (CANONICAL_PROVIDERS). The following load_pool().has_credentials() call already handles the legitimate pooled- credential case; checking for an empty key just ahead of it was redundant and actively harmful.	2026-05-04 03:12:12 -07:00
briandevans	c8ecb56f27	fix(cli): reject invalid argv values from -p/--profile before resolving `_apply_profile_override()` scans `sys.argv` for `-p / --profile` at module import time. When `hermes_cli.main` is imported inside pytest with `-p no:xdist` on the command line, it picks up `'no:xdist'` as a profile name candidate, then passes it to `resolve_profile_env()` which raises `ValueError` (invalid format), and the function calls `sys.exit(1)` — aborting test collection with an INTERNALERROR before any test runs. The same conflict affects any tool or wrapper that uses `-p` for its own flag and then imports `hermes_cli.main`. Fix: add a format guard immediately after step 1 (explicit flag scan). If `consume == 2` (the value came from `-p <value>`, not `--profile=value`) and the candidate doesn't match the canonical profile-name pattern `[a-z0-9][a-z0-9_-]{0,63}` (mirrored from `hermes_cli.profiles._PROFILE_ID_RE`), discard it and continue as if no `-p` flag was found. The `active_profile` file-based fallback (step 2) only reads a file written by hermes itself, so it always produces valid names and needs no guard. Regression guard: with the guard reverted, importing `hermes_cli.main` with `sys.argv = ['pytest', '-p', 'no:xdist', ...]` raises `SystemExit(1)`. With the guard in place, the import succeeds and `sys.argv` is left intact for pytest. Legitimate `-p coder` still flows through to `resolve_profile_env()` unchanged. Rebased onto current `origin/main` (``e5dad4ac5``) — the prior branch base (``4fade39c9``) was 824 commits behind and the PR was DIRTY / CONFLICTING. The 1.5 HERMES_HOME-set early-return block has since landed between the original insertion point and step 2; the new guard is positioned correctly before the early return so a bogus `-p` value no longer prevents the early return from kicking in. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-04 03:11:47 -07:00
ChanlerDev	e3461e0b2a	fix(cli): remove dead 'q' check from quit command resolution The 'q' alias is defined for 'queue' command in commands.py:93. The hardcoded 'q' in cli.py:5910 was dead code - resolve_command('q') returns the queue CommandDef, so canonical would never be 'q'. Removes the misleading check without changing any behavior: - /quit and /exit still exit (defined aliases) - /q still maps to queue (as intended)	2026-05-04 03:11:30 -07:00
YAMAGUCHI Seiji	cba86b7303	fix(cronjob): treat bare 'custom' provider as unspecified in override `_resolve_model_override` treated any non-empty `provider` string from the LLM as user-specified and skipped the pin-to-current-provider fallback. When the LLM wrote bare `'custom'` (instead of the canonical `'custom:<name>'` referring to a custom_providers entry), the value serialized into jobs.json as `"provider": "custom"` and the scheduler could never resolve a provider from it — the cron job failed silently at run time. Treat bare `'custom'` as "no provider supplied" so the current main provider gets pinned instead, matching behaviour for the omitted case. Defence-in-depth complement to a schema-description fix (#15477) that discourages the LLM from emitting bare `'custom'` in the first place.	2026-05-04 03:11:11 -07:00
pander	6b88f46c54	fix(compressor): trigger fallback on timeout errors alongside model-not-found Previously only HTTP 404/503 and specific error strings triggered a fallback to the main model when the summary model was unavailable. Timeout errors (HTTP 408/429/502/504, or error strings containing 'timeout') entered a short cooldown instead, leaving context to grow unbounded for the rest of the session. Add _is_timeout detection alongside _is_model_not_found so that transient timeout errors on the summary model also trigger immediate fallback to the main model, preventing compression failure from cascading. Closes #15935	2026-05-04 03:10:53 -07:00
DaniuXie	a45bd28598	fix(wecom): set SUPPORTS_MESSAGE_EDITING=False to prevent broken streaming	2026-05-04 03:10:36 -07:00
zng8418	d2ea959fe9	fix(doctor): skip /models health check for MiniMax CN (returns 404) MiniMax China (api.minimaxi.com) does not expose a /v1/models endpoint. The doctor command was probing it and reporting HTTP 404 as a warning, even though the API works correctly for chat completions. Set supports_health_check=False for MiniMax CN so doctor shows "(key configured)" instead of the false 404 warning. Refs #12768, #13757	2026-05-04 03:10:17 -07:00
ideathinklab01-source	d17eff29d5	fix(delegate): guard _load_config() against delegation: null in config.yaml YAML parses `delegation: null` as Python None. `dict.get(key, {})` only uses the default when the key is missing, not when it exists with a None value, so `cfg.get("max_concurrent_children")` crashes with `'NoneType' object has no attribute 'get'`. Same pattern as `fd9b692d` (fix(tui): tolerate null top-level sections). Use `dict.get(key) or {}` to handle both missing and None-valued keys. Closes: delegation null config crash (same class as #7215, #7346)	2026-05-04 03:09:59 -07:00
ygd58	2d3d1d9736	fix(tui): use --outdir instead of --outfile in hermes-ink build script esbuild raises 'Must use outdir when there are multiple input files' on Android/Termux ARM64 with esbuild >=0.25. The build script used --outfile=dist/ink-bundle.js which is only valid for a single entry point with no code splitting. Switching to --outdir=dist fixes the error and names the output file dist/entry-exports.js (matching the input file name). Update index.js to import from the new path. Fixes #16072	2026-05-04 03:09:41 -07:00
LLing486	145a38a875	fix(agent): preserve dots in model names for Xiaomi MiMo provider Add 'xiaomi' to the _anthropic_preserve_dots() provider whitelist and 'xiaomimimo.com' to the URL-based fallback check. Without this, normalize_model_name() converts mimo-v2.5 to mimo-v2-5, which the Xiaomi API rejects with HTTP 400. Fixes #16156	2026-05-04 03:09:24 -07:00
YAMAGUCHI Seiji	0896944382	fix(cronjob): advertise 'custom:<name>' provider format in tool schema The `provider` field in CRONJOB_SCHEMA only showed examples like 'openrouter' and 'anthropic', with no mention of the canonical 'custom:<name>' form required for custom_providers entries. When the user has custom providers configured, LLMs tend to write the bare type name ('custom') because the schema does not advertise the ':<name>' suffix. The bare value then serializes into jobs.json and causes the cron job to fail silently at run time — `_resolve_model_override` treats it as a user-specified provider and skips the pin-to-current fallback, but no provider ever resolves from the bare 'custom' string. Clarifying the schema so the canonical form is discoverable addresses the root cause at the tool-definition boundary.	2026-05-04 03:09:07 -07:00
jjjojoj	9c64d09610	fix(status): show NVIDIA NIM api key status hermes status was missing NVIDIA API key from its API keys display. Now shows NVIDIA NIM ✓/✗ with key hash like other providers. Fixes #16082	2026-05-04 03:08:50 -07:00
Teknium	64b39d835e	chore(release): AUTHOR_MAP entries for Tier 1d salvage batch	2026-05-04 03:07:30 -07:00
taeng0204	20a06c586f	fix(dashboard): render null instead of flashing spinner during plugin load	2026-05-04 03:06:45 -07:00
taeng0204	06a6d6967a	fix(dashboard): defer unknown-route redirect while dashboard plugins load	2026-05-04 03:06:45 -07:00
Teknium	986ec04048	docs: document /kanban slash command (#19584 ) * docs: document /kanban slash command The kanban user guide and slash-commands reference only mentioned the /kanban slash command in passing. Add a proper section covering: - CLI and gateway both expose the full hermes kanban surface via hermes_cli.kanban.run_slash (identical argument surface) - Mid-run usage: /kanban bypasses the running-agent guard, so reads and writes land immediately while an agent is still in a turn - Auto-subscribe on /kanban create from the gateway — originating chat is subscribed to terminal events, with a worked example - Output truncation (~3800 chars) in messaging - Autocomplete hint list vs full subcommand surface Also adds /kanban rows to both slash-command tables (CLI + messaging) in reference/slash-commands.md and moves it into the 'works in both' notes bucket. * docs(kanban): frame the model's tool surface as primary, CLI as the human surface The kanban user guide and CLI reference read as if you drive the board by running `hermes kanban` commands everywhere. In practice: - You (human, scripts, cron, dashboard) use the `hermes kanban …` CLI, the `/kanban …` slash command, or the REST/dashboard. - Workers spawned by the dispatcher use a dedicated `kanban_` toolset (`kanban_show`, `kanban_complete`, `kanban_block`, `kanban_heartbeat`, `kanban_comment`, `kanban_create`, `kanban_link`) and never shell out to the CLI. Changes to `user-guide/features/kanban.md`: - New 'Two surfaces' intro distinguishes the two front doors up front. - Quick-start section re-labelled so each step says who is running it (you vs. orchestrator vs. worker). - 'How workers interact with the board' rewritten: - Lead with "Workers do not shell out to `hermes kanban`." - Tool table extended with required params. - Concrete worker-turn example (`kanban_show` → `kanban_heartbeat` → `kanban_complete`) and an orchestrator fan-out example (`kanban_create` x N with `parents=[...]`). - Moved 'Why tools not CLI' from a defensive aside to a clean follow-up section. - 'Worker skill' section explicitly says the lifecycle is taught in tool calls, not CLI commands. - 'Pinning extra skills' reordered — orchestrator tool form first (the usual case), human/CLI second, dashboard third. - 'Orchestrator skill' now shows a canonical `kanban_create` / `kanban_link` / `kanban_complete` tool-call sequence instead of only describing what the skill teaches. - CLI-command-reference heading now clarifies this is the human surface, with a cross-link to the tool-surface section. - 'Runs — one row per attempt' structured-handoff example replaced: the primary example is now `kanban_complete(summary=..., metadata=...)` (what a worker actually does), with the CLI form retained as "when you, the human, need to close a task a worker can't." Changes to `reference/cli-commands.md`: - `hermes kanban` intro marks itself as the human / scripting surface and links out to the worker tool surface. - Corrected `comment <id>` description — the next worker reads it via `kanban_show()`, not by running `hermes kanban show`. docs(kanban-tutorial): reframe worker actions as tool calls Honest answer to Teknium's follow-up: no, the first pass missed the tutorial. The four stories all showed `hermes kanban claim / complete / block / unblock` as if the backend-dev, pm, and reviewer personas were humans running CLI commands. In a real hermes kanban run those agents are dispatcher-spawned workers driving the board through the `kanban_` tool surface. Changes: - Setup intro now distinguishes the three surfaces up front (dashboard / CLI for you, `kanban_` tools for workers) and establishes the convention: `bash` blocks are commands you run, `# worker tool calls` blocks are what the agent emits. - Story 1 (solo dev schema): 'Claim the schema task, do the work, hand off' block replaced with the dispatcher spawning the backend-dev worker and a `kanban_show → kanban_heartbeat → kanban_complete` tool-call sequence. The 'On the CLI' `hermes kanban show / runs` block re-labelled as 'you peeking at the board' to keep it correct as a human inspection step. - Story 2 (fleet farming): note about structured handoff updated from `--summary` / `--metadata` CLI flags to `kanban_complete(summary=..., metadata=...)` tool form. - Story 3 (role pipeline): the big PM/engineer/reviewer block fully rewritten as three worker tool-call sequences — PM worker completes spec, engineer worker blocks, human/reviewer `hermes kanban unblock` (or `/kanban unblock`), engineer worker respawns and completes. The respawn-as-new-run mechanic is now explicit. - Reviewer paragraph: `build_worker_context` replaced with `kanban_show()` — that's the tool that delivers the parent handoff to the model. - Structured handoff section heading and body updated: `--summary`/`--metadata` → `summary`/`metadata` (tool params), with a note that the tool surface doesn't expose a bulk variant for the same reason the CLI refuses multi-task `complete`. Story 4 (circuit breaker) unchanged — its workers fail to spawn, so there are no tool calls to show; the `hermes kanban create` and `hermes kanban runs` commands in it are correctly human-driven.	2026-05-04 03:05:34 -07:00
Teknium	0628004709	docs(model-catalog): rename x-ai/grok-4.20-beta to x-ai/grok-4.20 (#19640 ) OpenRouter and Nous Portal dropped the -beta suffix from the Grok 4.20 slug. The OpenRouter section already used the new slug; this updates the Nous Portal section and bumps updated_at.	2026-05-04 02:48:30 -07:00
ms-alan	c659a16899	fix(cli): detect quoted relative paths in _detect_file_drop Closes #15197	2026-05-04 02:48:20 -07:00
ms-alan	08b8465ca9	fix(email): add required Date header to send_message_tool._send_email Adds RFC 5322 Date header to the _send_email tool path in tools/send_message_tool.py. Issue #15160 noted that both gateway/platforms/email.py and tools/send_message_tool.py construct MIMEMultipart/MIMEText messages without setting a Date header. RFC 5322 requires the Date header; mail filters reject messages that lack it. PR #15207 fixed the gateway/platforms/email.py path but did not cover tools/send_message_tool._send_email, which is used by the send_message tool for cross-channel messaging. This change adds msg["Date"] = formatdate(localtime=True) to _send_email, mirroring the fix applied to the gateway email adapter. Closes #15160	2026-05-04 02:48:20 -07:00
thchen	51dc98d314	fix(agent): detect Qwen3/Ollama inline thinking after tool calls Ollama serves Qwen3 thinking inside the content field as <think>...</think> blocks rather than in the API-level reasoning_content field. This means _has_structured was False for these responses, so an empty-looking reply after a tool call triggered the nudge instead of the prefill continuation, causing a double-response loop. Fix: detect <think>/<thinking>/<reasoning> in final_response and: 1. Skip the nudge when thinking is present (model is still reasoning) 2. Include _has_inline_thinking in _has_structured so prefill kicks in	2026-05-04 02:47:29 -07:00
LeonSGP43	0df7e61d2c	fix(cli): omit empty api_mode when probing custom models	2026-05-04 02:46:41 -07:00
QifengKuang	52c539d53a	fix(agent): disable SDK retries on per-request OpenAI clients Per-request OpenAI-wire clients (used by both non-streaming and streaming chat-completions paths in _interruptible_api_call) should not run the SDK's built-in retry loop: the agent's outer loop owns retries with credential rotation, provider fallback, and backoff that the SDK can't see. Leaving SDK retries on (default 2) compounds with our outer retries and lets a single hung provider request stretch to ~3x the per-call timeout before our stale detector reports it. Shared/primary clients and Anthropic / Bedrock paths are unaffected (they don't go through here). Salvage of #15811 core improvement — the timeout push-down in the original PR required scaffolding that has since been refactored on main, so only the max_retries=0 change is preserved. Co-authored-by: QifengKuang <k2767567815@gmail.com>	2026-05-04 02:43:20 -07:00
Teknium	3c070f9f9d	fix(curator): only mark agent-created for background-review sediment (#19621 ) Tighten the provenance semantics added in #19618: skills a user asks a foreground agent to write via skill_manage(create) now stay invisible to the curator. Only skills the background self-improvement review fork sediments through skill_manage get the created_by=agent marker. - tools/skill_provenance.py — new ContextVar module mirroring the _approval_session_key pattern: set_current_write_origin / reset / get / is_background_review. Default origin is 'foreground'; the review fork sets 'background_review'. - run_agent.py — run_conversation() binds the ContextVar from self._memory_write_origin at the top of each call. The review fork runs on its own thread (fresh context), so foreground and review contexts never cross-contaminate. - tools/skill_manager_tool.py — skill_manage(action='create') now only calls mark_agent_created() when is_background_review(). All other cases (foreground create, patch, edit, write_file, delete) continue as before. - tests: test_skill_provenance.py (6 tests covering the ContextVar surface), split test_full_create_via_dispatcher into foreground vs. review-fork variants, curator status tests now mark-first. Why: the agent routinely edits existing user skills on the user's behalf; those writes must never flip provenance. And when a user explicitly asks the foreground agent to create a skill, that skill belongs to the user. The curator should only be cleaning up after its own autonomous sediment from the review nudge loop.	2026-05-04 02:42:16 -07:00
Teknium	bff484a51b	fix(kanban-dashboard): widen drawer, bump body fonts, fix code-block contrast (#19638 ) Closes #18576. Addresses three of four complaints from the readability report; live-verified in a dashboard against a seeded task with body, comments, and run history. - Drawer default width 480px → 640px, exposed as the CSS var `--hermes-kanban-drawer-width` so deployments / user themes can override without forking the plugin. - Bump body/meta/pre/log/run-history font sizes from the 0.65-0.75rem cluster to the 0.78-0.85rem cluster. Long paths and code snippets in task bodies, run metadata, and worker logs are legible again instead of requiring a squint. - Fix the black-text-on-dark-theme regression in fenced markdown code blocks. Root cause: themes that don't define `--color-foreground` (NERV, at least) leave `color: var(--color-foreground)` resolving empty on <code>, which then falls back to the UA default (near-black) instead of inheriting from the drawer's <body>. Fix: force `color: inherit` on both inline and fenced code, and give the fenced block background via `currentColor` instead of `--color-foreground` so there's a visible card even when the theme var is absent. Out of scope for this PR (comments added to #18576): - Draggable resize handle (structural JS work; plugin ships built-only, no src/ in-tree). - Live worker-log viewer for running tasks (backend WS + component). - Sibling fix: themes like NERV should define --color-foreground. The current changes make the drawer robust against that gap, but the root fix belongs in the theme layer.	2026-05-04 02:41:51 -07:00
alt-glitch	2a52e28568	fix(setup): skip AUXILIARY_VISION_MODEL write when input is blank Guard the save_env_value('AUXILIARY_VISION_MODEL', ...) call with 'if _selected_vision_model:' so blank input at the non-OpenAI vision model prompt doesn't nuke existing values in .env. save_env_value has no internal guard against empty strings — it faithfully writes whatever it receives, including empty values that shadow the previously-configured model. Salvage of #15504 (core hunk). Contributor's test was dropped because it collided with subsequent test refactors; the fix stands on its own. Co-authored-by: alt-glitch <balyan.sid@gmail.com>	2026-05-04 02:41:47 -07:00

1 2 3 4 5 ...

7111 commits