hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-05-07 02:51:50 +00:00

Author	SHA1	Message	Date
Siddharth Balyan	1fa76607c0	feat: trigram FTS5 index for CJK search, replace LIKE fallback (#16651 ) * fix: bypass FTS5 for CJK queries in session_search FTS5 default tokenizer splits CJK characters into individual tokens, so multi-character queries like "大别山项目" become AND of single chars. This produces few/no results compared to LIKE substring search. For CJK queries, skip FTS5 entirely and use LIKE for accurate phrase matching. Fixes NousResearch/hermes-agent#15500 * fix: cache _contains_cjk, escape LIKE wildcards, add regression tests On top of the CJK FTS5 bypass from #15509: - Cache _contains_cjk() result in a local var to avoid redundant O(n) scans on every CJK query - Escape %, _ in LIKE queries so literal wildcards in user input are not treated as SQL wildcards (consistent with other LIKE queries in hermes_state.py that use ESCAPE '\') - Fix misleading comment ('or CJK fallback' → accurate description) - Add 3 regression tests: - test_cjk_partial_fts5_results_supplemented_by_like (#15500 / #14829) - test_cjk_like_dedup_no_duplicates - test_cjk_like_escapes_wildcards (new wildcard escaping) * feat: trigram FTS5 index for CJK search, replace LIKE fallback Replace the LIKE '%query%' full-table-scan fallback for CJK queries with a proper trigram FTS5 index (messages_fts_trigram). The trigram tokenizer creates overlapping 3-byte sequences so substring matching works natively for any script — CJK, Thai, etc. For queries with 3+ CJK characters: uses the trigram FTS5 table with proper ranking, snippets, and indexed lookups. For shorter queries (1-2 CJK chars): falls back to LIKE since the trigram tokenizer needs ≥9 UTF-8 bytes (3 CJK chars) minimum. Schema v10 migration creates the trigram table and backfills existing messages. Triggers keep the index in sync on INSERT/UPDATE/DELETE. Builds on top of #16276 (bypass FTS5 for CJK, escape LIKE wildcards). --------- Co-authored-by: vominh1919 <vominh1919@gmail.com>	2026-04-28 00:12:07 +05:30
brooklyn!	e80504b088	Merge pull request #16656 from NousResearch/bb/tui-parity-mutating-commands fix(tui): route mutating slash commands through live gateway state	2026-04-27 13:30:19 -05:00
Brooklyn Nicholson	ed4f7f0ba3	test(tui): skip slash parity matrix when Python registry is unavailable Keep the parity test backed by the real Python command registry while avoiding hard failures in Node-only Vitest environments that cannot import hermes_cli.commands.	2026-04-27 13:19:11 -05:00
kshitijk4poor	56724147ef	fix(providers/gmi): post-salvage review fixes - config.py: remove dead ENV_VARS_BY_VERSION[17] entry (current _config_version is 22, so all users are past version 17 and would never be prompted for GMI_API_KEY on upgrade — consistent with how arcee was added) - auxiliary_client.py: use google/gemini-3.1-flash-lite-preview as GMI aux model instead of anthropic/claude-opus-4.6 (matches cheap fast-model pattern used by all other providers: zai→glm-4.5-flash, kimi→kimi-k2-turbo-preview, stepfun→step-3.5-flash, kilocode→google/gemini-3-flash-preview) - test_gmi_provider.py: fix malformed write_text() call in doctor test (was: write_text("GMI_API_KEY=* encoding="utf-8") → missing closing quote, wrote literal string 'GMI_API_KEY=* encoding=' to .env file) - test_gmi_provider.py + test_auxiliary_client.py: update aux model assertions to match new cheaper default - docs/integrations/providers.md: add 'gmi' to inline 'Supported providers' fallback list (was only in the table, not the inline list at line ~1181) - docs/reference/cli-commands.md: add 'gmi' to --provider choices list	2026-04-27 11:17:59 -07:00
Isaac Huang	c53fcb0173	feat(providers): add GMI Cloud as a first-class API-key provider (#11955 ) Add GMI Cloud (api.gmi-serving.com) as a full first-class API-key provider with built-in auth, aliases, model catalog, CLI entry points, auxiliary client routing, context length resolution, doctor checks, env var tracking, and docs. - auth.py: ProviderConfig for 'gmi' (api_key, GMI_API_KEY / GMI_BASE_URL) - providers.py: HermesOverlay with extra_env_vars for models.dev detection - models.py: curated slash-form model catalog; live /v1/models fetch - main.py: 'gmi' in _named_custom_provider_map and --provider choices - model_metadata.py: _URL_TO_PROVIDER, _PROVIDER_PREFIXES, dedicated context-length probe block (GMI's /models has authoritative data) - auxiliary_client.py: alias entries; _compat_model fix for slash-form models on cached aggregator-style clients; gmi aux default model - doctor.py: GMI in provider connectivity checks - config.py: GMI_API_KEY / GMI_BASE_URL in OPTIONAL_ENV_VARS - conftest.py: explicit GMI_BASE_URL clearing (not caught by _API_KEY suffix) - docs: providers.md, environment-variables.md, fallback-providers.md, configuration.md, quickstart.md (expands provider table) Co-authored-by: Isaac Huang <isaachuang@Isaacs-MacBook-Pro.local>	2026-04-27 11:17:59 -07:00
Brooklyn Nicholson	8a33ed6136	fix(tui): address rollback guard and parity registry review Load slash command names from the Python registry instead of regex-parsing source, and guard native rollback when no TUI session is active.	2026-04-27 13:10:13 -05:00
brooklyn!	41f70e6fc4	Merge pull request #16664 from NousResearch/bb/fix-tui-forceredraw-export fix(tui): expose forceRedraw in Ink type shim	2026-04-27 13:08:16 -05:00
Brooklyn Nicholson	adbd173ddd	fix(tui): expose forceRedraw in Ink type shim	2026-04-27 13:07:48 -05:00
Brooklyn Nicholson	4f59510dd4	fix(tui): tighten fast-mode support validation Distinguish missing model from unsupported model before enabling fast mode and cover both cases so config and live agent state remain untouched on invalid fast toggles.	2026-04-27 13:00:11 -05:00
Brooklyn Nicholson	4a08f1015a	fix(tui): reject fast mode for unsupported live models Match classic CLI parity by refusing to enable fast mode when the active model cannot produce fast request overrides, avoiding a misleading fast status with no runtime effect.	2026-04-27 12:55:41 -05:00
Brooklyn Nicholson	8bd5d0667a	Merge origin/main into bb/tui-parity-mutating-commands Resolve session command merge conflict and keep the branch current with main so PR #16656 is mergeable.	2026-04-27 12:51:11 -05:00
brooklyn!	6d24880604	Merge pull request #16657 from NousResearch/bb/tui-keybinding-model-parity fix(tui): align Ctrl+L and /model default scope with classic CLI	2026-04-27 12:49:37 -05:00
Brooklyn Nicholson	b8556eb15e	fix(tui): address fast-mode live sync review feedback Make `config.set fast status` read-only and keep live agent request overrides in sync with fast-mode toggles so runtime API kwargs match the selected mode.	2026-04-27 12:47:42 -05:00
Brooklyn Nicholson	b3e7a412e2	fix(tui): wire Ctrl+L to Ink forceRedraw path Expose a small forceRedraw API from @hermes/ink and use it for Ctrl/Cmd+L so the hotkey performs a real terminal clear + full repaint instead of a no-op state patch.	2026-04-27 12:44:24 -05:00
Brooklyn Nicholson	da6f8449a5	test(tui): tighten redraw hotkey review follow-ups Use explicit repaint patch semantics for Ctrl/Cmd+L and narrow the hotkey assertion to the actual +L entry so unrelated descriptions do not cause false failures.	2026-04-27 12:30:40 -05:00
Brooklyn Nicholson	a13449a40a	fix(tui): address Copilot review feedback on mutating command parity Harden busy mode config reads against invalid display config shapes and align /fast help+usage text with accepted aliases, with regression coverage for non-dict display values.	2026-04-27 12:30:30 -05:00
Brooklyn Nicholson	17029a64e8	chore(ui-tui): apply npm run fix formatting pass Run ui-tui lint autofix + prettier and commit the resulting formatting-only changes for the keybinding/model parity branch.	2026-04-27 12:25:27 -05:00
Brooklyn Nicholson	487da4b72b	chore(ui-tui): apply npm run fix formatting pass Run ui-tui lint autofix + prettier and commit the resulting formatting-only changes for the parity PR branch.	2026-04-27 12:25:21 -05:00
Brooklyn Nicholson	4909b94f99	fix(tui): align Ctrl+L and /model with classic CLI semantics Make Ctrl+L non-destructive by redrawing the current screen state instead of starting a new session, and stop auto-appending --global for typed /model commands so session scope remains the default unless explicitly requested.	2026-04-27 12:23:56 -05:00
Brooklyn Nicholson	a4cb3ef66c	fix(tui): make mutating slash paths native and lifecycle-safe Route /browser, /reload-mcp, /rollback, /stop, /fast, and /busy through direct TUI RPC handlers so state changes hit the live gateway session instead of slash-worker fallback. Add TUI session finalize/reset parity hooks (memory commit + plugin boundaries) and parity matrix tests to keep mutating commands off fallback.	2026-04-27 12:20:08 -05:00
brooklyn!	d5a89283b7	Merge pull request #16625 from NousResearch/bb/fix-tui-title-session-sync fix(tui): keep /title session names in sync	2026-04-27 12:05:54 -05:00
Brooklyn Nicholson	633f74504f	fix(ci): resolve follow-up title edge case and flaky checks Handle queued-title ValueError cleanup during session init, harden Discord message source building for test stubs, and fix the Dockerfile contract test syntax error. Also refresh the TUI lockfile and Nix build flags so nix ubuntu-latest no longer fails on npm lock/peer resolution drift.	2026-04-27 11:49:02 -05:00
Brooklyn Nicholson	27936ee02d	fix(tui-gateway): keep queued user titles from being dropped Retry queued pending titles even when the DB already has a non-empty title so explicit user title intents are not silently lost (for example after auto-title). Includes regression coverage.	2026-04-27 11:31:49 -05:00
Brooklyn Nicholson	3aa86717b6	fix(tui-gateway): harden pending-title retry and user errors Retry persisting queued titles on session.title reads and map title validation failures to a user-facing 4022 code instead of generic 5007.	2026-04-27 11:27:51 -05:00
Brooklyn Nicholson	492c4c6573	fix(tui-gateway): address follow-up Copilot title threads Tighten pending-title flush during session init and treat row lookup failures during title-set no-op detection as RPC errors instead of silently queueing.	2026-04-27 11:15:37 -05:00
Brooklyn Nicholson	3824b03237	fix(tui-gateway): harden session title RPC edge cases Handle session.title read failures without crashing, distinguish no-op title writes from missing session rows, and use a distinct empty-title error code with regression coverage.	2026-04-27 11:05:10 -05:00
Brooklyn Nicholson	42b917c92c	chore: uptick	2026-04-27 08:52:12 -07:00
Brooklyn Nicholson	7ccfb97fee	test(cli): assert active-session file lifecycle in launch_tui Validate that the temp active-session file exists while the TUI subprocess runs and is removed after launch cleanup to match mkstemp semantics.	2026-04-27 08:52:12 -07:00
Brooklyn Nicholson	7a6128cc4f	fix(tui): harden active-session temp file handling - create HERMES_TUI_ACTIVE_SESSION_FILE with mkstemp instead of a predictable tmp path and always cleanup in finally - add assertions that launch wiring uses a randomized session file path and removes it on exit	2026-04-27 08:52:12 -07:00
Brooklyn Nicholson	4b28140912	fix(cli): tighten MRU lookup and session DB cleanup - use a grouped last_active join in search_sessions to avoid per-row correlated max lookups - always close SessionDB in _resolve_last_session via finally and add regression coverage for search failure cleanup	2026-04-27 08:52:12 -07:00
Brooklyn Nicholson	653b5ec128	fix(tui): report actual session on exit	2026-04-27 08:52:12 -07:00
Brooklyn Nicholson	164e33aa46	fix(cli): resolve -c by true MRU session - order session listing by computed last_active in SessionDB so callers get MRU rows directly - keep _resolve_last_session as a single-row lookup and add regression coverage for >20 session sampling	2026-04-27 08:52:12 -07:00
Brooklyn Nicholson	cdfbd89ea5	fix(tui): keep /title session names in sync Route TUI /title through session.title RPC and queue titles when the session DB row is still initializing, so renamed sessions reliably appear in /resume and browse flows.	2026-04-27 10:51:14 -05:00
kshitijk4poor	730347e38f	feat(skills): expand touchdesigner-mcp with GLSL, post-FX, audio, geometry references (#13664 ) Add 6 new reference files with generic reusable patterns: - glsl.md: uniforms, built-in functions, shader templates, Bayer dither - postfx.md: bloom, CRT scanlines, chromatic aberration, feedback glow - layout-compositor.md: layoutTOP, overTOP grids, panel dividers - operator-tips.md: wireframe rendering, feedback TOP setup - geometry-comp.md: instancing, POP vs SOP rendering, shape morphing - audio-reactive.md: band extraction (audiofilterCHOP), beat detection, MIDI Expand pitfalls.md (#46-63): - Connection syntax, moviefileoutTOP bug, batch frame capture - TOP.save() time advancement, feedback masking, incremental builds - MCP reconnection after project.load(), TOX reverse-engineering - sliderCOMP naming, create() suffix requirement - COMP reparenting (copyOPs), expressionCHOP crash - Strip session-specific names in earlier pitfalls (promo_ -> my_) - Audio device CHOP at FPS=0: active=False is the fix, not volume=0 All content is generic — no session-specific paths, hardware, aesthetics, or param-name-only entries (those belong in td_get_par_info). Bumps version 1.0.0 -> 1.1.0. Salvaged from @kshitijk4poor's original PR #13664; dropped setup.sh and troubleshooting.md changes that reverted subsequent HERMES_HOME and pgrep fixes already on main, and preserved original author frontmatter.	2026-04-27 08:46:36 -07:00
Teknium	628ca99d9b	fix(compression): show main + aux model and provider in feasibility warning (#16619 ) The auto-lowered-threshold warning only named the compression model, making it confusing when the main and aux models are configured with the same slug but end up with different resolved context lengths (e.g. OpenRouter's stepfun/step-3.5-flash catalog value vs. a main-model context_length override). Users couldn't tell whether the warning reflected two different models or a context-resolution mismatch. Now includes both 'model (provider)' labels. The aux provider falls back to the client's base_url hostname when the configured provider is 'auto', so users see where compression is actually being called.	2026-04-27 08:43:24 -07:00
Teknium	460a8ce5d9	chore(release): map hermes-agent-dhabibi bot -> dhabibi	2026-04-27 08:35:50 -07:00
hermes-agent-dhabibi	aa53fb661a	fix(copilot): mark native image requests as vision Co-authored-by: dhabibi <9087935+dhabibi@users.noreply.github.com>	2026-04-27 08:35:50 -07:00
hermes-agent-dhabibi	8402ba150e	fix(copilot): send vision header for Copilot vision requests Thread a vision-request flag through auxiliary provider resolution so Copilot clients can include Copilot-Vision-Request only for vision tasks. This preserves normal text requests while ensuring Copilot vision payloads reach the vision-capable route. Add regression coverage for Copilot vision routing and keep cached text and vision clients separate so a text client without the header is not reused for vision. Co-authored-by: dhabibi <9087935+dhabibi@users.noreply.github.com>	2026-04-27 08:35:50 -07:00
brooklyn!	512c610058	Merge pull request #16605 from NousResearch/bb/fix-tui-docker-ink-build fix(docker): prebuild TUI assets in image	2026-04-27 10:17:58 -05:00
Brooklyn Nicholson	b479205396	fix(docker): tighten TUI build contract	2026-04-27 10:15:00 -05:00
Austin Pickett	60f2415a4a	Merge pull request #16600 from NousResearch/austin/fix/model-provider fix(models): consolidate provider and model into /model command	2026-04-27 08:14:27 -07:00
Austin Pickett	082acc75b0	fix(review): address copilot review	2026-04-27 11:06:28 -04:00
Brooklyn Nicholson	4424a0e0f7	fix(docker): prebuild TUI assets in image	2026-04-27 10:05:07 -05:00
kshitij	98d75dea5a	perf(tui): lazily seed virtual history heights (#16523 )	2026-04-27 07:55:45 -07:00
Teknium	9b55365f6f	fix(gateway,cron): close ephemeral agents + reap stale aux clients (salvage #13979 ) (#16598 ) * fix: clean gateway auxiliary client caches on teardown * fix(gateway): recover from stale pid files and close cron agents Two issues were keeping the gateway from surviving long runs: 1. `_cleanup_invalid_pid_path` delegated to `remove_pid_file`, which refuses to unlink when the file's pid differs from our own. That safety check exists for the --replace atexit handoff, but it also applied to stale-record cleanup, so after a crashy exit the pid file was orphaned: `write_pid_file()`'s O_EXCL create then failed with `FileExistsError`, and systemd looped on "PID file race lost to another gateway instance". Unlink unconditionally from this helper since the caller has already verified the record is dead. 2. The cron scheduler never closed the ephemeral `AIAgent` it creates per tick, and never swept the process-global auxiliary-client cache. Over days of 10-minute ticks this leaked subprocesses and async httpx transports until the gateway hit EMFILE. Release the agent and call `cleanup_stale_async_clients()` in `run_job`'s outer `finally`, matching the gateway's own per-turn cleanup. * chore(release): map bloodcarter@gmail.com -> bloodcarter --------- Co-authored-by: bloodcarter <bloodcarter@gmail.com>	2026-04-27 07:41:42 -07:00
Austin Pickett	a0b62e0c5a	fix(models): consolidate provider and model into /model command	2026-04-27 10:38:36 -04:00
Teknium	ac0325c257	diagnostic(cli): log slow bracketed-paste handler (>500ms) for #16263 (#16575 ) When a paste takes longer than 500ms to process on the prompt_toolkit event-loop thread, emit a logger.warning with elapsed time, byte size, line count, and sys.platform. Gives us concrete repro data for the recurring 'CLI freezes after paste on macOS' class of reports (issue #16263, plus sibling reports across Claude Code / Cursor / Lightroom against macOS Tahoe 26). Pure diagnostic — no behavior change. Two time.perf_counter() calls and one conditional per paste event. Log line only fires when the handler is actually slow, so normal pastes add no log noise.	2026-04-27 06:44:36 -07:00
Teknium	817633bc5d	feat(backup): exclude SQLite WAL/SHM/journal sidecars (#16576 ) The backup takes a consistent snapshot of each .db via sqlite3.backup(), so shipping the live .db-wal / .db-shm / .db-journal alongside pairs the fresh snapshot with stale sidecar state and produces a torn restore on first open. Sidecars are transient and SQLite regenerates them on next connection anyway. This also trims multi-MB of junk from every zip — state.db-wal alone was ~9 MB here, doubled by the fact the WAL is the live write-ahead log, not data.	2026-04-27 06:43:52 -07:00
Teknium	9692ce2072	chore(release): map andrewho.sf@gmail.com -> andrewhosf Release-notes contributor attribution for the salvaged PR #13734 fix.	2026-04-27 06:42:32 -07:00
Teknium	008860a23f	fix(approval): close remaining prompt_toolkit deadlock vectors (#15216 ) PR #13734 fixed the concurrent-tool-executor vector (ThreadPoolExecutor workers didn't inherit the CLI's TLS approval callback). Two vectors remained that could still land in the deadlocking input() fallback: 1. _spawn_background_review spawns a raw threading.Thread with no approval callback installed, so any dangerous-command guard the review agent trips falls back to input() -> deadlock against the parent's prompt_toolkit TUI (same class as delegate_task subagents, fixed in `023b1bff1` / #15491). Install a _bg_review_auto_deny callback at thread start, clear on finally. 2. prompt_dangerous_approval's fallback unconditionally spawned a daemon thread calling input() when approval_callback was None. That fallback can never succeed under prompt_toolkit because the user's Enter goes to pt's raw-mode stdin capture. Detect an active pt Application via get_app_or_none() and fail closed (deny + log) instead, so future threads that forget to install a callback degrade gracefully instead of hanging 60s invisibly. Regression guards: - tests/run_agent/test_background_review.py verifies the review worker thread sees a callable auto-deny callback mid-run and that the slot is cleared in the finally block. - tests/tools/test_approval.py TestFailClosedUnderPromptToolkit verifies prompt_dangerous_approval returns 'deny' fast under a mocked pt Application, and that a real callback still wins over the guard.	2026-04-27 06:42:32 -07:00

... 5 6 7 8 9 ...

6567 commits