hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-21 10:22:18 +00:00

Author	SHA1	Message	Date
kshitijk4poor	69716a2e6f	docs(compression): fix stale 'discarded' wording on in_place config flag Review nit (yoniebans): the config.py comment still said compaction is 'lossy: the pre-compaction transcript is discarded, matching Claude Code / Codex' — leftover from the original destructive design. The shipped behavior is soft-archive: lossy for the LIVE context (what the model reloads), but the pre-compaction turns are kept on disk (active=0, compacted=1), searchable via session_search and recoverable. Comment now says so. Comment-only; no behavior change.	2026-06-20 10:57:07 -07:00
kshitijk4poor	854d75723f	fix(compression): keep compaction-archived turns discoverable in session_search Follow-up to the soft-archive durability fix. Reusing the rewind/undo active=0 flag for compaction-archived turns inherited the wrong search semantics: undo rows are intentionally HIDDEN from session_search (the user took them back), but compaction-archived turns must stay DISCOVERABLE — that is the whole point of Teknium's "searchable / recoverable" requirement. As built, search_messages defaulted to WHERE active=1, so after in-place compaction the pre-compaction turns were in the FTS index but filtered out of the default search. (The earlier "searchable" claim only held for a raw FTS query / include_inactive=True, not the actual session_search tool.) Empirically confirmed the gap: search 'HMAC' returned 2 hits before compaction, 1 after (only the summary's mention) — the originals were hidden. Fix — a `compacted` flag distinct from `active`, giving a 3-way state: - active=1, compacted=0 → live context (normal) - active=0, compacted=1 → compaction-archived: OUT of live context, IN search - active=0, compacted=0 → rewind/undo: OUT of live context, OUT of search Changes: - messages.compacted INTEGER NOT NULL DEFAULT 0 added to SCHEMA_SQL. Declarative _reconcile_columns adds it on existing DBs — no version bump (plain column add). - archive_and_compact: UPDATE … SET active=0, compacted=1 (was active=0 only). - search_messages: default WHERE active=1 → (active=1 OR compacted=1), on BOTH the main FTS5 path and the trigram CJK path. include_inactive=True still returns everything. The short-CJK LIKE fallback already returns all rows (no active filter) — unchanged. - Docstrings on archive_and_compact + search_messages document the 3-way state. Verified: after compaction, session_search default finds the archived originals (ids 1 & 4); rewind/undo rows stay hidden by default (recoverable via include_inactive); live context still excludes both. 322 in-place + hermes_state tests and 46 session_search tests green; ruff clean. Mutation check: reverting the search WHERE to active-only fails the new searchable test. (Surfaced by the question "is search semantic or only FTS?" — answer: session search is FTS5 keyword/BM25 only, no embeddings over the transcript; semantic retrieval lives in the optional memory-provider layer. Tracing that confirmed the active-only filter gap above.)	2026-06-20 10:57:07 -07:00
kshitijk4poor	4663456996	fix(compression): in-place compaction is non-destructive (soft-archive, not delete) Teknium review: keeping one durable session id must NOT come at the cost of destroying history. The prior in-place implementation used replace_messages, which hard-DELETEs the pre-compaction turns (they also drop out of the FTS index) — same id, but the original conversation is gone with no recovery path and the summary becomes the only record. Rotation today is non-destructive (the old session's full transcript survives under the old id); in-place must match that durability contract, not weaken it. Fix: compact in place by SOFT-ARCHIVING, reusing the existing messages.active flag (the /undo soft-delete mechanic), instead of deleting: - New SessionDB.archive_and_compact(session_id, compacted): in one atomic write, UPDATE messages SET active=0 on the live turns, then insert the compacted set as fresh active=1 rows. Nothing is deleted. - The insert loop is extracted into a shared _insert_message_rows() helper so archive_and_compact and replace_messages don't duplicate the 60-line column/encoding block (extend-don't-duplicate). - Agent in-place branch calls archive_and_compact instead of replace_messages. Durability outcome (proven by test + E2E across repeated compactions): - Live context load (get_messages_as_conversation / get_messages) filters active=1, so a resume reloads ONLY the compacted set — compaction still shrinks the live session. - The pre-compaction turns stay on disk at active=0, recoverable via get_messages(include_inactive=True) / restore_rewound. - They remain FTS-searchable: the messages_fts* triggers index on INSERT and remove on DELETE only — they do NOT key on active, and active=0 is a content-preserving UPDATE. session_search still finds them. - Verified across TWO successive compactions: the 1st compaction's originals are still recoverable + searchable after the 2nd (answers the "no recovery path after the next compaction" concern directly). message_count now reflects the LIVE (active/compacted) count, matching the live load. replace_messages keeps its DELETE semantics (still correct for /retry, /undo) and gains a docstring note pointing compaction at the non-destructive method. Tests: test_in_place_keeps_same_session_id strengthened to assert the 8 seeded originals survive at active=0 alongside the 2 compacted rows AND stay FTS-searchable. Mutation check: swapping archive_and_compact back to a hard DELETE fails the test, so the non-destructive contract is bound. 285 hermes_state + in-place tests green; rotation/persistence/compress-command/cli suites green; ruff clean.	2026-06-20 10:57:07 -07:00
kshitijk4poor	4f9485a95d	refactor(compression): tidy in-place compaction path (simplify pass) Parallel 3-reviewer cleanup of the in-place compaction code. Findings applied: - perf: in-place mode no longer pre-flushes current-turn messages. The flush ran INSERTs that the immediately-following replace_messages(compressed) DELETE+reinsert discarded -- pure wasted writes per compaction. The current-turn tail survives via the compressor's compressed output (protect_last_n), not the flush. Verified no data loss; rotation still pre-flushes (its old session row is preserved, so the flush is real there). - quality: hoist the two shared post-write steps (update_system_prompt + _last_flushed_db_idx = 0) below the if/else -- they ran in both branches against agent.session_id. Removes the easiest divergence bug. - quality: compute the compaction-boundary locals (_old_sid, _is_boundary, _boundary_parent) ONCE instead of recomputing locals().get('old_session_id') and the "_old_sid or agent.session_id or ''" chain three times. - quality: initialize compacted_in_place up front and assign agent._last_compaction_in_place directly, dropping the fragile locals().get('compacted_in_place') reflection. - reuse: parse the in_place config flag with utils.is_truthy_value (the project's canonical truthy coerce) instead of a hand-rolled str().lower() in {...} (agent_init already imports from utils). Dropped as false positives / out of scope: gateway getattr of agent internals (established session_id pattern), dual result-dict carry (mirrors history_offset etc.), stringly-typed "compression" (codebase-wide convention, no constant). Behavior-preserving: 7 in-place tests (incl. 2 new flush-guard tests) + 26 rotation/boundary/persistence/command tests green; mutation check confirms the durable-replace guard still binds (removing replace_messages fails the test); ruff clean. Added test_in_place_skips_redundant_preflush / test_rotation_still_preflushes to guard the perf change.	2026-06-20 10:57:07 -07:00
kshitijk4poor	1fbf48d4ad	fix(compression): make in-place compaction durable + rotation-independent end-to-end Review (Codex + 3-agent parallel) found the first cut of in-place mode was incomplete: it only updated the system prompt, so the persisted transcript stayed 'full history + summary' and the next turn/resume reloaded the full history and immediately re-compacted (a loop), and every downstream layer that keyed off session-id rotation silently no-op'd. The session_id was doing double duty as the 'compaction happened' signal. This wires the whole path so removing rotation is actually complete: Agent (agent/conversation_compression.py): - In-place now DURABLY replaces the transcript: replace_messages(session_id, compressed) on the same row (the canonical store the gateway reloads from), not just update_system_prompt. Resume reloads the compacted set; no loop. - Reset flush identity/cursor (_last_flushed_db_idx=0, _flushed_db_message_ids cleared) so next-turn appends diff against the compacted transcript. - Expose a rotation-independent signal: agent._last_compaction_in_place, and in_place=True on the session:compress event. - Fire the compaction-boundary hooks (context-engine on_session_start, memory manager on_session_switch, reason='compression') in BOTH modes — in-place passes the same id as parent so DAG/buffer state still checkpoints. Without this, memory/context plugins miss every in-place compaction. Gateway auto-compress (gateway/run.py): - Read agent._last_compaction_in_place; set history_offset=0 on rotation OR in-place (both return the compacted set, so slicing past the pre-compaction length would drop everything). Carry compacted_in_place in the result dict. - No extra rewrite needed: the agent shares the gateway's SessionDB, so its replace_messages already updated the canonical store load_transcript reads. Manual /compress (gateway/slash_commands.py): - The throwaway /compress agent has no _session_db, so rewrite_transcript is the durable write. Previously gated behind 'if rotated:' which treated 'id unchanged' as the #44794 data-loss failure case and SKIPPED the rewrite — making /compress a silent no-op in in-place mode. Now rewrites on rotated OR in_place; the data-loss guard still fires only for the genuine no-rotation-AND-not-in-place failure. Hygiene auto-compress already writes _compressed to the same id unconditionally (its agent has no _session_db, can't rotate) — correct for in-place, no change. Tests (tests/run_agent/test_in_place_compaction.py): - Assert the DURABLE transcript IS the compacted set after reload (get_messages_as_conversation == compacted), message_count==2, flush identity reset, and the rotation-independent signal set on in-place / unset on rotation. Rotation regression guard unchanged. Verified: 64 tests green across in-place + rotation/persistence/boundary/ concurrent/failure-sync/command/cli suites; E2E both modes (durable replace, gateway offset=0, rotation preserves old transcript); ruff clean. Still default-off.	2026-06-20 10:57:07 -07:00
kshitijk4poor	47fadc24d7	feat(compression): in-place compaction option that keeps one session id (#38763 ) Context compression today rewrites the message list AND rotates the session id — it ends the session, forks a parent_session_id child, and renumbers the title (name -> name #2). That moving identity key is the root cause of a whole bug cluster: /goal lost (#33618), pending response lost at the split (#14238), orphan sessions (#33907), TUI sid desync (#36777), FTS search gaps + duplicate sidebar entries (#45117), null continuation cwd (#42228), and title-rename dead-ends (#48989). It also forced a large defensive apparatus (compression lock, contextvar/env/ logging triple-sync, orphan finalization, gateway SessionEntry re-propagation, tip projection) whose only job is surviving a mid-conversation id change. Add a compression.in_place config flag (default False during rollout). When True, compaction rewrites the transcript and rebuilds the system prompt but keeps the SAME session_id: no end_session, no child row, no title renumber, no contextvar/logging re-sync, no memory/context-engine session-switch. The conversation keeps one durable id for life, like Claude Code / Codex. Compaction is lossy by design — the pre-compaction transcript is summarized away, not archived. The rotation path is unchanged when the flag is off (moved verbatim into an else branch). Staged rollout: this PR ships the option behind a default-off flag for live validation; a follow-up flips the default and deletes the now-redundant rotation machinery, superseding the 14 open band-aid PRs in this area. - hermes_cli/config.py: add compression.in_place (default False), documented - agent/agent_init.py: resolve the flag -> agent.compression_in_place - agent/conversation_compression.py: branch compress_context() on the flag - tests/run_agent/test_in_place_compaction.py: in-place invariants + rotation regression guard + config default The pre-flush of current-turn messages (#47202) runs in BOTH modes, so no boundary data loss. Prompt-cache invariant preserved: the system-prompt rebuild is the same single sanctioned invalidation that already happens during compaction — no NEW invalidation. Message alternation preserved.	2026-06-20 10:57:07 -07:00
teknium1	37a4dd4982	fix(auth): heal poisoned Nous inference URL on refresh instead of retaining it A nous inference_base_url that fails the host allowlist (e.g. a stale stg-inference-api.nousresearch.com persisted before the allowlist existed) was only replaced 'if refreshed_url:' — so when the validator rejected the URL it left the poisoned value in place. The 'falling back to default' warning fired but never took effect: every subsequent call, including the auxiliary compression call, kept hitting the dead staging endpoint and 401'd. Reset to DEFAULT_NOUS_INFERENCE_URL when validation returns None at both refresh sites in resolve_nous_runtime_credentials, so a poisoned auth.json self-heals on the next refresh. The proxy adapter already did this correctly; this brings the two auth.py sites in line.	2026-06-20 10:53:45 -07:00
teknium1	92d40c2553	chore(release): add IamSanchoPanza to AUTHOR_MAP Author email lacked a numeric-id prefix so the noreply auto-extraction misses it; map it explicitly for PR #43872 salvage.	2026-06-20 10:46:01 -07:00
Sancho	c884ff64ea	fix(agent): keep system-prompt model identity in sync across provider failover The session-stable system prompt embeds Model:/Provider: identity lines, but mid-turn failover (try_activate_fallback) swaps the runtime without touching them, so a fallback model misreports itself as the primary when asked "what model are you?". rewrite_prompt_model_identity() rewrites the last occurrence of each line on _cached_system_prompt when a fallback activates (and back on restore, byte-identical so the primary's prefix cache still hits). The rewrite is never persisted to the session DB. _sync_failover_system_message() patches the in-flight api_messages[0] at all 8 failover sites so the current turn ships the corrected identity. Cache-safe: the fallback's prefix cache is cold on a model switch anyway. Co-authored-by: Hermes Agent <noreply@nousresearch.com>	2026-06-20 10:46:01 -07:00
Teknium	11c6f4c7bc	feat(setup): Blank Slate setup mode — minimal agent, opt in to everything (#36733 ) * feat(setup): Blank Slate setup mode — minimal agent, opt in to everything Adds a third first-time setup option alongside Quick Setup and Full Setup. Blank Slate forces ON only what an agent needs to run — provider & model, the File Operations toolset, and the Terminal toolset — and turns everything else OFF, then walks the user through opting each capability back in. What it does: - platform_toolsets.cli = [file, terminal] (explicit, authoritative list) - agent.disabled_toolsets = every other known toolset (web, browser, code_execution, vision, memory, delegation, cronjob, skills, image_gen, kanban, …). Applied last in the resolver, so it overrides the non-configurable platform-toolset recovery that would otherwise re-add toolsets like kanban — guaranteeing a true blank slate. - Optional config features off: compression, memory + user-profile capture, checkpoints, smart model routing, auto session reset. - Bundled skills default to NONE (reuses the .no-bundled-skills marker); offers to seed the full catalog. - Walks through tools / plugins / MCP / messaging, all opt-in. Proven end-to-end: with the Blank Slate config, model_tools.get_tool_definitions emits exactly 6 schemas — patch, process, read_file, search_files, terminal, write_file. Nothing else reaches the model. Re-enable later via hermes tools / hermes skills opt-in --sync / hermes setup agent. Tests: tests/hermes_cli/test_setup_blank_slate.py (8 tests) pin the writers, the resolver invariant ({file, terminal}), and the 6-schema end-to-end set. Docs: getting-started/quickstart.md documents all three setup modes. * feat(setup): Blank Slate fork — finish minimal, or walk through configs After applying the minimal baseline (provider/model + file + terminal, everything else off), Blank Slate now presents a choice instead of always running the full walkthrough: 1. Start with everything disabled — finish now with the minimal agent. 2. Walk through all configurations — opt in to tools, skills, plugins, MCP, and messaging. Provider/model and terminal are still configured first either way (the agent can't run without them). The finish-now path records the bundled-skill opt-out so future `hermes update` runs don't re-inject skills. The walkthrough body moved to a separate _blank_slate_walkthrough() helper. Tests: TestBlankSlateFork covers both branches (finish-now applies baseline + skill opt-out and skips the walkthrough; walkthrough path invokes it). Docs updated to describe the fork.	2026-06-20 10:45:55 -07:00
teknium	838daca9f4	chore(desktop): format tooltip indentation + author map for #49697 Re-indent the salvaged title= lines to spaces (prettier), and map alelpoan@proton.me in the release author map.	2026-06-20 10:45:14 -07:00
alelpoan	404fe730b7	fix: add tooltips to right sidebar header buttons	2026-06-20 10:45:14 -07:00
Teknium	c329279482	test: retarget source-path refs to migrated plugin paths test_telegram_webhook_secret reads telegram adapter source by path; point it at plugins/platforms/telegram/adapter.py. test_windows_native_support npm-spawn parametrization referenced gateway/platforms/whatsapp.py; point it at plugins/platforms/whatsapp/adapter.py.	2026-06-20 10:26:45 -07:00
Teknium	5600105478	refactor(gateway): migrate slack/dingtalk/whatsapp/matrix/feishu/telegram/wecom/email/sms adapters to bundled plugins Salvage of PR #41284 onto current main. Relocates the last 9 inline messaging adapters (+ satellites: telegram_network, feishu_comment/_rules/meeting_invite, wecom_crypto, wecom_callback) from gateway/platforms/ into self-contained bundled plugins under plugins/platforms/<x>/, discovered via the platform registry. Strips the per-platform core touchpoints from gateway/run.py, gateway/config.py, hermes_cli/gateway.py, hermes_cli/setup.py, and tools/send_message_tool.py. Carries forward the migration fixes (explicit enabled:false honored, get_connected_platforms forces discovery, plugin is_connected via gateway.get_env_value, logs --component gateway matches plugins.platforms.*, matrix hidden on Windows). Additionally ports config keys main added since the PR base: the matrix plugin's _apply_yaml_config now also covers allowed_users, ignore_user_patterns, process_notices, and session_scope (the inline gateway/config.py matrix block gained these in the 1340 commits the PR sat open; they would otherwise have been silently dropped on deletion).	2026-06-20 10:26:45 -07:00
kshitij	2ab09a6c50	Merge pull request #49680 from NousResearch/fix/signal-quote-cache-eviction fix(signal): FIFO-evict the quote-detection timestamp cache (follow-up to #49678)	2026-06-20 21:06:01 +05:30
kshitijk4poor	26d9a3c710	fix(signal): FIFO-evict the quote-detection timestamp cache `_sent_message_timestamps` (the reply-to-own-message quote cache) used a `set` evicted with `set.pop()`, which removes an ARBITRARY element — so once more than the cap (500) outbound timestamps are tracked, a still-recent timestamp could be dropped while older ones survive, missing a genuine reply-to-own-message. Convert it to an OrderedDict with FIFO (oldest-first) eviction, mirroring the recently-hardened echo ring (#31250). This closes the same bug class on the sibling cache. Adds a regression test asserting oldest-first eviction + MRU promotion.	2026-06-20 21:00:46 +05:30
kshitij	85ad7c9b0a	Merge pull request #49678 from NousResearch/salvage/signal-echo-ring fix(signal): salvage echo-ring LRU+TTL hardening (#31250)	2026-06-20 20:56:43 +05:30
kshitijk4poor	e49272fe53	chore(release): map w31rdm4ch1nZ contributor email to GitHub login	2026-06-20 20:51:41 +05:30
kshitijk4poor	2f86283217	test(signal): update echo-discard test for OrderedDict ring The hardened echo ring (#31250) changes _recent_sent_timestamps from a set to an OrderedDict, so the reply-detection-cache regression test from the quote salvage can no longer call .discard(); route it through the new _consume_sent_timestamp() helper, which is the real echo-removal path.	2026-06-20 20:51:01 +05:30
w31rdm4ch1nZ	332f88f6a6	fix(signal): harden recently-sent echo ring with LRU + TTL	2026-06-20 20:50:52 +05:30
kshitij	b88d0007c9	Merge pull request #49583 from NousResearch/salvage/signal-mention-typing fix(signal): salvage self-mention strip + explicit stop-typing RPC (batch of #31217, #40054)	2026-06-20 16:32:30 +05:30
kshitijk4poor	32a97a20af	fix(signal): strip self-mention in all groups, not just require_mention Review follow-up on the salvaged self-mention strip (#31217): the original only stripped the bot's rendered @<number>/@<uuid> self-mention inside the `require_mention=true` branch, so groups with require_mention=false still leaked it into the agent text. Hoist the strip to run for every group message (fixing the whole bug class), and collapse the doubled space a mid-sentence removal leaves while preserving intentional newlines.	2026-06-20 16:27:28 +05:30
kshitijk4poor	ef7e716930	chore(release): map rratmansky contributor email to GitHub login	2026-06-20 16:24:15 +05:30
Kailigithub	40b6ac9ac7	fix(signal): send explicit stop-typing RPC when cancelling indicator	2026-06-20 16:23:41 +05:30
Rick Ratmansky	96b10327b6	fix(signal): strip bot self-mention from group messages before agent dispatch	2026-06-20 16:23:41 +05:30
kshitij	65561e9de6	Merge pull request #49563 from NousResearch/salvage/signal-quote-history Some checks failed Deploy Site / deploy-vercel (push) Waiting to run Details Deploy Site / deploy-docs (push) Waiting to run Details Docker Build and Publish / build-amd64 (push) Waiting to run Details Docker Build and Publish / build-arm64 (push) Waiting to run Details Docker Build and Publish / merge (push) Blocked by required conditions Details Lint (ruff + ty) / ruff + ty diff (push) Waiting to run Details Lint (ruff + ty) / ruff enforcement (blocking) (push) Waiting to run Details Lint (ruff + ty) / Windows footguns (blocking) (push) Waiting to run Details Tests / test (1) (push) Waiting to run Details Tests / test (2) (push) Waiting to run Details Tests / test (3) (push) Waiting to run Details Tests / test (4) (push) Waiting to run Details Tests / test (5) (push) Waiting to run Details Tests / test (6) (push) Waiting to run Details Tests / save-durations (push) Blocked by required conditions Details Tests / e2e (push) Waiting to run Details Typecheck / typecheck (apps/bootstrap-installer) (push) Waiting to run Details Typecheck / typecheck (apps/desktop) (push) Waiting to run Details Typecheck / typecheck (apps/shared) (push) Waiting to run Details Typecheck / typecheck (ui-tui) (push) Waiting to run Details Typecheck / typecheck (web) (push) Waiting to run Details Typecheck / desktop-build (push) Waiting to run Details OSV-Scanner / Scan lockfiles (push) Has been cancelled Details uv.lock check / uv lock --check (push) Has been cancelled Details fix(signal): salvage quoted-reply context (#46388)	2026-06-20 15:36:02 +05:30
lkz-de	96db7c6883	fix(signal): preserve quoted reply context Carry Signal quote metadata through gateway events so replies to assistant messages include the quoted context without personalizing comments.	2026-06-20 15:16:53 +05:30
kshitij	ff50a88617	Merge pull request #49558 from NousResearch/salvage/env-var-guards-48735	2026-06-20 15:11:54 +05:30
kshitij	834bbae895	Merge pull request #49530 from NousResearch/salvage/signal-trio refactor(signal): salvage AAC voice-note remux + shared markdown formatting (batch of #47766, #46386)	2026-06-20 15:08:23 +05:30
kshitijk4poor	467c879b2e	chore(release): map lkz-de contributor email to GitHub login The contributor-check CI auto-resolves only the +id form of GitHub noreply emails; lkz-de's commits use the legacy plain form (lkz-de@users.noreply.github.com), so add an explicit AUTHOR_MAP entry.	2026-06-20 15:03:29 +05:30
kshitijk4poor	a7dd98c860	fix(env): guard remaining malformed int/float env var casts with utils helpers Widen the env_float() guard from #48735 across the whole bug class: a non-numeric value (e.g. a stale .env "HERMES_API_TIMEOUT=abc" or a typo'd port) raised an unhandled ValueError and crashed adapter/agent init. Converts 22 genuinely-unguarded first-party int/float(os.getenv()) sites to the canonical utils.env_int / utils.env_float helpers (the established house pattern), instead of duplicating per-module helpers or inline try/except: - gateway/config.py: WECOM_CALLBACK_PORT, BLUEBUBBLES_WEBHOOK_PORT - gateway/platforms/email.py: EMAIL_IMAP/SMTP_PORT, EMAIL_POLL_INTERVAL - gateway/platforms/feishu.py: dedup cache + text/media batch settings - gateway/platforms/wecom.py, discord/adapter.py: text batch delays - gateway/platforms/telegram.py: media batch delay, TELEGRAM_WEBHOOK_PORT - gateway/platforms/whatsapp.py: WHATSAPP_NPM_INSTALL_TIMEOUT - hermes_cli/auth.py: CODEX/XAI refresh timeouts - agent/chat_completion_helpers.py: API/stream read/stale timeouts - run_agent.py, agent/auxiliary_client.py: API + nous timeouts Sites already guarded by try/except or local helpers are left untouched. The HERMES_MAX_ITERATIONS sites are already guarded on main via _current_max_iterations(), so they are not included.	2026-06-20 14:54:36 +05:30
xxxigm	7eb9678c54	test(desktop): cover link-title window audio muting Verify createLinkTitleWindow mutes audio (regression guard for #49505) and keeps the hardened offscreen defaults, and register the new test file in the desktop platforms test script.	2026-06-20 14:53:05 +05:30
xxxigm	ae8db1ab53	fix(desktop): mute hidden link-title window so historical links don't autoplay audio Tier-2 link-title resolution loads the URL in an offscreen BrowserWindow to read its <title> when curl can't. That window was never muted, so pages that autoplay media (e.g. YouTube `watch` URLs) leaked ~2s of audio every time a session containing such links was re-rendered. Move the window creation into a dedicated helper that calls `webContents.setAudioMuted(true)` immediately after construction, so the offscreen probe can never emit sound. Fixes #49505	2026-06-20 14:53:05 +05:30
kshitijk4poor	abafba0762	refactor(signal): correct STT-fallback comment, type the markdown wrapper, make AAC test portable Review follow-up on the salvaged AAC + markdown changes: - Fix an inaccurate comment claiming the STT layer has a sniff-and-remux fallback (verified: no such fallback exists; the ffmpeg-absent path caches raw ADTS and STT may reject it). - Type the _markdown_to_signal wrapper as tuple[str, list[str]] to match the shared helper instead of a bare tuple. - Replace the hardcoded /home/pi/... test fixture with a runtime-generated ADTS AAC sample so the remux round-trip actually runs in CI (skips only when ffmpeg is absent) instead of always-skipping.	2026-06-20 14:24:29 +05:30
annguyenNous	06ca1e9980	fix(utils): add env_float helper for safe float env var parsing Mirrors the existing env_int() helper: returns the default when the variable is unset or non-numeric instead of raising ValueError. Used by the follow-up commit to guard malformed float env vars across the gateway. Salvaged from #48735 (@annguyenNous). The PR's api_server.py change is now redundant — main guards HERMES_MAX_ITERATIONS via _current_max_iterations().	2026-06-20 14:00:07 +05:30
jasnoorgill	da34fca2bb	fix(signal): detect ADTS AAC voice notes and remux to MP4 Android Signal delivers voice notes as raw ADTS AAC frames, which share the `0xFF 0xFx` sync word with MPEG-1/2 Layer 3 (MP3). The `_guess_extension` byte-signature test in gateway/platforms/signal.py was matching both, so ADTS AAC was being misclassified as MP3 — saved to disk with the wrong extension and rejected by every major STT API (Groq, OpenAI) because their server-side format sniffers inspect the actual codec, not the file extension. Two changes: 1. Tighten the MP3 vs ADTS disambiguator. ADTS packs `ID`, `layer`, and `protection_absent` into bits 3-0 of byte 1, where `ID=0` and `layer=00` for AAC. Real MP3 has `ID=1` and `layer` in {01, 10, 11}. The mask `0xF6` against target `0xF0` cleanly separates them. 2. Remux raw ADTS AAC to MP4 container at the cache step via `ffmpeg -c:a copy`. Single demux/remux, no re-encode, no quality loss, sub-100ms on a Pi 5. The cached file is a normal `.m4a` that all major STT providers accept. ffmpeg is a transitive dependency of many other Hermes features (TTS, video skills) so this isn't a new install requirement; the remux degrades gracefully to a no-op if ffmpeg is missing. The new helper `_remux_aac_to_m4a` is unit-tested with a real Android voice note from the audio cache that originally triggered the bug, plus synthetic ADTS frames for the byte-level disambiguator and garbage-input graceful failure. Closes the gap that broke transcription for any Android Signal user sending voice messages to Hermes.	2026-06-20 13:48:05 +05:30
lkz-de	905820b59f	fix(signal): share markdown formatting across send paths Route Signal send paths through shared markdown formatting helpers and render markdown bullets consistently as Unicode bullets. Add coverage for Signal formatting and send_message integration.	2026-06-20 13:47:14 +05:30
brooklyn!	15852722d4	feat(desktop): pop the composer out into a draggable floating window (#49488 ) * feat(desktop): pop the composer out into a draggable floating window Gesture-driven: drag the docked composer up to peel it out, drag it back to the bottom-center dock zone (radial glow ramps with proximity) to redock, and double-click the grab area to toggle. Floating composer is compact, grows upward as it wraps, and can be moved by its 5px transparent grab platform (diagonal hatch on hover). Position + popped state persist; secondary windows always start docked. rAF-coalesced drag, persisted only on release. * fix(desktop): keep floating composer radius consistent with docked * fix(desktop): composer popout polish — peel-off placement, panels, chip editing - Peel-off undock drops the floating composer under the cursor (centered horizontally, preserving the vertical grab offset) instead of snapping to the docked corner. - Unify the / · @ · ? completion drawer and the attach (+) menu onto one shared glassy panel primitive (composerPanelCard): smallest theme font, hairline border, nous shadow; floats off the composer, inset from the left. - Directive chips: Backspace removes the chip + its auto-inserted trailing space atomically (no orphaned space), and a phantom trailing block left by contenteditable no longer falsely expands the composer to two rows. - Model picker: scroll area capped at max(150px, 30dvh); footer rows aligned (matching icons, dropped a redundant margin). - Composer focus shifts the border ~15% toward foreground (no fill change); input is cursor-text; trimmed control icon/button sizes.	2026-06-20 02:16:25 -05:00
Brooklyn Nicholson	eed78d6ebb	fix(desktop): composer popout polish — peel-off placement, panels, chip editing - Peel-off undock drops the floating composer under the cursor (centered horizontally, preserving the vertical grab offset) instead of snapping to the docked corner. - Unify the / · @ · ? completion drawer and the attach (+) menu onto one shared glassy panel primitive (composerPanelCard): smallest theme font, hairline border, nous shadow; floats off the composer, inset from the left. - Directive chips: Backspace removes the chip + its auto-inserted trailing space atomically (no orphaned space), and a phantom trailing block left by contenteditable no longer falsely expands the composer to two rows. - Model picker: scroll area capped at max(150px, 30dvh); footer rows aligned (matching icons, dropped a redundant margin). - Composer focus shifts the border ~15% toward foreground (no fill change); input is cursor-text; trimmed control icon/button sizes.	2026-06-20 02:10:38 -05:00
kshitijk4poor	a6f08ff0c8	docs(delegate): clarify subagent model is config-level, not per-call delegate_task has never exposed a per-call model parameter (removed intentionally in `fb0f579b1`). The tool description gave no hint about how subagent model is actually controlled, so users kept expecting a model arg and filing it as a dropped/ignored param (e.g. #49332, #23467). Add one bullet to the dynamically-built tool description stating that children inherit the parent model + fallback chain, and that pinning all subagents to a specific model is done via delegation.provider / delegation.model in config.yaml. No behavior change.	2026-06-20 12:13:39 +05:30
Brooklyn Nicholson	f697c97e02	fix(desktop): keep floating composer radius consistent with docked	2026-06-20 01:36:29 -05:00
Brooklyn Nicholson	236f0597e5	feat(desktop): pop the composer out into a draggable floating window Gesture-driven: drag the docked composer up to peel it out, drag it back to the bottom-center dock zone (radial glow ramps with proximity) to redock, and double-click the grab area to toggle. Floating composer is compact, grows upward as it wraps, and can be moved by its 5px transparent grab platform (diagonal hatch on hover). Position + popped state persist; secondary windows always start docked. rAF-coalesced drag, persisted only on release.	2026-06-20 01:35:30 -05:00
helix4u	c253b07380	fix(model): clear stale endpoint credentials across switches	2026-06-19 19:58:26 -07:00
helix4u	95a3affc2e	fix(model): keep Nous picker from restoring stale custom keys	2026-06-19 19:58:26 -07:00
Harish Kukreja	1b7b4d138a	fix(desktop): handle slash exec dispatch payloads (#49358 )	2026-06-19 21:11:16 -05:00
Gille	857d0244af	fix(tui): handle dispatch payloads from slash exec (#49337 )	2026-06-19 20:05:58 -05:00
Teknium	cf58f1a520	feat(titles): support language-aware title generation (#45296 ) Make auxiliary title prompts match the user language by default, with an optional pinned `auxiliary.title_generation.language` config.	2026-06-19 17:15:52 -07:00
ruangraung	8cf7df867e	fix(plugins): silence raft check_fn log spam for users without raft CLI The raft platform plugin's check_raft_requirements() logged a WARNING every time it returned False. Since check_fn is called on every load_gateway_config() (~every 10s during normal gateway operation), users who don't have the raft CLI installed get their logs flooded with no way to suppress it — hermes plugins disable doesn't work for bundled platform plugins, and platforms.raft.enabled: false doesn't gate the check_fn call. Fix: make check_raft_requirements() a silent predicate (return True/False only, no logging), matching the convention documented and used by other platform adapters (e.g. teams/adapter.py). The caller in gateway/platform_registry.py create_adapter() already emits its own warning when requirements aren't met and an adapter is actually requested — that's the correct place for a user-facing warning (fires once per connect attempt, not once per config load). Fixes #49234	2026-06-19 17:12:58 -07:00
joaomarcos	75ed07ace8	fix(gateway): break the restart loop at the source on session resume When a tool call itself restarts the gateway (docker restart, systemctl restart, and similar), the process is terminated mid-call — before the tool result is persisted and before the orderly drain rewind can run. The transcript tail is left as an assistant(tool_calls) with no matching tool answer. On resume the model re-issues the unanswered call, taking the gateway down again — an infinite loop (#49201). Source fix: _build_gateway_agent_history now strips a trailing assistant(tool_calls) block that has no tool answers (_strip_dangling_tool_call_tail), so there is nothing for the model to re-execute. This complements _strip_interrupted_tool_tails, which only handles the case where a tool result row exists with an interrupt marker. Cognitive backstop: the resume-pending system note now states that any restart command in the history already ran and must not be re-executed or verified, and the empty-message auto-resume startup turn reports recovery and asks for instructions instead of the nonsensical "address the user's NEW message" (there is no new message on that turn). Reimplements the intent of #49243 by @JoaoMarcos44 at the replay layer. Fixes #49201	2026-06-19 16:59:58 -07:00
teknium1	6504f51cd5	chore: add @hakanpak to AUTHOR_MAP for PR #49282 salvage	2026-06-19 16:59:54 -07:00

1 2 3 4 5 ...

12263 commits