hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-27 17:58:07 +00:00

Author	SHA1	Message	Date
Teknium	4474873d2c	feat(cli): persist resolved approval/clarify prompts in scrollback (#44702 ) Some checks failed Deploy Site / deploy-vercel (push) Waiting to run Details Deploy Site / deploy-docs (push) Waiting to run Details Docker Build and Publish / build-amd64 (push) Waiting to run Details Docker Build and Publish / build-arm64 (push) Waiting to run Details Docker Build and Publish / merge (push) Blocked by required conditions Details Lint (ruff + ty) / ruff + ty diff (push) Waiting to run Details Lint (ruff + ty) / ruff enforcement (blocking) (push) Waiting to run Details Lint (ruff + ty) / Windows footguns (blocking) (push) Waiting to run Details Nix Lockfile Fix / auto-fix-main (push) Waiting to run Details Nix Lockfile Fix / fix (push) Waiting to run Details Nix / nix (macos-latest) (push) Waiting to run Details Nix / nix (ubuntu-latest) (push) Waiting to run Details OSV-Scanner / Scan lockfiles (push) Waiting to run Details Tests / test (1) (push) Waiting to run Details Tests / test (2) (push) Waiting to run Details Tests / test (3) (push) Waiting to run Details Tests / test (4) (push) Waiting to run Details Tests / test (5) (push) Waiting to run Details Tests / test (6) (push) Waiting to run Details Tests / save-durations (push) Blocked by required conditions Details Tests / e2e (push) Waiting to run Details Typecheck / typecheck (apps/bootstrap-installer) (push) Waiting to run Details Typecheck / typecheck (apps/desktop) (push) Waiting to run Details Typecheck / typecheck (apps/shared) (push) Waiting to run Details Typecheck / typecheck (ui-tui) (push) Waiting to run Details Typecheck / typecheck (web) (push) Waiting to run Details uv.lock check / uv lock --check (push) Waiting to run Details Build Skills Index / build-index (push) Has been cancelled Details Build Skills Index / trigger-deploy (push) Has been cancelled Details Modal prompt panels (dangerous-command approval, clarify questions) live in the prompt_toolkit layout and vanish on the next repaint, leaving no trace of the question or the decision in chat history. Emit a dim one-line summary after each prompt resolves: ⚠ Approval: <command> → allowed for session ? Clarify: <question> → <answer> Gated on display.persist_prompts (default true). Detail and outcome are whitespace-collapsed and capped at 120 chars.	2026-06-12 01:14:35 -07:00
Tranquil-Flow	286ecd26d8	fix(agent): strip MEDIA directives from compressor summarizer input (#14665 )	2026-06-12 01:14:28 -07:00
Teknium	74180ebf0b	fix(gateway): classify SimpleX non-image/non-audio files as DOCUMENT SimpleX tagged unknown files application/octet-stream in media_types but classification only handled audio/image, leaving msg_type TEXT — run.py never injected the document context. Same bug class as #12845.	2026-06-12 01:07:50 -07:00
Teknium	f03f161b39	fix(gateway): classify email document attachments as DOCUMENT Email cached document attachments and placed them in media_urls, but msg_type only flipped on image attachments — documents stayed TEXT and run.py's document-context injection (gated on MessageType.DOCUMENT) silently dropped them. Same bug class as Signal #12845. DOCUMENT wins over PHOTO for mixed attachments since image handling keys off per-path mime types while document injection gates strictly on message_type.	2026-06-12 01:07:50 -07:00
Teknium	1e29ab38c7	fix(gateway): classify Signal video attachments + catch-all DOCUMENT fallback Widen the salvaged #12851 fix to match the established classification pattern (WhatsApp/Slack/BlueBubbles/Mattermost): video/* -> VIDEO, and any remaining MIME type falls through to DOCUMENT instead of TEXT, so exotic types still trigger run.py's document-context injection.	2026-06-12 01:07:50 -07:00
Kyle Dunn	8e821cd2f5	test(gateway): verify Signal inbound text attachment sets MessageType.DOCUMENT	2026-06-12 01:07:50 -07:00
Kyle Dunn	ffef9da9b7	test(gateway): verify Signal inbound PDF attachment sets MessageType.DOCUMENT	2026-06-12 01:07:50 -07:00
teknium1	05470aa1b6	feat(messaging): expose action='unreact' in send_message + react dispatch tests Follow-up for salvaged PR #44486: the adapter shipped remove_reaction but the tool only exposed 'react'. Generalize _handle_react(remove=) and add tool-level dispatch tests for react/unreact (missing from the original PR).	2026-06-12 01:07:38 -07:00
underthestars-zhy	a652131c42	fix(photon): stop gateway restarts from orphaning the sidecar on its port A hard gateway exit (crash, SIGKILL, supervisor restart) left the detached Node sidecar running with a token the next gateway run doesn't know, so it could never be told to /shutdown. Every replacement spawn then died on EADDRINUSE, failing each 30→300s reconnect attempt while the orphan kept consuming the inbound gRPC stream. Two layers: - Lifetime binding: the adapter now holds the sidecar's stdin as a pipe, and the sidecar (PHOTON_SIDECAR_WATCH_STDIN=1) shuts down on stdin EOF — fired by the OS on any parent death, including SIGKILL. - Startup reaping: before spawning, the adapter probes the port and terminates a stale listener, but only after verifying its command line is a Photon sidecar; a foreign listener raises a clear error instead of being signalled. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>	2026-06-12 01:07:38 -07:00
underthestars-zhy	573c4e6511	feat(photon): upgrade to spectrum-ts 3.0.0 (pinned) with markdown + reactions Pin spectrum-ts to exactly 3.0.0 (was ^1.18.0 plus an `npm install spectrum-ts@latest` on every setup) so breaking SDK majors can't take down fresh installs silently; `hermes photon setup` now runs `npm ci`. Upgrade procedure documented in the README. Migrate resolveSpace to the v3 namespace API: `im.space.create(phone)` for DMs and `im.space.get(id)` for everything else — group spaces are now rehydratable from their persisted id after a sidecar restart, which v1 could not do. Markdown: replies go out via the v3 `markdown()` builder (iMessage renders natively; other Spectrum platforms degrade to plain text). `PHOTON_MARKDOWN=false` reverts to the stripped plain-text path. Reactions, behind PHOTON_REACTIONS (default off): lifecycle tapbacks (👀 while processing, 👍/👎 on completion) via new sidecar /react and /unreact endpoints with per-target reaction-handle tracking, and user tapbacks on bot-sent messages routed to the agent as synthetic `reaction:added:<emoji>` events. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>	2026-06-12 01:07:38 -07:00
Teknium	c196269d8d	fix(credits): suppress usage gauge when top-up funds exist + add display.credits_notices toggle (#44716 ) The subscription-cap usage gauge (50/75/90% bands) ignored purchased (top-up) credits: a sub user with top-up funds got a sticky warn banner at 90% of their cap — permanently at >=100%, alongside grant_spent — despite being fully able to keep inferencing. The cap is the wrong denominator for an account that can keep spending. - evaluate_credits_notices: purchased_micros > 0 suppresses the usage band (grant_spent already covers the cap-reached + top-up case with the remaining balance). A top-up landing mid-session clears any showing band; spending top-up down to 0 resumes the gauge. - New display.credits_notices config (default true): false silences all credits notices. State capture and /usage are unaffected. Read once per agent (cached) in _emit_credits_notices, fail-open true. - Docs: configuration.md display block.	2026-06-12 01:06:46 -07:00
kshitijk4poor	15439bee47	refactor(memory): reuse _summarize_user_message_for_log instead of forking it The original fix added agent/memory_manager.py:flatten_message_content, but that helper was a near-exact duplicate of agent/codex_responses_adapter.py:_summarize_user_message_for_log — same None/str/list dispatch, same {text,input_text,output_text}/{image_url,input_image} part sets, the identical [N image(s)] marker, and the same str() fallback. The only difference was the join separator (newline for memory vs space for the log/trajectory previews the existing helper already serves), and that helper is already imported into agent/turn_finalizer.py — the same file whose call site the memory fix touches. Parameterize the existing helper with sep=' ' (default preserves every current logging/trajectory caller byte-for-byte) and call it with sep='\n' at the memory boundary; drop the forked flatten_message_content. Repoints the unit tests to the consolidated helper and adds a case locking the default space-join. Single source of truth for multimodal-content flattening; no behavior change for the fix or for existing callers.	2026-06-12 12:49:18 +05:30
Erosika	87893fe4cb	fix(memory): flatten multimodal content before provider sync Multimodal turns carry message content as a list of typed parts ({type: "text"\|"image_url", ...}). _sync_external_memory_for_turn passed that list straight into MemoryManager.sync_all, and providers feed it to regexes — Honcho's sync_turn calls sanitize_context, where re.sub raised 'expected string or bytes-like object, got list'. Every turn with an attached image silently never synced. Flatten to plain text at the boundary: text parts joined, images noted as an [N image(s)] marker so the attachment isn't erased from recall. Fixing here covers all providers instead of patching each plugin. (cherry picked from commit `705bdb6ffe`)	2026-06-12 12:46:28 +05:30
brooklyn!	d810f2b262	Merge pull request #44676 from NousResearch/bb/fix-schema-ref-default fix(tools): strip default from $ref nodes in tool schemas	2026-06-12 01:21:14 -05:00
墨綠BG	81cdbbddc8	🐛 fix(cli): wrap approval preview hints	2026-06-11 23:05:08 -07:00
墨綠BG	d6df38bb6b	🐛 fix(cli): wrap long approval commands in prompt	2026-06-11 23:05:08 -07:00
Teknium	c7bee8f961	refactor(agent): drop unused tail_start param from _derive_auto_focus_topic The parameter was reserved-but-unused (del'd immediately); YAGNI. Test call site updated.	2026-06-11 23:03:52 -07:00
konsisumer	434c684bfa	fix(agent): focus automatic compression on recent user turns	2026-06-11 23:03:52 -07:00
Teknium	db7714d5f1	Merge pull request #44331 from NousResearch/hermes/hermes-6b48295e feat(whatsapp): WhatsApp Business Cloud API adapter (salvage #43921)	2026-06-11 22:48:06 -07:00
Kyssta	a942bfd9cc	fix(gateway): reset _last_flushed_db_idx when reusing cached agent (#44327 ) (#44518 ) Co-authored-by: kyssta-exe <kyssta-exe@users.noreply.github.com>	2026-06-11 22:41:34 -07:00
Brooklyn Nicholson	b2d151abe2	fix(tools): strip default from $ref nodes in tool schemas Fireworks-hosted Kimi rejects tool requests when nullable MCP/Pydantic schemas collapse to {"$ref": "...", "default": null}. Strip that sibling during global schema sanitization so gateway and CLI calls succeed again.	2026-06-12 00:30:51 -05:00
kshitijk4poor	44bd478039	fix(plugins): credit shared hook/middleware/tool names to every plugin list_plugins() attribution diffed registry names against all already-loaded plugins, so when a plugin registered a hook / middleware / tool name an earlier plugin had already used, the shared name was credited to the first plugin only and later plugins under-reported (0 hooks) in hermes plugins list. commands_registered right beside it already attributed correctly by plugin ownership. Snapshot per-registry counts before register() and attribute the entries this plugin's register() actually added (per-registration delta). Add a regression test: two plugins registering the same hook name are each credited with 1 hook.	2026-06-12 10:57:25 +05:30
kshitijk4poor	889a13696b	fix(plugins): clear _plugin_platform_names on force-rediscover discover_and_load(force=True) cleared every per-plugin registry except _plugin_platform_names, which register_platform() populates. A platform plugin disabled between force-rediscovers left a stale name behind, so the set diverged from the real platform_registry / _plugins state and never shrank across repeated force passes. Add the missing clear() and a regression test that seeds every per-plugin registry, forces a rediscover, and asserts they all empty (so a future registry addition can't silently leak across a force pass either).	2026-06-12 10:55:44 +05:30
Veritas-7	82d570165e	fix(slack): ack reaction lifecycle events Register no-op Slack event handlers for inbound reaction_added and reaction_removed events so Slack Bolt does not log unhandled-request warnings for events Hermes does not consume.	2026-06-12 10:54:07 +05:30
Brad Smith	08e8bedae8	fix(gateway): keep plugin action wrapper signature to (ack, body, action) The previous implementation captured loop vars via default arguments:: async def _wrapped(ack, body, action, _cb=_cb, _plugin_name=_plugin_name): slack_bolt's ``kwargs_injection`` introspects each listener's signature via ``inspect.signature`` and passes ``None`` for any parameter name it doesn't recognise (see ``slack_bolt/kwargs_injection/async_utils.py`` ``build_async_required_kwargs``). That clobbered ``_cb`` to ``None`` at dispatch time, so the wrapped plugin handler became ``NoneType`` — ``await _cb(...)`` then raised ``'NoneType' object is not callable`` and no plugin action handler ever fired. Replace the default-arg trick with a small closure factory so the wrapper's public signature is exactly ``(ack, body, action)``. Add a regression test that introspects the wrapped function's signature. Found via real Slack click on a Block Kit button registered through ``ctx.register_slack_action_handler`` — gateway log showed ``[Slack] Plugin 'None' action handler raised: 'NoneType' object is not callable`` despite the registration log line confirming the handler was wired. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-12 10:36:14 +05:30
Brad Smith	62e937bf2b	feat(plugins): expose register_slack_action_handler API Plugins that post Block Kit messages with interactive elements (buttons, overflow menus, datepickers, etc.) had no documented way to receive the resulting click events. The plugin API exposed register_tool, register_hook, register_command, register_platform, and register_context_engine, but nothing for slack_bolt action handlers. The only workaround was to monkey-patch SlackAdapter.connect from inside register(), which is fragile and breaks on every Hermes update. This change adds: * PluginContext.register_slack_action_handler(action_id, callback) — validates inputs and queues the handler on the PluginManager. action_id accepts whatever slack_bolt.App.action() accepts (literal string, compiled re.Pattern, or constraint dict). * PluginManager.get_slack_action_handlers() — accessor used by the Slack adapter at connect time. * SlackAdapter.connect — after wiring its built-in approval and slash-confirm buttons, iterates the plugin-registered handlers and registers each via self._app.action(matcher)(callback). Each callback is wrapped defensively so a misbehaving plugin cannot crash slack_bolt's dispatch loop, with a best-effort ack on exception so Slack stops retrying the click. * Defensive fallback when the plugin layer is unhealthy: a RuntimeError from get_plugin_manager() is logged and swallowed rather than blocking the gateway from starting. * Test coverage in tests/gateway/test_slack_plugin_action_handlers.py for input validation, multi-plugin registration, the connect-time wiring, defensive exception handling, and the plugin-loader- failure fallback path. * Documentation in website/docs/guides/build-a-hermes-plugin.md describing the new API alongside the existing register_command / dispatch_tool documentation. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-12 10:36:14 +05:30
Brooklyn Nicholson	ab06ef8ed6	fix(coding): teach agents terminal env state persists Tell coding agents to activate shell setup once per session instead of re-sourcing it before every command, and pin the existing LocalEnvironment env-snapshot behavior with regression tests.	2026-06-11 19:50:08 -05:00
brooklyn!	afe53708ee	Merge pull request #44545 from NousResearch/hermes-worktree-code fix(coding): don't expose primary worktree path in coding context	2026-06-11 19:35:18 -05:00
Teknium	5affecb443	fix(mcp): capability-gate tools/list so prompt-only MCP servers can connect (#44550 ) Port from anomalyco/opencode#31271: only call tools/list when the server advertises the 'tools' capability in InitializeResult.capabilities. Previously, _discover_tools() unconditionally called session.list_tools() right after initialize. Prompt-only / resource-only servers (which omit the tools capability per the MCP spec) raise McpError(-32601 Method not found), which aborted the connection — burning all 3 initial-connect retries and permanently failing the server even though its prompts and resources were perfectly usable. The 180s keepalive had the same problem: it probed with list_tools(), so even a successfully connected prompt-only server would be torn down on the first keepalive cycle. Changes: - MCPServerTask._advertises_tools(): capability check with a legacy fallback (no captured InitializeResult -> behave as before) - _discover_tools(): skip tools/list for non-tool servers - keepalive: use the universal ping request for non-tool servers - _refresh_tools(): guard against tools/list_changed from non-tool servers E2E verified with a real stdio prompt-only FastMCP-style server: on main it fails all 3 connection attempts with Method-not-found; with this fix it connects, lists prompts, answers ping keepalives, and shuts down cleanly.	2026-06-11 17:34:49 -07:00
ethernet	96cc7ee1e3	fix(coding): don't provide worktree root in context this makes the agent frequently edit files in the wrong worktree. what the agent doesn't know can't hurt it.	2026-06-11 20:27:06 -04:00
brooklyn!	4ddb03390a	fix(desktop): collect + persist API key for custom OpenAI endpoints (#43896 ) The desktop "Local / custom endpoint" onboarding never collected an API key and /api/model/set silently dropped one, so an auth-gated endpoint (e.g. a hosted vLLM behind a key) could never enumerate models — and Settings' "Set up custom endpoint" routed `custom` into a non-existent OAuth flow, booting the user back to the first screen (the reported loop). Backend (web_server.py): - /api/providers/validate accepts an optional api_key and sends it as a Bearer header when probing a custom endpoint's /v1/models. - /api/model/set accepts api_key, persists it to model.api_key (same switch/preserve lifecycle as base_url), and registers a named custom_providers entry via _save_custom_provider — matching the `hermes model` CLI flow so the endpoint shows up as a ready picker row. Desktop: - ApiKeyForm shows an optional API key field for the local/custom option; the key is threaded through saveOnboardingLocalEndpoint → validate + setModelAssignment. - New onboarding `localEndpoint` intent + startManualLocalEndpoint(); the Settings "Set up custom endpoint" button now opens the local-endpoint form (URL + key) instead of the OAuth dead-end. - Added localApiKeyPlaceholder i18n key (en + types + zh). Tests: api_key lifecycle on _apply_main_model_assignment, key persistence + custom_providers registration on /api/model/set, Bearer-header probe; onboarding store forwards + persists the key.	2026-06-12 00:03:55 +00:00
brooklyn!	c6007e5c1a	Merge pull request #44534 from NousResearch/bb/approval-allow-permanent fix(approval): carry allow_permanent to TUI + desktop approval prompts	2026-06-11 18:49:58 -05:00
Brooklyn Nicholson	81436e143e	fix(approval): carry allow_permanent to TUI + desktop approval prompts When a tirith content-security warning is present the approval backend forces allow_permanent=False and silently downgrades an "always" choice to session scope (the persistence loop in check_all_command_guards only honors "always" → permanent when no tirith finding exists). But the gateway notify payload that drives the TUI and the Electron desktop app never carried that flag, so both surfaces always rendered "Always allow" — offering a permanent allow the backend would quietly refuse to persist. Plumb allow_permanent end-to-end: - tools/approval.py: include `allow_permanent: not has_tirith` in the gateway approval_data the notify callback emits as `approval.request`. - ui-tui: thread `allowPermanent` through the event handler, gateway types, and ApprovalReq; ApprovalPrompt drops the "always" option (and renumbers the quick-pick keys) when it's false. - apps/desktop: thread `allow_permanent` through the gateway payload type, the per-session approval store, and the inline ApprovalBar, which now hides the "Always allow…" dropdown item when permanent allow is disallowed — reusing the existing DropdownMenu / confirm-Dialog UI. The desktop/TUI render path for approvals already landed in #38578 (the root cause of approvals not surfacing in the GUI); this completes the salvage of #37856 by carrying allow_permanent across both surfaces. #37856's original thread-local _block() approach is dropped: desktop/TUI approvals resolve via approval.respond → resolve_gateway_approval (the per-session queue), not the _block()/request_id correlation, so a worker-thread callback waiting on _block would never be released by the real UI. Tests: gateway notify payload carries allow_permanent (True without tirith, False with a tirith warning); ui-tui approvalAction reduced option set + event-handler allowPermanent propagation; desktop store round-trip + the ApprovalBar showing/hiding "Always allow". Supersedes #37856 Closes #37812 Co-authored-by: LeonSGP43 <cine.dreamer.one@gmail.com>	2026-06-11 18:23:59 -05:00
Gille	9102d4a588	fix(dashboard): show Windows 11 in host panel (#44511 )	2026-06-11 19:06:29 -04:00
brooklyn!	b1fe2107d6	fix(desktop): keep named-profile desktop backends per-profile (#44510 ) Desktop spawns its dashboard backend with `--profile <name>` and `HERMES_DESKTOP=1`. cmd_dashboard's unified-launch routing treats any named profile as a request for the shared machine dashboard: it re-execs as the default profile (dropping HERMES_HOME) or, when one is already listening, prints "Machine dashboard already running ... Managing profile '<name>'" and exits 0. Either way the desktop-spawned child exits before the app sees a ready backend, so Desktop retries forever — the Windows named-profile boot loop in the post-mortem. Skip the machine-dashboard reroute when HERMES_DESKTOP=1 so desktop pool backends stay per-profile (which is what the pool expects). Carved out of #44478. Co-authored-by: AJ <yspdev@gmail.com>	2026-06-11 22:47:28 +00:00
brooklyn!	73969771a5	fix(desktop): discover MCP tools for dashboard /api/ws backends (#44512 ) The desktop chat surface talks to the dashboard's in-process /api/ws gateway, which builds agents through tui_gateway.server._make_agent. That path only snapshots the existing tool registry — MCP discovery is started by tui_gateway/entry.py (the stdio TUI), which the dashboard process never runs. So a profile's configured MCP servers never connect under the desktop app and sessions show no MCP tools. Start a shared background MCP discovery thread at dashboard startup (via hermes_cli.mcp_startup, bounded so a slow/dead server can't block boot), and have _make_agent briefly join that thread in addition to the existing entry-owned TUI thread before snapshotting tools. Carved out of #44478. Co-authored-by: AJ <yspdev@gmail.com>	2026-06-11 22:45:45 +00:00
Austin Pickett	2ee69d0579	fix(skills): let ClawHub index build walk past the 12s browse budget (#44500 ) The deploy-site skills index crawl was capped at ~3k ClawHub entries because CATALOG_WALK_BUDGET_SECONDS applied to max_items=0 walks too. Only enforce the wall-clock budget for bounded browse requests and pass limit=0 from build_skills_index so CI walks the full catalog. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-11 18:03:11 -04:00
Austin Pickett	021ed69141	docs: finish Automation Blueprints terminology rebrand (#44470 ) * docs: finish Automation Blueprints terminology rebrand Replace leftover "Automation Templates" wording from the Cron Recipes rebrand, rename the copy-paste cookbook guide to Automation Recipes, and point the marketing gallery link at the blueprints catalog. Co-authored-by: Cursor <cursoragent@cursor.com> * docs: use Automation Blueprints instead of Recipes in guide Rename the cookbook guide from automation-recipes to automation-blueprints so sidebar and copy match the product term. Co-authored-by: Cursor <cursoragent@cursor.com> * docs: rename automation-blueprints-catalog to automation-blueprints Drop the -catalog suffix from the reference page slug and title, and move the copy-paste cookbook to automation-blueprint-examples so the main Automation Blueprints doc is unambiguous. Co-authored-by: Cursor <cursoragent@cursor.com> * Revert "docs: rename automation-blueprints-catalog to automation-blueprints" This reverts commit `605f1eeab5`. --------- Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-11 17:22:22 -04:00
Teknium	acb2954d82	fix(agent): freeze carveout-era SUMMARY_PREFIX for renormalization The prompt consolidation above retires the carveout-era prefix. Without a frozen copy in _HISTORICAL_SUMMARY_PREFIXES, summaries persisted by pre-upgrade builds would lose detection (_is_context_summary_content) and renormalization (_strip_summary_prefix) — the exact regression class the tuple exists to prevent. Adds contract tests covering every frozen prefix. Refs #41607 #38364 #42812	2026-06-11 13:57:13 -07:00
kyssta-exe	8f8cad7ec5	fix(agent): strengthen compression preamble against stale task execution (#41607 )	2026-06-11 13:57:13 -07:00
konsisumer	d5e2fbf244	fix(agent): frame compaction handoff sections as historical context	2026-06-11 13:57:13 -07:00
teknium1	114e265737	fix(plugins): don't cache a failed discovery sweep as discovered Root-cause hardening for the stranded-empty-registry failure behind 'No web search/extract provider configured': discover_and_load() set _discovered=True before scanning, so a sweep that raised partway was swallowed by callers as a warning and every later call early-returned against an empty registry for the process lifetime. The flag now acts only as a re-entrancy guard and is reset when the sweep raises, so the next call retries discovery.	2026-06-11 12:56:44 -07:00
xxxigm	32a73010bb	test(web): cover keyless default surviving a failed plugin sweep Pins the invariant that _ensure_web_plugins_loaded registers the keyless Parallel default (and the wider bundled set) even when the general plugin discovery raises, that the direct-registration fallback honors plugins.disabled, and that it stays a no-op on the healthy path.	2026-06-11 12:56:44 -07:00
Austin Pickett	c3464ecf45	fix(discord): recover from runtime gateway task exits (#44383 ) * fix(discord): recover from runtime gateway task exits Salvaged from #39416 (AMEOBIUS) — cherry-picked only the task-exit recovery; the original PR was 1081 commits behind with 28 unrelated commits. A post-ready discord.py WebSocket crash left the gateway split-brained: producers stayed active while Discord stopped responding. After this fix the adapter calls _set_fatal_error(retryable=True) + _notify_fatal_error() so the existing GatewayRunner reconnect watcher replaces the dead adapter. Also adds _wait_for_ready_or_bot_exit() so startup failures (SOCKS/proxy errors, invalid tokens) surface fast instead of burning the full ready timeout. Because connect() no longer waits via asyncio.wait_for on that path, test_connect_releases_token_lock_on_timeout is updated to trigger the timeout through the new helper (same lock-release contract). 3 tests pass (2 new runtime-failure tests + the updated timeout test); test_discord_connect.py and test_discord_slash_commands.py green. Co-Authored-By: ameobius <ameobius@local.host> * fix(test): patch _wait_for_ready_or_bot_exit in timeout cancel test connect() no longer uses asyncio.wait_for for the ready handshake, so test_connect_timeout_cancels_bot_task was hanging for 30s in CI. Co-authored-by: Cursor <cursoragent@cursor.com> --------- Co-authored-by: ameobius <ameobius@local.host> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-11 15:39:01 -04:00
Dineth Hettiarachchi	020ef76cf1	fix(discord): cancel _bot_task on connect() timeout to prevent zombie client When connect() times out waiting for the Discord ready event, the background asyncio.Task running client.start() was not cancelled. discord.py's internal reconnect loop can ignore client.close() while a WebSocket handshake is in flight, so the orphaned task eventually completes and fires on_ready. A later successful reconnect then leaves two live Discord clients in the same process — each with its own on_message handler and MessageDeduplicator instance — so every @mention creates two threads because the per-adapter dedup caches cannot catch cross-client duplicates. Fix: explicitly cancel and await _bot_task in two places: 1. The asyncio.TimeoutError handler inside connect() — catches the case where the adapter's own inner wait_for fires before the gateway's outer timeout. 2. The start of disconnect() — the load-bearing path, always reached via _dispose_unused_adapter regardless of which timeout fired first. Root cause confirmed from production logs: a Jun 8 network outage caused three consecutive connect() timeouts. The first attempt's bot_task completed its handshake 4 minutes later ("Connected as") with no preceding watcher line, then the watcher's real reconnect also connected 90 seconds after that. The two clients ran continuously for 41+ hours, confirmed by the same user message appearing as two separate inbound events in two different thread IDs 357ms apart. Regression tests added to tests/gateway/test_discord_connect.py: - test_connect_timeout_cancels_bot_task: simulates a connect() timeout with a NeverReadyBot and asserts _bot_task is None afterward - test_disconnect_cancels_running_bot_task: injects a live zombie task, calls disconnect(), and asserts the task is cancelled and the attribute cleared	2026-06-11 12:09:18 -07:00
Teknium	13650ab7f8	fix(gateway): audio attachment note no longer steers the agent into punting Sibling site of the PDF/DOCX note fixed in PR #44175: the audio file attachment context note led with "Ask the user what they'd like you to do with it", steering the model into asking instead of transcribing. Rewritten to instruct the agent to transcribe/process the file itself when the request involves its content, only asking when intent is genuinely unclear. Contract assertion added to the existing audio attachment note test.	2026-06-11 11:58:19 -07:00
xxxigm	4e9be3ee32	test(gateway): cover document context note for PDF/DOCX vs text Pin the contract for _build_document_context_note: text documents confirm the inlined content and record the path; binary documents (PDF/DOCX/XLSX/octet- stream) tell the agent to extract the text itself and never instruct it to ask the user to paste the contents.	2026-06-11 11:58:19 -07:00
Austin Pickett	ce99a81123	fix(dashboard): suppress unicode-animations postinstall during npm ci Set CI=1 in _run_npm_install_deterministic so the package's /dev/tty postinstall demo is skipped during hermes dashboard web UI builds. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-11 11:49:08 -07:00
brooklyn!	a4f179c509	fix(agent): steer GPT/Codex family to V4A for single-file edits too (#44411 ) The coding-posture brief told GPT/Codex models to use patch mode='patch' (V4A) for structured/multi-file changes but mode='replace' "for a single small swap". That second nudge points those models at a format their first-party harness never taught them. Verified against openai/codex (current main): apply_patch is the ONLY file editor in codex-rs — zero occurrences of str_replace/old_string anywhere in the repo; the grammar (core/src/tools/handlers/apply_patch.lark) is exactly the V4A dialect our patch_parser implements; the shipped model prompts (gpt_5_codex, gpt-5.2-codex, gpt-5.1-codex-max + instruction templates) explicitly say to use apply_patch "for single file edits"; and the tool is gated per model via ModelInfo.apply_patch_tool_type, i.e. OpenAI ships V4A-for-everything as model metadata. The GPT-family line now steers to mode='patch' for all edits, single-file included. The replace-family line (Claude + open-weight) is unchanged — Claude Code's FileEdit is old_string/new_string/replace_all exact string replacement (confirmed from Anthropic's shipped sdk-tools.d.ts, the only file editor in its tool union), matching our mode='replace'.	2026-06-11 17:52:52 +00:00
Teknium	cb29e8a82e	refactor(cron): rebrand Cron Recipes -> Automation Blueprints Product rename across every surface: module/file names (blueprint_catalog, tools/blueprints, blueprint_cmd), slash command /cron-recipe -> /blueprint (alias /bp), dashboard API /api/cron/blueprints, desktop deep-link hermes://blueprint/<key>, docs catalog page + extract script, and the skill frontmatter block metadata.hermes.blueprint. No behavior change.	2026-06-11 10:49:47 -07:00

1 2 3 4 5 ...

5354 commits