hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-18 09:51:59 +00:00

Author	SHA1	Message	Date
Wolfram Ravenwolf	9137b86a52	fix(skills): ignore support docs in skill discovery Support files under references/, templates/, assets/, and scripts/ are progressive-disclosure data loaded through skill_view(..., file_path=...). They should not be treated as standalone skills during discovery or collision checks. This prevents archived skill packages or support markdown files inside a real skill from shadowing active skills with the same name while still allowing top-level categories named scripts/templates/assets/references. Tests cover: - pruning nested SKILL.md files inside skill support directories - preserving support-named top-level categories - avoiding skill_view collisions from support markdown - keeping archived package SKILL.md files accessible only through file_path	2026-06-16 13:08:34 -07:00
Wolfram Ravenwolf	e76e7b5073	feat(hooks): session:compress event_callback for MemPalace sync	2026-06-16 11:45:36 -07:00
teknium	6ebc449915	fix(prompt): isolate truncation warnings per context Follow-up to salvaged PR #41619: replace the module-global _truncation_warnings list with a contextvars.ContextVar so concurrent gateway-session prompt builds can't drain or clear each other's pending warnings (cross-session leak). Adds a context-isolation test.	2026-06-16 11:28:35 -07:00
Wolfram Ravenwolf	f6a42b1acf	feat(prompt): make context-file truncation limit configurable PROBLEM: Automatic context files such as SOUL.md and AGENTS.md were capped by a hardcoded CONTEXT_FILE_MAX_CHARS value. Amy's local fork had raised that constant from 20K to 25K so a larger SOUL.md would not be silently truncated, but the hardcoded 25K value changed upstream default behavior and made the patch less generally useful. SOLUTION: Restore the upstream-compatible 20K default, add a context_file_max_chars config setting for users who intentionally keep larger identity/project-context files, keep chat-visible truncation warnings, and document the new setting. Tests cover the default, config override, explicit max_chars precedence, and the warning text.	2026-06-16 11:28:35 -07:00
Teknium	c2c55c4443	fix(memory): strip skill scaffolding for all providers, not just openviking Generalizes #32663 (@ehz0ah). The slash-skill scaffolding pollution affected every auto-syncing memory provider — mem0, hindsight, retaindb, byterover, honcho, supermemory all store/embed the raw user turn, so a /skill invocation poisoned their stores with the full skill body, not just openviking. - Lift the contributor's parser into agent/skill_commands.py as the canonical extract_user_instruction_from_skill_message(), co-located with the message builders so the markers can't drift. - Strip once in MemoryManager.{prefetch_all,queue_prefetch_all,sync_all} — fixes the whole provider fan-out, bare /skill turns are skipped entirely. - OpenViking's _derive_openviking_user_text() now delegates to the shared helper as defense-in-depth (no duplicated marker literals). - Marker-drift regression now asserts against the canonical skill_commands constants; add manager-level coverage proving every provider gets clean text.	2026-06-16 10:37:37 -07:00
brooklyn!	c6e99ab375	Merge pull request #46959 from NousResearch/bb/composer-model-selector feat(desktop): composer model selector, per-model presets & external-provider disconnect	2026-06-16 09:55:57 -05:00
Brooklyn Nicholson	7d938cc5c9	fix(desktop): keep live model switch metadata truthful A live config.set model switch already moved the next API call to the new model, but the conversation could still restore an old sessions.system_prompt snapshot whose Model/Provider lines named the previous runtime. That made "what model are you?" answer from stale metadata even while inference ran on the new model. After a live switch we now refresh the stored system prompt and append a real system-history pivot (not a fake user turn) so the transcript itself records the new model/provider. Restore also rejects already-stale prompt snapshots when their Model/Provider lines disagree with the runtime, so existing bad sessions self-heal.	2026-06-16 09:50:17 -05:00
Teknium	4858942c55	fix(auxiliary): honor main fallback chain for auto tasks (#47235 )	2026-06-16 06:23:24 -07:00
Wolfram Ravenwolf	4cf9d80fba	feat(display): verbose skill change notifications with content previews When display.memory_notifications is set to 'verbose', skill_manage notifications now show meaningful change details instead of just the generic tool message. Before (verbose mode): 💾 📝 Patched SKILL.md in skill 'gogcli' (1 replacement). After (verbose mode): 💾 📝 Skill 'gogcli' patched: "old pitfall text..." → "new pitfall text..." Changes: - skill_manager_tool.py: _patch_skill() now includes old/new string previews (truncated to 200 chars) in the result via '_change' key. _create_skill() and _edit_skill() include skill description from frontmatter for verbose create/edit notifications. - run_agent.py: Background review notification builder now reads the '_change' dict from skill tool results and formats descriptive notifications per action type (patch → old→new diff, create/edit → description preview). Falls back to generic message when _change data is unavailable (backwards compatible). This is especially useful when subagents patch skills, since neither the user nor the parent agent can see what the subagent changed.	2026-06-16 05:45:40 -07:00
Wolfram Ravenwolf	20b1f4f3fb	feat(memory): configurable background memory update notifications Background memory reviews now support three notification modes, configured via display.memory_notifications in config.yaml: off — no chat notification (still logged to stdout/HA log) on — generic '💾 Memory updated' (default, unchanged behavior) verbose — content preview with action indicators: 💾 Memory ➕ Hermes Repo liegt unter /config/amy/hermes-agent/... 💾 Memory ✏️ Updated repo path from claude-code to hermes-agent... 💾 Memory ➖ old entry about claude-code path... Previews are truncated to 120 chars for adds/replaces, 60 for removes. Each action gets its own line in verbose mode for readability. Files: run_agent.py, gateway/run.py	2026-06-16 05:45:40 -07:00
Teknium	3e7e9b24d4	fix: harden salvaged session and browser improvements Polish salvaged contributor work before PR review: - read browser inactivity timeout from config with documented fallback - skip redundant v10 trigram backfill before v11 FTS rebuild - show delegate_task goals safely in progress previews - show gateway status model/context without redundant token wording - wire gateway /sessions to shared session-listing helpers - map Ravenwolf author emails for release attribution Co-authored-by: Wolfram Ravenwolf <github.com@wolfram.ravenwolf.de> Co-authored-by: Amy Ravenwolf <amy@ravenwolf.de>	2026-06-15 07:46:34 -07:00
Amy Ravenwolf	5035fa9029	feat(display): show delegate_task goals in tool progress notifications Previously, delegate_task in batch mode only showed '3 parallel tasks' without revealing what the tasks actually are. Single-task mode showed the goal via the primary_args fallback, but batch mode had no goal extraction. Changes: - build_tool_preview(): Add dedicated delegate_task handler that extracts individual task goals from both single and batch modes. Batch shows '3 tasks: Goal A \| Goal B \| Goal C'. - _get_cute_tool_message_impl(): Show individual goals in CLI cute messages for batch delegate calls ('3x: Goal A \| Goal B'). - Add 4 tests covering single goal, batch goals, missing goals, and no-goal edge case.	2026-06-15 07:46:34 -07:00
Teknium	49e743985a	fix: route minimax m3 reasoning controls through profile Follow up PR #46609's api.minimax.io reasoning report by moving the behavior out of the broad run_agent host gate and into the MiniMax provider profile. Only MiniMax-M3 on the documented OpenAI-compatible /v1 route gets reasoning_split/thinking/reasoning_effort; Anthropic-format MiniMax and non-M3 models keep their existing wire shapes. Co-authored-by: goku94123 <gooku94123@gmail.com>	2026-06-15 07:08:43 -07:00
Teknium	733472952a	fix: complete cron jobs lock salvage Route curator rollback through the same cross-process cron job lock, make save_jobs lock for legacy direct callers without deadlocking nested mutation paths, and harden the regression test so a second _jobs_lock caller really blocks across processes.	2026-06-15 06:29:00 -07:00
Teknium	aab2e99bae	test: cover request debug dump redaction Keep request dump writes on the shared atomic JSON path, add regression coverage for request body/error/stdout redaction, and map the salvaged contributor email for release attribution.	2026-06-15 05:31:21 -07:00
xtymac	ad58dd51ac	redact secrets in API request debug dumps dump_api_request_debug() masks the provider Authorization header but writes the request `body` (system prompt, tool defs, context-embedded values) and the error message raw via atomic_json_write. This path also fires unconditionally on API errors (not only under HERMES_DUMP_REQUESTS), so any secret surfaced into context (e.g. an integration token) lands in cleartext at request_dump_*.json on every failed call. Run the serialized dump through the existing redact_sensitive_text() scrubber (already used for logs/tool output) before persisting and before the HERMES_DUMP_REQUEST_STDOUT print; preserve atomicity via temp-file + Path.replace. Also add the Notion internal-integration prefix (ntn_) to _PREFIX_PATTERNS so bare values are caught. Per SECURITY.md §3.2 this is a redaction (in-process heuristic) hardening, not a §3.1 vulnerability. Refs #46583.	2026-06-15 05:31:21 -07:00
liuhao1024	2cddc9c895	fix(bedrock): check boto3 version >= 1.34.59 before using converse_stream converse() and converse_stream() were added in boto3 1.34.59. When Hermes is installed editable into system Python (e.g. Ubuntu 24.04 ships 1.34.46), the system boto3 takes precedence and calls to converse_stream fail with AttributeError. Add an early version check in _require_boto3() that raises a clear RuntimeError with upgrade instructions.	2026-06-15 05:25:17 -07:00
Teknium	f3fe99863d	revert(web): remove keyless Parallel search fallback (#46350 ) Remove the free Parallel Search MCP path and restore the keyed Parallel backend behavior from before it was introduced. Also drops the keyless fallback registration/display labeling tests and returns the Parallel SDK pin to the prior version.	2026-06-14 16:47:57 -07:00
mr-r0b0t	bff78a34dc	feat(zai): add GLM-5.2 with verified 1M context window GLM-5.2 ships with a 1M (1,048,576) token context window. Without this entry, Hermes falls through to the generic 'glm' key (202,752 tokens), under-reporting the context bar and prematurely compressing conversations. The 1M limit was verified empirically via needle-in-a-haystack retrieval at 789,240 prompt tokens on api.z.ai/api/coding/paas/v4 — zero errors, zero truncation, correct retrieval at every tested size (25K through 789K). Changes: - agent/model_metadata.py: add 'glm-5.2': 1_048_576 before 'glm' fallback - hermes_cli/models.py: add glm-5.2 to zai curated models - hermes_cli/setup.py: add glm-5.2 to setup wizard zai list - hermes_cli/auth.py: add glm-5.2 to coding plan endpoint probes - plugins/model-providers/zai/__init__.py: add glm-5.2 to fallback_models - tests/agent/test_model_metadata.py: context resolution + vendor-prefix tests	2026-06-14 13:50:36 -07:00
Teknium	4e6d05c6a5	perf(skills): share raw config cache in skill utils (#46149 )	2026-06-14 11:14:58 -07:00
kshitijk4poor	ce19fdb7ce	fix(skills): apply global\|platform disabled union to all resolution sites The platform-disabled fix landed only in agent.skill_utils.get_disabled_skill_names (the system-prompt path). Two sibling resolvers still used the old replace-not-union semantics, so the same skill could be hidden from the <available_skills> prompt yet reported enabled elsewhere: - hermes_cli/skills_config.get_disabled_skills (the 'hermes skills config' UI) returned only the platform list, so a globally-disabled skill showed as enabled (unchecked) on any platform with a platform_disabled entry. - tools/skills_tool._is_skill_disabled (gates whether skill_view loads a skill) ignored the global list when a platform list existed, so a globally-disabled skill could still be loaded on such a platform. Both now union the global list with the platform list, matching get_disabled_skill_names. An explicit empty platform list no longer re-enables a globally-disabled skill — global disables hold on every platform (#46201). Also: fix the now-stale get_disabled_skill_names docstring and drop a stray blank line. Regression tests added for both sites (proven to fail on the old replace semantics).	2026-06-14 22:54:54 +05:30
ibrahim özsaraç	7bbe7024c2	fix: filter platform-disabled skills from <available_skills> prompt (#46201 ) build_skills_system_prompt() already resolved _platform_hint but called get_disabled_skill_names() with no argument, so the resolved platform never reached the filter and the prompt cache_key varied by platform while the disabled set did not. Pass _platform_hint or None. get_disabled_skill_names() also fully ignored the global 'disabled' list once a platform-specific list was found. Return the union (global \| platform) so a globally-disabled skill stays disabled on every platform. Salvaged from #46203 by @iborazzi; the unrelated apps/shared/tsconfig.json ES2023 bump is intentionally dropped (one concern per PR).	2026-06-14 22:52:57 +05:30
Teknium	13a1bd0f83	perf(model-metadata): persist OpenRouter metadata cache (#46114 )	2026-06-14 04:45:46 -07:00
Teknium	723c2331bd	fix: make profile subprocess HOME policy explicit	2026-06-14 03:20:21 -07:00
zccyman	b00060ce54	fix(agent): expose HERMES_REAL_HOME in subprocess envs for profile isolation When profile isolation activates ({HERMES_HOME}/home/ exists), child processes receive HOME={HERMES_HOME}/home/ for tool config isolation (git, ssh, gh). However, scripts using Path.home() to locate ~/.hermes/ would incorrectly resolve to the isolated profile home, breaking helpers that rely on the real user home directory. New get_real_home() helper in hermes_constants resolves the actual user home independently of profile isolation. All four subprocess spawners now inject HERMES_REAL_HOME alongside the profile HOME: - tools/code_execution_tool.py (execute_code) - tools/environments/local.py (terminal background, run_env) - agent/copilot_acp_client.py (Copilot ACP) Child scripts can now use: Path(os.environ.get("HERMES_REAL_HOME", os.environ.get("HOME", ""))) to reliably find the real user home regardless of profile isolation. Closes #25114	2026-06-14 03:20:21 -07:00
helix4u	85e6232a07	fix(providers): support anthropic proxy v1 endpoints	2026-06-14 02:09:16 -07:00
Teknium	81e42335a1	fix(file-safety): relax user-write deny policy (#45947 ) Allow file tools to edit shell startup files, user package-manager configs, and Hermes control files that the user can already modify directly. Keep hard blocks for SSH keys, .env/OAuth token stores, mcp-tokens, pairing files, and system privilege files.	2026-06-14 02:07:32 -07:00
Brooklyn Nicholson	715b691723	fix(desktop): show summarizing indicator during auto-compaction Auto-compression rewrites history mid-turn, which made long threads look like they reset. Re-tag the gateway lifecycle status as compacting and surface it in the desktop thread loading indicators.	2026-06-14 02:28:07 -05:00
kshitijk4poor	10bd01972b	refactor(agent): share the content_policy_blocked result builder + recovery hint The HTTP-200 refusal handler (finish_reason=content_filter) and the exception-path handler (a provider moderation error classified as content_policy_blocked) independently built the same terminal turn result — the same {final_response, messages, api_calls, completed:False, failed:True, error:'content_policy_blocked: ...'} dict — and ended their user-facing message with the same 'Try rephrasing... hermes fallback add' trailer, copied verbatim. The two copies could drift. Funnel both through a shared _content_policy_blocked_result() builder and a shared _CONTENT_POLICY_RECOVERY_HINT constant. Also collapse the HTTP-200 path's two near-identical with/without-explanation templates into one (compute the detail fragment once) and pass reason=FailoverReason.content_policy_blocked .value to the error hook instead of a hand-written string literal, matching the sibling hook call. Behavior-preserving: the provider/refusal lead-in wording stays distinct (a provider safety filter vs the model declining are genuinely different signals), the with-text and exception messages are byte-identical to before, and the no-explanation case only gains a paragraph break for consistency. Surfaced by the simplify-code reuse/quality reviewers. The efficiency reviewer's 'redundant normalize_response' flag was deliberately NOT applied: that branch is cold (refusal-only) and pure-CPU, and reusing the sibling-branch normalized locals would risk a NameError on the codex_responses path (which sets finish_reason without normalizing) — re-normalizing is the robust choice.	2026-06-14 12:19:19 +05:30
kshitijk4poor	12c84d6c77	fix(transports): only treat a refusal as terminal when it is the sole payload A chat-completions response that carries real text or tool calls alongside a `message.refusal` note is a normal, usable turn — the model did work. The prior logic flipped finish_reason to `content_filter` whenever a refusal string was present, so the conversation loop reframed a content-bearing turn as a failed safety refusal (failed=True) and buried the model's actual output inside the "model declined" template, or dropped tool calls entirely. Only promote to a terminal `content_filter` when the refusal is the sole payload (no visible text AND no tool calls). The refusal explanation is still recorded in provider_data in every case for observability. Refusal-only responses (the bug this feature targets) are unaffected and still surface terminally; the empty+refusal, bare content_filter passthrough, and no-refusal common cases are byte-identical to before. Updates the partial-content test to the corrected contract and adds a tool_calls-alongside-refusal regression guard.	2026-06-14 12:12:52 +05:30
SHL0MS	bb46bf8ce4	fix(agent): surface model refusals instead of retrying them as errors A Claude refusal (HTTP 200, stop_reason="refusal", empty content) was laundered into a generic retry loop and surfaced as a misleading "rate limited / invalid response" or "no content after retries" error, burning paid attempts reproducing a deterministic refusal. This hit two distinct paths: - Direct Anthropic (anthropic_messages): validate_response rejected the empty-content refusal before normalize_response mapped refusal -> content_filter, so it fell into the invalid-response retry loop. - Nous Portal / OpenAI-compatible (chat_completions): the portal surfaces a Claude refusal via message.refusal with empty content, which sailed past validation and died in the empty-response retry loop. Fix (one unified content_filter dispatch for all backends): - AnthropicTransport.validate_response: accept empty content when stop_reason == "refusal" so it flows to normalize_response. - ChatCompletionsTransport.normalize_response: promote message.refusal to content + a content_filter finish reason. - conversation_loop: handle finish_reason == "content_filter" - fire the api_request_error hook (content_policy_blocked), try a configured fallback once, else return a clear terminal refusal message. Never retry a deterministic refusal. Supersedes #43084, which fixed only the direct-Anthropic path and could not reach the chat_completions/portal path. Tests: transport-level (validate_response refusal, message.refusal promotion) + end-to-end loop (refusal surfaced, exactly one API call). (cherry picked from commit `01f546f92c`)	2026-06-14 12:10:08 +05:30
brooklyn!	4b5ba112ad	fix: shrink images to reported provider dimension limit (#45979 ) Parse provider-reported image pixel ceilings so many-image Anthropic requests can recover by shrinking Retina screenshots below the stricter limit instead of retrying the same rejected payload.	2026-06-14 01:07:43 -05:00
Teknium	7aaae7acd0	fix(ssl): align guard docs and escape hatch	2026-06-13 21:14:32 -07:00
Teknium	dc90ca4e17	fix(ssl): run CA guard during agent initialization	2026-06-13 21:14:32 -07:00
Teknium	af5b526472	fix(ssl): validate CA bundle paths before provider calls	2026-06-13 21:14:32 -07:00
chromalinx	a218a0f156	fix(agent,gateway,doctor): add SSL CA cert bundle fail-fast guard A stale certifi CA bundle after a partial `hermes update` used to crash the agent on the first outbound HTTPS call with a raw traceback and trap the gateway in a retry loop. This patch: * Adds `agent/errors.py` with a typed `SSLConfigurationError` * Adds `agent/ssl_guard.py` with a `verify_ca_bundle()` pre-flight that asserts the bundle exists, is non-trivial in size, and can build a working SSLContext. On macOS, it falls back to the system trust store when the bundle is empty but the system store is healthy (covers corporate proxies / MDM setups). * Wires the guard into `run_agent.py` and `gateway/run.py` right after the `hermes_bootstrap` import, inside a try/except so a bug in the guard itself can never prevent startup. * Adds a `SSL / CA Certificates` section to `hermes_cli doctor` so users can detect the failure with one command. * Adds unit tests covering the healthy, missing, empty, skip-env, and macOS-fallback paths. * Adds an RCA document describing the failure mode and the recovery path (`pip install -e .`). When the bundle is broken the user sees: \u26a0\ufe0f SSL certificate bundle issue detected. Run: pip install -e . `HERMES_SKIP_SSL_GUARD=1` disables the check for sandboxed environments that ship their own trust store.	2026-06-13 21:14:32 -07:00
Teknium	069bfd6545	fix(agent): keep Codex reasoning replay on Codex path	2026-06-13 14:35:00 -07:00
Teknium	c8e5f34f24	fix(gemini): strip native self prefixes before generateContent (#36141 ) Strip `google/` and `gemini/` self-prefixes before native Gemini generateContent calls, and keep provider-normalization expectations aligned.	2026-06-13 13:47:08 -07:00
briandevans	7d11fa4e9e	fix(codex-responses): let final_answer complete top-level incomplete responses	2026-06-13 13:45:29 -07:00
ITheEqualizer	7c0605bf22	fix(telegram): preserve rich formatting on stream final	2026-06-13 13:44:45 -07:00
achaljhawar	819def44c7	fix(agent): scope Nous tags to Nous auxiliary calls	2026-06-13 13:24:40 -07:00
ashishpatel26	957a8ffa88	fix(bedrock): omit sampling params for restricted Claude models Bedrock Converse rejects non-default sampling parameters for Opus 4.7 and 4.8 with a ValidationException. Reuse the Anthropic-native sampling-param guard in the Bedrock kwargs builder so those models omit temperature/topP while older Claude and non-Claude models keep existing behavior. Includes the stop-sequence regression from the parallel fix to ensure stopSequences still pass through for restricted Opus models. Co-authored-by: Tranquil-Flow <tranquil_flow@protonmail.com>	2026-06-13 10:45:56 -07:00
Tranquil-Flow	4fd9397ae3	fix(codex): drop extra_headers for chatgpt.com backend	2026-06-13 07:13:24 -07:00
Henrik Bentel	eed61a1251	fix(gemini): add role field to systemInstruction	2026-06-13 06:12:52 -07:00
Tranquil-Flow	5acd185f7c	fix(moonshot): handle union type arrays in tool schemas	2026-06-13 05:51:41 -07:00
Teknium	a59d5e37e8	feat(telegram): make rich messages always on (#45584 ) Remove the rich_messages config toggle entirely so Telegram replies always try the Bot API 10.1 rich-message path first, with the existing MarkdownV2 fallback/latch behavior for unsupported endpoints and per-message failures. Restore the Telegram platform hint to encourage rich Markdown tables/task lists/math now that the rich path is the default, and remove the config/docs surface for the old toggle.	2026-06-13 05:45:11 -07:00
Teknium	4b646bc21e	fix(auxiliary): preserve main provider base url (#45587 )	2026-06-13 05:44:18 -07:00
helix4u	2d474e39c7	fix(acp): preserve memory provider tools	2026-06-13 04:51:44 -07:00
Teknium	3803e5fc28	fix(agent): don't treat custom:<name> pools as cross-provider mismatch (#45289 ) Custom endpoints carry two naming conventions for the same provider: the agent's provider attribute is the generic 'custom' label while the pool is keyed 'custom:<normalized-name>'. The defensive guard in recover_with_credential_pool compared them literally, logged 'Credential pool provider mismatch: pool=custom:<name>, agent=custom', and skipped recovery — so 401 refresh and 429 rotation never ran for ANY custom-provider user (seen in the field on a Fireworks setup whose dead key burned full retry cycles every turn with the skip warning on each one). Accept the pair only when the agent's CURRENT base_url resolves to the same pool key via get_custom_provider_pool_key, preserving the guard's original purpose (#33088/#33163): a fallback provider or a different custom endpoint still skips pool mutation.	2026-06-13 02:01:09 -07:00
kyssta-exe	956af7f3c3	fix(agent): add metadata flag to context compression summary messages (#38389 ) Summary messages (standalone insertion and merge-into-tail) now carry a metadata flag so frontends (CLI, Desktop, gateway, TUI) can distinguish them from real assistant/user messages without content-prefix heuristics. Re-applied from PR #38434 onto current main (conflicted with the _SUMMARY_END_MARKER hoist). Key renamed from the PR's 'is_compressed_summary' to '_compressed_summary': the wire sanitizers strip underscore-prefixed message keys, so the flag stays in-process and can never reach strict gateways (Fireworks/Mistral/Kimi reject unknown keys with 'Extra inputs are not permitted').	2026-06-12 16:47:15 -07:00

1 2 3 4 5 ...

1286 commits