hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-09 08:21:50 +00:00

Author	SHA1	Message	Date
Teknium	3a2c03061c	fix(stt,tts): restore mistralai — 2.4.8 is clean, ban lifted (#34841 ) * docs(code-execution): document HERMES_* env narrowing + passthrough workaround The execute_code sandbox-child env scrub (`108397726`, #27303) deliberately dropped the broad HERMES_ prefix passthrough, keeping only an operational 4-var allowlist (HERMES_HOME/PROFILE/CONFIG/ENV). A script that relied on a non-secret HERMES_* var (HERMES_BASE_URL, HERMES_KANBAN_DB, HERMES__WEBHOOK, or a plugin-defined one) now sees it unset in the child. Document the behavior change and the two recovery routes (terminal.env_passthrough in config.yaml, or required_environment_variables in skill frontmatter), plus the debug log line that surfaces the drop for diagnosis. fix(stt,tts): restore mistralai — 2.4.8 is clean, ban lifted PyPI quarantined mistralai on 2026-05-12 after the malicious 2.4.6 release (Mini Shai-Hulud worm). 2.4.6 has since been removed from the registry and clean releases resumed (2.4.7 2026-05-25, 2.4.8 2026-05-28). This rolls back the blanket runtime ban so Voxtral STT + TTS work again, following the restoration checklist the repo left in pyproject.toml. Verified against the real SDK: 2.4.8 keeps the import path the code uses (from mistralai.client import Mistral) and the audio.transcriptions.complete / audio.speech.complete surfaces. Changes: - pyproject.toml: re-add mistral extra pinned to mistralai==2.4.8; left OUT of [all] per the 2026-05-12 lazy-install policy (one quarantined release must not break fresh installs). uv.lock regenerated. - tools/lazy_deps.py: add stt.mistral / tts.mistral entries so the SDK lazy-installs on first use (matches edge / elevenlabs). - tools/transcription_tools.py: restore explicit-provider gate (_HAS_MISTRAL + key) and auto-detect entry (local>groq>openai>mistral>xai); _transcribe_mistral lazy-installs before import. - tools/tts_tool.py: dispatcher routes back to _generate_mistral_tts; _import_mistral_client lazy-installs the SDK. - hermes_cli/tools_config.py, hermes_cli/web_server.py: un-hide Mistral from the TTS provider picker and dashboard STT options. - hermes_cli/security_advisories.py: KEEP the shai-hulud-2026-05 advisory (module policy forbids removal) — it is scoped to 2.4.6 only, so it still warns anyone with the poisoned build cached and never fires on 2.4.8. Summary note updated to reflect the un-quarantine. - tests: revert the disabled-behavior assertions added by the ban commit back to routing/positive expectations; add mistral to the lazy-installable-extras-excluded-from-[all] contract. Reported by @SkYNewZ (#34503). Validation: 189 targeted STT/TTS/lazy_deps/metadata tests pass; E2E with the real mistralai 2.4.8 SDK routes both STT and TTS to mistral.	2026-05-29 13:24:12 -07:00
Teknium	781604ce4c	fix(gateway): unify MEDIA: extraction extension set + close the unknown-ext black hole (#34517 ) (#34844 ) MEDIA:<path> tags for .md/.json/.yaml/.xml/.html and other document extensions were silently dropped. extract_media() carried a narrow extension allowlist that omitted them, while extract_local_files() had a broad one. The dispatch sites then ran an unconditional re.sub(r'MEDIA:\\s*\\S+', '') that stripped the tag from the body even when extract_media had not matched it — so extract_local_files (broad list) ran on text where the path was already gone, and the file was delivered by neither path. - Add MEDIA_DELIVERY_EXTS in gateway/platforms/base.py as the single source of truth; extract_media and extract_local_files both derive their extension set from it (no more drift). - Replace the loose MEDIA cleanup at the non-streaming dispatch site (base.py) and the streaming consumer (stream_consumer.py) with the shared, extension-anchored MEDIA_TAG_CLEANUP_RE. A MEDIA: tag with an unknown extension is left in the body so the bare-path detector can still pick it up instead of being black-holed. - Chain cleaned text through extract_media -> extract_images -> extract_local_files in run.py's post-stream media delivery (it was dropping the cleaned text and rescanning raw text with MEDIA: tags). - Regression tests covering both halves: previously-dropped extensions now extract, and unknown-ext paths survive the cleanup. Consolidates the MEDIA extension-allowlist PR cluster. Co-authored-by: Bartok9 <259807879+Bartok9@users.noreply.github.com> Co-authored-by: banditburai <123342691+banditburai@users.noreply.github.com> Co-authored-by: Kyzcreig <9063726+Kyzcreig@users.noreply.github.com>	2026-05-29 13:24:01 -07:00
teknium1	0dc0c5ea6b	chore: add AUTHOR_MAP entry for sweetcornna Maps the cherry-picked commit's noreply email to the GitHub login so the release attribution / CI author check passes.	2026-05-29 13:22:54 -07:00
Bartok9	3845d86b93	fix(cron): restore jobs.json emptied by config migration on update Config-version migrations have been observed to leave cron/jobs.json valid-but-empty after `hermes update`, silently dropping every scheduled job (#34600). The existing malformed-shape guards in cron/jobs.py don't catch this because {"jobs": []} is valid JSON. Add restore_cron_jobs_if_emptied() as a post-migration safety net: if the live cron/jobs.json now has zero jobs while the pre-update snapshot held one or more, restore the snapshot copy in place and warn loudly. The check is conservative — it only restores on unambiguous evidence of loss (snapshot had jobs, live file readable-and-empty), so a user who genuinely cleared their jobs is never second-guessed and an unreadable live file is left untouched so real corruption still surfaces. Wired into _cmd_update_impl after migrate_config(), reusing the existing pre-update quick snapshot (which already captures cron/jobs.json). Closes #34600	2026-05-29 13:22:54 -07:00
Cornna	d473e7c938	fix(cron): exclude jobs.json registry from disk-cleanup pattern Closes #32164	2026-05-29 13:22:54 -07:00
Teknium	91b174038c	fix(feishu): bound _chat_locks with LRU eviction (#34836 ) The Feishu adapter stored one asyncio.Lock per chat_id in a plain dict with no upper bound, so a long-running gateway that saw many distinct chats grew _chat_locks without limit. Port the LRU-eviction pattern already used by the yuanbao adapter: OrderedDict + move_to_end on access, CHAT_LOCK_MAX_SIZE cap (1000), and eviction that skips currently-held locks (falling back to dropping the LRU entry only if all are held).	2026-05-29 13:18:15 -07:00
teknium1	8055d0f092	test(ntfy): cover echo-tag filter; tag standalone send path Adds tests for the echo-loop fix (outgoing X-Tags header, inbound skip on tagged events, genuine tags pass through) and extends the tag to the out-of-process _standalone_send() path so cron / send_message deliveries to a self-subscribed topic are also skipped. Maps both contributors in release.py AUTHOR_MAP. Co-authored-by: liuhao1024 <sunsky.lau@gmail.com>	2026-05-29 13:17:46 -07:00
annguyenNous	9405cdc8dd	fix(ntfy): prevent echo loop by tagging outgoing messages When publish_topic equals the subscribe topic, the agent's own replies are echoed back by ntfy as incoming messages, creating an infinite reply spiral. Fix: tag outgoing messages with X-Tags: hermes-agent header, and skip incoming messages that carry this tag. This is zero-config — works automatically regardless of topic configuration. Fixes NousResearch/hermes-agent#34447	2026-05-29 13:17:46 -07:00
Bartok9	08c0b22417	fix(gateway): scope tool-result MEDIA scan to current turn The post-run scan that appends tool-emitted MEDIA: tags to the final response iterated every tool/function message in the full conversation and relied solely on path-based dedup against paths reconstructed from the replayable transcript. When that reconstruction does not byte-match the in-memory tool content (timestamp stripping, observed-context withholding, compression rewrites), a stale path emitted several turns earlier is absent from the dedup set and leaks onto a later text-only reply (Telegram 'Sending media group of 1 photo(s)' with no MEDIA directive present). Scope the scan to this turn's new messages by slicing result['messages'] at len(agent_history) (agent_history is passed as conversation_history into run_conversation, so the returned list is history + this turn). Retain path-based dedup as a secondary guard and as the sole guard on the compression-shrink fallback, preserving the #160 behaviour. Closes #34608	2026-05-29 13:13:34 -07:00
teknium1	38c4f8c371	test(gateway): update system-unit cwd assertion to HERMES_HOME anchor test_system_unit_has_no_root_paths asserted the system unit's WorkingDirectory was the remapped checkout path (/home/alice/.hermes/hermes-agent). That is the brittle pin this PR fixes — the system unit now anchors cwd at the target user's HERMES_HOME (/home/alice/.hermes). The test's intent (no root-home leak, target-user paths present) is unchanged and still holds.	2026-05-29 12:36:59 -07:00
teknium1	a1cb5fa2c7	fix(gateway): anchor service WorkingDirectory at HERMES_HOME, not the source checkout The systemd unit (and launchd plist) pinned WorkingDirectory to PROJECT_ROOT (the checkout the unit was generated from). When that checkout is transient — a git worktree, or a clone hermes update later relocates/removes — the path rots. systemd then fails the start at the CHDIR step (status=200/CHDIR) BEFORE Python loads, so the on-boot refresh_systemd_unit_if_needed() self-heal never runs and Restart=always crash-loops forever on a dead directory. Observed in the wild: a gateway that crash-looped 153 times overnight, bot offline until a manual 'hermes gateway restart' regenerated the unit. Anchor cwd at HERMES_HOME instead — it never moves, always exists, and the gateway never needed cwd to be the checkout (ExecStart uses an absolute python + -m hermes_cli.main). Existing broken units now differ from the generated unit and self-heal on the next start/restart/update.	2026-05-29 12:36:59 -07:00
Teknium	45b00bb49a	fix(packaging): ship hermes_cli subpackages in wheel (#34811 ) [tool.setuptools.packages.find] listed 'hermes_cli' without the 'hermes_cli.' wildcard, so the wheel shipped hermes_cli/.py but dropped the dashboard_auth and proxy subpackages. The dashboard died on every install with ModuleNotFoundError: No module named 'hermes_cli.dashboard_auth' (#34701); 'hermes proxy' was equally broken. Add the wildcard, and add a regression test that drives setuptools' own find_packages against the live tree so any future subpackage dropped from the include list fails CI instead of a user's container.	2026-05-29 12:36:09 -07:00
teknium1	8836b3a113	fix(cli): widen Windows .bat wrapper fix to custom-name alias path The profile alias --name path in main.py rewrote the wrapper with a hardcoded #!/bin/sh script right after create_wrapper_script(), clobbering the .bat on Windows and reintroducing the exact bug for custom aliases. create_wrapper_script() now takes an optional target so the alias file is named after the alias while the -p content references the profile — one platform-aware code path, no post-hoc rewrite.	2026-05-29 12:32:47 -07:00
liuhao1024	6312dd8c3a	fix(cli): create .bat wrapper on Windows instead of POSIX shell script On Windows, hermes profile create produced a #!/bin/sh script that the shell cannot execute. Now creates a .bat file with @echo off + %* on Windows, and keeps the POSIX shell script on macOS/Linux. Also fixes check_alias_collision to use 'where' instead of 'which' on Windows, and remove_wrapper_script to find .bat files. Fixes #34708	2026-05-29 12:32:47 -07:00
zapabob	30a0d5bc9e	chore(release): map zapabob author email	2026-05-29 12:32:35 -07:00
zapabob	aa283d1e4f	fix(model): isolate custom provider picker credentials	2026-05-29 12:32:35 -07:00
Teknium	2fc2280e63	fix(cli): clarify panel clips choices off-screen on short terminals (#34808 ) * docs(code-execution): document HERMES_* env narrowing + passthrough workaround The execute_code sandbox-child env scrub (`108397726`, #27303) deliberately dropped the broad HERMES_ prefix passthrough, keeping only an operational 4-var allowlist (HERMES_HOME/PROFILE/CONFIG/ENV). A script that relied on a non-secret HERMES_* var (HERMES_BASE_URL, HERMES_KANBAN_DB, HERMES__WEBHOOK, or a plugin-defined one) now sees it unset in the child. Document the behavior change and the two recovery routes (terminal.env_passthrough in config.yaml, or required_environment_variables in skill frontmatter), plus the debug log line that surfaces the drop for diagnosis. fix(cli): clarify panel clips choices off-screen on short terminals The clarify multiple-choice panel is a height-less Window inside a non-full-screen HSplit. When its content exceeds the viewport, prompt_toolkit distributes height per child and clips the panel's tail — where the choices live — so options render invisible/cut off (issue #34645, reported on macOS Terminal.app). Two budget-accounting bugs let the panel overflow: - the compact-chrome decision ignored the question rows, so full chrome (3 blank separators) was kept even with no room - the '… (question truncated)' marker was not counted against the question's row budget, overshooting by one row at a 1-row budget Fix: reserve one question row in the compact decision, count the truncation marker against the budget, and drop the question entirely when the choices alone already exceed the viewport (choices are the must-see content for a selection).	2026-05-29 12:32:31 -07:00
Teknium	27a2c4f36f	fix(mcp): stop reporting false OAuth success when no token was obtained (#34807 ) * docs(code-execution): document HERMES_* env narrowing + passthrough workaround The execute_code sandbox-child env scrub (`108397726`, #27303) deliberately dropped the broad HERMES_ prefix passthrough, keeping only an operational 4-var allowlist (HERMES_HOME/PROFILE/CONFIG/ENV). A script that relied on a non-secret HERMES_* var (HERMES_BASE_URL, HERMES_KANBAN_DB, HERMES__WEBHOOK, or a plugin-defined one) now sees it unset in the child. Document the behavior change and the two recovery routes (terminal.env_passthrough in config.yaml, or required_environment_variables in skill frontmatter), plus the debug log line that surfaces the drop for diagnosis. fix(mcp): stop reporting false OAuth success when no token was obtained `hermes mcp login` reported "Authenticated — N tool(s) available" for servers that serve tools/list without auth (e.g. Google's official Drive MCP server) even when the OAuth flow never completed — dynamic client registration 400'd because the provider doesn't support RFC 7591, so no token was ever acquired. Every real tool call then hung until timeout with no indication of why. Login now verifies a token actually landed on disk after the probe. When it didn't, it warns that authentication didn't complete and shows the config needed to supply a pre-registered client_id/client_secret (the existing, already-supported workaround for DCR-less providers). Adds a docs pitfall for Google Drive / Atlassian-style providers. Fixes #34775	2026-05-29 12:32:19 -07:00
Teknium	1cb850b674	fix(api_server): emit per-turn transcript on run.completed (#34703 ) (#34804 ) * docs(code-execution): document HERMES_* env narrowing + passthrough workaround The execute_code sandbox-child env scrub (`108397726`, #27303) deliberately dropped the broad HERMES_ prefix passthrough, keeping only an operational 4-var allowlist (HERMES_HOME/PROFILE/CONFIG/ENV). A script that relied on a non-secret HERMES_* var (HERMES_BASE_URL, HERMES_KANBAN_DB, HERMES__WEBHOOK, or a plugin-defined one) now sees it unset in the child. Document the behavior change and the two recovery routes (terminal.env_passthrough in config.yaml, or required_environment_variables in skill frontmatter), plus the debug log line that surfaces the drop for diagnosis. fix(api_server): emit per-turn transcript on run.completed (#34703) WebUI clients lost intermediate (pre-tool-call) assistant text after switching session pages mid-stream. The session-chat SSE stream delivers all assistant text as assistant.delta events under one message_id interleaved with tool.* events, then a single assistant.completed carrying only the final reply — so a client accumulating deltas into one buffer cannot reconstruct intermediate text segments that preceded tool calls, and they vanish from the live view (state.db persists them correctly). run.completed now carries the authoritative per-turn transcript (assistant + tool messages for this turn, in client-safe shape) so any SSE consumer can reconcile its live view against ground truth without a separate GET /messages round-trip. Purely additive — clients that ignore the field are unaffected.	2026-05-29 12:27:49 -07:00
Teknium	b6ed3913d2	feat(skills): categorize tap skills from skills.sh.json grouping sidecar A GitHub tap can ship a repo-root skills.sh.json (the published skills.sh schema) declaring category groupings. The Skills Hub now reads it at index time and uses each grouping title as the skill's category label, instead of the tag-derived guess. Generic: any tap that ships the file gets real categorization — NVIDIA's groupings (Inference AI, Decision Optimization, GPU Development, etc.) flow through automatically. - GitHubSource: _get_skillsh_groupings() fetches+caches the sidecar per repo; _parse_skillsh_groupings() flattens it to {skill_name: title}; _list_skills_in_repo() stamps meta.extra['category']; _meta_to_dict now serializes extra so the category survives the index cache round-trip. - extract-skills.py: prefers extra['category'] over the tag heuristic and exempts sidecar categories from the small-category to Other collapse. - Docs + 12 tests.	2026-05-29 12:24:39 -07:00
Teknium	4de8009ce4	feat(skills): integrate NVIDIA/skills as a trusted skills hub tap NVIDIA/skills is now a default trusted tap in the Hermes Skills Hub — discoverable, browsable, searchable, and auto-updating through the same pipeline that already serves OpenAI, Anthropic, and HuggingFace skills. Rebased onto current main.	2026-05-29 12:24:39 -07:00
Teknium	1596bb287e	fix(dashboard): chat tab works in gated (OAuth) mode (#34793 ) The Chat/TUI dashboard tab showed a false "Session token unavailable" error and never rendered the terminal whenever the dashboard ran in gated mode (OAuth auth gate active, --insecure not set), even though the user was fully authenticated and every other tab worked. Two checks in ChatPage.tsx gated purely on window.__HERMES_SESSION_TOKEN__, which the server intentionally omits in gated mode (web_server.py only injects __HERMES_AUTH_REQUIRED__=true there; the SPA is expected to use cookie auth + a single-use WS ticket). buildWsAuthParam() already resolves WS auth correctly for both modes, but the early bail prevented the effect from ever reaching it. Both checks now also honor __HERMES_AUTH_REQUIRED__: the banner no longer fires and the xterm/WS effect no longer bails in gated mode. Reported-by: wbrione <wbrione@users.noreply.github.com> Closes #34755	2026-05-29 12:19:51 -07:00
Teknium	90b3c54de9	fix: drain thread no longer crashes on fd-less stdout streams (#34789 ) * docs(code-execution): document HERMES_* env narrowing + passthrough workaround The execute_code sandbox-child env scrub (`108397726`, #27303) deliberately dropped the broad HERMES_ prefix passthrough, keeping only an operational 4-var allowlist (HERMES_HOME/PROFILE/CONFIG/ENV). A script that relied on a non-secret HERMES_* var (HERMES_BASE_URL, HERMES_KANBAN_DB, HERMES__WEBHOOK, or a plugin-defined one) now sees it unset in the child. Document the behavior change and the two recovery routes (terminal.env_passthrough in config.yaml, or required_environment_variables in skill frontmatter), plus the debug log line that surfaces the drop for diagnosis. fix: drain thread no longer crashes on fd-less stdout streams The _wait_for_process drain thread called proc.stdout.fileno() unconditionally. ProcessHandle implementations whose stdout is not backed by a real OS fd (iterator-style in-memory streams, mock procs) raised 'list_iterator' object has no attribute 'fileno' (or 'fileno() returned a non-integer' from select.select), killing the daemon thread and silently losing all process output. Resolve the fd defensively at the top of _drain; when stdout has no usable integer fileno, fall back to draining it as an iterable (the legacy 'for line in proc.stdout' contract). The real subprocess / os.pipe-backed select() fast path is unchanged.	2026-05-29 12:16:57 -07:00
teknium1	5641ae6469	chore(release): add AUTHOR_MAP entries for Bucket-1 docs salvage contributors	2026-05-29 12:06:22 -07:00
Twanislas	549a69a925	docs(curator): align 'agent-created' definition with actual provenance semantics The curator docs stated that any skill not bundled/hub-installed was 'agent-created' and subject to curation — including foreground-created skills and hand-written ones. Since PR #19621 (May 2026), the curator requires an explicit marker in .usage.json, which only the background self-improvement review fork sets. Changes: - Rewrite 'What agent-created means' to document the 3-step eligibility check (not bundled + not hub + created_by=agent marker) - Explain that foreground skill_manage(create) does NOT mark skills as agent-created (user-directed by design) - Warn that hand-written skills are NOT curated - Add note in Per-run reports explaining the '(not resolved)' display when no candidates exist (LLM pass skipped, not a config error) - Link to skill_provenance.py for the write-origin ContextVar Ref: PR #19621, tools/skill_provenance.py, tools/skill_manager_tool.py	2026-05-29 12:06:22 -07:00
Aman113114-IITD	3f0d44af8a	docs: replace invalid 'hermes config get <key>' with 'hermes config show' 'hermes config get <key>' is referenced in three guides but is not a valid subcommand. The valid subcommands under 'hermes config' are {show,edit,set,path,env-path,check,migrate}. 'hermes config show' is already used elsewhere in the docs (including 'hermes config show \| grep <pattern>' in the FAQ), so it's the idiomatic replacement. - work-with-skills.md: 'View all skill config' now uses 'hermes config show \| grep ^skills\.config' - migrate-from-openclaw.md: session-policy check now reads the value from 'hermes config show' - configuring-models.md: 'inspect what the CLI will actually use' now uses 'hermes config show \| grep ^model\.' Refs #30195	2026-05-29 12:06:22 -07:00
HKPA	eff4626747	fix(docs): add baseUrl prefix to SVG image paths in sessions and CLI pages Fixes #24809 The docs site uses baseUrl='/docs/' but the <img> tags in sessions.md and cli.md referenced images at /img/docs/... which resolves to a 404. The static files are served at /docs/img/docs/... instead. Before: <img src="/img/docs/session-recap.svg"> → 404 After: <img src="/docs/img/docs/session-recap.svg"> → 200 Also fixes cli-layout.svg which had the same issue.	2026-05-29 12:06:22 -07:00
aqilaziz	175885218e	fix(docs): align fallback provider config examples Use the current top-level fallback_providers list in fallback docs and keep fallback_model documented only as the legacy compatibility shape. Also align cron and delegation fallback coverage with current runtime behavior. Closes #19691 Co-authored-by: Codex <codex@openai.com>	2026-05-29 12:06:22 -07:00
helix4u	119390a2a1	docs(config): deprecate MESSAGING_CWD guidance	2026-05-29 12:06:22 -07:00
helix4u	3625dbb844	docs(security): update redaction skill source	2026-05-29 12:06:22 -07:00
helix4u	aef04b2b53	docs(security): fix secret redaction default docs	2026-05-29 12:06:22 -07:00
TonyPepe	a2d3cff53f	docs(cli): refine update gateway restart wording	2026-05-29 12:06:22 -07:00
TonyPepe	ee0a9bf7c7	docs(cli): align hermes update flags	2026-05-29 12:06:22 -07:00
WadydX	b922e3ff93	docs(prompt): align precedence docs with system prompt runtime - Replace outdated linear ordering in prompt-assembly guide with current stable/context/volatile tier contract from system_prompt.py - Clarify where memory/profile snapshots live versus skills guidance - Document that pre_llm_call context is user-message injection, not cached system-prompt mutation - Update architecture guide wording to reference system_prompt.py + prompt_builder.py tiered assembly Closes #34118	2026-05-29 12:06:22 -07:00
Octavio Turra	053969fd53	Correct URL format for simplex-chat download Fix download link for Linux/macOS binary in documentation.	2026-05-29 12:06:22 -07:00
alelpoan	988cf1743b	fix(docs): replace channel link with actual playlist URL in quickstart	2026-05-29 12:06:22 -07:00
kurobaryo	03bdeaa876	docs: fix BROWSERBASE_SESSION_TIMEOUT unit (ms → seconds)	2026-05-29 12:06:22 -07:00
haran2001	d86710528a	docs(google-workspace): fix dead gws CLI link to googleworkspace/cli The Google Workspace skill doc linked to https://github.com/nicholasgasior/gws which returns 404. The actual upstream CLI lives at https://github.com/googleworkspace/cli (the official Google Workspace CLI in Rust, dynamically built from the Google Discovery Service). Closes #28922	2026-05-29 12:06:22 -07:00
Niels Kaspers	6891e05e78	docs: fix session recap image baseUrl	2026-05-29 12:06:22 -07:00
hllqkb	0673638560	fix(docs): correct GitHub org links in memory-providers.md hermes-ai/hermes-agent → NousResearch/hermes-agent (2 occurrences). The old org name leads to 404 pages.	2026-05-29 12:06:22 -07:00
Hashclaw	ae9dfa510e	docs: fix separate typo; hyphenate built-in trust wording - ACL LaTeX template comment: seperate -> separate - CONTRIBUTING and docs site: builtin trust -> built-in trust (prose/table cells) Made-with: Cursor	2026-05-29 12:06:22 -07:00
kshitij	7379f17556	fix(gateway): only fire planned-stop watcher for self-targeting markers + fix Windows consume (#34749 ) * fix(gateway): only fire planned-stop watcher for markers targeting self Salvaged from #34599 — rebased onto current main. The planned-stop watcher now only fires shutdown for a marker that targets the current process, instead of any marker that exists on disk. Fixes the Windows crash loop (#34597) where a stale marker from a previous Gateway instance kills a freshly booted Gateway ~400ms after start with a false "Received UNKNOWN — initiating shutdown". Co-authored-by: Bartok9 <danielrpike9@gmail.com> * fix(gateway): match planned-stop/takeover markers by PID alone when start_time is unavailable Follow-up to the #34599 salvage. The watcher's non-destructive probe (planned_stop_marker_targets_self) already falls back to PID equality when a process start_time is unavailable, but the authoritative consume it gates (_consume_pid_marker_for_self) still required a non-None start_time match. _get_process_start_time reads /proc/<pid>/stat and returns None on macOS and native Windows — the only platform the planned-stop watcher exists for. So on Windows the probe would fire the shutdown handler (PID matches) but the handler's consume_planned_stop_marker_for_self() would return False, and a legitimate 'hermes gateway stop' was still misclassified as an unexpected UNKNOWN exit (exit 1) and revived by the service manager — a residual half of the #34597 crash loop on the legitimate-stop path. Align the consume with the probe: when both start_times are known they must match (PID-reuse guard preserved on Linux); when either is unavailable, fall back to PID equality alone, bounded by the existing short marker TTL. This also fixes the parallel --replace takeover consume on Windows, which shares the same helper. Adds regression tests for the Windows (None start_time) path, the foreign-PID rejection under that fallback, and confirmation the start_time-mismatch guard still rejects when both are known. --------- Co-authored-by: Bartok9 <danielrpike9@gmail.com>	2026-05-29 17:36:58 +00:00
alt-glitch	0563ab0652	fix(test): add fal_client.submit stub to surface matrix test The plugin switched from fal_client.subscribe() to submit()+handle.get(). The test mock only had subscribe, causing CI failures.	2026-05-29 22:26:24 +05:30
alt-glitch	e46e4bcf47	fix(video_gen): parse duration suffix in success_response int(payload["duration"]) blows up on "4s" (veo3.1 format). Strip non-digit chars before int conversion in the response builder.	2026-05-29 22:26:24 +05:30
alt-glitch	3183b2e28c	fix(video_gen): veo3.1 duration format and 4k resolution FAL veo3.1 API expects duration as "4s"/"6s"/"8s" (with unit suffix), not bare "4"/"6"/"8" like other families. Add per-family duration_suffix field and apply it in _build_payload. Also add "4k" to veo3.1 resolutions per FAL API docs. Note: the managed gateway currently rejects the "4s" format (expects integer duration). Gateway-side fix needed for veo3.1 to work through the Nous subscription path.	2026-05-29 22:26:24 +05:30
alt-glitch	a4c18f65d4	feat(video_gen): wire Nous subscription override into hermes tools UX Add the same managed-gateway UX that image_gen already has: - TOOL_CATEGORIES['video_gen'] gets a 'Nous Subscription' provider row with managed_nous_feature='video_gen' + video_gen_plugin_name='fal' - NousSubscriptionFeatures gains a video_gen property + feature state computation (managed/active/available using the fal-queue gateway) - _GATEWAY_TOOL_LABELS, _GATEWAY_DIRECT_LABELS, _ALL_GATEWAY_KEYS, _get_gateway_direct_credentials, opted_in all include video_gen - apply_nous_managed_defaults and apply_gateway_defaults handle video_gen - _is_toolset_satisfied checks Nous features for video_gen - _is_provider_active detects managed video_gen (use_gateway + fal provider) - _select_plugin_video_gen_provider accepts use_gateway kwarg, propagated from all 4 call sites in _configure_provider when managed_feature is set - hermes setup status shows 'Video Generation (FAL via Nous subscription)' Users on a Nous subscription can now pick 'Nous Subscription' under hermes tools → Video Generation, which sets video_gen.provider=fal + video_gen.use_gateway=true. The FAL plugin's _resolve_managed_fal_video_gateway then routes through the managed queue gateway — no FAL_KEY needed.	2026-05-29 22:26:24 +05:30
alt-glitch	b6294ea9f1	test(video_gen): cover gateway decision matrix gaps and 4xx error path - Add test for 4xx ValueError with actionable remediation message - Add test for is_available() returning True via managed gateway - Add test for prefers_gateway overriding direct FAL_KEY - Add test for is_available() via gateway in plugin test file	2026-05-29 22:26:24 +05:30
alt-glitch	d04b3c193e	feat(video_gen): route FAL video gen through managed Nous gateway Wire plugins/video_gen/fal/__init__.py to use the same _ManagedFalSyncClient pattern that image gen already uses. Changes: - Add managed gateway resolution, client caching, and _submit_fal_video_request() that routes between direct FAL_KEY and Nous gateway modes - Update is_available() to return True when either FAL_KEY or the managed gateway is reachable - Update generate() to use submit+get handle pattern instead of fal_client.subscribe() directly - Fix happy-horse endpoint namespace: fal-ai/ → alibaba/ (matches the tool-gateway allowlist from fal-video-gen branch) - Surface actionable error on 4xx gateway rejections Tests: - 4 new tests in test_managed_media_gateways.py (gateway routing, client reuse, direct mode fallback, alibaba namespace) - Updated existing test_fal_plugin.py fixture to use submit/handle pattern and patch _resolve_managed_fal_video_gateway for isolation	2026-05-29 22:26:24 +05:30
kshitijk4poor	5cd0673217	ci: harden supply-chain gate jobs against changes-job failure The scan-gate / dep-bounds-gate jobs use needs.changes; if the changes job itself fails, its dependents would be skipped via a failed dependency (not a conditional skip), leaving the required check unreported — the same "pending forever" failure this PR fixes. Add always() and switch the gate condition from == 'false' to != 'true' so the gate still fires (and reports SUCCESS) when changes fails and its output is empty.	2026-05-29 09:17:01 -07:00
ethernet	6bc309baf2	ci: ensure required checks always report status Remove paths filters from contributor-check and supply-chain-audit workflows. When no matching files changed, the workflows never ran and the required checks (check-attribution, supply chain scan, dep bounds) stayed "pending" forever, blocking merge. Now both workflows always trigger. A path-check step/job determines whether the real work should run; gate jobs with matching names report success when the real job was skipped, so branch protection always gets a check status. Also fixes dep-bounds: the old condition if: contains(github.event.pull_request.changed_files_url, 'pyproject.toml') \|\| true was always true (the \|\| true made it unconditional). Now uses the proper changes.deps output from the shared filter job.	2026-05-29 09:17:01 -07:00

1 2 3 4 5 ...

9986 commits