hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-20 10:11:58 +00:00

Author	SHA1	Message	Date
Brooklyn Nicholson	af0f4a52fe	feat: cute spinners	2026-04-08 13:45:34 -05:00
Brooklyn Nicholson	b50d81f212	fix: diff colours	2026-04-08 12:11:55 -05:00
Brooklyn Nicholson	a9fa054df9	chore: uptick	2026-04-08 10:35:07 -05:00
Brooklyn Nicholson	31cb23890a	Merge branch 'feat/ink-refactor' of github.com:NousResearch/hermes-agent into feat/ink-refactor	2026-04-08 09:46:46 -05:00
Brooklyn Nicholson	a3cfb1de86	feat: auto install tui deps	2026-04-08 09:46:40 -05:00
Austin Pickett	371efafc46	feat: personality	2026-04-08 00:15:15 -04:00
Austin Pickett	ebd2d83ef2	feat: add skin logo support	2026-04-07 23:59:11 -04:00
Brooklyn Nicholson	af077b2c0d	fix: history up arrow	2026-04-07 20:47:59 -05:00
Brooklyn Nicholson	2d884ff12d	chore: uptick	2026-04-07 20:46:59 -05:00
Brooklyn Nicholson	b397c91d4a	chore: uptick	2026-04-07 20:44:18 -05:00
Brooklyn Nicholson	9c2c9e3a3e	chore: fmt	2026-04-07 20:30:22 -05:00
Brooklyn Nicholson	c3eeb03e26	chore: clean exit	2026-04-07 20:29:31 -05:00
Brooklyn Nicholson	d9d0ac06b9	chore: readme update	2026-04-07 20:24:46 -05:00
Brooklyn Nicholson	29f2610e4b	tui updates for rendering pipeline	2026-04-07 20:11:05 -05:00
Brooklyn Nicholson	dcb97f7465	chore: readme	2026-04-06 18:52:45 -05:00
Brooklyn Nicholson	86308b6de4	chore: better command support	2026-04-06 18:49:40 -05:00
Brooklyn Nicholson	2d349bbf7a	chore: fmt	2026-04-06 18:43:00 -05:00
Brooklyn Nicholson	39878aff00	chore: uptick	2026-04-06 18:40:21 -05:00
Brooklyn Nicholson	afd670a36f	feat: small refactors	2026-04-06 18:38:13 -05:00
Brooklyn Nicholson	e2b3b1c5e4	Merge branch 'main' of github.com:NousResearch/hermes-agent into feat/ink-refactor	2026-04-06 17:56:45 -05:00
Siddharth Balyan	e651e04100	fix(nix): read version, regen uv.lock, fix packages.nix to add hermes_logging (#5651 ) * - read version from pyproject for nix - regen uv.lock - add hermes_logging to packages.nix * fix secret regen w/ sops	2026-04-07 04:21:19 +05:30
Siddharth Balyan	7b129636f0	feat(tools): add Firecrawl cloud browser provider (#5628 ) * feat(tools): add Firecrawl cloud browser provider Adds Firecrawl (https://firecrawl.dev) as a cloud browser provider alongside Browserbase and Browser Use. All browser tools route through Firecrawl's cloud browser via CDP when selected. - tools/browser_providers/firecrawl.py — FirecrawlProvider - tools/browser_tool.py — register in _PROVIDER_REGISTRY - hermes_cli/tools_config.py — add to onboarding provider picker - hermes_cli/setup.py — add to setup summary - hermes_cli/config.py — add FIRECRAWL_BROWSER_TTL config - website/docs/ — browser docs and env var reference Based on #4490 by @developersdigest. Co-Authored-By: Developers Digest <124798203+developersdigest@users.noreply.github.com> * refactor: simplify FirecrawlProvider.emergency_cleanup Use self._headers() and self._api_url() instead of duplicating env-var reads and header construction. * fix: recognize Firecrawl in subscription browser detection _resolve_browser_feature_state() now handles "firecrawl" as a direct browser provider (same pattern as "browser-use"), so hermes setup summary correctly shows "Browser Automation (Firecrawl)" instead of misreporting as "Local browser". Also fixes test_config_version_unchanged assertion (11 → 12). --------- Co-authored-by: Developers Digest <124798203+developersdigest@users.noreply.github.com>	2026-04-07 02:35:26 +05:30
Teknium	150f70f821	feat(skills): add skill config interface + llm-wiki skill (#5635 ) Skills can now declare config.yaml settings via metadata.hermes.config in their SKILL.md frontmatter. Values are stored under skills.config.* namespace, prompted during hermes config migrate, shown in hermes config show, and injected into the skill context at load time. Also adds the llm-wiki skill (Karpathy's LLM Wiki pattern) as the first skill to use the new config interface, declaring wiki.path. Skill config interface (new): - agent/skill_utils.py: extract_skill_config_vars(), discover_all_skill_config_vars(), resolve_skill_config_values(), SKILL_CONFIG_PREFIX - agent/skill_commands.py: _inject_skill_config() injects resolved values into skill messages as [Skill config: ...] block - hermes_cli/config.py: get_missing_skill_config_vars(), skill config prompting in migrate_config(), Skill Settings in show_config() LLM Wiki skill (skills/research/llm-wiki/SKILL.md): - Three-layer architecture (raw sources, wiki pages, schema) - Three operations (ingest, query, lint) - Session orientation, page thresholds, tag taxonomy, update policy, scaling guidance, log rotation, archiving workflow Docs: creating-skills.md, configuration.md, skills.md, skills-catalog.md Closes #5100	2026-04-06 13:49:13 -07:00
Mikita Lisavets	29b5ec2555	fix: clear session-scoped model after session reset	2026-04-06 13:20:01 -07:00
Mikita Lisavets	9afb9a6cb2	fix: clear session-scoped model overrides during session reset	2026-04-06 13:20:01 -07:00
donrhmexe	2c814d7b5d	fix: /model --global writes model.name instead of model.default The canonical config key for model name is model.default (used by setup, auth, runtime_provider, profile list, and CLI startup). But /model --global wrote to model.name in both gateway and CLI paths. This caused: - hermes profile list showing the old model (reads model.default) - Gateway restart reverting to the old model (_resolve_gateway_model reads model.default) - CLI startup using the old model (main.py reads model.default) The only reason it appeared to work in Telegram was the cached agent staying alive with the in-place switch. Fix: change all 3 write/read sites to use model.default.	2026-04-06 13:20:01 -07:00
BongSuCHOI	ad567c9a8f	fix: subagent toolset inheritance when parent enabled_toolsets is None When parent_agent.enabled_toolsets is None (the default, meaning all tools are enabled), subagents incorrectly fell back to DEFAULT_TOOLSETS (['terminal', 'file', 'web']) instead of inheriting the parent's full toolset. Root cause: - Line 188 used 'or' fallback: None or DEFAULT_TOOLSETS evaluates to DEFAULT_TOOLSETS - Line 192 checked truthiness: None is falsy, falling through to else Fix: - Use 'is not None' checks instead of truthiness - When enabled_toolsets is None, derive effective toolsets from parent_agent.valid_tool_names via the tool registry Fixes the bug introduced in `f75b1d21b` and repeated in `e5d14445e` (PR #3269).	2026-04-06 13:20:01 -07:00
donrhmexe	ff655de481	fix: model alias fallback uses authenticated providers instead of hardcoded openrouter/nous When an alias like 'claude' can't be resolved on the current provider, _resolve_alias_fallback() tries other providers. Previously it hardcoded ('openrouter', 'nous') — so '/model claude' on z.ai would resolve to openrouter even if the user doesn't have openrouter credentials but does have anthropic. Now the fallback uses the user's actual authenticated providers (detected via list_authenticated_providers which is backed by the models.dev in-memory cache). If no authenticated providers are found, falls back to the old ('openrouter', 'nous') for backwards compatibility. New helper: get_authenticated_provider_slugs() returns just the slug strings from list_authenticated_providers().	2026-04-06 13:20:01 -07:00
Ayman Kamal	96f85b03cd	fix: handle launchctl kickstart exit code 113 in launchd_start() launchctl kickstart returns exit code 113 ("Could not find service") when the plist exists but the job hasn't been bootstrapped into the runtime domain. The existing recovery path only caught exit code 3 ("unloaded"), causing an unhandled CalledProcessError. Exit code 113 means the same thing practically -- the service definition needs bootstrapping before it can be kicked. Add it to the same recovery path that already handles exit 3, matching the existing pattern in launchd_stop(). Follow-up: add a unit test covering the 113 recovery path.	2026-04-06 13:20:01 -07:00
Dusk1e	1a2f109d8e	Ensure atomic writes for gateway channel directory cache to prevent truncation	2026-04-06 13:20:01 -07:00
Mariano A. Nicolini	af9a9f773c	fix(security): sanitize workdir parameter in terminal tool backends Shell injection via unquoted workdir interpolation in docker, singularity, and SSH backends. When workdir contained shell metacharacters (e.g. ~/;id), arbitrary commands could execute. Changes: - Add shlex.quote() at each interpolation point in docker.py, singularity.py, and ssh.py with tilde-aware quoting (keep ~ unquoted for shell expansion, quote only the subpath) - Add _validate_workdir() allowlist in terminal_tool.py as defense-in-depth before workdir reaches any backend Original work by Mariano A. Nicolini (PR #5620). Salvaged with fixes for tilde expansion (shlex.quote breaks cd ~/path) and replaced incomplete deny-list with strict character allowlist. Co-authored-by: Mariano A. Nicolini <entropidelic@users.noreply.github.com>	2026-04-06 13:19:22 -07:00
Teknium	537a2b8bb8	docs: add WSL2 networking guide for local model servers (#5616 ) Windows users running Hermes in WSL2 with model servers on the Windows host hit 'connection refused' because WSL2's NAT networking means localhost points to the VM, not Windows. Covers: - Mirrored networking mode (Win 11 22H2+) — makes localhost work - NAT mode fallback using the host IP via ip route - Per-server bind address table (Ollama, LM Studio, llama-server, vLLM, SGLang) - Detailed Ollama Windows service config for OLLAMA_HOST - Windows Firewall rules for WSL2 connections - Quick verification steps - Cross-reference from Troubleshooting section	2026-04-06 13:01:18 -07:00
Teknium	261e2ee862	fix: restore Path import in env_passthrough.py (removed by #5526 ) The ContextVar migration removed 'from pathlib import Path' but Path is still used in _load_config_passthrough(). Without this import, config-based env passthrough would raise NameError.	2026-04-06 12:42:16 -07:00
Awsh1	878b1d3d33	fix(cron): harden scheduler against path traversal and env leaks Cherry-picked from PR #5503 by Awsh1. - Validate ALL script paths (absolute, relative, tilde) against scripts_dir boundary - Add API-boundary validation in cronjob_tools.py - Move os.environ injections inside try block so finally cleanup always runs - Comprehensive regression tests for path containment bypass	2026-04-06 12:42:16 -07:00
Dusk1e	7d0953d6ff	security(gateway): isolate env/credential registries using ContextVars	2026-04-06 12:42:16 -07:00
Teknium	da02a4e283	fix: auxiliary client payment fallback — retry with next provider on 402 (#5599 ) When a user runs out of OpenRouter credits and switches to Codex (or any other provider), auxiliary tasks (compression, vision, web_extract) would still try OpenRouter first and fail with 402. Two fixes: 1. Payment fallback in call_llm(): When a resolved provider returns HTTP 402 or a credit-related error, automatically retry with the next available provider in the auto-detection chain. Skips the depleted provider and tries Nous → Custom → Codex → API-key providers. 2. Remove hardcoded OpenRouter fallback: The old code fell back specifically to OpenRouter when auto/custom resolution returned no client. Now falls back to the full auto-detection chain, which handles any available provider — not just OpenRouter. Also extracts _get_provider_chain() as a shared function (replaces inline tuple in _resolve_auto and the new fallback), built at call time so test patches on _try_* functions remain visible. Adds 16 tests covering _is_payment_error(), _get_provider_chain(), _try_payment_fallback(), and call_llm() integration with 402 retry.	2026-04-06 12:41:40 -07:00
Teknium	8ffd44a6f9	feat(discord): register skills as native slash commands via shared gateway logic (#5603 ) Centralize the skill → slash command registration that Telegram already had in commands.py so Discord uses the exact same priority system, filtering, and cap enforcement: 1. Core/built-in commands (never trimmed) 2. Plugin commands (never trimmed) 3. Skill commands (fill remaining slots, alphabetical, only tier trimmed) Changes: hermes_cli/commands.py: - Rename _TG_NAME_LIMIT → _CMD_NAME_LIMIT (32 chars shared by both platforms) - Rename _clamp_telegram_names → _clamp_command_names (generic) - Extract _collect_gateway_skill_entries() — shared plugin + skill collection with platform filtering, name sanitization, description truncation, and cap enforcement - Refactor telegram_menu_commands() to use the shared helper - Add discord_skill_commands() that returns (name, desc, cmd_key) triples - Preserve _sanitize_telegram_name() for Telegram-specific name cleaning gateway/platforms/discord.py: - Call discord_skill_commands() from _register_slash_commands() - Create app_commands.Command per skill entry with cmd_key callback - Respect 100-command global Discord limit - Log warning when skills are skipped due to cap Backward-compat aliases preserved for _TG_NAME_LIMIT and _clamp_telegram_names. Tests: 9 new tests (7 Discord + 2 backward-compat), 98 total pass. Inspired by PR #5498 (sprmn24). Closes #5480.	2026-04-06 12:09:36 -07:00
Julien Talbot	92c19924a9	feat: add xAI prompt caching via x-grok-conv-id header When using xAI's API directly (base_url contains x.ai), send the x-grok-conv-id header set to the Hermes session_id. This routes consecutive requests to the same server, maximizing automatic prompt cache hits. Ref: https://docs.x.ai/developers/advanced-api-usage/prompt-caching	2026-04-06 12:06:33 -07:00
SHL0MS	0afa3a87d4	Merge pull request #5600 from SHL0MS/feat/p5js-skill feat(skills): add p5js creative coding skill	2026-04-06 14:52:27 -04:00
Teknium	3d08a2fa1b	fix: extract MEDIA: tags from cron delivery before sending (#5598 ) The cron scheduler delivery path passed raw text including MEDIA: tags to _send_to_platform(), so media attachments were delivered as literal text instead of actual files. The send function already supports media_files= but the cron path never used it. Now calls BasePlatformAdapter.extract_media() to split media paths from text before sending, matching the gateway's normal message flow. Salvaged from PR #4877 by robert-hoffmann.	2026-04-06 11:42:44 -07:00
kshitijk4poor	5e88eb2ba0	fix(signal): implement send_image_file, send_voice, and send_video for MEDIA: tag delivery The Signal adapter inherited base class defaults for send_image_file(), send_voice(), and send_video() which only sent the file path as text (e.g. '🖼️ Image: /tmp/chart.png') instead of actually delivering the file as a Signal attachment. When agent responses contain MEDIA:/path/to/file tags, the gateway media pipeline extracts them and routes through these methods by file type. Without proper overrides, image/audio/video files were never actually delivered to Signal users. Extract a shared _send_attachment() helper that handles all file validation, size checking, group/DM routing, and RPC dispatch. The four public methods (send_document, send_image_file, send_voice, send_video) now delegate to this helper, following the same pattern used by WhatsApp (_send_media_to_bridge) and Discord (_send_file_attachment). The helper also uses a single stat() call with try/except FileNotFoundError instead of the previous exists() + stat() two-syscall pattern, eliminating a TOCTOU race. As a bonus, send_document() now gains the 100MB size check that was previously missing (inconsistency with send_image). Add 25 tests covering all methods plus MEDIA: tag extraction integration, method-override guards, and send_document's new size check. Fixes #5105	2026-04-06 11:41:34 -07:00
SHL0MS	17e2a27c51	feat(skills): add p5js creative coding skill Production pipeline for interactive and generative visual art using p5.js. Covers 7 modes: generative art, data visualization, interactive experiences, animation/motion graphics, 3D scenes, image processing, and audio-reactive. Includes: - SKILL.md with creative standard, pipeline, and critical implementation notes - 10 reference files covering core API, shapes, visual effects (noise, flow fields, particles, domain warp, attractors, L-systems, circle packing, bloom, reaction-diffusion), animation (easing, springs, state machines, scene transitions), typography, color systems, WebGL/3D/shaders, interaction, and comprehensive export pipeline - Deterministic headless frame capture via Puppeteer (noLoop + redraw) - ffmpeg render pipeline for MP4 video export - Per-clip architecture for multi-scene video production - Interactive viewer template with seed navigation and parameter controls - Performance guidance: FES disable, Math.* hot loops, per-pixel budgets - Addon library coverage: p5.brush, p5.grain, CCapture.js, p5.js-svg - fxhash/Art Blocks generative platform conventions - p5.js 2.0 migration guide (async setup, OKLCH, splineVertex, shader.modify) - 13 documented common mistakes and troubleshooting patterns 17 files, ~5,900 lines.	2026-04-06 14:39:00 -04:00
kshitijk4poor	214e60c951	fix: sanitize Telegram command names to strip invalid characters Telegram Bot API requires command names to contain only lowercase a-z, digits 0-9, and underscores. Skill/plugin names containing characters like +, /, @, or . caused set_my_commands to fail with Bot_command_invalid. Two-layer fix: - scan_skill_commands(): strip non-alphanumeric/non-hyphen chars from cmd_key at source, collapse consecutive hyphens, trim edges, skip names that sanitize to empty string - _sanitize_telegram_name(): centralized helper used by all 3 Telegram name generation sites (core commands, plugin commands, skill commands) with empty-name guard at each call site Closes #5534	2026-04-06 11:27:28 -07:00
ClintonEmok	f77be22c65	Fix #5211 : Preserve dots in OpenCode Go model names OpenCode Go model names with dots (minimax-m2.7, glm-4.5, kimi-k2.5) were being mangled to hyphens (minimax-m2-7), causing HTTP 401 errors. Two code paths were affected: 1. model_normalize.py: opencode-go was incorrectly in DOT_TO_HYPHEN_PROVIDERS 2. run_agent.py: _anthropic_preserve_dots() did not check for opencode-go Fix: - Remove opencode-go from _DOT_TO_HYPHEN_PROVIDERS (dots are correct for Go) - Add opencode-go to _anthropic_preserve_dots() provider check - Add opencode.ai/zen/go to base_url fallback check - Add regression tests in tests/test_model_normalize.py Co-authored-by: jacob3712 <jacob3712@users.noreply.github.com>	2026-04-06 11:25:06 -07:00
Teknium	582dbbbbf7	feat: add grok to TOOL_USE_ENFORCEMENT_MODELS for direct xAI usage (#5595 ) Grok models (x-ai/grok-4.20-beta, grok-code-fast-1) now receive tool-use enforcement guidance, steering them to actually call tools instead of describing intended actions. Matches both OpenRouter (x-ai/grok-*) and direct xAI API usage.	2026-04-06 11:22:07 -07:00
SHL0MS	0bac07ded3	Merge pull request #5588 from SHL0MS/feat/manim-skill-deep-expansion docs(manim-video): add 5 new reference files — design thinking, updaters, paper explainer, decorations, production quality	2026-04-06 13:58:00 -04:00
SHL0MS	a912cd4568	docs(manim-video): add 5 new reference files — design thinking, updaters, paper explainer, decorations, production quality Five new reference files expanding the skill from rendering knowledge into production methodology: animation-design-thinking.md (161 lines): When to animate vs show static, concept decomposition into visual beats, pacing rules, narration sync, equation reveal strategies, architecture diagram patterns, common design mistakes. updaters-and-trackers.md (260 lines): Deep ValueTracker mental model, lambda/time-based/always_redraw updaters, DecimalNumber and Variable live displays, animation-based updaters, 4 complete practical patterns (dot tracing, live area, connected diagram, parameter exploration). paper-explainer.md (255 lines): Full workflow for turning research papers into animations. Audience selection, 5-minute template, pre-code gates (narration, scene list, style contract), equation reveal strategies, architecture diagram building, results animation, domain-specific patterns for ML/physics/ biomedical papers. decorations.md (202 lines): SurroundingRectangle, BackgroundRectangle, Brace, arrows (straight, curved, labeled), DashedLine, Angle/RightAngle, Cross, Underline, color highlighting workflows, annotation lifecycle pattern. production-quality.md (190 lines): Pre-code, pre-render, post-render checklists. Text overlap prevention, spatial layout coordinate budget, max simultaneous elements, animation variety audit, tempo curve, color consistency, data viz minimums. Total skill now: 14 reference files, 2614 lines.	2026-04-06 13:51:36 -04:00
Teknium	cc7136b1ac	fix: update Gemini model catalog + wire models.dev as live model source Follow-up for salvaged PR #5494: - Update model catalog to Gemini 3.x + Gemma 4 (drop deprecated 2.0) - Add list_agentic_models() to models_dev.py with noise filter - Wire models.dev into _model_flow_api_key_provider as primary source (static curated list serves as offline fallback) - Add gemini -> google mapping in PROVIDER_TO_MODELS_DEV - Fix Gemma 4 context lengths to 256K (models.dev values) - Update auxiliary model to gemini-3-flash-preview - Expand tests: 3.x catalog, context lengths, models.dev integration	2026-04-06 10:28:03 -07:00
Teknium	6dfab35501	feat(providers): add Google AI Studio (Gemini) as a first-class provider Cherry-picked from PR #5494 by kshitijk4poor. Adds native Gemini support via Google's OpenAI-compatible endpoint. Zero new dependencies.	2026-04-06 10:28:03 -07:00
SHL0MS	85973e0082	fix(nous): don't use OAuth access_token as inference API key When agent_key is missing from auth state (expired, not yet minted, or mint failed silently), the fallback chain fell through to access_token — an OAuth bearer token for the Nous portal API, not an inference credential. The Nous inference API returns 404 because the OAuth token is not a valid inference key. Remove the access_token fallback so an empty agent_key correctly triggers resolve_nous_runtime_credentials() to mint a fresh key. Closes #5562	2026-04-06 10:04:02 -07:00

1 2 3 4 5 ...

3386 commits