hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-29 01:31:41 +00:00

Author	SHA1	Message	Date
teknium1	363633e2ba	fix: allow self-hosted Firecrawl without API key + add self-hosting docs On top of PR #460: self-hosted Firecrawl instances don't require an API key (USE_DB_AUTHENTICATION=false), so don't force users to set a dummy FIRECRAWL_API_KEY when FIRECRAWL_API_URL is set. Also adds a proper self-hosting section to the configuration docs explaining what you get, what you lose, and how to set it up (Docker stack, tradeoffs vs cloud). Added 2 more tests (URL-only without key, neither-set raises).	2026-03-05 16:44:21 -08:00
teknium1	a41ba57a7a	Merge PR #460 : feat(tools): add support for self-hosted firecrawl Authored by caentzminger. Adds optional FIRECRAWL_API_URL env var to point the Firecrawl client at a self-hosted instance instead of the cloud API.	2026-03-05 16:41:30 -08:00
teknium1	884c8ea70a	chore: add openai/gpt-5.4 to OpenRouter preferred models list	2026-03-05 16:13:45 -08:00
teknium1	c886333d32	feat: smart context length probing with persistent caching + banner display Replaces the unsafe 128K fallback for unknown models with a descending probe strategy (2M → 1M → 512K → 200K → 128K → 64K → 32K). When a context-length error occurs, the agent steps down tiers and retries. The discovered limit is cached per model+provider combo in ~/.hermes/context_length_cache.yaml so subsequent sessions skip probing. Also parses API error messages to extract the actual context limit (e.g. 'maximum context length is 32768 tokens') for instant resolution. The CLI banner now displays the context window size next to the model name (e.g. 'claude-opus-4 · 200K context · Nous Research'). Changes: - agent/model_metadata.py: CONTEXT_PROBE_TIERS, persistent cache (save/load/get), parse_context_limit_from_error(), get_next_probe_tier() - agent/context_compressor.py: accepts base_url, passes to metadata - run_agent.py: step-down logic in context error handler, caches on success - cli.py + hermes_cli/banner.py: context length in welcome banner - tests: 22 new tests for probing, parsing, and caching Addresses #132. PR #319's approach (8K default) rejected — too conservative.	2026-03-05 16:09:57 -08:00
teknium1	55b173dd03	refactor: move shutil import to module level Cleanup on top of PR #305 — replace two inline 'import shutil as _shutil' with a single module-level import.	2026-03-05 15:57:05 -08:00
dmahan93	9079a27814	fix: prompt box and response box span full terminal width on wide screens - Replace hardcoded '─' * 200 horizontal rules with Window(char='─') so prompt_toolkit fills the entire terminal width automatically - Use shutil.get_terminal_size().columns instead of Rich Console.width for response box, separator line, and input height calculation (more reliable inside patch_stdout context)	2026-03-05 15:57:05 -08:00
caentzminger	d7d10b14cd	feat(tools): add support for self-hosted firecrawl Adds optional FIRECRAWL_API_URL environment variable to support self-hosted Firecrawl deployments alongside the cloud service. - Add FIRECRAWL_API_URL to optional env vars in hermes_cli/config.py - Update _get_firecrawl_client() in tools/web_tools.py to accept custom API URL - Add tests for client initialization with/without URL - Document new env var in installation and config guides	2026-03-05 16:16:18 -06:00
shitcoinsherpa	81986022b7	Add explicit encoding="utf-8" to all config/data file open() calls On Windows, open() defaults to the system locale encoding (cp1252, cp1254, etc.) rather than UTF-8. This breaks any file containing non-ASCII characters, and also causes crashes when writing JSON with ensure_ascii=False. This adds encoding="utf-8" to open() calls in: - gateway/run.py (config.yaml reads/writes throughout) - gateway/config.py (gateway.json and config.yaml) - hermes_cli/config.py (config.yaml load/save) - hermes_cli/main.py (session export with ensure_ascii=False) - hermes_cli/status.py (jobs.json and sessions.json)	2026-03-05 17:16:04 -05:00
shitcoinsherpa	dcba291d45	Use pywinpty instead of ptyprocess on Windows for PTY support ptyprocess depends on Unix-only APIs (fork, openpty) and cannot work on Windows at all. pywinpty provides a compatible PtyProcess interface using the Windows ConPTY API. This conditionally imports winpty.PtyProcess on Windows and ptyprocess.PtyProcess on Unix. The pyproject.toml pty extra now uses platform markers so the correct package is installed automatically.	2026-03-05 17:16:04 -05:00
shitcoinsherpa	48e65631f6	Fix auth store file lock for Windows (msvcrt) with reentrancy support fcntl is not available on Windows. This adds msvcrt.locking as a fallback for cross-process advisory locking on Windows. msvcrt.locking is not reentrant within the same thread, unlike fcntl.flock. This matters because resolve_codex_runtime_credentials holds the lock and then calls _save_codex_tokens, which tries to acquire it again. Without reentrancy tracking, this deadlocks on Windows after a 15-second timeout. Uses threading.local() to track lock depth per thread, allowing nested acquisitions to pass through without re-acquiring the underlying lock. Also handles msvcrt-specific requirements: file must be opened in r+ mode (not a+), must have at least 1 byte of content, and the file pointer must be at position 0 before locking.	2026-03-05 17:16:03 -05:00
rovle	a6499b6107	fix(daytona): use shell timeout wrapper instead of broken SDK exec timeout The Daytona SDK's process.exec(timeout=N) parameter is not enforced — the server-side timeout never fires and the SDK has no client-side fallback, causing commands to hang indefinitely. Fix: wrap commands with timeout N sh -c '...' (coreutils) which reliably kills the process and returns exit code 124. Added shlex.quote for proper shell escaping and a secondary deadline (timeout + 10s) that force-stops the sandbox if the shell timeout somehow fails. Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 13:12:41 -08:00
0xbyt4	14a11d24b4	fix: handle None args in build_tool_preview When an LLM returns null/empty tool call arguments, json.loads() produces None. build_tool_preview then crashes with "argument of type 'NoneType' is not iterable" on the `in` check. Return None early when args is falsy.	2026-03-05 23:09:11 +03:00
rovle	74a36b0729	docs: add Daytona to backend lists in docs Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 11:55:41 -08:00
rovle	efc7a7b957	fix(daytona): don't guess /root on cwd probe failure, keep constructor default; update tests to reflect this Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 11:49:35 -08:00
rovle	4f1464b3af	fix(daytona): default disk to 10GB to match platform limit Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 11:37:30 -08:00
rovle	3a41079fac	fix(daytona): add optional dependency group to pyproject.toml Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 11:13:12 -08:00
rovle	5279540bb4	fix(daytona): add missing config mappings in gateway, CLI defaults, and config display Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 11:12:50 -08:00
rovle	577da79a47	fix(daytona): make disk cap visible and use SDK enum for sandbox state - Replace logger.warning with warnings.warn for the disk cap so users actually see it (logger was suppressed by CLI's log level config) - Use SandboxState enum instead of string literals in _ensure_sandbox_ready Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 11:03:39 -08:00
rovle	1faa9648d3	chore(daytona): cap the disk size to current maximum on daytona sandboxes Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 10:43:41 -08:00
PercyDikec	ad57bf1e4b	fix(cli): use correct dict key for codex auth file path in status output	2026-03-05 21:27:12 +03:00
rovle	d5efb82c7c	test(daytona): add unit and integration tests for Daytona backend Unit tests cover cwd resolution, sandbox persistence/resume, cleanup, command execution, resource conversion, interrupt handling, retry exhaustion, and sandbox readiness checks. Integration tests verify basic commands, filesystem ops, session persistence, and task isolation against a live Daytona API. Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 10:26:22 -08:00
PercyDikec	36214d14db	fix(cli): use correct visibility filter string in codex API model fetch	2026-03-05 21:12:53 +03:00
rovle	ea2f7ef2f6	docs(config): add Daytona disk limit hint and fix default cwd in example Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 10:02:22 -08:00
rovle	435530018b	fix(daytona): resolve cwd by detecting home directory inside the sandbox	2026-03-05 10:02:22 -08:00
rovle	df61054a84	feat(cli): add Daytona to setup wizard, doctor, and status display Add Daytona as a backend choice in the interactive setup wizard with SDK installation and API key prompts. Show Daytona image in status output and validate API key + SDK in doctor checks. Add OPTION 6 example in cli-config.yaml.example. Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 10:02:22 -08:00
rovle	690b8bb563	feat(cli): add Daytona config mapping and env var sync Wire TERMINAL_DAYTONA_IMAGE through cli.py env_mappings and hermes_cli/config.py so `hermes config set` propagates correctly.	2026-03-05 10:02:21 -08:00
rovle	c43451a50b	feat(terminal): integrate Daytona backend into tool pipeline Add Daytona to image selection, container_config guards, environment factory, requirements check, and diagnostics in terminal_tool.py and file_tools.py. Also add to sandboxed-backend approval bypass. Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 10:02:21 -08:00
rovle	1e312c6582	feat(environments): add Daytona cloud sandbox backend New execution backend using the Daytona Python SDK. Supports persistent sandboxes via stop/start lifecycle, interrupt handling, and automatic retry on transient errors. Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 10:02:21 -08:00
PercyDikec	e36c8cd49a	fix: add missing re.DOTALL flag to DeepSeek V3 tool call parser	2026-03-05 20:32:38 +03:00
PercyDikec	16cb6d1a6e	fix(gateway): return response from /retry handler instead of discarding it	2026-03-05 19:59:54 +03:00
Teknium	21d61bdd71	Merge pull request #307 from batuhankocyigit/patch-1 fix: correct typo 'Grup' -> 'Group' in test section headers	2026-03-05 08:54:05 -08:00
teknium1	ad9c26afb8	Merge PR #293 : fix: eliminate shell noise from terminal output and fix test failures Authored by 0xbyt4. Wraps commands with unique fence markers to isolate real output from shell init/exit noise (oh-my-zsh, macOS session restore, etc.). Falls back to expanded pattern-based cleaning. Also fixes BSD find fallback and test module shadowing.	2026-03-05 08:48:26 -08:00
JackTheGit	71c0cd00e5	docs: fix spelling of 'publicly'	2026-03-05 16:46:21 +00:00
teknium1	83f99d8203	Merge PR #438 : fix: add missing empty-content guard after think-block stripping in retry path Authored by PercyDikec. Fixes #437. The retry path in _handle_max_iterations was missing the second if final_response: guard after stripping <think> blocks, which could result in an empty assistant message being appended to history instead of using the fallback message.	2026-03-05 08:37:49 -08:00
teknium1	6b37d38dee	Merge PR #292 : feat(whatsapp): native media attachments for images, videos and documents Authored by satelerd. Adds native WhatsApp media sending for images, videos, and documents via MEDIA: tags. Also includes conflict resolution with edit_message feature, Telegram hint fix (only advertise supported media types), and import cleanup.	2026-03-05 08:35:13 -08:00
PercyDikec	938499ddfb	fix: add missing empty-content guard after think-block stripping in retry path	2026-03-05 18:57:59 +03:00
teknium1	d92266d7c0	ci: pin tests to Python 3.11 only The installer hardcodes PYTHON_VERSION=3.11 and creates the venv with that version. No point testing 3.12 — halves CI time.	2026-03-05 07:55:01 -08:00
teknium1	a352b5c193	docs: remove legacy docs/ directory — all content migrated to website Removed 10 markdown files (~4,200 lines) that have been fully migrated, restructured, and accuracy-audited on the docs site at hermes-agent.nousresearch.com/docs/ Left docs/README.md as a pointer to the website. Updated CONTRIBUTING.md file tree reference.	2026-03-05 07:37:06 -08:00
teknium1	82f7483999	docs: simplify README from 1776 to 121 lines All detailed documentation now lives at hermes-agent.nousresearch.com/docs/. README retains: banner, badges, value proposition, feature highlights, one-line install, getting started commands, docs site link table, quick contributor setup, community links, and license. Removed: 1600+ lines of inline docs covering config, messaging setup, tools, skills, MCP, terminal backends, memory, cron, hooks, security, TTS, browser, batch processing, RL training, manual installation, env vars reference, file structure, and troubleshooting.	2026-03-05 07:33:07 -08:00
teknium1	56dc9277d7	ci: add test workflow for PRs and main branch Run pytest on Python 3.11 + 3.12 for every PR and push to main. - Uses uv for fast dependency installation - Excludes integration tests (need real API keys/services) - Blanks API keys as safety net against accidental real API calls - Concurrency: cancels in-progress runs when new commits are pushed - 10 minute timeout (tests take ~77s) - fail-fast disabled so both Python versions run independently GitHub's default 'require approval for first-time contributors' means maintainers approve CI before it runs on new contributors' PRs, preventing abuse of CI resources.	2026-03-05 07:29:16 -08:00
teknium1	d50e9bcef7	docs: add 11 new pages + expand 4 existing pages (26 → 37 total) New pages (sourced from actual codebase): - Security: command approval, DM pairing, container isolation, production checklist - Session Management: resume, export, prune, search, per-platform tracking - Context Files: AGENTS.md project context, discovery, size limits, security - Personality: SOUL.md, 14 built-in personalities, custom definitions - Browser Automation: Browserbase setup, 10 browser tools, stealth mode - Image Generation: FLUX 2 Pro via FAL, aspect ratios, auto-upscaling - Provider Routing: OpenRouter sort/only/ignore/order config - Honcho: AI-native memory integration, setup, peer config - Home Assistant: HASS setup, 4 HA tools, WebSocket gateway - Batch Processing: trajectory generation, dataset format, checkpointing - RL Training: Atropos/Tinker integration, environments, workflow Expanded pages: - code-execution: 51 → 195 lines (examples, limits, security, comparison table) - delegation: 60 → 216 lines (context tips, batch mode, model override) - cron: 88 → 273 lines (real-world examples, delivery options, expression cheat sheet) - memory: 98 → 249 lines (best practices, capacity management, examples)	2026-03-05 07:28:41 -08:00
teknium1	c4e520fd6e	docs: add documentation & housekeeping checklist to PR template Add a second checklist section covering common oversights seen in PRs: - Update relevant docs (README, docs/, docstrings) - Update cli-config.yaml.example when adding config keys - Update CONTRIBUTING.md/AGENTS.md for architecture changes - Consider cross-platform impact (Windows/macOS) - Update tool schemas when changing tool behavior Each item has an 'or N/A' option so contributors aren't blocked on items that don't apply to their change.	2026-03-05 07:23:52 -08:00
teknium1	30ff395924	feat: add issue and PR templates Add structured GitHub templates based on analysis of 200+ closed PRs and 50+ closed issues to improve submission quality: Issue templates (YAML form-based): - Bug Report: requires reproduction steps, expected/actual behavior, OS/Python/Hermes version. Optional root cause analysis field. - Feature Request: requires problem/use case, links to skill-vs-tool guidance in CONTRIBUTING.md to reduce misguided tool PRs. - Setup/Installation Help: requires install method, hermes doctor output, error logs, steps already tried. - Template chooser config with links to Discord, docs, contributing guide. PR template: - Type of change selector (bug/feature/security/docs/tests/refactor/skill) - Mandatory issue reference, changes list, testing steps - Checklist: conventional commits, no duplicates, focused changes, tests pass, tests added, platform tested - Dedicated 'New Skills' section asking if skill is broadly useful and properly formatted/tested Key problems these templates address: - Bug reports with no reproduction steps or environment info - Duplicate/racing PRs (multiple people fixing same issue) - Stale branches with 85+ unrelated file changes - Junk skill PRs that should go to Skills Hub instead of bundled - Missing tests on bug fix PRs - No issue references on PRs	2026-03-05 07:22:39 -08:00
teknium1	f55025952d	docs: reorder sidebar — Quickstart before Installation	2026-03-05 07:15:35 -08:00
teknium1	1bc45ee8fe	docs: simplify installer description for getting started page	2026-03-05 07:14:13 -08:00
teknium1	19016497ef	docs: fix all remaining minor accuracy issues - updating.md: Note that 'hermes update' auto-handles config migration - cli.md: Add summary_model to compression config, fix display config (add personality/compact), remove unverified pastes/ claim - configuration.md: Add 5 missing config sections (stt, human_delay, code_execution, delegation, clarify), fix display defaults, fix reasoning_effort default to empty/unset - messaging/index.md: Add GATEWAY_ALLOWED_USERS to security section - skills.md: Add category field to skills_list return value - mcp.md: Document auto-registered utility tools (resources/prompts) - architecture.md: Fix file_tools.py reference, base_url default to None, synchronous agent loop pseudocode - cli-commands.md: Fix hermes logout description - environment-variables.md: Add HERMES_QUIET, HERMES_EXEC_ASK, BROWSER_INACTIVITY_TIMEOUT, GATEWAY_ALLOWED_USERS Verification scan: 27/27 checks passed, zero issues remaining.	2026-03-05 07:00:51 -08:00
teknium1	d578d06f59	docs: comprehensive accuracy audit fixes (35+ corrections) CRITICAL fixes: - Installation: Remove false prerequisites (installer auto-installs everything except git) - Tools: Remove non-existent 'web_crawl' tool from tools table - Memory: Remove non-existent 'read' action (only add/replace/remove exist) - Code execution: Fix 'search' to 'search_files' in sandbox tools list - CLI commands: Fix --model/--provider/--toolsets/--verbose as chat subcommand flags IMPORTANT fixes: - Installation: Add missing installer features (Node.js, ripgrep, ffmpeg, skills seeding) - Installation: Add 6 missing package extras to table (mcp, honcho, tts-premium, etc) - Installation: Fix mkdir to include all directories the installer creates - Quickstart: Add OpenAI Codex to provider table - CLI: Fix all 'hermes --flag' to 'hermes chat --flag' across all docs - Configuration: Remove non-existent --max-turns CLI flag - Tools: Fix 'search' to 'search_files', add missing 'process' tool - Skills: Remove skills_categories() (not a registered tool) - Cron: Remove unsupported 'daily at 9am' schedule format - TTS: Fix output directory to ~/.hermes/audio_cache/ - Delegation: Clarify depth limit wording - Architecture: Fix default model, chat() signature, file names - Contributing: Fix Python requirement from 3.11+ to 3.10+ - CLI reference: Add missing commands (login, tools, sessions subcommands) - Env vars: Fix TERMINAL_DOCKER_IMAGE default, add HERMES_MODEL	2026-03-05 06:50:22 -08:00
Farukest	e25ad79d5d	fix: use _max_tokens_param in max-iterations retry path The retry summary in _handle_max_iterations hardcodes max_tokens instead of calling _max_tokens_param(). For direct OpenAI API users (gpt-4o, o-series), the correct parameter name is max_completion_tokens. The first attempt at line 2697 already uses _max_tokens_param correctly but the retry path at line 2743 was missed.	2026-03-05 17:49:37 +03:00
teknium1	f2624a1426	docs: remove Windows support references, recommend WSL2 - Installation: Remove PowerShell/CMD install commands, add WSL2 warning - Quickstart: Replace PowerShell block with WSL2 tip - Contributing: Update cross-platform section to clarify Windows unsupported - Index: Update install description to say WSL2 instead of Windows	2026-03-05 06:36:18 -08:00
jackx707	15561ec425	feat: add WebResearchEnv RL environment for multi-step web research	2026-03-05 14:34:36 +00:00

... 100 101 102 103 104 ...

5886 commits