hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-27 01:11:40 +00:00

Author	SHA1	Message	Date
teknium1	9dac85b069	fix: uv pip install fails outside venv in setup wizard uv pip install requires a virtual environment by default. When hermes is installed system-wide or via pipx, the setup wizard's SDK installs (daytona, swe-rex[modal], tinker-atropos) fail with 'No virtual environment found'. Fix by passing --python sys.executable to uv, which targets the correct Python regardless of venv state. Also show the actual error message on install failure so users can debug.	2026-03-06 21:55:33 -08:00
teknium1	99bd69baa8	Merge feat/modular-setup-wizard: modular setup wizard with section subcommands and tool-first UX - 5 standalone sections: hermes setup [model\|terminal\|gateway\|tools\|agent] - Returning user menu with section shortcuts - Tool-first UX: category -> provider -> API key flow - Unified hermes tools / hermes setup tools - Fixed dict-format model config display bug Closes #567	2026-03-06 21:12:30 -08:00
teknium1	a62a137a4f	fix: handle dict-format model config in setup wizard display config['model'] can be a dict (old format: {default, base_url, provider}) or a string (new format). The setup wizard was showing the raw dict in 'Keep current' and 'Model set to' messages. Now extracts the model name from either format.	2026-03-06 21:11:40 -08:00
teknium1	82b18e8ac2	feat: unify hermes tools and hermes setup tools into single flow Both 'hermes tools' and 'hermes setup tools' now use the same unified flow in tools_config.py: 1. Select platform (CLI, Telegram, Discord, etc.) 2. Toggle all 18 toolsets on/off in checklist 3. Newly enabled tools that need API keys → provider-aware config (e.g., TTS shows Edge/OpenAI/ElevenLabs picker) 4. Already-configured tools that stay enabled → silent, no prompts 5. Menu option: 'Reconfigure an existing tool' for updating providers or API keys on tools that are already set up Key changes: - Move TOOL_CATEGORIES, provider config, and post-setup hooks from setup.py to tools_config.py - Replace flat _check_and_prompt_requirements() with provider-aware _configure_toolset() that uses TOOL_CATEGORIES - Add _reconfigure_tool() flow for updating existing configs - setup.py's setup_tools() now delegates to tools_command() - tools_command() menu adds 'Reconfigure' option alongside platforms - Only prompt for API keys on tools that are NEWLY toggled on AND don't already have keys configured No breaking changes. All 2013 tests pass.	2026-03-06 21:02:00 -08:00
teknium1	0111c9848d	fix: remove ANSI codes and em dashes from menu labels simple_term_menu miscalculates string widths when labels contain ANSI escape codes (from color()) or em dashes, causing duplicated and garbled lines on arrow key navigation. Replace color() status indicators with plain text [configured]/[active] and em dashes with regular dashes in all prompt_choice/prompt_checklist labels.	2026-03-06 21:02:00 -08:00
teknium1	ab9cadfeee	feat: modular setup wizard with section subcommands and tool-first UX Restructure the monolithic hermes setup wizard into independently-runnable sections with a category-first tool configuration experience. Changes: - Break setup into 5 sections: model, terminal, gateway, tools, agent - Each section is a standalone function, runnable individually via 'hermes setup model', 'hermes setup terminal', etc. - Returning users get a menu: Quick Setup / Full Setup / individual sections - First-time users get a guided walkthrough of all sections Tool Configuration UX overhaul: - Replace flat API key checklist with category-first approach - Show tool types (TTS, Web Search, Image Gen, etc.) as top-level items - Within each category, let users pick a provider: - TTS: Microsoft Edge (Free), OpenAI, ElevenLabs - Web: Firecrawl Cloud, Firecrawl Self-Hosted - Image Gen: FAL.ai - Browser: Browserbase - Smart Home: Home Assistant - RL Training: Tinker/Atropos - GitHub: Personal Access Token - Shows configured status on each tool and provider - Only prompts for API keys after provider selection Also: - Add section argument to setup argparse parser in main.py - Update summary to show new section commands - Add self-hosted Firecrawl and Home Assistant to tool setup - All 2013 tests pass	2026-03-06 21:02:00 -08:00
PercyDikec	8bf28e1441	fix(setup): prevent OpenRouter model list fallback for Nous provider When `fetch_nous_models()` fails silently during setup, the model selection falls through to the OpenRouter static list. Users then pick models in OpenRouter format (e.g. `anthropic/claude-opus-4.6`) which the Nous inference API rejects with a 400 "missing model" error. Add an explicit `elif selected_provider == "nous"` branch that prompts for manual model entry instead of falling through to the generic OpenRouter fallback.	2026-03-07 07:16:22 +03:00
teknium1	ce28f847ce	fix: update OpenRouter model names for yc-bench config Use anthropic/claude-sonnet-4.6 (OpenRouter format) instead of anthropic/claude-sonnet-4-20250514 (direct API format).	2026-03-06 19:58:56 -08:00
stablegenius49	5609117882	fix(doctor): recognize OPENAI_API_KEY custom endpoint config	2026-03-06 19:47:09 -08:00
teknium1	b4fbb6fe10	feat: add YC-Bench long-horizon agent benchmark environment Adds eval-only benchmark for YC-Bench (collinear-ai/yc-bench), a deterministic long-horizon benchmark where the agent acts as CEO of an AI startup over a simulated 1-3 year run. Key design decisions verified against the official yc-bench repo: - Uses 'sim init' (NOT 'yc-bench run') to avoid starting a competing built-in agent loop - Correct DB table names: 'companies' and 'sim_events' - Correct 4 domains: research, inference, data_environment, training - Penalty values are preset-dependent (not hardcoded in system prompt) - Sequential evaluation (each run is 100-500 turns) - Follows TerminalBench2 patterns: KeyboardInterrupt handling, cleanup_all_environments(), tqdm logging handler, streaming JSONL yc-bench added as optional dependency: pip install hermes-agent[yc-bench] Closes #340	2026-03-06 19:25:56 -08:00
teknium1	82d7e9429e	chore: add GLM/Kimi/MiniMax models to insights pricing (zero cost) These direct providers don't return cost in API responses and their per-token pricing isn't readily available externally. Treat as local models with zero cost so they appear in /insights without fake estimates.	2026-03-06 19:12:14 -08:00
teknium1	e2821effb5	feat: add direct API-key providers as auxiliary client fallbacks When the user only has a z.ai/Kimi/MiniMax API key (no OpenRouter key), auxiliary tasks (context compression, web summarization, session search) now fall back to the configured direct provider instead of returning None. Resolution chain: OpenRouter -> Nous -> Custom endpoint -> Codex OAuth -> direct API-key providers -> None. Uses cheap/fast models for auxiliary tasks: - zai: glm-4.5-flash - kimi-coding: kimi-k2-turbo-preview - minimax/minimax-cn: MiniMax-M2.5-highspeed Vision auxiliary intentionally NOT modified — vision needs multimodal models (Gemini) that these providers don't serve.	2026-03-06 19:08:54 -08:00
teknium1	9742f11fda	chore: add context lengths for Kimi and MiniMax models Adds DEFAULT_CONTEXT_LENGTHS entries for kimi-k2.5 (262144), kimi-k2-thinking (262144), kimi-k2-turbo-preview (262144), kimi-k2-0905-preview (131072), MiniMax-M2.5/M2.5-highspeed/M2.1 (204800), and glm-4.5/4.5-flash (131072). Avoids unnecessary 2M-token probe on first use with direct providers.	2026-03-06 19:01:38 -08:00
Tyler	53b4b7651a	Add official OpenClaw migration skill for Hermes Agent Introduces a new OpenClaw-to-Hermes migration skill with a Python helper script that handles importing SOUL.md, memories, user profiles, messaging settings, command allowlists, skills, TTS assets, and workspace instructions. Supports two migration presets (user-data / full), three skill conflict modes (skip / overwrite / rename), overflow file export for entries that exceed character limits, and granular include/exclude option filtering. Includes detailed SKILL.md agent instructions covering the clarify-tool interaction protocol, decision-to-command mapping, post-run reporting rules, and path resolution guidance. Adds dynamic panel width calculation to CLI clarify/approval widgets so panels adapt to content and terminal size. Includes 7 new tests covering presets, include/exclude, conflict modes, overflow exports, and skills_guard integration.	2026-03-06 18:57:12 -08:00
teknium1	388dd4789c	feat: add z.ai/GLM, Kimi/Moonshot, MiniMax as first-class providers Adds 4 new direct API-key providers (zai, kimi-coding, minimax, minimax-cn) to the inference provider system. All use standard OpenAI-compatible chat/completions endpoints with Bearer token auth. Core changes: - auth.py: Extended ProviderConfig with api_key_env_vars and base_url_env_var fields. Added providers to PROVIDER_REGISTRY. Added provider aliases (glm, z-ai, zhipu, kimi, moonshot). Added auto-detection of API-key providers in resolve_provider(). Added resolve_api_key_provider_credentials() and get_api_key_provider_status() helpers. - runtime_provider.py: Added generic API-key provider branch in resolve_runtime_provider() — any provider with auth_type='api_key' is automatically handled. - main.py: Added providers to hermes model menu with generic _model_flow_api_key_provider() flow. Updated _has_any_provider_configured() to check all provider env vars. Updated argparse --provider choices. - setup.py: Added providers to setup wizard with API key prompts and curated model lists. - config.py: Added env vars (GLM_API_KEY, KIMI_API_KEY, MINIMAX_API_KEY, etc.) to OPTIONAL_ENV_VARS. - status.py: Added API key display and provider status section. - doctor.py: Added connectivity checks for each provider endpoint. - cli.py: Updated provider docstrings. Docs: Updated README.md, .env.example, cli-config.yaml.example, cli-commands.md, environment-variables.md, configuration.md. Tests: 50 new tests covering registry, aliases, resolution, auto-detection, credential resolution, and runtime provider dispatch. Inspired by PR #33 (numman-ali) which proposed a provider registry approach. Credit to tars90percent (PR #473) and manuelschipper (PR #420) for related provider improvements merged earlier in this changeset.	2026-03-06 18:55:18 -08:00
Teknium	fdebca4573	Merge pull request #571 from NousResearch/rewbs/nous-key-remint-attempt-on-401 fix: implement Nous credential refresh on 401 error for retry logic	2026-03-06 18:52:01 -08:00
teknium1	479dfc096a	Merge PR #473 : Update model id in OpenRouter from minimax-m2.1 to minimax-m2.5 Authored by tars90percent. Updates remaining minimax-m2.1 references to minimax-m2.5 in rl_training_tool.py and docs.	2026-03-06 18:43:18 -08:00
teknium1	3c6c11b7c9	Merge PR #420 : fix: respect OPENAI_BASE_URL when resolving API key priority Authored by manuelschipper. Adds GLM-4.7 and GLM-5 context lengths (202752) to model_metadata.py. The key priority fix (prefer OPENAI_API_KEY for non-OpenRouter endpoints) was already applied in PR #295; merged the Z.ai mention into the comment.	2026-03-06 18:43:13 -08:00
Robin Fernandes	bc091eb7ef	fix: implement Nous credential refresh on 401 error for retry logic	2026-03-07 13:34:23 +11:00
alireza78a	a857321463	fix(code-execution): close server socket in finally block to prevent fd leak	2026-03-07 05:49:48 +03:30
teknium1	f75b1d21b4	fix: execute_code and delegate_task now respect disabled toolsets When a user disables the web toolset via 'hermes tools', the execute_code schema description still hardcoded web_search/web_extract as available, causing the model to keep trying to use them. Similarly, delegate_task always defaulted to ['terminal', 'file', 'web'] for subagents regardless of the parent's config. Changes: - execute_code schema is now built dynamically via build_execute_code_schema() based on which sandbox tools are actually enabled - model_tools.py rebuilds the execute_code schema at definition time using the intersection of sandbox-allowed and session-enabled tools - delegate_task now inherits the parent agent's enabled_toolsets instead of hardcoding DEFAULT_TOOLSETS when no explicit toolsets are specified - delegate_task description updated to say 'inherits your enabled toolsets' Reported by kotyKD on Discord.	2026-03-06 17:36:14 -08:00
0xbyt4	33cfe1515d	fix: sanitize FTS5 queries and close mirror DB connections Two bugs fixed: 1. search_messages() crashes with OperationalError when user queries contain FTS5 special characters (+, ", (, {, dangling AND/OR, etc). Added _sanitize_fts5_query() to strip dangerous operators and a fallback try-except for edge cases. 2. _append_to_sqlite() in mirror.py creates a new SessionDB per call but never closes it, leaking SQLite connections. Added finally block to ensure db.close() is always called.	2026-03-07 04:24:45 +03:00
teknium1	94053d75a6	fix: custom endpoint no longer leaks OPENROUTER_API_KEY (#560 ) API key selection is now base_url-aware: when the resolved base_url targets OpenRouter, OPENROUTER_API_KEY takes priority (preserving the #289 fix). When hitting any other endpoint (Z.ai, vLLM, custom, etc.), OPENAI_API_KEY takes priority so the OpenRouter key doesn't leak. Applied in both the runtime provider resolver (the real code path) and the CLI initial default (for consistency). Fixes #560.	2026-03-06 17:16:14 -08:00
teknium1	2a68099675	fix(tests): isolate tests from user ~/.hermes/ config and SOUL.md _make_cli() now patches CLI_CONFIG with clean defaults so test_cli_init tests don't depend on the developer's local config.yaml. test_empty_dir_returns_empty now mocks Path.home() so it doesn't pick up a global SOUL.md. Credit to teyrebaz33 for identifying and fixing these in PR #557. Fixes #555.	2026-03-06 17:10:35 -08:00
0xbyt4	3b43f7267a	fix: count actual tool calls instead of tool-related messages tool_call_count was inaccurate in two ways: 1. Under-counting: an assistant message with N parallel tool calls (e.g. "kill the light and shut off the fan" = 2 ha_call_service) only incremented tool_call_count by 1 instead of N. 2. Over-counting: tool response messages (role=tool) also incremented tool_call_count, double-counting every tool interaction. Combined: 2 parallel tool calls produced tool_call_count=3 (1 from assistant + 2 from tool responses) instead of the correct value of 2. Fix: only count from assistant messages with tool_calls, incrementing by len(tool_calls) to handle parallel calls correctly. Tool response messages no longer affect tool_call_count. This impacts /insights and /usage accuracy for sessions with tool use.	2026-03-07 04:07:52 +03:00
teknium1	6cd3bc6640	Merge PR #563 : fix: prevent data loss in skills sync on copy/update failure Authored by 0xbyt4. Two bugs fixed: 1. Failed copytree no longer poisons the manifest (skill gets retried) 2. Failed update no longer destroys user's copy (backup + restore)	2026-03-06 17:01:30 -08:00
0xbyt4	211b55815e	fix: prevent data loss in skills sync on copy/update failure Two bugs in sync_skills(): 1. Failed copytree poisons manifest: when shutil.copytree fails (disk full, permission error), the skill is still recorded in the manifest. On the next sync, the skill appears as "in manifest but not on disk" which is interpreted as "user deliberately deleted it" — the skill is never retried. Fix: only write to manifest on successful copy. 2. Failed update destroys user copy: rmtree deletes the existing skill directory before copytree runs. If copytree then fails, the user's skill is gone with no way to recover. Fix: move to .bak before copying, restore from backup if copytree fails. Both bugs are proven by new regression tests that fail on the old code and pass on the fix.	2026-03-07 03:58:32 +03:00
teknium1	8ae4a6f824	fix: improve handling of empty responses after tool calls - Added fallback mechanism to utilize previous content when the model generates an empty response after tool calls, reducing unnecessary API retries. - Enhanced logging to indicate when prior content is used as a final response. - Updated logic to ensure that genuine empty responses are retried appropriately, maintaining user experience.	2026-03-06 16:54:31 -08:00
teknium1	b98301677a	docs: add /insights to all help menus and documentation - website/docs/reference/cli-commands.md: Added 'hermes insights' terminal command section with --days and --source flags, plus /insights slash command in the Conversation section - website/docs/user-guide/cli.md: Added /insights to slash commands table - website/docs/user-guide/messaging/index.md: Added /insights to gateway chat commands table - website/docs/user-guide/sessions.md: Added cross-reference to hermes insights from the sessions stats section	2026-03-06 16:48:58 -08:00
teknium1	f2fdde5ba4	fix: show user-modified skills count in hermes update output	2026-03-06 16:14:43 -08:00
teknium1	4f56e31dc7	fix: track origin hashes in skills manifest to preserve user modifications Upgrade skills_sync manifest to v2 format (name:origin_hash). The origin hash records the MD5 of the bundled skill at the time it was last synced. On update, the user's copy is compared against the origin hash: - User copy == origin hash → unmodified → safe to update from bundled - User copy != origin hash → user customized → skip (preserve changes) v1 manifests (plain names) are auto-migrated: the user's current hash becomes the baseline, so future syncs can detect modifications. Output now shows user-modified skills: ~ whisper (user-modified, skipping) 27 tests covering all scenarios including v1→v2 migration, user modification detection, update after migration, and origin hash tracking. 2009 tests pass.	2026-03-06 16:13:58 -08:00
Teknium	6d3804770c	Merge pull request #552 from NousResearch/feat/insights feat: /insights command — usage analytics, cost estimation & activity patterns	2026-03-06 16:00:28 -08:00
teknium1	ab0f4126cf	fix: restore all removed bundled skills + fix skills sync system - Restored 21 skills removed in commits `757d012` and `740dd92`: accelerate, audiocraft, code-review, faiss, flash-attention, gguf, grpo-rl-training, guidance, llava, nemo-curator, obliteratus, peft, pytorch-fsdp, pytorch-lightning, simpo, slime, stable-diffusion, tensorrt-llm, torchtitan, trl-fine-tuning, whisper - Rewrote sync_skills() with proper update semantics: * New skills (not in manifest): copied to user dir * Existing skills (in manifest + on disk): updated via hash comparison * User-deleted skills (in manifest, not on disk): respected, not re-added * Stale manifest entries (removed from bundled): cleaned from manifest - Added sync_skills() to CLI startup (cmd_chat) and gateway startup (start_gateway) — previously only ran during 'hermes update' - Updated cmd_update output to show new/updated/cleaned counts - Rewrote tests: 20 tests covering manifest CRUD, dir hashing, fresh install, user deletion respect, update detection, stale cleanup, and name collision handling 75 bundled skills total. 2002 tests pass.	2026-03-06 15:57:30 -08:00
unmodeled-tyler	1755a9e38a	Design agent migration skill for Hermes Agent from OpenClaw \| Run successful dry tests with reports	2026-03-06 15:12:45 -08:00
teknium1	585f8528b2	fix: deep review — prefix matching, tool_calls extraction, query perf, serialization Issues found and fixed during deep code path review: 1. CRITICAL: Prefix matching returned wrong prices for dated model names - 'gpt-4o-mini-2024-07-18' matched gpt-4o ($2.50) instead of gpt-4o-mini ($0.15) - Same for o3-mini→o3 (9x), gpt-4.1-mini→gpt-4.1 (5x), gpt-4.1-nano→gpt-4.1 (20x) - Fix: use longest-match-wins strategy instead of first-match - Removed dangerous key.startswith(bare) reverse matching 2. CRITICAL: Top Tools section was empty for CLI sessions - run_agent.py doesn't set tool_name on tool response messages (pre-existing) - Insights now also extracts tool names from tool_calls JSON on assistant messages, which IS populated for all sessions - Uses max() merge strategy to avoid double-counting between sources 3. SELECT * replaced with explicit column list - Skips system_prompt and model_config blobs (can be thousands of chars) - Reduces memory and I/O for large session counts 4. Sets in overview dict converted to sorted lists - models_with_pricing / models_without_pricing were Python sets - Sets aren't JSON-serializable — would crash json.dumps() 5. Negative duration guard - end > start check prevents negative durations from clock drift 6. Model breakdown sort fallback - When all tokens are 0, now sorts by session count instead of arbitrary order 7. Removed unused timedelta import Added 6 new tests: dated model pricing (4), tool_calls JSON extraction, JSON serialization safety. Total: 69 tests.	2026-03-06 14:50:57 -08:00
teknium1	75f523f5c0	fix: unknown/custom models get zero cost instead of fake estimates Custom OAI endpoints, self-hosted models, and local inference should NOT show fabricated cost estimates. Changed default pricing from $3/$12 per million tokens to $0/$0 for unrecognized models. - Added _has_known_pricing() to distinguish commercial vs custom models - Models with known pricing show $ amounts; unknown models show 'N/A' - Overview shows asterisk + note when some models lack pricing data - Gateway format adds '(excludes custom/self-hosted models)' note - Added 7 new tests for custom model cost handling	2026-03-06 14:18:19 -08:00
teknium1	68fbae5692	docs: add Custom & Self-Hosted LLM Providers guide Comprehensive guide for using Hermes Agent with alternative LLM backends: - Ollama (local models, zero config) - vLLM (high-performance GPU inference) - SGLang (RadixAttention, prefix caching) - llama.cpp / llama-server (CPU & Metal inference) - LiteLLM Proxy (multi-provider gateway) - ClawRouter (cost-optimized routing with complexity scoring) - 10+ other compatible providers table (Together, Groq, DeepSeek, etc.) - Choosing the Right Setup decision table - General custom endpoint setup instructions All of these work via the existing OPENAI_BASE_URL + OPENAI_API_KEY custom endpoint support — no code changes needed.	2026-03-06 14:16:06 -08:00
teknium1	80f1dd8d37	docs: add Custom & Self-Hosted LLM Providers guide Comprehensive guide for using Hermes Agent with alternative LLM backends: - Ollama (local models, zero config) - vLLM (high-performance GPU inference) - SGLang (RadixAttention, prefix caching) - llama.cpp / llama-server (CPU & Metal inference) - LiteLLM Proxy (multi-provider gateway) - ClawRouter (cost-optimized routing with complexity scoring) - 10+ other compatible providers table (Together, Groq, DeepSeek, etc.) - Choosing the Right Setup decision table - General custom endpoint setup instructions All of these work via the existing OPENAI_BASE_URL + OPENAI_API_KEY custom endpoint support — no code changes needed.	2026-03-06 14:15:57 -08:00
teknium1	b52b37ae64	feat: add /insights command with usage analytics and cost estimation Inspired by Claude Code's /insights, adapted for Hermes Agent's multi-platform architecture. Analyzes session history from state.db to produce comprehensive usage insights. Features: - Overview stats: sessions, messages, tokens, estimated cost, active time - Model breakdown: per-model sessions, tokens, and cost estimation - Platform breakdown: CLI vs Telegram vs Discord etc. (unique to Hermes) - Tool usage ranking: most-used tools with percentages - Activity patterns: day-of-week chart, peak hours, streaks - Notable sessions: longest, most messages, most tokens, most tool calls - Cost estimation: real pricing data for 25+ models (OpenAI, Anthropic, DeepSeek, Google, Meta) with fuzzy model name matching - Configurable time window: --days flag (default 30) - Source filtering: --source flag to filter by platform Three entry points: - /insights slash command in CLI (supports --days and --source flags) - /insights slash command in gateway (compact markdown format) - hermes insights CLI subcommand (standalone) Includes 56 tests covering pricing helpers, format helpers, empty DB, populated DB with multi-platform data, filtering, formatting, and edge cases.	2026-03-06 14:04:59 -08:00
aydnOktay	566aeaeefa	Make skill file writes atomic	2026-03-07 00:49:10 +03:00
Himess	7a0544ab57	fix: three small inconsistencies across cron, gateway, and daytona 1. cron/jobs.py: respect HERMES_HOME env var for job storage path. scheduler.py already uses os.getenv("HERMES_HOME", ...) but jobs.py hardcodes Path.home() / ".hermes", causing path mismatch when HERMES_HOME is set. 2. gateway/run.py: add Platform.HOMEASSISTANT to default_toolset_map and platform_config_key. The adapter and hermes-homeassistant toolset both exist but the mapping dicts omit it, so HomeAssistant events silently fall back to the Telegram toolset. 3. tools/environments/daytona.py: use time.monotonic() for deadline instead of float subtraction. All other backends (docker, ssh, singularity, local) use monotonic clock for timeout tracking. The accumulator pattern (deadline -= 0.2) drifts because t.join(0.2) + interrupt checks take longer than 0.2s per iteration.	2026-03-06 16:52:17 +03:00
teknium1	d63b363cde	refactor: extract atomic_json_write helper, add 24 checkpoint tests Extract the duplicated temp-file + fsync + os.replace pattern from batch_runner.py (1 instance) and process_registry.py (2 instances) into a shared utils.atomic_json_write() function. Add 12 tests for atomic_json_write covering: valid JSON, parent dir creation, overwrite, crash safety (original preserved on error), no temp file leaks, string paths, unicode, custom indent, concurrent writes. Add 12 tests for batch_runner checkpoint behavior covering: _save_checkpoint (valid JSON, last_updated, overwrite, lock/no-lock, parent dirs, no temp leaks), _load_checkpoint (missing file, existing data, corrupt JSON), and resume logic (preserves prior progress, different run_name starts fresh).	2026-03-06 05:50:12 -08:00
teknium1	c05c60665e	Merge PR #298 : Make process_registry checkpoint writes atomic Authored by aydnOktay. Companion to PR #297 (batch_runner). Applies the same atomic write pattern (temp file + fsync + os.replace) to both _write_checkpoint() and recover_from_checkpoint() in process_registry.py. Prevents checkpoint corruption on gateway crashes. Also improves error handling: bare 'pass' replaced with logger.debug(..., exc_info=True) for better debugging.	2026-03-06 05:32:35 -08:00
teknium1	b4873a5de7	fix(setup): Escape skips instead of exiting, add control hints to all prompts Previously pressing Escape in any setup wizard menu called sys.exit(1), killing the entire wizard with no way to recover. Now: - prompt_choice: Escape keeps the current default and moves on (prints 'Skipped (keeping current)'). Shows '↑/↓ Navigate Enter Select Esc Skip Ctrl+C Exit' hint. - prompt_checklist: Escape returns pre-selected items instead of empty list. Shows 'SPACE Toggle ENTER Confirm ESC Skip Ctrl+C Exit'. - prompt_yes_no: now catches KeyboardInterrupt/EOFError properly. - Fallback number prompts also show control hints. Ctrl+C still exits the wizard cleanly.	2026-03-06 05:27:11 -08:00
teknium1	913f8ce0a5	Merge PR #297 : Make batch_runner checkpoint incremental and atomic Authored by aydnOktay. Three improvements to batch_runner fault tolerance: 1) Atomic checkpoint writes (temp file + fsync + os.replace) to prevent corruption on crashes — same pattern as auth.py's _save_auth_store(). 2) Incremental checkpoints after each batch result instead of only at end, so interrupted runs can resume with minimal progress loss. 3) Resume loads existing checkpoint state instead of initializing empty, preventing clobber of prior progress. Conflict resolved: kept both the incremental checkpoint logic (PR) and the batch worker error handling (HEAD) in the imap_unordered loop.	2026-03-06 05:16:31 -08:00
Himess	453e0677d6	fix: use regex for search output parsing to handle Windows drive-letter paths The ripgrep/grep output parser uses `split(':', 2)` to extract file:lineno:content from match lines. On Windows, absolute paths contain a drive letter colon (e.g. `C:\Users\foo\bar.py:42:content`), so `split(':', 2)` produces `["C", "\Users\...", "42:content"]`. `int(parts[1])` then raises ValueError and the match is silently dropped. All search results are lost on Windows. Same category as #390 — string-based path parsing that fails on Windows. Replace `split()` with a regex that optionally captures the drive letter prefix: `^([A-Za-z]:)?(.?):(\d+):(.)$`. Applied to both `_search_with_rg` and `_search_with_grep`.	2026-03-06 15:54:33 +03:00
teknium1	4a63737227	Merge PR #433 : fix(whatsapp): replace Linux-only fuser with cross-platform port cleanup Authored by Farukest. Fixes #432. Extracts _kill_port_process() helper that uses netstat+taskkill on Windows and fuser on Linux. Previously, fuser calls were inline with bare except-pass, so on Windows orphaned bridge processes were never cleaned up — causing 'address already in use' errors on reconnect. Includes 5 tests covering both platforms, port matching edge cases, and exception suppression.	2026-03-06 04:52:25 -08:00
teknium1	3e93db16bd	Merge PR #436 : fix: use _max_tokens_param in max-iterations retry path Authored by Farukest. Fixes #435. The retry summary in _handle_max_iterations() hardcoded max_tokens instead of using _max_tokens_param(), which returns max_completion_tokens for direct OpenAI API (required by gpt-4o, o-series). The first attempt already used _max_tokens_param correctly — only the retry path was wrong. Includes 4 tests for _max_tokens_param provider detection.	2026-03-06 04:46:24 -08:00
teknium1	f863a42351	Merge PR #441 : fix(gateway): return response from /retry handler instead of discarding it Authored by PercyDikec. Fixes #440. _handle_retry_command called _handle_message(retry_event) but discarded the return value, returning None instead. Since only _process_message_background sends the response via adapter.send(), this meant the agent would run (tool progress was visible) but the final answer was silently dropped on all platforms.	2026-03-06 04:42:54 -08:00
teknium1	dc55f493be	fix: add missing re.DOTALL to DeepSeek V3.1 parser (same bug as V3) The V3.1 parser had the same issue — .*? without re.DOTALL fails to match multi-line JSON arguments. Found during review of PR #444.	2026-03-06 04:41:47 -08:00

... 87 88 89 90 91 ...

5319 commits