hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-26 01:01:40 +00:00

Author	SHA1	Message	Date
Teknium	41560192c4	chore(attribution): add AUTHOR_MAP entry for nish3451 Adds the nish3451 noreply email to the AUTHOR_MAP so CI attribution checks pass for the #6100 Telegram DM fallback fix merged in `1a9a2d7f`.	2026-04-18 18:52:41 -07:00
Teknium	aa5f89d3ea	test: add coverage for from_user=None DM fallback Tests the three cases: - DM with from_user=None: user_id falls back to chat.id - Group with from_user=None: user_id stays None (safe default) - DM with from_user present: user_id uses from_user.id (no regression)	2026-04-18 18:18:01 -07:00
Nish	1a9a2d7fe8	fix(gateway/telegram): fall back to chat.id when from_user is None in DMs When `message.from_user` is None — which can happen for forwarded messages, anonymous admin mode in groups, or certain Telegram client edge cases — `_build_message_event` set `source.user_id` to None. This caused: 1. `_is_user_authorized()` to early-return False (`if not user_id: return False`) 2. The access check never compared against `TELEGRAM_ALLOWED_USERS` even when the user actually was in the allowlist 3. The pairing flow fired and generated a code for `user_id=None` 4. The pairing approval saved an entry under the literal string key "null" 5. The user was effectively locked out because their real user_id never matched the "null" key on subsequent messages For DMs (`chat_type == "dm"`), Telegram guarantees `chat.id == user.id` — they are the same numeric ID for private chats. Falling back to `chat.id` when `from_user` is None for DMs restores the expected access-control behavior without weakening it (group/channel chats correctly stay None). Also adds a parallel `user_name` fallback to `chat.full_name` so the display name still works in the same edge case.	2026-04-18 18:18:01 -07:00
Teknium	139a6da67c	fix(skills): touchdesigner-mcp setup.sh — correct pgrep match + suppress stray yaml output Discovered while dogfooding the skill end-to-end: - pgrep -if "TouchDesigner" matched any shell whose command line contained the substring (including the setup script's own invocation under certain wrappers), falsely reporting TD running on machines where it isn't. Switch to pgrep -x (exact process name match, supported on both macOS and Linux) and also check TouchDesignerFTE (the non-commercial variant). - The embedded python3 yaml-writer printed 'added' / 'exists' to stdout as status, which leaked a stray word into the setup output right before the ✔ line. Drop the print()s — the bash-level ✔/✘ is the status indicator.	2026-04-18 17:43:42 -07:00
Teknium	6b31e20894	chore(skills): touchdesigner-mcp follow-ups - Remove orphan skills/creative/touchdesigner/references/pitfalls.md left over from the rename commit (git add-then-edit instead of git mv meant the old file never got deleted). - Honour $HERMES_HOME in setup.sh and SKILL.md setup invocation so profile-aware installs work correctly. - Fix troubleshooting.md config path to use $HERMES_HOME instead of hardcoding ~/.hermes/. - Add touchdesigner-mcp entries to skills-catalog.md and optional-skills-catalog.md for parity with blender-mcp/meme-generation.	2026-04-18 17:43:42 -07:00
Teknium	11ee87e605	chore(attribution): add AUTHOR_MAP entry for kshitijk4poor@gmail.com Covers the non-noreply email used on commit `dd3e6424` (rename of the TouchDesigner skill to touchdesigner-mcp).	2026-04-18 17:43:42 -07:00
kshitijk4poor	6d2fe1d624	feat: rename touchdesigner -> touchdesigner-mcp, move to optional-skills/ - Rename skill to touchdesigner-mcp (matches blender-mcp convention) - Move from skills/creative/ to optional-skills/creative/ - Fix duplicate pitfall numbering (#3 appeared twice) - Update SKILL.md cross-references for renumbered pitfalls - Update setup.sh path for new directory location	2026-04-18 17:43:42 -07:00
kshitijk4poor	6f27390fae	feat: rewrite TouchDesigner skill for twozero MCP (v2.0.0) Major rewrite of the TouchDesigner skill: - Replace custom API handler with twozero MCP (36 native tools) - Add audio-reactive GLSL proven recipe (spectrum chain, pitfalls) - Add recording checklist (FPS>0, non-black, audio cueing) - Expand pitfalls: 38 entries from real sessions (was 20) - Update network-patterns with MCP-native build scripts - Rewrite mcp-tools reference for twozero v2.774+ - Update troubleshooting for MCP-based workflow - Remove obsolete custom_api_handler.py - Generalize Environment section for all users - Remove session-specific Paired Skills section - Bump version to 2.0.0	2026-04-18 17:43:42 -07:00
kshitijk4poor	7a5371b20d	feat: add TouchDesigner integration skill New skill: creative/touchdesigner — control a running TouchDesigner instance via REST API. Build real-time visual networks programmatically. Architecture: Hermes Agent -> HTTP REST (curl) -> TD WebServer DAT -> TD Python env Key features: - Custom API handler (scripts/custom_api_handler.py) that creates a self-contained WebServer DAT + callback in TD. More reliable than the official mcp_webserver_base.tox which frequently fails module imports. - Discovery-first workflow: never hardcode TD parameter names. Always probe the running instance first since names change across versions. - Persistent setup: save the TD project once with the API handler baked in. TD auto-opens the last project on launch, so port 9981 is live with zero manual steps after first-time setup. - Works via curl in execute_code (no MCP dependency required). - Optional MCP server config for touchdesigner-mcp-server npm package. Skill structure (2823 lines total): SKILL.md (209 lines) — setup, workflow, key rules, operator reference references/pitfalls.md (276 lines) — 24 hard-won lessons references/operators.md (239 lines) — all 6 operator families references/network-patterns.md (589 lines) — audio-reactive, generative, video processing, GLSL, instancing, live performance recipes references/mcp-tools.md (501 lines) — 13 MCP tool schemas references/python-api.md (443 lines) — TD Python scripting patterns references/troubleshooting.md (274 lines) — connection diagnostics scripts/custom_api_handler.py (140 lines) — REST API handler for TD scripts/setup.sh (152 lines) — prerequisite checker Tested on TouchDesigner 099 Non-Commercial (macOS/darwin).	2026-04-18 17:43:42 -07:00
Teknium	c49a58a6d0	fix(gateway): mark only still-running sessions resume_pending on drain timeout (#12332 ) Follow-up to #12301. The drain-timeout branch of _stop_impl() was iterating the drain-start snapshot (active_agents) when marking sessions resume_pending. That snapshot can include sessions that finished gracefully during the drain window — marking them would give their next turn a stray 'your previous turn was interrupted by a gateway restart' system note even though the prior turn actually completed cleanly. Iterate self._running_agents at timeout time instead, mirroring _interrupt_running_agents() exactly: - only sessions still blocking the shutdown get marked - pending sentinels (AIAgent construction not yet complete) are skipped Changes: - gateway/run.py: swap active_agents.keys() for filtered self._running_agents.items() iteration in the drain-timeout mark loop. - tests/gateway/test_restart_resume_pending.py: two regression tests — finisher-during-drain not marked, pending sentinel not marked.	2026-04-18 17:40:34 -07:00
Teknium	cb4addacab	fix(gateway): auto-resume sessions after drain-timeout restart (#11852 ) (#12301 ) The shutdown banner promised "send any message after restart to resume where you left off" but the code did the opposite: a drain-timeout restart skipped the .clean_shutdown marker, which made the next startup call suspend_recently_active(), which marked the session suspended, which made get_or_create_session() spawn a fresh session_id with a 'Session automatically reset. Use /resume...' notice — contradicting the banner. Introduce a resume_pending state on SessionEntry that is distinct from suspended. Drain-timeout shutdown flags active sessions resume_pending instead of letting startup-wide suspension destroy them. The next message on the same session_key preserves the session_id, reloads the transcript, and the agent receives a reason-aware restart-resume system note that subsumes the existing tool-tail auto-continue note (PR #9934). Terminal escalation still flows through the existing .restart_failure_counts stuck-loop counter (PR #7536, threshold 3) — no parallel counter on SessionEntry. suspended still wins over resume_pending in get_or_create_session() so genuinely stuck sessions converge to a clean slate. Spec: PR #11852 (BrennerSpear). Implementation follows the spec with the approved correction (reuse .restart_failure_counts rather than adding a resume_attempts field). Changes: - gateway/session.py: SessionEntry.resume_pending/resume_reason/ last_resume_marked_at + to_dict/from_dict; SessionStore .mark_resume_pending()/clear_resume_pending(); get_or_create_session() returns existing entry when resume_pending (suspended still wins); suspend_recently_active() skips resume_pending entries. - gateway/run.py: _stop_impl() drain-timeout branch marks active sessions resume_pending before _interrupt_running_agents(); _run_agent() injects reason-aware restart-resume system note that subsumes the tool-tail case; successful-turn cleanup also clears resume_pending next to _clear_restart_failure_count(); _notify_active_sessions_of_shutdown() softens the restart banner to 'I'll try to resume where you left off' (honest about stuck-loop escalation). - tests/gateway/test_restart_resume_pending.py: 29 new tests covering SessionEntry roundtrip, mark/clear helpers, get_or_create_session precedence (suspended > resume_pending), suspend_recently_active skip, drain-timeout mark reason (restart vs shutdown), system-note injection decision tree (including tool-tail subsumption), banner wording, and stuck-loop escalation override.	2026-04-18 17:32:17 -07:00
brooklyn!	ad99e32371	Merge pull request #12312 from NousResearch/bb/tui-ux-pack feat(tui): UX pack — stable picker keys, /clear confirm, light-theme preset	2026-04-18 18:13:06 -05:00
Brooklyn Nicholson	df5ca5065f	feat(tui): replace /clear double-press gate with a proper confirm overlay The time-window gate felt wrong — users would hit /clear, read the prompt, retype, and consistently blow past the window. Swapping to a real yes/no overlay that blocks input like the existing Approval and Clarify prompts. - add ConfirmReq type + OverlayState.confirm + $isBlocked coverage - ConfirmPrompt component (prompts.tsx): cancel row on top as the default, danger-coloured confirm row on the bottom, Y/N hotkeys, Enter on default = cancel, Esc/Ctrl+C cancel - wire into PromptZone (appOverlays.tsx) - /clear + /new now push onto the overlay instead of arming a timer - HERMES_TUI_NO_CONFIRM=1 still skips the prompt for scripting - drop the destructiveGate + createSlashHandler reset wiring (destructive.ts and its tests removed) Refs #4069.	2026-04-18 18:04:08 -05:00
Brooklyn Nicholson	75377feb07	fix(tui): make /clear confirm window humane (3s → 30s, reset on other slash) The 3s gate was too tight — users reading the prompt and retyping consistently blow past it and get stuck in a loop ("press /clear again within 3s" forever). Fixes: - bump CONFIRM_WINDOW_MS 3_000 → 30_000 - drop the time number from the confirmation message to remove the pressure vibe: "press /clear again to confirm — starts a new session" - reset the gate from createSlashHandler whenever any non-destructive slash command runs, so stale arming from 20s ago can't silently turn the next /clear into an unintended confirm - export the gate + isDestructiveCommand helper for that wiring - add armed() introspection method Follow-up to #4069 / `3366714b`.	2026-04-18 17:55:53 -05:00
Brooklyn Nicholson	20eab355e7	feat(tui): add LIGHT_THEME preset for white/light terminal backgrounds Splits the existing palette into DARK_THEME (current yellow-heavy default) and LIGHT_THEME (darker browns + proper contrast on white). DEFAULT_THEME aliases DARK_THEME, and flips to LIGHT_THEME when HERMES_TUI_LIGHT=1 is set at launch. Skin system (fromSkin) still layers on top of whichever preset is active, so users can keep customizing on top of either palette. Refs #11300.	2026-04-18 17:49:40 -05:00
Brooklyn Nicholson	3366714ba4	feat(tui): double-press confirm on /clear and /new Prevents accidental session loss: the first press prints "press /clear again within 3s to confirm"; a second press inside the window actually starts a new session. Outside the window the gate re-arms. Opt out with HERMES_TUI_NO_CONFIRM=1 for scripted / muscle-memory workflows. Refs #4069.	2026-04-18 17:48:34 -05:00
Brooklyn Nicholson	52124384de	fix(tui): stable React keys in /model picker rows Use provider.slug (and a composite key for model rows) instead of the rendered string, so dupes in the backend response can't collapse two rows into one or trigger key-collision warnings.	2026-04-18 17:47:26 -05:00
brooklyn!	db59c190c1	Merge pull request #12305 from NousResearch/bb/tui-status-git-branch feat(tui): append git branch to cwd label in status bar	2026-04-18 17:27:40 -05:00
brooklyn!	c0edcf2d53	Merge pull request #12306 from NousResearch/bb/tui-model-picker-dedupe-names fix(tui): disambiguate /model picker rows when provider display names collide	2026-04-18 17:27:31 -05:00
Brooklyn Nicholson	4aa52590d8	fix(tui): disambiguate /model picker rows when provider display names collide If the gateway returns two providers that resolve to the same display name (e.g. `kimi-coding` and `kimi-coding-cn` both → "Kimi For Coding"), the picker now appends the slug so users can tell them apart, in both the provider list and the selected-provider header. No-op when names are already unique. Refs #10526 — the Python backend dedupe from #10599 skips one alias, but user-defined providers, canonical overlays, and future regressions can still surface as indistinguishable rows in the picker. This is a client-side safety net on top of that.	2026-04-18 17:22:23 -05:00
Brooklyn Nicholson	ff2aa7ccd7	feat(tui): append git branch to cwd label in status bar Adds useGitBranch hook (async, cached, 15s TTL) and fmtCwdBranch helper so the footer shows `~/repo (main)` instead of just `~/repo`. Degrades silently when git is unavailable or cwd is outside a repo. Partial fix for #12267 (TUI portion; #12277 covers the Python side).	2026-04-18 17:17:05 -05:00
Teknium	0175ff7516	feat(skills): replace xitter with xurl — the official X API CLI (#12303 ) Swap the social-media/xitter skill (third-party wrapper around Infatoshi/x-cli) for a new social-media/xurl skill wrapping xdevplatform/xurl — the official X API CLI from the X developer platform team. Why: - xurl is officially maintained by the X dev platform team - OAuth 2.0 PKCE with auto-refresh + multi-app / multi-user support (vs. xitter's 5-env-var OAuth 1.0a + single account) - Credentials stored in ~/.xurl managed by xurl itself — no manual env var juggling for users - Substantially larger API surface: DMs, follows, blocks, mutes, media upload, streaming, and raw v2 endpoint access - Ships stronger agent-safety guardrails (forbidden-flag list, no --verbose in agent mode, never-read-~/.xurl rule) Adaptation: - Ported the openclaw SKILL.md (which the xdevplatform team seeded) to Hermes frontmatter conventions (prerequisites.commands, platforms, metadata.hermes.tags/homepage) — dropped openclaw-specific metadata - Added a Hermes-oriented one-time user setup section so the agent knows to direct the user to run auth commands themselves, never execute them with inline secrets - Preserved the mandatory secret-safety rules verbatim - Attribution block credits xdevplatform, openclaw, and the Hermes port Docs: updated website/docs/reference/skills-catalog.md to replace the xitter row with xurl.	2026-04-18 15:11:32 -07:00
Teknium	6a3a6a0fb6	Merge pull request #12263 from NousResearch/bb/tui-audit-followup fix(tui): TUI v2 audit follow-up — registry, overlays, paste, reasoning, hyperlinks	2026-04-18 14:40:16 -07:00
helix4u	4e8f60fd11	fix(cli): use display width for wrapped spinner height	2026-04-18 14:34:05 -07:00
Brooklyn Nicholson	fb06bc67de	fix(tui): Ctrl+C with input selection actually preserves input (lift handler to app level) Previous fix in 9dbf1ec6 handled Ctrl+C inside textInput but the APP-level useInputHandlers fires the same keypress in a separate React hook and ran clearIn() regardless. Net effect: the OSC 52 copy succeeded but the input wiped right after, so Brooklyn only noticed the wipe. Lift the selection-aware Ctrl+C to a single place by threading input selection state through a new nanostore (src/app/inputSelectionStore.ts). textInput syncs its derived `selected` range + a clear() callback to the store on every selection change, and the app-level Ctrl+C handler reads the store before its clear/interrupt/die chain: - terminal-level selection (scrollback) → copy, existing behavior - in-input selection present → copy + clear selection, preserve input - input has text, no selection → clearIn(), existing behavior - empty + busy → interrupt turn - empty + idle → die textInput no longer has its own Ctrl+C block; keypress falls through to app-level like it did before 9dbf1ec6.	2026-04-18 16:28:51 -05:00
Brooklyn Nicholson	bfac5d039d	Merge branch 'main' of github.com:NousResearch/hermes-agent into bb/tui-audit-followup	2026-04-18 15:27:40 -05:00
Brooklyn Nicholson	17e95a26b7	fix(tui): render /skills browse as a formatted Panel instead of raw JSON Previous handler dumped the raw skills.manage response into a pager, which was unreadable and hid the pagination metadata. Also silently accepted non-numeric page args. Now: - validates page arg (rejects NaN / <1 with a usage message) - shows "fetching community skills (scans 6 sources, may take ~15s)…" up front so the 10-30s hub fetch isn't a silent hang - renders items as {name · trust, description (truncated 160 chars)} rows in the existing Panel component - footer shows "page X of Y · N skills total · /skills browse N+1 for more" when the server returned pagination metadata Skills hub's remote fetch latency is a separate upstream issue (browse_skills hits 6 sources sequentially) — client-side we just stop misrepresenting it.	2026-04-18 15:22:43 -05:00
Brooklyn Nicholson	7e9a098574	chore: uptick	2026-04-18 15:17:42 -05:00
Brooklyn Nicholson	450ded98db	chore(tui): prettier whitespace on files touched in this branch	2026-04-18 15:13:31 -05:00
Brooklyn Nicholson	93b4080b78	Merge branch 'main' of github.com:NousResearch/hermes-agent into bb/tui-audit-followup # Conflicts: # ui-tui/src/components/markdown.tsx # ui-tui/src/types/hermes-ink.d.ts	2026-04-18 14:52:54 -05:00
helix4u	ca32a2a60b	fix(gemini): restore bearer auth on openai route	2026-04-18 12:52:01 -07:00
helix4u	a7dd6a3449	fix(gemini): hide stale and low-TPM Google models	2026-04-18 12:52:01 -07:00
helix4u	2eab7ee15f	fix(gemini): hide low-TPM Gemma models from exposed lists	2026-04-18 12:52:01 -07:00
LVT382009	f7af90e2da	fix: wire _ephemeral_max_output_tokens into chat_completions and add NVIDIA NIM default Based on #12152 by @LVT382009. Two fixes to run_agent.py: 1. _ephemeral_max_output_tokens consumption in chat_completions path: The error-recovery ephemeral override was only consumed in the anthropic_messages branch of _build_api_kwargs. All chat_completions providers (OpenRouter, NVIDIA NIM, Qwen, Alibaba, custom, etc.) silently ignored it. Now consumed at highest priority, matching the anthropic pattern. 2. NVIDIA NIM max_tokens default (16384): NVIDIA NIM falls back to a very low internal default when max_tokens is omitted, causing models like GLM-4.7 to truncate immediately (thinking tokens exhaust the budget before the response starts). 3. Progressive length-continuation boost: When finish_reason='length' triggers a continuation retry, the output budget now grows progressively (2x base on retry 1, 3x on retry 2, capped at 32768) via _ephemeral_max_output_tokens. Previously the retry loop just re-sent the same token limit on all 3 attempts.	2026-04-18 12:51:30 -07:00
jarvischer	0f778f7768	fix: prevent tool name duplication in streaming accumulator (MiniMax/NVIDIA NIM) Based on #11984 by @maxchernin. Fixes #8259. Some providers (MiniMax M2.7 via NVIDIA NIM) resend the full function name in every streaming chunk instead of only the first. The old accumulator used += which concatenated them into 'read_fileread_file'. Changed to simple assignment (=), matching the OpenAI Node SDK, LiteLLM, and Vercel AI SDK patterns. Function names are atomic identifiers delivered complete — no provider splits them across chunks, so concatenation was never correct semantics.	2026-04-18 12:50:32 -07:00
Brooklyn Nicholson	4caf6c23dd	fix(tui): strip <think>…</think> tags from assistant content and route to reasoning panel Models that emit reasoning inline as <think>/<reasoning>/<thinking>/<thought>/ <REASONING_SCRATCHPAD> tags in the content field (rather than a separate API reasoning channel) had the raw tags + inner content shown twice: once as body text with literal <think> markers, and again in the thinking panel when the reasoning field was populated. Port v1's tag set to lib/reasoning.ts with a splitReasoning(text) helper that returns { reasoning, text }. Applied in three spots: - scheduleStreaming: strips tags from the live streaming view so the user never sees <think> mid-turn. - flushStreamingSegment: when a tool interrupts assistant output mid-turn, the saved segment is the stripped text; extracted reasoning promotes to reasoningText if the API channel hasn't already populated it. - recordMessageComplete: final message text is split, extracted reasoning merges with any existing reasoning (API channel wins on conflicts so we don't double-count when both are present).	2026-04-18 14:46:38 -05:00
Brooklyn Nicholson	37cba82bfc	fix(tui): Ctrl+C on in-input selection copies to clipboard instead of clearing Before: textInput explicitly ignored Ctrl+C so the app-level handler took over — with no knowledge of the TextInput's own selection — and fell through to clearIn() whenever input had text. Selecting part of the composer and pressing Ctrl+C silently nuked everything you typed. Now: Ctrl+C with an active in-input selection writes the selected substring to the clipboard via OSC 52 and clears the selection. The original semantics (Ctrl+C with no selection → app-level interrupt/clear/die chain) are preserved by still returning early in that case.	2026-04-18 14:42:03 -05:00
Teknium	0bebf5b948	chore(attribution): add AUTHOR_MAP entry for Honghua Yang (honghua)	2026-04-18 12:40:56 -07:00
Honghua Yang	3128d9fcd2	fix(context_compressor): keep tool-call arguments JSON valid when shrinking Pass 3 of `_prune_old_tool_results` previously shrunk long `function.arguments` blobs by slicing the raw JSON string at byte 200 and appending the literal text `...[truncated]`. That routinely produced payloads like:: {"path": "/foo.md", "content": "# Long markdown ...[truncated] — an unterminated string with no closing brace. Strict providers (observed on MiniMax) reject this as `invalid function arguments json string` with a non-retryable 400. Because the broken call survives in the session history, every subsequent turn re-sends the same malformed payload and gets the same 400, locking the session into a re-send loop until the call falls out of the window. Fix: parse the arguments first, shrink long string leaves inside the parsed structure, and re-serialise. Non-string values (paths, ints, booleans, lists) pass through intact. Arguments that are not valid JSON to begin with (rare, some backends use non-JSON tool args) are returned unchanged rather than replaced with something neither we nor the provider can parse. Observed in the wild: a `write_file` with ~800 chars of markdown `content` triggered this on a real session against MiniMax-M2.7; every turn after compression got rejected until the session was manually reset. Tests: - 7 direct tests of `_truncate_tool_call_args_json` covering valid-JSON output, non-JSON pass-through, nested structures, non-string leaves, scalar JSON, and Unicode preservation - 1 end-to-end test through `_prune_old_tool_results` Pass 3 that reproduces the exact failure payload shape from the incident Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:40:56 -07:00
Brooklyn Nicholson	5c8b291607	fix(tui): wrap markdown links in Link so Ghostty/iTerm/kitty get real OSC 8 hyperlinks renderLink was discarding the URL entirely — it rendered the label as amber underlined text and dropped the href. Result: Cmd+Click / Ctrl+Click did nothing in any terminal, including Ghostty. Now both markdown links `[label](url)` and bare `https://…` URLs are wrapped in @hermes/ink's Link component, which emits OSC 8 (\\x1b]8;;url\\x07label\\x1b]8;;\\x07) when supportsHyperlinks() returns true. ADDITIONAL_HYPERLINK_TERMINALS already includes ghostty, iTerm2, kitty, alacritty, Hyper. Autolinks that look like bare emails (foo@bar.com) now prepend mailto: in the href so they open the mail client correctly. Also adds a typed declaration for Link in hermes-ink.d.ts.	2026-04-18 14:39:24 -05:00
Brooklyn Nicholson	a7f4d756b7	fix(tui): cap approval prompt command preview at 10 lines Large inline scripts (e.g. Python code_execution bodies) rendered as a single unbounded <Text> block, pushing the Allow/Deny options below the visible viewport. Users had to scroll the terminal to vote. Preview now shows the first 10 lines with truncate-end wrap per line and a dim "… +N more lines" indicator. Full text remains in the transcript above.	2026-04-18 14:36:34 -05:00
Teknium	b73ebfee30	chore(attribution): add AUTHOR_MAP entry for Jim Liu (JimLiu) Maps junminliu@gmail.com → JimLiu for the baoyu-infographic skill port co-author attribution.	2026-04-18 12:32:16 -07:00
Teknium	ade7958f1f	docs: add PORT_NOTES.md for baoyu-infographic Documents what changed from upstream and how to sync future updates.	2026-04-18 12:32:16 -07:00
Teknium	65c0a30a77	feat(skills): add baoyu-infographic skill — 21 layouts × 21 styles Port of baoyu-infographic from JimLiu/baoyu-skills (v1.56.1) adapted for Hermes Agent's tool ecosystem. Adaptations from upstream: - Frontmatter: openclaw metadata → hermes metadata - Usage: slash command syntax → natural language triggers - Removed EXTEND.md config system (not part of Hermes infrastructure) - AskUserQuestion → clarify tool (one question at a time) - Image generation → image_generate tool - Removed Windows-specific paths - Simplified file operations to use Hermes file tools - All 45 reference files (layouts, styles, templates) preserved intact Attribution preserved per agreement with 宝玉 (Jim Liu): - author, version, GitHub homepage URL in frontmatter Co-authored-by: Jim Liu 宝玉 <junminliu@gmail.com>	2026-04-18 12:32:16 -07:00
Siddharth Balyan	a828daa7f8	perf(docker): layer-cache npm/Playwright and skip redundant web rebuild (#12225 ) * perf(docker): layer-cache npm/Playwright and skip redundant web rebuild Copy package manifests before source so npm install + Playwright only re-run when lockfiles change. Use COPY --chown instead of chown -R, set HERMES_WEB_DIST to skip runtime web rebuild, and drop the USER root / chmod dance since entrypoint.sh is already executable in git. * Update Dockerfile	2026-04-18 22:44:31 +05:30
bluefishs	b0bde98b0f	fix(docker): build web/ dashboard assets in image (#12180 ) The Dockerfile installs root-level npm dependencies (for Playwright) and the whatsapp-bridge bundle, but never builds the web/ Vite project. As a result, 'hermes dashboard' starts FastAPI on :9119 but serves a broken SPA because hermes_cli/web_dist/ is empty and requests to /assets/index-<hash>.js 404. Add a build step inside web/ so the Vite output is baked into the image. Reproduce (before): docker build -t hermes-repro -f Dockerfile . docker run --rm -p 9119:9119 hermes-repro hermes dashboard curl -sI http://localhost:9119/assets/ \| head -1 # -> 404 After: /assets/ returns the built asset path.	2026-04-18 22:20:24 +05:30
kshitij	c14b3b5880	fix(kimi): force fixed temperature on kimi-k2.* models (k2.5, thinking, turbo) (#12144 ) * fix(kimi): force fixed temperature on kimi-k2.* models (k2.5, thinking, turbo) The prior override only matched the literal model name "kimi-for-coding", but Moonshot's coding endpoint is hit with real model IDs such as `kimi-k2.5`, `kimi-k2-turbo-preview`, `kimi-k2-thinking`, etc. Those requests bypassed the override and kept the caller's temperature, so Moonshot returns HTTP 400 "invalid temperature: only 0.6 is allowed for this model" (or 1.0 for thinking variants). Match the whole kimi-k2.* family: * kimi-k2-thinking / kimi-k2-thinking-turbo -> 1.0 (thinking mode) * all other kimi-k2.* -> 0.6 (non-thinking / instant mode) Also accept an optional vendor prefix (e.g. `moonshotai/kimi-k2.5`) so aggregator routings are covered. * refactor(kimi): whitelist-match kimi coding models instead of prefix Addresses review feedback on PR #12144. - Replace `startswith("kimi-k2")` with explicit frozensets sourced from Moonshot's kimi-for-coding model list. The prefix match would have also clamped `kimi-k2-instruct` / `kimi-k2-instruct-0905`, which are the separate non-coding K2 family with variable temperature (recommended 0.6 but not enforced — see huggingface.co/moonshotai/Kimi-K2-Instruct). - Confirmed via platform.kimi.ai docs that all five coding models (k2.5, k2-turbo-preview, k2-0905-preview, k2-thinking, k2-thinking-turbo) share the fixed-temperature lock, so the preview-model mapping is no longer an assumption. - Drop the fragile `"thinking" in bare` substring test for a set lookup. - Log a debug line on each override so operators can see when Hermes silently rewrites temperature. - Update class docstring. Extend the negative test to parametrize over kimi-k2-instruct, Kimi-K2-Instruct-0905, and a hypothetical future kimi-k2-experimental name — all must keep the caller's temperature.	2026-04-18 09:35:51 -07:00
kshitijk4poor	656c375855	fix(tui): review follow-up — /retry, /plan, ANSI truncation, caching - /retry: use session['history'] instead of non-existent agent.conversation_history; truncate history at last user message to match CLI retry_last() behavior; add history_lock safety - /plan: pass user instruction (arg) to build_plan_path instead of session_key; add runtime_note so agent knows where to save the plan - ANSI tool results: render full text via <Ansi wrap=truncate-end> instead of slicing raw ANSI through compactPreview (which cuts mid-escape-sequence producing garbled output) - Move _PENDING_INPUT_COMMANDS frozenset to module level - Use get_skill_commands() (cached) instead of scan_skill_commands() (rescans disk) in slash.exec skill interception - Add 3 retry tests: happy path with history truncation verification, empty history error, multipart content extraction - Update test mock target from scan_skill_commands to get_skill_commands	2026-04-18 09:30:48 -07:00
kshitijk4poor	abc95338c2	fix(tui): slash.exec _pending_input commands, tool ANSI, terminal title Additional TUI fixes discovered in the same audit: 1. /plan slash command was silently lost — process_command() queues the plan skill invocation onto _pending_input which nobody reads in the slash worker subprocess. Now intercepted in slash.exec and routed through command.dispatch with a new 'send' dispatch type. Same interception added for /retry, /queue, /steer as safety nets (these already have correct TUI-local handlers in core.ts, but the server-side guard prevents regressions if the local handler is bypassed). 2. Tool results were stripping ANSI escape codes — the messageLine component used stripAnsi() + plain <Text> for tool role messages, losing all color/styling from terminal, search_files, etc. Now uses <Ansi> component (already imported) when ANSI is detected. 3. Terminal tab title now shows model + busy status via useTerminalTitle hook from @hermes/ink (was never used). Users can identify Hermes tabs and see at a glance whether the agent is busy or ready. 4. Added 'send' variant to CommandDispatchResponse type + asCommandDispatch parser + createSlashHandler handler for commands that need to inject a message into the conversation (plan, queue fallback, steer fallback).	2026-04-18 09:30:48 -07:00
kshitijk4poor	2da558ec36	fix(tui): clickable hyperlinks and skill slash command dispatch Two TUI fixes: 1. Hyperlinks are now clickable (Cmd+Click / Ctrl+Click) in terminals that support OSC 8. The markdown renderer was rendering links as plain colored text — now wraps them in the existing <Link> component from @hermes/ink which emits OSC 8 escape sequences. 2. Skill slash commands (e.g. /hermes-agent-dev) now work in the TUI. The slash.exec handler was delegating to the _SlashWorker subprocess which calls cli.process_command(). For skills, process_command() queues the invocation message onto _pending_input — a Queue that nobody reads in the worker subprocess. The skill message was lost. Now slash.exec detects skill commands early and rejects them so the TUI falls through to command.dispatch, which correctly builds and returns the skill payload for the client to send().	2026-04-18 09:30:48 -07:00

1 2 3 4 5 ...

4842 commits