hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-29 01:31:41 +00:00

Author	SHA1	Message	Date
Brooklyn Nicholson	11b2942f16	fix(tui): anchor inline_diff to the segment where the edit happened Revisits #13729. That PR buffered each `tool.complete`'s inline_diff and merged them into the final assistant message body as a fenced ```diff block. The merge-at-end placement reads as "the agent wrote this after the summary", even when the edit fired mid-turn — which is both misleading and (per blitz feedback) feels like noise tacked onto the end of every task. Segment-anchored placement instead: - On tool.complete with inline_diff, `pushInlineDiffSegment` calls `flushStreamingSegment` first (so any in-progress narration lands as its own segment), then pushes the ```diff block as its own segment into segmentMessages. The diff is now anchored BETWEEN the narration that preceded the edit and whatever the agent streams afterwards, which is where the edit actually happened. - `recordMessageComplete` no longer merges buffered diffs. The only remaining dedupe is "drop diff-only segments whose body the final assistant text narrates verbatim (or whose diff fence the final text already contains)" — same tradeoff as before, kept so an agent that narrates its own diff doesn't render two stacked copies. - Drops `pendingInlineDiffs` and `queueInlineDiff` — buffer + end- merge machinery is gone; segmentMessages is now the only source of truth. Side benefit: Ctrl+C interrupt (`interruptTurn`) iterates segmentMessages, so diff segments are now preserved in the transcript when the user cancels after an edit. Previously the pending buffer was silently dropped on interrupt. Reported by Teknium during blitz usage: "no diffs are ever at the end because it didn't make this file edit after the final message".	2026-04-23 19:02:44 -05:00
Brooklyn Nicholson	eda400d8a5	chore: uptick	2026-04-22 11:32:17 -05:00
Brooklyn Nicholson	7785654ad5	feat(tui): subagent spawn observability overlay Adds a live + post-hoc audit surface for recursive delegate_task fan-out. None of cc/oc/oclaw tackle nested subagent trees inside an Ink overlay; this ships a view-switched dashboard that handles arbitrary depth + width. Python - delegate_tool: every subagent event now carries subagent_id, parent_id, depth, model, tool_count; subagent.complete also ships input/output/ reasoning tokens, cost, api_calls, files_read/files_written, and a tail of tool-call outputs - delegate_tool: new subagent.spawn_requested event + _active_subagents registry so the overlay can kill a branch by id and pause new spawns - tui_gateway: new RPCs delegation.status, delegation.pause, subagent.interrupt, spawn_tree.save/list/load (disk under \$HERMES_HOME/spawn-trees/<session>/<ts>.json) TUI - /agents overlay: full-width list mode (gantt strip + row picker) and Enter-to-drill full-width scrollable detail mode; inverse+amber selection, heat-coloured branch markers, wall-clock gantt with tick ruler, per-branch rollups - Detail pane: collapsible accordions (Budget, Files, Tool calls, Output, Progress, Summary); open-state persists across agents + mode switches via a shared atom - /replay [N\|last\|list\|load <path>] for in-memory + disk history; /replay-diff <a> <b> for side-by-side tree comparison - Status-bar SpawnHud warns as depth/concurrency approaches caps; overlay auto-follows the just-finished turn onto history[1] - Theme: bump DARK dim #B8860B → #CC9B1F for readable secondary text globally; keep LIGHT untouched Tests: +29 new subagentTree unit tests; 215/215 passing.	2026-04-22 10:38:17 -05:00
Brooklyn Nicholson	a8eb13e828	fix(tui): dedupe inline diffs, strip CLI review-diff header After the prior inline-diff fix, the gateway still prepends a literal " ┊ review diff" line to inline_diff (it's terminal chrome written by `_emit_inline_diff`). Wrapping that in a ```diff fence left that header inside the code block. The agent also often narrates its own edit in a second fenced diff, so the assistant message ended up stacking two diff blocks for the same change. - Strip the leading "┊ review diff" header from queued inline diffs before fencing. - Skip appending the fenced diff entirely when the assistant already wrote its own ```diff (or ```patch) fence. Keeps the single-surface diff UX even when the agent is chatty.	2026-04-21 19:21:00 -05:00
Brooklyn Nicholson	9654c9fb10	fix(tui): dedupe inline_diff when assistant already echoes it Avoid duplicate diff rendering in #13729 flow. We now skip queued inline diffs that are already present in final assistant text and dedupe repeated queued diffs by exact content.	2026-04-21 19:06:49 -05:00
Brooklyn Nicholson	31b3b09ea4	fix(tui): render inline diffs inside assistant completion Follow-up for #13729: segment-level system artifacts still looked detached in real flow.\n\nInstead of appending inline_diff as a standalone segment/system row, queue sanitized diffs during tool.complete and append them as a fenced diff block to the assistant completion text on message.complete. This keeps the diff in the same message flow as the assistant response.	2026-04-21 19:02:53 -05:00
Brooklyn Nicholson	bddf0cd61e	fix(tui): keep inline diffs below tool rows and strip ANSI Follow-up on #13729 from blitz screenshot feedback.\n\n- When tool.complete carried inline_diff but no buffered assistant text existed, pending tool rows were still in streamPendingTools, so diff rendered above the tool row section. appendSegmentMessage now emits pending tool rows as a trail segment before appending the diff artifact.\n- Strip ANSI color escapes from inline_diff payloads so we don't render loud red/green terminal palettes in the transcript.	2026-04-21 18:50:42 -05:00
Brooklyn Nicholson	dff1c8fcf1	fix(tui): tool inline_diff renders inline with the active turn Reported during TUI v2 blitz retest: code-review diffs from tool.complete appeared at the top of the current interaction thread, out of sequence with the agent's messages and tool rows below them. Root cause — `sys(inline_diff)` appends to `historyItems`, which sits above the `StreamingAssistant` pane that renders the active turn. Until the turn closed, the diff visually floated above everything else happening in the same turn. Route the diff through `turnController.appendSegmentMessage` instead so it flushes any pending streaming text first, then lands in the segment stream beside assistant output and tool calls. On `message.complete` the segment list is committed to history in emit order (diff → final text), matching what the gateway sent. Adds a regression test that exercises tool.complete → message.complete with an inline_diff payload and asserts both the streaming and final placement.	2026-04-21 18:35:59 -05:00
Brooklyn Nicholson	dd5ead1007	fix(tui): preserve prior segment output on Ctrl+C interrupt interruptTurn only flushed the in-flight streaming chunk (bufRef) to the transcript before calling idle(), which wiped segmentMessages and pendingSegmentTools. Every tool call and commentary line the agent had already emitted in the current turn disappeared the moment the user cancelled, even though that output is exactly what they want to keep when they hit Ctrl+C (quote from the blitz feedback: "everything was fine up until the point where you wanted to push to main"). Append each flushed segment message to the transcript first, then render the in-flight partial with the `[interrupted]` marker and its pendingSegmentTools. Sys-level "interrupted" note still fires when there is nothing to preserve.	2026-04-21 14:48:50 -05:00
Brooklyn Nicholson	4caf6c23dd	fix(tui): strip <think>…</think> tags from assistant content and route to reasoning panel Models that emit reasoning inline as <think>/<reasoning>/<thinking>/<thought>/ <REASONING_SCRATCHPAD> tags in the content field (rather than a separate API reasoning channel) had the raw tags + inner content shown twice: once as body text with literal <think> markers, and again in the thinking panel when the reasoning field was populated. Port v1's tag set to lib/reasoning.ts with a splitReasoning(text) helper that returns { reasoning, text }. Applied in three spots: - scheduleStreaming: strips tags from the live streaming view so the user never sees <think> mid-turn. - flushStreamingSegment: when a tool interrupts assistant output mid-turn, the saved segment is the stripped text; extracted reasoning promotes to reasoningText if the API channel hasn't already populated it. - recordMessageComplete: final message text is split, extracted reasoning merges with any existing reasoning (API channel wins on conflicts so we don't double-count when both are present).	2026-04-18 14:46:38 -05:00
Brooklyn Nicholson	fd6ffc777f	feat(tui): honor display.* flags in turn renderer, status bar, and event handler - turnController gates scheduleStreaming / reasoning recorders on streaming + showReasoning so disabling them keeps the buffer silent until message.complete flushes - createGatewayEventHandler only surfaces inline_diff previews when inlineDiffs is on - StatusRule takes a showCost prop and renders `· $X.XXXX` with the same toFixed(4) formatting as /usage when usage.cost_usd is present - Usage grows cost_usd?: number to match the gateway payload - Existing handler tests flip showReasoning on in beforeEach so reasoning-flow assertions keep their meaning	2026-04-18 09:42:57 -05:00
Brooklyn Nicholson	bedbeebbc8	feat(tui): interleave tool rows into live assistant turns Live turn rendering used to show the streaming assistant text as one blob with tool calls pooled in a separate section below, so the live view drifted from the reload view (which threads tool rows inline via toTranscriptMessages). Model now mirrors reload: - turnStore gains streamSegments (completed assistant chunks, each with any tool rows that landed between its predecessor and itself) and streamPendingTools (tool rows waiting for the next chunk) - turnController.flushStreamingSegment() seals the current bufRef into a segment when a new tool.start fires; pending tools get attached to that next chunk so order matches reload hydration - recordMessageComplete returns finalMessages instead of one payload, so appendMessage gets the same shape for live-ending turns as for reloaded ones - appLayout renders segments before the progress/streaming area, and the streaming message + pending-tools fallback carry whatever tools arrived after the last assistant chunk	2026-04-17 11:33:29 -05:00
Brooklyn Nicholson	5b386ced71	fix(tui): approval flow + input ergonomics + selection perf - tui_gateway: route approvals through gateway callback (HERMES_GATEWAY_SESSION/ HERMES_EXEC_ASK) so dangerous commands emit approval.request instead of silently falling through the CLI input() path and auto-denying - approval UX: dedicated PromptZone between transcript and composer, safer defaults (sel=0, numeric quick-picks, no Esc=deny), activity trail line, outcome footer under the cost row - text input: Ctrl+A select-all, real forward Delete, Ctrl+W always consumed (fixes Ctrl+Backspace at cursor 0 inserting literal w) - hermes-ink selection: swap synchronous onRender() for throttled scheduleRender() on drag, and only notify React subscribers on presence change — no more per-cell paint/subscribe spam - useConfigSync: silence config.get polling failures instead of surfacing 'error: timeout: config.get' in the transcript	2026-04-17 10:37:48 -05:00
Brooklyn Nicholson	68ecdb6e26	refactor(tui): store-driven turn state + slash registry + module split Hoist turn state from a 286-line hook into $turnState atom + turnController singleton. createGatewayEventHandler becomes a typed dispatch over the controller; its ctx shrinks from 30 fields to 5. Event-handler refs and 16 threaded actions are gone. Fold three createSlash*Handler factories into a data-driven SlashCommand[] registry under slash/commands/{core,session,ops}.ts. Aliases are data; findSlashCommand does name+alias lookup. Shared guarded/guardedErr combinator in slash/guarded.ts. Split constants.ts + app/helpers.ts into config/ (timing/limits/env), content/ (faces/placeholders/hotkeys/verbs/charms/fortunes), domain/ (roles/ details/messages/paths/slash/viewport/usage), protocol/ (interpolation/paste). Type every RPC response in gatewayTypes.ts (26 new interfaces); drop all `(r: any)` across slash + main app. Shrink useMainApp from 1216 -> 646 lines by extracting useSessionLifecycle, useSubmission, useConfigSync. Add <Fg> themed primitive and strip ~50 `as any` color casts. Tests: 50 passing. Build + type-check clean.	2026-04-16 12:34:45 -05:00

14 commits