hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-14 09:11:54 +00:00

Author	SHA1	Message	Date
Teknium	197337cc47	fix(gateway): suppress duplicate final stream sends (#45517 )	2026-06-13 03:23:44 -07:00
Teknium	8cf9d8689d	fix(desktop): keep composer usable during reconnect (#45488 ) * feat(cli): add --safe-mode troubleshooting flag Inspired by Claude Code v2.1.169 (June 2026): run Hermes with all customizations disabled to isolate setup problems from product bugs. --safe-mode implies --ignore-user-config and --ignore-rules, and additionally skips plugin discovery (hermes_cli/plugins.py) and MCP server loading (tools/mcp_tool.py) via the internal HERMES_SAFE_MODE env bridge. * fix(desktop): keep composer usable during reconnect	2026-06-13 02:36:09 -07:00
brooklyn!	b62e57b2f4	Merge pull request #45445 from NousResearch/bb/desktop-stick-to-bottom fix(desktop): stabilize thread scrolling and session switching	2026-06-13 04:14:49 -05:00
Teknium	bc060c7c1c	fix(models): remove unavailable claude-fable-5 (#45492 )	2026-06-13 02:03:50 -07:00
Teknium	3803e5fc28	fix(agent): don't treat custom:<name> pools as cross-provider mismatch (#45289 ) Custom endpoints carry two naming conventions for the same provider: the agent's provider attribute is the generic 'custom' label while the pool is keyed 'custom:<normalized-name>'. The defensive guard in recover_with_credential_pool compared them literally, logged 'Credential pool provider mismatch: pool=custom:<name>, agent=custom', and skipped recovery — so 401 refresh and 429 rotation never ran for ANY custom-provider user (seen in the field on a Fireworks setup whose dead key burned full retry cycles every turn with the skip warning on each one). Accept the pair only when the agent's CURRENT base_url resolves to the same pool key via get_custom_provider_pool_key, preserving the guard's original purpose (#33088/#33163): a fallback provider or a different custom endpoint still skips pool mutation.	2026-06-13 02:01:09 -07:00
brooklyn!	bdd3868b57	fix(desktop): keep profile color picker open from the context menu (#45489 ) Some checks are pending Deploy Site / deploy-vercel (push) Waiting to run Details Deploy Site / deploy-docs (push) Waiting to run Details Docker Build and Publish / build-amd64 (push) Waiting to run Details Docker Build and Publish / build-arm64 (push) Waiting to run Details Docker Build and Publish / merge (push) Blocked by required conditions Details Lint (ruff + ty) / ruff + ty diff (push) Waiting to run Details Lint (ruff + ty) / ruff enforcement (blocking) (push) Waiting to run Details Lint (ruff + ty) / Windows footguns (blocking) (push) Waiting to run Details Nix Lockfile Fix / auto-fix-main (push) Waiting to run Details Nix Lockfile Fix / fix (push) Waiting to run Details Nix / nix (macos-latest) (push) Waiting to run Details Nix / nix (ubuntu-latest) (push) Waiting to run Details OSV-Scanner / Scan lockfiles (push) Waiting to run Details Tests / test (1) (push) Waiting to run Details Tests / test (2) (push) Waiting to run Details Tests / test (3) (push) Waiting to run Details Tests / test (4) (push) Waiting to run Details Tests / test (5) (push) Waiting to run Details Tests / test (6) (push) Waiting to run Details Tests / save-durations (push) Blocked by required conditions Details Tests / e2e (push) Waiting to run Details Typecheck / typecheck (apps/bootstrap-installer) (push) Waiting to run Details Typecheck / typecheck (apps/desktop) (push) Waiting to run Details Typecheck / typecheck (apps/shared) (push) Waiting to run Details Typecheck / typecheck (ui-tui) (push) Waiting to run Details Typecheck / typecheck (web) (push) Waiting to run Details uv.lock check / uv lock --check (push) Waiting to run Details Right-click → Color flashed open then closed: on dismiss the context menu refocuses its trigger, which doubles as the popover anchor, so the picker read it as a focus-outside event and closed itself. Suppress the menu's close auto-focus so the picker survives. Long-press already worked since it bypasses the menu lifecycle.	2026-06-13 04:00:09 -05:00
xxxigm	b6c7ebf028	fix(tui): honor provider_routing config in the desktop/TUI backend (#44953 ) * fix(tui): honor provider_routing config in the desktop/TUI backend The messaging gateway and classic CLI both read `provider_routing` from config.yaml and pass the OpenRouter routing prefs (only / ignore / order / sort / require_parameters / data_collection) into the agent. The tui_gateway backend that powers the desktop app and TUI never did, so it built agents with every routing pref left at its default — OpenRouter then selected providers freely (effectively at random), ignoring the user's config. Load `provider_routing` in `_make_agent` and forward the same six prefs the gateway does, restoring parity across CLI / gateway / desktop. Background subagent kwargs already propagate these from the parent agent, so they now inherit correctly too. * test(tui): cover provider_routing forwarding in _make_agent Asserts the six OpenRouter routing prefs flow from config.yaml into AIAgent, and that an absent provider_routing section forwards None/False (unchanged behavior for users who never configured routing).	2026-06-13 02:58:15 -05:00
Brooklyn Nicholson	b2bc48cd5e	Merge branch 'main' into bb/desktop-stick-to-bottom # Conflicts: # apps/desktop/src/components/assistant-ui/thread.tsx	2026-06-13 02:52:03 -05:00
brooklyn!	9cd3d8a6ac	Merge pull request #45466 from NousResearch/bb/fix-image-generation-placement fix(desktop): keep generated images in the tool slot, not inline	2026-06-13 02:50:59 -05:00
Brooklyn Nicholson	b82d2e549f	fix(desktop): keep the diffusion placeholder circular at any aspect Normalise the radial bloom by the shorter side so portrait/square placeholders aren't squished into an ellipse.	2026-06-13 02:45:34 -05:00
Brooklyn Nicholson	b15dc58064	fix(desktop): keep generated images in the tool slot, not inline The image-generate tool showed a placeholder, then the model echoed a (often different) image inline in its prose — a second, jarring copy in the wrong place, dimmed as tool scaffolding, with a misplaced download button. Now the generated image lives only in the tool slot: - Strip every embedded image/media link from the assistant prose of a message that produced an image (the model frequently restates the remote URL while the result holds the local path), preserving the agent's words. Applied on hydration, live deltas, and completion. - One stable frame sized from the aspect_ratio arg up front, so the diffusion placeholder and the decoded image share the same box and crossfade with no layout shift; the box derives its height from the true ratio on load (no letterboxing). - Exempt generated images from the tool-block dim-until-hover rule. - Extract a shared useImageDownload hook + ImageLightbox so the tool image and markdown images share one implementation.	2026-06-13 02:42:15 -05:00
Brooklyn Nicholson	acd4278c8a	fix(nix): use fetchNpmDeps hash from flake check prefetch-npm-deps returned a different digest than the actual fetchNpmDeps build; use the CI-reported hash.	2026-06-13 02:34:25 -05:00
Brooklyn Nicholson	be6713c536	fix(nix): refresh npm deps hash	2026-06-13 02:16:13 -05:00
Brooklyn Nicholson	77687156b4	fix(desktop): tighten multiline user prompt spacing	2026-06-13 02:16:13 -05:00
Brooklyn Nicholson	45ceee8a32	Merge branch 'main' of github.com:NousResearch/hermes-agent into bb/desktop-stick-to-bottom	2026-06-13 02:08:10 -05:00
brooklyn!	0a7a81835b	Merge pull request #45255 from NousResearch/bb/desktop-stuck-tool-rows fix(desktop): dismiss settled tool rows (persistent, caret-safe)	2026-06-13 02:08:01 -05:00
Brooklyn Nicholson	76b93869d8	fix(desktop): rebuild thread autoscroll on use-stick-to-bottom	2026-06-13 01:57:30 -05:00
brooklyn!	a856276124	Merge pull request #45414 from NousResearch/bb/fix-desktop-queue-drain-strand fix(desktop): stop stranding queued prompts across backend bounces	2026-06-13 00:39:13 -05:00
Gille	1e755ff556	fix(desktop): keep recents sorted unless manually reordered (#45404 )	2026-06-13 00:38:10 -05:00
Brooklyn Nicholson	7f302c91b2	chore: uptick	2026-06-13 00:33:44 -05:00
Brooklyn Nicholson	18916376f1	fix(desktop): never surface "session busy" — retry every submit past it "Session busy" (4009) is the gateway's concurrency guard, not a user-facing error. The queue already covers the deliberate "type while busy" case, so the only leak was a submit racing the settle edge. Generalize the rewind path's busy-retry into a shared `withSessionBusyRetry` and wrap every `prompt.submit` (fresh send, session-resume resubmit, and rewind) so a transient busy is ridden out within a bounded deadline and the call lands silently. The fromQueue swallow stays as a backstop for the pathological >deadline case.	2026-06-13 00:26:34 -05:00
Brooklyn Nicholson	f23a4b7bb3	fix(desktop): keep queued drains quiet on transient "session busy" A queued drain firing on the settle edge can race a not-yet-wound-down turn and get a transient 4009 "session busy". Previously that appended a red "session busy" error bubble (and toast) per attempt. For fromQueue submits, swallow the busy error: release busy, keep the entry queued, and let the composer's bounded auto-drain retry on the next idle.	2026-06-13 00:23:51 -05:00
Brooklyn Nicholson	bf090deed3	fix(desktop): stop stranding queued prompts across backend bounces A prompt typed mid-turn ("ghost bubble") could stick forever and never send when the backend restarted/reconnected during the turn. Two fragile assumptions in the composer queue drain caused it: 1. Drain fired ONLY on an observed busy true→false edge. A remount/ reconnect resets `previousBusyRef` to the current busy value, so the settle edge is swallowed and the queue never drains. Replace `shouldAutoDrainOnSettle` with the edge-independent `shouldAutoDrain` (idle + non-empty), driven on the settle edge, on mount/reconnect, and after a re-key. The drain lock still serializes sends. 2. The queue is keyed by `queueSessionKey \|\| sessionId`. When a backend resume mints a new runtime session id for the same conversation, the entry strands under the dead key. Pass the stable stored id as `queueSessionKey` so the composer can tell runtime churn from a real session switch, and `migrateQueuedPrompts` re-keys pending entries on a runtime-id change only (never on a deliberate switch). Also make the drain resilient to a thrown/rejected onSubmit (e.g. a stale- session 404): the entry stays queued and is retried on the next idle, with a per-entry attempt cap (MAX_AUTO_DRAIN_ATTEMPTS) to avoid spin-loops and a quiet toast once it gives up. A manual send clears the backoff. Tests: composer-queue covers edge-free drain + re-key migration; use-prompt-actions covers rejected-drain-keeps-entry + idle retry sends.	2026-06-13 00:20:51 -05:00
brooklyn!	7d183f6497	fix(desktop): theme the image-gen placeholder instead of a white square (#45354 ) The diffusion placeholder read `--dt-*` tokens via `getComputedStyle().getPropertyValue()`, but those resolve through `var()` chains into `color-mix(in srgb, …)` — returned verbatim and unparseable, so every token fell to a hardcoded light fallback (white card). In dark mode the placeholder rendered as a white square. Resolve each token through a throwaway probe element's `color` so the browser computes it to a concrete color, and teach `parseColor` Chromium's `color(srgb r g b / a)` serialization. Re-resolve on theme repaint via a MutationObserver rather than per animation frame.	2026-06-12 21:45:24 -05:00
brooklyn!	492c402774	perf(desktop): cut GUI streaming & interaction lag (#45343 ) * perf(desktop): isolate streaming re-renders & cut layout thrash During a token stream $messages is replaced ~30x/s. Subscribing the whole chat view to it re-rendered the composer, runtime boundary, and every message on every delta. - Derive coarse facts (empty thread? tail is user?) via nanostores `computed` atoms so per-token flushes don't re-render their consumers. - Move the $messages subscription + runtime wiring into a dedicated ChatRuntimeBoundary; the composer reads $messages imperatively. - Drive message rows off stable useAuiState selectors and a lazy getMessageText getter instead of eagerly materialized text. - Feed ResizeObserver entry sizes into measureClamp / FadeText and dedupe the style writes, killing the read-write-read reflow cascade. * perf(desktop): incremental markdown rendering during streams Re-parsing the full message markdown every reveal frame is O(N^2) over a long answer and dominated stream CPU. - Throttle useSmoothReveal commits to ~1 frame (REVEAL_MIN_COMMIT_MS). - Memoize block parsing with an LRU keyed on source text so only changed blocks re-parse. - Replace Streamdown's full-text parseIncompleteMarkdown with a tail-bounded remend: scan to the last top-level boundary outside fences/math and repair only the trailing open block. New remend-tail.ts is proven render-equivalent to full remend at every streaming prefix (remend-tail.test.ts), minus an intentional, documented divergence on cross-block dangling openers. * perf(desktop): faster session resume & warm AudioContext at idle - Resume: fire the REST transcript prefetch and the session.resume RPC in parallel, and skip the redundant message conversion + reconciliation when the prefetch already hydrated the transcript. - Haptics: web-haptics builds its AudioContext lazily on first trigger, paying the ~850ms CoreAudio spin-up on the first streamStart haptic as the first token paints. Open/close a throwaway context at idle so the real one connects to an already-warm audio service.	2026-06-12 21:22:39 -05:00
Brooklyn Nicholson	d62e9b7592	build(nix): refresh npmDepsHash for the remend dependency Adding remend changed package-lock.json, so the flake's pinned npm deps hash went stale and `nix flake check` failed. Bump it to match.	2026-06-12 21:17:22 -05:00
Brooklyn Nicholson	3cf7d43262	perf(desktop): faster session resume & warm AudioContext at idle - Resume: fire the REST transcript prefetch and the session.resume RPC in parallel, and skip the redundant message conversion + reconciliation when the prefetch already hydrated the transcript. - Haptics: web-haptics builds its AudioContext lazily on first trigger, paying the ~850ms CoreAudio spin-up on the first streamStart haptic as the first token paints. Open/close a throwaway context at idle so the real one connects to an already-warm audio service.	2026-06-12 21:07:40 -05:00
Brooklyn Nicholson	edc36f3a45	perf(desktop): incremental markdown rendering during streams Re-parsing the full message markdown every reveal frame is O(N^2) over a long answer and dominated stream CPU. - Throttle useSmoothReveal commits to ~1 frame (REVEAL_MIN_COMMIT_MS). - Memoize block parsing with an LRU keyed on source text so only changed blocks re-parse. - Replace Streamdown's full-text parseIncompleteMarkdown with a tail-bounded remend: scan to the last top-level boundary outside fences/math and repair only the trailing open block. New remend-tail.ts is proven render-equivalent to full remend at every streaming prefix (remend-tail.test.ts), minus an intentional, documented divergence on cross-block dangling openers.	2026-06-12 21:07:36 -05:00
Brooklyn Nicholson	7c226cc57f	perf(desktop): isolate streaming re-renders & cut layout thrash During a token stream $messages is replaced ~30x/s. Subscribing the whole chat view to it re-rendered the composer, runtime boundary, and every message on every delta. - Derive coarse facts (empty thread? tail is user?) via nanostores `computed` atoms so per-token flushes don't re-render their consumers. - Move the $messages subscription + runtime wiring into a dedicated ChatRuntimeBoundary; the composer reads $messages imperatively. - Drive message rows off stable useAuiState selectors and a lazy getMessageText getter instead of eagerly materialized text. - Feed ResizeObserver entry sizes into measureClamp / FadeText and dedupe the style writes, killing the read-write-read reflow cascade.	2026-06-12 21:07:33 -05:00
brooklyn!	a86b7b314b	Merge pull request #45273 from NousResearch/bb/sidebar-workspace-dedup feat(desktop): worktree-aware sidebar grouping + composer/sidebar UX fixes	2026-06-12 20:03:43 -05:00
Brooklyn Nicholson	d14f6c9563	fix(desktop): stop streaming autoscroll bounce; move attachments below user bubble Streaming auto-follow chased content growth while parked at the bottom, which rubber-banded — the tail pin and the virtualizer's own measurement adjustments fought for scrollTop. Drop it; the one-time new-turn jump already lands a fresh message in view and the viewport stays put after. Attachments rendered inside the editable user bubble and were collapsed via an IntersectionObserver + [data-stuck] CSS hack while the bubble was pinned. Render them as a flow sibling BELOW the sticky bubble instead, so they scroll away behind it naturally — no observer, no collapse. Image refs still render as thumbnails, file refs as chips; no border. Removes the now-unused useStuckToTop hook and its CSS.	2026-06-12 19:58:25 -05:00
Brooklyn Nicholson	a1c6349c1f	Merge branch 'main' of github.com:NousResearch/hermes-agent into bb/sidebar-workspace-dedup	2026-06-12 19:40:24 -05:00
Brooklyn Nicholson	78ce91750e	fix(desktop): crisp terminal text via opaque xterm canvas The terminal looked soft/heavy on every platform because the xterm Terminal was built with allowTransparency: true, which drops the WebGL renderer's opaque fast-path and bakes glyphs as grayscale-alpha coverage for compositing over a see-through canvas. Our surface (--ui-bg-chrome) is opaque and withSurface already paints it, so transparency was pure blur for no benefit — VS Code keeps it off too. Also drop the Medium (500) base weight for normal/bold (400/700) to match VS Code's metrics, and remove the now-unused JetBrains Mono Medium face + woff2.	2026-06-12 19:36:30 -05:00
Brooklyn Nicholson	1a3cd3d436	refactor(desktop): collapse sidebar drag-reorder into one generic ReorderableList Every reorderable surface (repos, worktrees, sessions, pins) now drops in a single ReorderableList that owns its own DndContext, so a drag only ever collides with that list's own items — nesting "just works" without leaking into the lists around or inside it. This replaces the shared DndContext + id-prefix dispatch (parent:/group:) whose closestCenter collisions resolved to a different-typed droppable and silently no-op'd worktree/repo drags. - Delete groupDndId/parentDndId/parse* helpers and the monolithic handleAgentDragEnd/handlePinnedDragEnd; each list persists its new id order via a direct typed write (reorderParents/reorderWorktree/reorderSessions/ reorderPinned). - Sessions inside repos/worktrees are date-ordered and static (no drag), matching the "never reorder on new messages" rule. - Add setPinnedSessionOrder; drop now-unused reorderPinnedSession.	2026-06-12 18:59:54 -05:00
Teknium	9688c1a94f	chore: add Kimi K2.7 code catalog slug (#45283 )	2026-06-12 16:55:40 -07:00
Teknium	7e46533d9f	test: compressed-summary metadata flag set in-process, stripped on wire	2026-06-12 16:47:15 -07:00
kyssta-exe	956af7f3c3	fix(agent): add metadata flag to context compression summary messages (#38389 ) Summary messages (standalone insertion and merge-into-tail) now carry a metadata flag so frontends (CLI, Desktop, gateway, TUI) can distinguish them from real assistant/user messages without content-prefix heuristics. Re-applied from PR #38434 onto current main (conflicted with the _SUMMARY_END_MARKER hoist). Key renamed from the PR's 'is_compressed_summary' to '_compressed_summary': the wire sanitizers strip underscore-prefixed message keys, so the flag stays in-process and can never reach strict gateways (Fireworks/Mistral/Kimi reject unknown keys with 'Extra inputs are not permitted').	2026-06-12 16:47:15 -07:00
helix4u	1899c8f507	fix(skills): run youtube transcript helper through uv	2026-06-12 16:33:46 -07:00
Brooklyn Nicholson	dd12a5403d	refactor(desktop): extract shared WorkspaceHeader for repo + worktree rows The repo and worktree header rows were ~identical after the handle move. Fold them into one WorkspaceHeader (emphasis flag for the repo level) plus a small WorkspaceAddButton, so the toggle/handle/count/+ wiring lives in one place.	2026-06-12 18:30:49 -05:00
Teknium	8905ee6b8a	fix(agent): rewind flush cursor exactly when repair compacts before the cursor Follow-up to the #44837 clamp: a min() clamp only fixes cursor overshoot past the new end of the list. When repair_message_sequence drops/merges messages at indexes below the cursor, the clamp leaves the cursor pointing past unflushed rows and the turn-end flush silently skips them. Extract repair_message_sequence_with_cursor(): snapshot the flushed prefix by object identity before repair, then recompute the cursor as the count of surviving flushed messages. Falls back to the clamp when no snapshot is available. Keeps the safety guard in _flush_messages_to_session_db. Adds targeted tests for overshoot, before-cursor compaction, no-repair, bare-agent, and the flush guard.	2026-06-12 16:29:01 -07:00
kyssta-exe	5d0408d9fe	fix(agent): clamp flush cursor after repair_message_sequence compaction (#44837 )	2026-06-12 16:29:01 -07:00
konsisumer	aec38855b5	fix(agent): preserve recent turns during compression	2026-06-12 16:26:58 -07:00
Brooklyn Nicholson	0595af0ad1	feat(desktop): move workspace/worktree drag handle into the leading icon Mirror the session row: the repo/worktree header's leading glyph (repo mark, or a new git-branch mark for worktrees) swaps to a grabber on hover/drag instead of carrying a separate handle on the right — freeing header width for the label and + button.	2026-06-12 18:26:38 -05:00
Brooklyn Nicholson	e90672696e	feat(desktop): worktree-aware sidebar grouping + composer/sidebar UX fixes Group recents as parent-repo → worktree → sessions using local git metadata (probed over IPC, with a path-name heuristic fallback for remote backends). Single-worktree repos collapse to one level. Sessions order by creation time and never reshuffle on new messages. Also: fuse the status stack to the composer border, restore icon actions in the queue panel, fix sidebar label truncation and drag styling, hide sticky-message attachments while pinned, and bump the terminal font.	2026-06-12 18:18:39 -05:00
brooklyn!	bbf020e709	feat(desktop): follow streaming output at bottom + jump-to-bottom button (#45263 ) Strict sticky-bottom autoscroll for the chat thread: while the viewport is parked at the bottom, the tail follows content growth (streaming tokens, late measurement, Shiki re-highlight) via a useLayoutEffect keyed on the virtualizer's own size signal, pinned in the same pre-paint pass as its scrollToFn so the two never rubber-band. The gate is a single boolean — one upward pixel (scroll/wheel/touch) disarms follow until the user returns to the bottom. Adds a floating jump-to-bottom control that appears once scrolled ~10px away (above the dim threshold so a sub-pixel settle never flashes it), positioned above the composer with respect to the status stack, with a subtle scale + slide in/out animation that honours prefers-reduced-motion. The button bridges to the virtualizer's re-arm + pin path through a small nanostore emitter. Supersedes #43624.	2026-06-12 23:00:11 +00:00
Teknium	135fe90166	fix(profiles): backfill .env for pre-existing profiles on hermes update (#45247 ) Profiles created before #44792 have no .env. Now that the Channels/Keys endpoints are profile-scoped (no os.environ fallback), those profiles would show everything as unconfigured. hermes update now copies the default install's .env into each named profile that lacks one (0600, never overwrites, placeholder fallback when the root has no .env), so existing users keep the credentials they were effectively running with.	2026-06-12 15:42:14 -07:00
xxxigm	68536d4375	test(compressor): regression coverage for assistant-tail anchor + compaction rollup (#29824 ) 21 cases pinning the new ``_ensure_last_assistant_message_in_tail`` anchor and its interaction with the existing tail-cut path: * ``TestFindLastAssistantMessageIdx`` — helper contract: prefers a content-bearing assistant message, skips ``tool_calls``-only stubs, multimodal text-block content counts, falls back to "any assistant" when no content-bearing reply exists, honours ``head_end``, returns -1 when there's none. * ``TestEnsureLastAssistantMessageInTail`` — direct: no-op when already in the tail, walks ``cut_idx`` back when the reply is in the compressed middle, never crosses into the head region, re-aligns through a preceding ``tool_call`` / ``tool_result`` group instead of orphaning it. * ``TestFindTailCutByTokensAnchorsAssistant`` — integration: reporter repro (long tool-output run after the visible reply) now preserves the reply; user and assistant anchors compose in a single tail-cut call; a soft-ceiling-overrunning oversized tool result no longer strands the prior reply. * ``TestCompactionRollupReproduction`` — end-to-end through ``compress()`` with a stubbed ``_generate_summary``: the visible reply text survives either as its own standalone assistant message (normal path) or concatenated onto the merged summary tail (double-collision path the WebUI then re-splits). The standalone-summary case is asserted strictly (exactly one summary row, exactly one separate assistant row carrying the reply) — that's the dominant path and any drift there reintroduces the original bug. * ``TestSourceGuardrail`` — static asserts on ``agent/context_compressor.py``: the helper exists, the anchor is wired into ``_find_tail_cut_by_tokens`` AFTER the user-message anchor (so chaining is monotonic), the content-bearing preference is preserved, and the issue number is referenced so future bisects can find this fix.	2026-06-12 15:41:57 -07:00
xxxigm	2fef3e2df2	fix(webui): split merge-into-tail compaction so reply renders as its own bubble (#29824 ) The compressor has a "double-collision" fallback path: when the chosen ``summary_role`` collides with the first tail message AND the flipped role would collide with the last head message, it can't emit a standalone summary turn (consecutive same-role messages break Anthropic and friends). It instead prepends the summary + end-of-summary marker to the first tail message's content via ``_merge_summary_into_tail``. With the matching anchor from the previous commit, that first tail message is now usually the user's previously-visible assistant reply — so the persisted assistant turn ends up shaped as ``[CONTEXT COMPACTION ...] ... --- END OF CONTEXT SUMMARY --- ... THE ACTUAL REPLY``. Without splitting it, the session viewer renders one big "Context handoff" bubble and the reply text is buried inside the metadata blob — which is exactly the "can't see the last reply" experience #29824 reports, just one layer deeper. Added ``splitCompactionContent`` that detects the merge marker (kept in sync with ``--- END OF CONTEXT SUMMARY — respond to the message below, not the summary above ---`` in ``agent/context_compressor.py``) and ``MessageBubble`` now recurses on the two halves: the prefix half renders as the muted "Context handoff" row, the remainder half renders with the original assistant styling. Pure (non-merged) summary messages hit the no-remainder branch and still render as a single "Context handoff" row, preserving the original behaviour.	2026-06-12 15:41:57 -07:00
xxxigm	691ff7c188	fix(compressor): keep last visible assistant reply out of compaction summary + label handoffs in WebUI (#29824 ) Two-pronged fix for the WebUI "context compaction block in place of last assistant response" regression. Agent layer (the real fix). ``_find_tail_cut_by_tokens`` already had ``_ensure_last_user_message_in_tail`` to keep the most recent user request out of the compressed middle (#10896), but no symmetric anchor for the assistant side. When the conversation has an oversized recent tool result or a long stretch of tool-call/result pairs after the assistant's last visible reply, the token-budget walk can stop with the previously-visible reply on the wrong side of ``cut_idx``. The summariser then rolls it into the single ``[CONTEXT COMPACTION — REFERENCE ONLY]`` block persisted as ``role="user"`` or ``role="assistant"``, and from the operator's perspective the WebUI session viewer (``web/src/pages/SessionsPage.tsx``) and the TUI chat panel both suddenly show the opaque "Context compaction" block in the slot where they were just reading the actual answer: User: "i cant see the output of the last message you sent, i did see it previously, however now see 'context compaction'" Added ``_ensure_last_assistant_message_in_tail`` mirror of the user-side anchor. It looks for the most recent assistant message with non-empty text content (skipping tool-call-only assistant "stubs" which the UI renders as small "calling tool X" indicators rather than a readable bubble) and walks ``cut_idx`` back through the standard ``_align_boundary_backward`` so we don't split a tool_call/result group that immediately precedes it. The two anchors are chained — each only walks ``cut_idx`` backward, so the tail can only grow. Falls back to "most recent assistant of any kind" only when no content-bearing reply exists in the compressible region (fresh multi-step tool sequence with no prior reply) — in that case the agent-side fix is effectively a no-op and the existing user-message anchor carries the load. WebUI layer (clarity). Added ``isCompactionMessage`` detector that recognises the ``[CONTEXT COMPACTION — REFERENCE ONLY]`` (current) and ``[CONTEXT SUMMARY]:`` (legacy) prefixes from ``agent/context_compressor.py``, and a new ``compaction`` entry in ``MessageBubble``'s ``ROLE_STYLES`` map. Compaction blocks now render as muted, italicised system-style rows labelled ``Context handoff`` — clearly metadata, not the assistant's actual reply — so an operator scrolling back through a long session can't mistake the summary for a real answer. Keeping the detected prefixes inline (rather than importing them) because the WebUI bundle has no Python interop. A guardrail comment points readers at the source-of-truth constants in ``agent/context_compressor.py``.	2026-06-12 15:41:57 -07:00
Teknium	7a318aae22	fix(profiles): exclude session history, backups, and snapshots from --clone-all (#45246 ) --clone-all copied the source profile's state.db, sessions/, backups/, state-snapshots/, and checkpoints/ into the new profile. These are per-profile history: a 49GB copy in practice (15GB snapshots + 11GB backup archives + 16GB state.db + 6.4GB sessions), and restoring a copied backup inside the clone would resurrect the SOURCE profile's state. A clone is a fresh workspace; history stays with the source. New _CLONE_ALL_HISTORY_EXCLUDE_ROOT set, applied at root level for ANY source profile (named profiles accumulate the same artifacts), unlike the default-gated infrastructure excludes. Nested same-name dirs still copy. Docs and the post-create CLI message updated to match; profile export / hermes backup remain the full-history paths.	2026-06-12 15:41:50 -07:00

1 2 3 4 5 ...

11551 commits