hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-27 11:22:03 +00:00

Author	SHA1	Message	Date
brooklyn!	6b639bc2b9	Merge pull request #52772 from NousResearch/bb/editor feat(desktop): in-app spot editor for the file preview pane	2026-06-25 20:25:06 -05:00
brooklyn!	41f4dce828	Merge pull request #52756 from NousResearch/bb/delegate-bg-resume-ux feat(delegation): calm "will resume" affordance for background delegate_task	2026-06-25 20:08:06 -05:00
Brooklyn Nicholson	563d347e4d	feat(desktop): show a calm "will resume" notice for background delegate_task When idle with a top-level delegate_task still in flight, render a static, shimmering system-note at the transcript tail instead of a spinner (which reads as "stuck"). Reuses the shared steer / slash-status chrome (centered, 0.6875rem, muted, Codicon) so it sits in the thread like every other meta line, and mirrors the primary child's latest stream line, falling back to generic copy. i18n across en/ja/zh/zh-hant; markdown prose/heading rhythm tuned so a re-entered turn breathes.	2026-06-25 19:57:51 -05:00
Brooklyn Nicholson	6e096a850a	feat(desktop): add $backgroundResume store for parked delegate_task Track top-level delegate_task work that dispatches in the background and re-enters as a fresh turn. $backgroundResume returns {count, activity} for the active session while idle — count of parked tasks plus the primary child's latest stream line (tool/progress/thinking) when readable.	2026-06-25 19:57:45 -05:00
Brooklyn Nicholson	09623b4527	fix(desktop): make the tab modified dot amber with a separating ring Use the app's amber warn color for the unsaved-edits tab dot (was inheriting the label text color) and add a tab-bg ring + soft drop shadow so it stays legible where it overlaps the filename.	2026-06-25 19:55:31 -05:00
Brooklyn Nicholson	c456029b4e	Merge remote-tracking branch 'origin/main' into bb/editor	2026-06-25 19:53:31 -05:00
Brooklyn Nicholson	1f950e189c	feat(desktop): vertical resize for the bottom-row terminal pane Extends the pane store with heightOverride (alongside widthOverride) and a get/set/clear API, and wires the pane shell + desktop controller so the bottom-row terminal pane can be resized on the Y axis with its size persisted.	2026-06-25 19:50:29 -05:00
Brooklyn Nicholson	ff81365988	feat(desktop): in-app spot editor for the file preview pane Adds a CodeMirror 6 spot editor to the right-rail file preview so users can make quick edits in-app without leaving for an IDE. Entering edit mode is a pure in-place swap of the read view — same fixed-height header, same gutter geometry/typography (mirrors SourceView 1:1) so nothing shifts — toggled via the Edit button, a bare `e` when the pane is hovered/focused, or the tab. - Save path is transport-agnostic (writeDesktopFileText): local Electron IPC or a new hardened POST /api/fs/write-text on the dashboard server (path validation, parent-must-exist, regular-files-only, size cap, atomic temp-file + os.replace), behind the existing auth middleware. - Stale-on-disk guard re-reads before writing and offers overwrite vs discard-and-reload instead of clobbering external/agent edits. - VS Code-style modified dot on the tab; ⌘/Ctrl+S and ⌘/Ctrl+Enter save, Esc cancels; GitHub highlight style matched to the read view's Shiki theme. - Typing stays render-free (draft in a ref; dirty flips once at the boundary).	2026-06-25 19:50:25 -05:00
ethernet	df514654ba	desktop: bundle main.cjs for electron fixes simple-git not found	2026-06-25 20:05:20 -04:00
brooklyn!	55af6c447a	Merge pull request #52206 from NousResearch/bb/desktop-tools-curation fix(desktop): hide platform/internal toolsets from the Skills & Tools list	2026-06-25 18:56:04 -05:00
brooklyn!	ffa3d3c811	Merge pull request #49037 from NousResearch/bb/projects-paradigm feat(desktop): first-class projects — sidebar, coding rail, review pane, and agent project tools	2026-06-25 17:49:05 -05:00
Teknium	fd2a35b169	fix: stop reporting cache-hit rate and cost across all UI surfaces (#52717 ) * fix: stop reporting cache-hit rate and cost across all UI surfaces Cost estimates and cache read/write token reporting are unreliable on providers that don't surface cached_tokens (e.g. ollama-cloud, which doesn't implement prompt_tokens_details.cached_tokens), producing misleading near-zero 'cache hit' readouts and cost figures. Remove cost + cache-hit reporting from every user-facing surface; keep input/output/total token counts (provider-agnostic and accurate) and the Nous account billing UI (real account money, separate from per-conversation estimates). Surfaces: - CLI /usage + model-info: drop cost lines + cache read/write token lines - Gateway /usage + /model: drop cost + cache lines - tui_gateway/server.py: stop emitting cost_usd / cache_read in usage and subagent.complete payloads - TUI (Ink): drop cost from status bar (+ showCost plumbing), /usage panel, thinking rollup, agents overlay (incl. compare view); keep token counts - Desktop Command Center: drop cost stat, per-model cost, actual-cost hint Underlying estimate_usage_cost / format_cost / insights cost columns are left intact but no longer surfaced (display-only change, reversible). * test: update TUI + gateway + CLI tests for removed cost/cache-hit reporting - CLI /usage test asserts cost/cache lines are absent, tokens present - gateway /usage test drops cost + cache asserts; removes cost-included test - TUI subagentTree summary expectation drops the cost segment - useConfigSync + appChrome status-rule tests drop showCost prop/state	2026-06-25 15:21:22 -07:00
Brooklyn Nicholson	19ca295a84	fix(desktop): clarify branch convert actions Open checked-out branches, switch the primary checkout for the default branch, and create linked worktrees only for non-trunk free branches.	2026-06-25 17:19:36 -05:00
Gille	e7d2f0b93c	fix(windows): suppress console flashes and harden gateway restarts	2026-06-25 14:42:38 -07:00
Brooklyn Nicholson	890e890281	chore(desktop): update package lock	2026-06-25 16:40:27 -05:00
Brooklyn Nicholson	a391523bcc	i18n(desktop): add project and worktree strings	2026-06-25 16:40:27 -05:00
Brooklyn Nicholson	b8d220f268	feat(desktop): wire project settings and shell chrome	2026-06-25 16:40:27 -05:00
Brooklyn Nicholson	62af32efe7	feat(desktop): keep active sessions aligned with cwd	2026-06-25 16:40:27 -05:00
Brooklyn Nicholson	68680db10d	feat(desktop): add Codex-style review pane	2026-06-25 16:40:27 -05:00
Brooklyn Nicholson	7a7f9a5b3d	feat(desktop): add composer coding rail and worktree flow	2026-06-25 16:40:27 -05:00
Brooklyn Nicholson	488ae376db	feat(desktop): render backend-authoritative projects sidebar	2026-06-25 16:40:27 -05:00
Brooklyn Nicholson	74352a1e61	feat(desktop): add project and coding stores	2026-06-25 16:40:27 -05:00
Brooklyn Nicholson	344415892f	feat(desktop): add shared project UI primitives	2026-06-25 16:40:27 -05:00
Brooklyn Nicholson	e2b8018729	feat(desktop): add git worktree and review IPC	2026-06-25 16:40:27 -05:00
Brooklyn Nicholson	4cdd1a3230	feat(sessions): record git workspace metadata	2026-06-25 16:40:26 -05:00
brooklyn!	c4ba4770eb	Merge pull request #52704 from NousResearch/bb/desktop-root-boundary-recover fix(desktop): recover root error boundary from transient render races (salvage #41787)	2026-06-25 16:17:18 -05:00
Brooklyn Nicholson	2e3efce66e	fix(desktop): recover the root error boundary from transient render races A stale-index render race in assistant-ui (a just-shrunk thread rendered at an old message index during a session switch / teardown) throws errors like "tapClientLookup: Index N out of bounds", "Cannot read properties of undefined (reading 'type')", or "Tried to unmount a fiber that is already unmounted". These bubble to the root ErrorBoundary and latch the WHOLE desktop app on the "Reload window" fallback even though the next render against fresh state would be fine. Teach the root boundary to treat that small set of known-transient renderer errors as recoverable: log them and schedule a next-tick reset() so React re-renders against current state instead of stranding the user on the fallback. Auto-recovery is BOUNDED -- at most MAX_RECOVERIES (3) attempts within a 5s window -- so a genuinely persistent error can't spin the boundary in a reset -> throw -> reset loop; after the budget is spent the fallback is left up for the user. Manual retry (the button) resets the budget. Only the root boundary auto-recovers; scoped boundaries keep their own fallbacks, and unrecognized errors are never swallowed. Tests: transient race recovers (fallback never sticks), a persistent recoverable error stops at the cap and surfaces the fallback (proving the loop is bounded), and neither a non-root boundary nor an unrecognized root error auto-recovers. Closes #41693. Supersedes #41787 by @izumi0uu, reimplemented with a bounded recovery budget so a non-transient error can't loop forever. Co-authored-by: izumi0uu <izumi0uu@gmail.com>	2026-06-25 16:15:20 -05:00
Brooklyn Nicholson	f7bf740640	fix(desktop): reject cross-wired runtime-id cache on session resume resumeSession's warm-cache fast-path trusted the storedSessionId -> runtimeId -> ClientSessionState mapping without checking the cached state still BELONGS to the session being resumed. A pooled profile backend that gets idle-reaped and respawned (pruneSecondaryGateways) re-mints runtime ids, so a recycled id can resolve to a live-but-DIFFERENT session's cache entry. The only existing guard was a session.usage 404 -- that catches a fully-dead runtime id, but a recycled id still 200s, so the fast-path happily painted the wrong transcript under the current route (open chat A, chat B loads). Fold the belongs-to check into a single takeWarmCache() helper used at BOTH cache reads -- the early transcript-keep decision and the fast-path itself -- so a cross-wired entry can't even briefly flash a stale transcript before the full resume repaints. On a mismatch the helper purges both stale map entries and reports a miss, falling through to a full resume that rebinds a correct runtime id. The full-resume path already guards its final paint with isCurrentResume(), so only the cached fast-path was missing the belongs-to check. Pre-existing bug from the initial desktop app (#20059); not introduced by the session-switch perf work (#49807), which left these lines untouched. Tests: two cases in use-session-actions.test.tsx driven through a harness that owns the two cache maps -- a cross-wired mapping is rejected + purged (the bug), and a correctly-wired cache still serves from memory with no needless refetch (no perf regression). Supersedes #50464 by @professorpalmer, reimplemented to also guard the early transcript-keep read (whole-class fix, not just the fast-path). Co-authored-by: professorpalmer <professorpalmer@users.noreply.github.com>	2026-06-25 16:11:18 -05:00
Teknium	c6575df927	feat(moa): expose MoA presets as selectable virtual models (#46081 ) * feat(moa): expose MoA presets as selectable virtual models Reconstructed onto current main (PR #46081's base had diverged with no common ancestor, marking the PR dirty so CI never dispatched). MoA is now a virtual provider: each named preset is a selectable model under provider 'moa', and the preset's aggregator is the acting model that answers and calls tools. Reference models fan out in parallel via a bounded ThreadPoolExecutor (the same batch pattern delegate_task uses) — all references dispatched at once, collected when every one finishes, then handed to the aggregator. Output order is preserved, failures and the MoA-recursion guard stay isolated per reference. - Removed the old mixture_of_agents model tool and moa toolset. - Added moa as a virtual provider in the provider/model inventory. - /moa is shortcut behavior over model selection (default preset / named preset / one-shot prompt). - Dashboard + Desktop manage named presets; presets appear in model pickers. - Parallel reference fan-out in agent/moa_loop.py with regression test. * fix(moa): thread moa_config through _run_agent to _run_agent_inner The reconstructed gateway MoA wiring declared moa_config on _run_agent (the profile-scoping wrapper) and used it inside _run_agent_inner, but the wrapper never forwarded it — _run_agent_inner had no such parameter, so the runtime hit NameError: name 'moa_config' is not defined on the compression-failure session sync path. Add moa_config to _run_agent_inner's signature and forward it from both wrapper call sites (multiplex and non-multiplex). Caught by tests/gateway/test_compression_failure_session_sync.py on CI shard test(4). * fix(moa): classify moa as a virtual provider in the catalog The moa virtual provider has no PROVIDER_REGISTRY/ProviderProfile entry, so provider_catalog() fell through to the default auth_type="api_key" with no env vars — tripping two catalog invariants: - test_provider_catalog: api_key providers must expose a credential env var - test_provider_parity: every hermes-model provider must be desktop-configurable moa already declares auth_type="virtual" in HERMES_OVERLAYS; consult that overlay as an auth_type fallback so the catalog reports moa as virtual (no real credential, no network endpoint). Exempt virtual providers from the desktop parity union check the same way 'custom' is exempt — derived from the catalog, not a hardcoded slug, so future virtual providers are covered too.	2026-06-25 13:52:06 -07:00
brooklyn!	edf35918be	Merge pull request #52620 from NousResearch/bb/desktop-session-switch-perf	2026-06-25 14:19:59 -05:00
David Metcalfe	da73223f4a	fix(desktop): show statusbar item tooltips on hover Statusbar items declared a 'title' string (e.g. YOLO, gateway health, agents, cron, version, context usage) that was populated by use-statusbar-items.tsx but never forwarded to the rendered DOM in StatusbarControls — so every statusbar button/menu/text/link had no hover hint. Wrap the four render branches (menu trigger, text, link, action) in the existing 'Tip' component from components/ui/tooltip.tsx. Tip is self-contained (carries its own Provider), instant (delayDuration=0), themed (bg-foreground/text-background, auto-inverts per theme), and already in use elsewhere in the desktop shell. Renders the child untouched when label is falsy, so items without a title stay zero-cost.	2026-06-25 12:11:17 -07:00
Brooklyn Nicholson	1ca1f9f2c7	refactor(tui_gateway): DRY the deferred-session paths Collapse the duplicated cold-resume / lazy-watch / create scaffolding into shared helpers: _deferred_session_record (the live-session dict minus the agent), _lazy_resume_info (the not-yet-built session.info), _claim_or_reuse_live (lock + double-checked register-or-reuse), and _schedule_agent_build (the pre-warm timer). Net -12 lines, three copies of the ~30-key session dict and the lazy-info block down to one each. No behavior change.	2026-06-25 14:03:03 -05:00
Brooklyn Nicholson	3bf00e459a	perf(desktop): make deferred resume the default, not an opt-in flag Per review: gating the faster path behind a `defer_build` flag that the only caller always sends is pointless. Flip it — `session.resume` now defers the agent build by default for every caller (desktop + Ink TUI); a caller that needs the agent built synchronously passes `eager_build: true` (used by the build-race test). The desktop no longer sends a flag. While verifying the flip, fixed two real parity gaps the deferred path had vs the old eager (`_init_session`) path: - `_enable_gateway_prompts()` was never called on a deferred resume, so approvals/clarify wouldn't route through the gateway prompt callbacks. - `_start_agent_build` never wired `background_review_callback` / `memory_notifications`, so a deferred-built session's self-improvement "💾 …" summary leaked to stdout instead of rendering in-transcript. Wiring it there also fixes it for `session.create` sessions, which build through the same path. ACP is unaffected (it uses its own session_manager, not this RPC); the Ink TUI already consumes the same lazy `info` shape from session.create and upgrades on the later `session.info` event.	2026-06-25 14:03:03 -05:00
Brooklyn Nicholson	c4c590e4a1	perf(desktop): make session switching fast under load Switching sessions in the desktop app could freeze the whole UI for several seconds on heavy, tool-rich chats. Root causes and fixes: - Cold `session.resume` built the AIAgent (MCP discovery, prompt/skill build) before returning, and the desktop awaits that RPC before it paints — so the entire switch blocked on the build. Add an opt-in `defer_build` resume path (the contract `session.create` already uses): return the full display transcript immediately, register an upgradable live session, and pre-warm the agent on a short timer. The persisted runtime identity (model/provider/base_url/api_mode/reasoning/tier) is restored on the deferred build so it can't drop the provider. - Nothing bounded how many in-memory agents accumulate; a user who reconnects often piled up detached sessions for the full 6h TTL. Add a soft LRU cap (`max_live_sessions`, default 16) that evicts the least-recently-active DETACHED sessions (no live client) — never a running, awaiting-input, mid-build, or live-transport one. Reopening re-resumes from disk. - On the prefetch-hit cold-resume path, skip rebuilding a throwaway merged-message array (and its 1000-entry Map) when the prefetch already painted the exact transcript; the downstream sameMessageList guard already drops the publish, so it was pure main-thread cost. The desktop opts into `defer_build` for every non-watch cold resume; the eager path stays for CLI/TUI and existing callers.	2026-06-25 14:03:03 -05:00
Brooklyn Nicholson	6b3ea2cea6	refactor(pets): tighten remix comments and confirm handler	2026-06-25 01:10:56 -05:00
Brooklyn Nicholson	5196575d40	feat(pets): remix a draft into a fresh round Add a hover/focus "Remix" action on each completed draft card in the generation grid. It re-runs generation with the chosen draft fed back in as the reference image, keeping the same prompt and staying on step 2 so the user can explore variations without starting over. Because regenerating is slow and replaces the current drafts, the first remix shows a one-time confirmation; the acknowledgement is persisted so subsequent remixes fire immediately.	2026-06-25 01:09:19 -05:00
Brooklyn Nicholson	f3d6d9bbd3	fix(ui): share compact tool previews across clients Move terminal/execute_code/read_file preview compaction into agent.display so CLI, gateway, and Ink TUI all inherit the same labels that desktop introduced in #52321. The shared preview keeps raw args intact while trimming display-only shell plumbing (`cd`, pipe tails, banner/status echoes) and read_file line ranges. Desktop now prefers backend `context` for live rows and keeps its TypeScript fallback only for hydrated history.	2026-06-25 00:47:14 -05:00
Brooklyn Nicholson	25c31cab62	fix(pets): soften step-1 ETA copy to "several minutes" The fixed "up to 5 minutes" wording undersells the slow quality-first path (OpenAI image via OpenRouter), where a full hatch can run far longer. Use an open-ended "several minutes" instead so the banner stays honest across the fast and slow providers.	2026-06-25 00:35:54 -05:00
Brooklyn Nicholson	7078d9d1e2	fix(pets): raise generation timeouts for the slow quality-first model path The quality-first default (OpenAI image via OpenRouter) is slow, and a full hatch fans out ~8 rows with up to 3 retries each (300s/call) across 2 parallel waves, so the absolute backend worst case is ~30 min. The old ceilings fired mid-run: - per-image HTTP call: 180s -> 300s (a single cold row can exceed 3 min) - drafts RPC: 240s -> 420s (single wave, no retries — 7 min is ample) - hatch RPC: 420s -> 1hr (sits above the ~30 min backend worst case) The hatch ceiling is intentionally well above the realistic max so the frontend never throws "request timed out" before the backend has exhausted its own retries. The background-resumable notification path remains the real UX safety net — the user can close the modal and get pinged on completion.	2026-06-25 00:34:52 -05:00
Brooklyn Nicholson	41f302fa73	fix(desktop): compact tool row titles Make completed desktop tool rows read like useful activity labels instead of raw plumbing: terminal rows use a dispatch-style shell summarizer for agent wrappers, and read_file rows keep the action plus filename and requested line range. The shell cleanup follows condensed-milk-pi's shape: split command compounds on real separators, strip pipe tails inside each segment, clean redirects/env prefixes, then classify setup/banner/status segments. Multi-command probes render as `first command + N commands`; the full command remains available in copy/detail. Read rows now render as `Read package.json` or `Read main.ts L25-34`, using requested positive offset/limit and returned line numbers only as fallback for negative/unknown offsets.	2026-06-25 00:01:11 -05:00
brooklyn!	0c442fa1d3	Merge pull request #52303 from NousResearch/bb/pets-gen-qa feat(pets): quality-first OpenRouter chain, stronger atlas gates, global pet-gen notifications	2026-06-24 23:16:40 -05:00
Brooklyn Nicholson	e92b5c6af8	feat(pets): quality-first OpenRouter model chain + stronger atlas gates + global pet-gen notifications OpenRouter/Nous image gen now runs a quality-first model chain by default: attempt the highest-fidelity OpenAI image model first, then fall back to Gemini 3 Pro Image when it's access-gated/unavailable/times out. An explicit OPENROUTER_IMAGE_MODEL / config model override pins one model with no fallback. Atlas validation rejects malformed model output instead of shipping it: adds a per-state collapse guard (a single sliver/fragment row no longer passes because other rows are healthy), on top of the existing postage-stamp + multi-pose checks. Desktop: pet-gen native notifications are now "global" (not tied to a chat session), so a background generation started from the command center fires an OS notification when the user is away even with no active session. Adds a neutral "This can take up to 5 minutes." banner on step 1, and lets the provider picker auto-size. Tests updated/added for the OpenRouter fallback chain, the collapse guard, and the global notification path.	2026-06-24 23:11:21 -05:00
Brooklyn Nicholson	281b333cc5	test(desktop): cover localized tool title shimmer	2026-06-24 21:59:41 -05:00
Brooklyn Nicholson	f2c45e2c81	fix(desktop): limit pending tool shimmer to action verb Localize tool titles and split pending rows so only the action segment shimmers — paths, commands, and URLs stay static.	2026-06-24 21:59:41 -05:00
brooklyn!	cbe5c5689f	perf(desktop): bound tool-result rendering so big /learn runs don't freeze (#52273 ) ToolFallback rebuilt the `part` wrapper every render, defeating the buildToolView memo and re-running a full JSON.stringify of the result on every ~33ms stream delta. A /learn over a large directory (many ~100KB tool results) saturated the renderer main thread (hang/throttle) and spiked memory until it OOMd (crash). - Re-derive a stable `part` from the referentially-stable args/result so the view/copy memos hold across deltas. - Clamp every inline-painted payload (detail, stdout/stderr, rawResult, technical trace) to MAX_TOOL_RENDER_CHARS; the row's Copy button still reads the uncapped view.detail for the full output.	2026-06-25 02:52:51 +00:00
xxxigm	4aeaba6922	test(desktop): cover undefined/null attachment holes in ref helpers Regression for the refText crash: attachmentDisplayText and optimisticAttachmentRef must return null (not throw) when handed an undefined/null attachment hole, so the submit path can't reproduce "Cannot read properties of undefined (reading 'refText')".	2026-06-24 18:22:01 -07:00
xxxigm	7e2db0a140	fix(desktop): stop refText crash on undefined composer attachment holes A session switch or draft restore can leave undefined/null holes in the composer attachments array. AttachmentList was guarded against this in #49624, but the sibling submit path was not: submitPromptText maps the same array through attachmentDisplayText/optimisticAttachmentRef and buildContextText (a.kind / a.label / a.refText), so a hole threw "Cannot read properties of undefined (reading 'refText')" — an uncaught renderer error that blanks the chat pane and shows "Desktop app link offline". Close the whole bug class: - attachmentDisplayText / optimisticAttachmentRef no-op on a falsy attachment (shared chokepoint, also protects thread.tsx drop handler). - submitPromptText filters falsy entries from the source array, and buildContextText filters its (possibly post-sync) input before reading fields.	2026-06-24 18:22:01 -07:00
Gille	284be6cc24	Merge pull request #52210 from helix4u/fix/desktop-update-progress-visibility fix(desktop): surface update progress lines	2026-06-24 19:45:05 -05:00
brooklyn!	7157b213f5	Merge pull request #47959 from NousResearch/bb/pets-gen Pet generation: frame-perfect hatch flow, backend picker, CPU-safe chroma, and CI-hardening	2026-06-24 19:41:34 -05:00
brooklyn!	153ad79524	Merge pull request #52201 from NousResearch/bb/desktop-shallow-update-count fix(desktop): don't report a bogus update count for a shallow checkout	2026-06-24 19:34:02 -05:00

1 2 3 4 5 ...

590 commits