hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-27 11:22:03 +00:00

Author	SHA1	Message	Date
liuhao1024	404b06ac4f	fix(gateway): honor server retry_after in _send_with_retry for Telegram flood control (#46762 ) When Telegram's sendRichMessage returns a FloodWait/RetryAfter error, _try_send_rich() now extracts the server-provided retry_after value and propagates it through SendResult.retry_after. The base _send_with_retry() layer honors this value instead of using its default short exponential backoff (~2s, ~4s), preventing the retry budget from being exhausted against a server that demands a 25-37s wait. Salvaged from #46774 by @liuhao1024. Telegram adapter path moved from gateway/platforms/telegram.py to plugins/platforms/telegram/adapter.py since the original PR. Closes #46762	2026-06-25 02:43:47 +05:30
kshitij	cedbb4cfa2	Merge pull request #52140 from NousResearch/salvage/47707-tool-schema-validation fix(agent): validate context/memory tool schemas before wrapping (#47707)	2026-06-25 02:36:19 +05:30
kshitij	085096fd59	Merge pull request #52135 from NousResearch/salvage/51826-tirith-mkdtemp-oerror fix(tools): catch mkdtemp OSError in tirith install (#51826)	2026-06-25 02:35:27 +05:30
kshitij	7d2c1f3f84	Merge pull request #52134 from NousResearch/salvage/42449-deepcopy-ctx-engine fix(agent): deepcopy plugin context engine to prevent parent corruption on delegate_task (#42449)	2026-06-25 02:28:37 +05:30
Bartok9	710cd48fb1	fix(agent): validate context/memory tool schemas before wrapping Closes #47707 Context engines and memory providers expose tool schemas via get_tool_schemas(). agent_init.py wrapped each as {"type":"function","function":_schema} without validating that _schema carries a top-level name. A provider returning an entry already in OpenAI tool form ({"type":"function","function":{...}}) was then double-wrapped into a tool whose function has no name. Strict providers (e.g. DeepSeek) reject the entire request with HTTP 400 'tools[N].function: missing field name', so one malformed schema silently disables the whole toolset and breaks every turn. The schema was also never added to valid_tool_names, so even lenient providers could not call it. Add a shared normalize_tool_schema() helper that unwraps an already-wrapped entry and returns None for anything lacking a resolvable string name. Wire it into the agent_init context-engine loop and all three memory_manager surfaces (inject_memory_provider_tools, add_provider routing index, get_all_tool_schemas), so a single bad plugin schema is skipped with a warning instead of poisoning the request. Verification: 209 targeted agent/memory tests pass (incl. 9 new). New tests assert the unwrap + skip-nameless behavior and fail without the fix.	2026-06-25 02:17:29 +05:30
liuhao1024	dbf0797335	fix(tools): catch mkdtemp OSError in tirith install to prevent unbounded retry and temp-dir leak (#51826 ) When tempfile.mkdtemp() raises OSError (e.g. disk full), the exception propagated past the try/finally block, so _mark_install_failed() was never called. The 24h backoff marker never engaged, causing unbounded retry on every command -- each attempt leaked a tirith-install-* temp directory, eventually filling /tmp completely. Fix: wrap mkdtemp in its own try/except OSError, returning (None, "no_space") so the caller's normal failure path (including _mark_install_failed) executes. Salvaged from #51831 by @liuhao1024. Closes #51826	2026-06-25 02:13:56 +05:30
liuhao1024	8d1f6debfd	fix(agent): deepcopy plugin context engine to prevent parent corruption on delegate_task (#42449 ) When delegate_task spawns a child agent with a different model/provider, the child's init_agent loaded the plugin context-engine GLOBAL singleton by reference (`_selected_engine = _candidate`) and then called update_model() on it with the child's (smaller) context_length. Because parent and child shared the same object, this mutated the PARENT's compressor: e.g. DeepSeek 1M ctx silently dropped to 204800 and the compression threshold from 200K to 40K after any delegate_task with a different model. Deepcopy the singleton before assigning/mutating it (agent_init.py) so the child gets its own instance and the parent's compressor is untouched. Salvaged from #42452 by @liuhao1024 (authorship preserved). Added a source-pin regression test that fails if the production line reverts to the bare alias, plus an end-to-end test driving get_plugin_context_engine() and a StubEngine.update_model() — the original PR's tests exercised copy.deepcopy in isolation but did not guard the actual agent_init code path. Closes #42449. Supersedes #42469, #42474 (same one-line fix, no test).	2026-06-25 02:13:26 +05:30
kshitij	77d2b50751	Merge pull request #52118 from NousResearch/salvage/36776-ddgs-timeout fix(ddgs): bound DuckDuckGo search with a wall-clock timeout (#36776)	2026-06-25 01:56:26 +05:30
kshitij	4d589b1e13	Merge pull request #52121 from NousResearch/salvage/43466-strip-cronjob-toolset fix(delegate): strip cronjob toolset from delegated children (#43466)	2026-06-25 01:54:37 +05:30
uzunkuyruk	489b85ee1e	fix(ddgs): bound DuckDuckGo search with a wall-clock timeout (#36776 ) A single ddgs (DuckDuckGo) search could hang indefinitely and block the shared agent loop — and therefore every platform (CLI, Telegram, Matrix...). The DDGS constructor's timeout only bounds individual HTTP requests; ddgs's multi-engine retry loop has no overall cap, so a slow/rate-limited response could spin for 20+ minutes with no output and no error. Run the synchronous ddgs call in a single-worker ThreadPoolExecutor and cap it with future.result(timeout=_SEARCH_TIMEOUT_SECS=30). On timeout, return a clear failure ("DuckDuckGo search timed out ... try a different provider") instead of blocking; the pool is shut down with cancel_futures so a hung worker is never awaited. Salvaged from #37422 by @uzunkuyruk (authorship preserved). Re-applied on current main (the PR's provider.py base had diverged). Added a load-bearing timeout regression test (the original PR only updated the fake's constructor and had no timeout-behavior test) — mutation-verified to fail without the cap. Closes #36776.	2026-06-25 01:45:06 +05:30
kshitijk4poor	e25b56fc64	chore: AUTHOR_MAP entry for riyas22 (PR #43687 salvage)	2026-06-25 01:39:11 +05:30
Riyasudeen Farook	1e4df599ec	fix(delegate): strip cronjob toolset from delegated children (#43466 ) _strip_blocked_tools used a hardcoded set missing 'cronjob'. Children on gateway platforms could inherit the cronjob toolset, scheduling persistent jobs that outlive the delegation despite DELEGATE_BLOCKED_TOOLS. Fix: derive the strip set from DELEGATE_BLOCKED_TOOLS at runtime so the two lists can never drift. Add 'cronjob' to DELEGATE_BLOCKED_TOOLS for documentation consistency. Two regression tests lock the invariant. Salvaged from #43687 by @riyas22. Adapted test to current main (no 'messaging' toolset exists -- send_message is intentionally not registered as an agent tool). Closes #43466	2026-06-25 01:37:25 +05:30
kshitij	7a79a4447c	Merge pull request #52116 from NousResearch/fix/46994-session-load-bool-iterable fix(gateway): skip non-dict entries in session loading (#46994)	2026-06-25 01:33:36 +05:30
kshitij	8f0a12ce09	Merge pull request #52114 from NousResearch/salvage/27405-preflight-fewbig fix(agent): trigger preflight compression on few-but-huge sessions (#27405)	2026-06-25 01:27:07 +05:30
kshitijk4poor	9c994377ed	fix(gateway): skip non-dict entries in session loading (#46994 ) Corrupted sessions.json entries (e.g. a bare bool where a dict is expected) caused TypeError on 'origin' in data' which escaped the (ValueError, KeyError) inner except and aborted loading ALL remaining sessions, not just the corrupted one. Two-layer fix: - Loop level: isinstance(entry_data, dict) guard before from_dict - from_dict: isinstance(data['origin'], dict) instead of bare truthiness - Added TypeError to the inner except as defense-in-depth Closes #46994	2026-06-25 01:26:13 +05:30
texhy	aacc6bb0a8	fix(agent): trigger preflight compression on few-but-huge sessions (#27405 ) The preflight-compression gate only ran the (expensive) token estimate when the message COUNT exceeded protect_first_n + protect_last_n + 1. A session with a handful of very large messages never tripped the count condition, so compression was never attempted and the turn eventually hit a hard context-overflow error. Add _should_run_preflight_estimate() with OR semantics: run the estimate when either the message count exceeds the protected ranges (the historical gate) OR a cheap char-based estimate already crosses the configured threshold. The downstream estimate_request_tokens_rough() stays authoritative — this is only a hint that decides whether to pay for the full estimate. Salvaged from #27435 by @texhy (authorship preserved). Re-applied on current main: the preflight gate moved from conversation_loop.py to turn_context.py since the PR was opened, so the helper + gate are placed there; the test imports the real MINIMUM_CONTEXT_LENGTH instead of a hardcoded literal. Closes #27405.	2026-06-25 01:20:23 +05:30
kshitij	ed1fdb5b61	Merge pull request #52112 from NousResearch/revert/52053-minimum-context-floor revert(plugins): revert minimum context floor configurable (#52053)	2026-06-25 01:11:53 +05:30
kshitijk4poor	e0272cfef2	Revert "fix(compression): make minimum context floor configurable (#31600 )" This reverts commit `cae1ee44a7`.	2026-06-25 01:04:44 +05:30
kshitij	59acaa972f	Merge pull request #52053 from NousResearch/salvage/31600-minimum-context-length-configurable fix(compression): make minimum context floor configurable (#31600)	2026-06-25 01:02:52 +05:30
kshitij	6800fd6608	Merge pull request #52091 from NousResearch/salvage/42874-memory-drift-guard-add fix(memory): skip drift guard for add (append-only) action (#42874)	2026-06-25 00:58:39 +05:30
Tranquil-Flow	cae1ee44a7	fix(compression): make minimum context floor configurable (#31600 ) Add compression.minimum_context_floor config key that allows users to lower the compression threshold floor below the hardcoded 64K default, preventing infinite tool-call loops on models whose structured output degrades well before 64K tokens. - agent/model_metadata.py: add get_configurable_minimum_context() helper with 16K hard safety limit - agent/context_compressor.py: accept minimum_context_floor param, thread it through _compute_threshold_tokens - agent/conversation_compression.py: use compressor's floor for aux model context validation - agent/agent_init.py: read compression.minimum_context_floor from config and pass to ContextCompressor - gateway/run.py: cache-busting includes new key Salvaged from #31686 by @Tranquil-Flow onto current main. Resolves conflicts with in-place compaction (#38763) and max_tokens threshold computation (#43547) that landed after the original PR. Closes #31600	2026-06-25 00:56:04 +05:30
liuhao1024	25e2312230	fix(memory): skip drift guard for add (append-only) action (#42874 ) The drift guard (introduced for #26045) correctly protects replace/remove from clobbering un-roundtrippable content, but it also fires on the add path. Since add only appends and never overwrites, the guard is unnecessary and causes false positives when prior add() calls in the same session shift the byte count of the on-disk file. Add skip_drift parameter to _reload_target() and pass True from add(). Replace/remove continue to use the drift guard unchanged. Salvaged from #42880 by @liuhao1024. Closes #42874	2026-06-25 00:51:12 +05:30
Jeffrey Quesnelle	b13e2fd694	Merge pull request #52044 from NousResearch/fix/install-venv-kill-venv-processes fix(install): kill venv-resident gateway before recreating venv on Windows	2026-06-24 15:16:58 -04:00
kshitij	9214aa7dde	Merge pull request #52090 from NousResearch/salvage/35994-reset-deadlock fix(gateway): offload agent cleanup off the event loop in /new reset (#35994)	2026-06-25 00:34:21 +05:30
kshitijk4poor	0225480369	fix(gateway): offload agent cleanup off the event loop in /new reset (#35994 ) The /new (and /reset) confirmation-button callback runs the slash-confirm handler on the asyncio event loop (see _request_slash_confirm). That handler calls _handle_reset_command, which invoked the SYNCHRONOUS, potentially long-blocking _cleanup_agent_resources inline: agent.close() tears down terminal sandboxes, browser daemons and background processes (subprocess waits), and shutdown_memory_provider() can make a network call. A slow teardown wedged the entire event loop, so the bot went silent and stopped processing all messages until a manual restart. Offload _cleanup_agent_resources via the existing contextvar-preserving _run_in_executor_with_context helper, bounded by asyncio.wait_for with a named _RESET_CLEANUP_TIMEOUT_S (30s). The loop is never blocked; on timeout the reset proceeds and the worker thread is left to finish on its own (it cannot be cancelled). The text /new path is unaffected (already off-loop). Tests (tests/gateway/test_35994_reset_button_deadlock.py): the loop keeps ticking while close() blocks in its worker thread; a cleanup that raises is swallowed (warning logged) and the reset still rotates the session; a cleanup that times out degrades gracefully. All three are mutation-verified to fail without their respective production branch.	2026-06-25 00:27:22 +05:30
kshitij	de281bcebc	Merge pull request #52084 from NousResearch/salvage/31884-silent-drop-after-stop fix(gateway): surface retry hint instead of silently dropping turn after /stop (#31884)	2026-06-25 00:06:32 +05:30
kshitij	5b065e32ed	Merge pull request #51051 from NousResearch/salvage/cron-provider-pin fix(cron): fail closed when an unpinned job provider drifts from creation snapshot (#44585)	2026-06-25 00:05:52 +05:30
brooklyn!	a130b62678	Merge pull request #52086 from NousResearch/bb/salvage-desktop-window-state feat(desktop): remember window size/position/maximized across launches (salvage #39154)	2026-06-24 13:35:46 -05:00
Brooklyn Nicholson	2de7549fe0	feat(desktop): remember window size/position/maximized across launches (salvage #39154 ) The desktop window opened at a hardcoded 1220×800 every launch, discarding whatever size and position the user left it at (#39101) — on macOS the dock reopen was the most visible case, but every restart reset it. A small window-state.json under userData (same pattern as connection.json / updates.json) records the window's normal bounds plus its maximized flag, written debounced on resize/move/maximize and flushed on close, applied on the next createWindow(). getNormalBounds() captures the pre-maximize size so an un-maximize next session lands where the user actually sized it. Restore is defensive: sanitize rejects garbage, drops off-screen positions (window falls back to Electron centering), and caps a size saved on a since-disconnected larger monitor to the largest current display. The geometry math lives in a side-effect-free window-state.cjs so it unit-tests with node --test, no Electron boot. No new dependency. Salvages #39154 by @jeffrobodie-glitch — same userData approach and validation intent, reimplemented tighter and folded into one module. Co-authored-by: jeffrobodie-glitch <jeffrobodie@gmail.com>	2026-06-24 13:32:05 -05:00
sweetcornna	b41d9b845d	fix(gateway): surface retry hint instead of silently dropping turn after /stop (#31884 ) After /stop, the next user message can hit a stale generation token and return with api_calls=0, no failure, no interruption. _normalize_empty_agent_response fell through to an empty string, so the gateway logged "response=0 chars" and sent nothing — the message was silently lost while internal work sometimes continued. Add the api_calls==0 / not-failed / not-interrupted / not-partial branch to the single normalization chokepoint so the user gets a short retry hint instead of silence. Regression test asserts the hint surfaces. Salvaged from #33851 (re-applied on current main; original was 1401 commits behind and the function had moved).	2026-06-24 23:51:31 +05:30
brooklyn!	35e9c63d89	Merge pull request #52008 from infinitycrew39/fix/desktop-nous-onboarding-stale-provider fix(desktop): stop Nous Portal onboarding from validating stale Anthropic config	2026-06-24 13:12:44 -05:00
emozilla	6638199c53	fix(install): harden venv-resident process sweep on Windows Follow-up to the salvaged venv-recreate fix. Three changes to the Install-Venv pre-delete sweep: - Match the venv path with a case-insensitive StartsWith instead of the PowerShell -like operator. A venv path containing wildcard metacharacters ('[', ']') — legal in a Windows user name — silently fails to match under -like, which would let the locking process slip through and reintroduce the exact access-denied failure this fix closes. - Retry Remove-Item once after a short pause. A force-killed process can take a moment to release its file handles, so the first delete may still hit a locked .pyd; retry before failing the stage. - Note in a comment that the gateway autostart task runs at LIMITED integrity as the current user, so the installer always runs at equal-or-higher integrity and can read the process executable path, and that Get-CimInstance is preferred over Get-Process because it returns a null path for an uninspectable process instead of throwing. Adds a regression test asserting the recreate branch sweeps by venv path prefix, uses StartsWith rather than -like, and runs the sweep before Remove-Item. Covers issues #47036, #47557, #47910.	2026-06-24 13:25:44 -04:00
Dana Moverman	7e55b934ea	fix(install): kill gateway running from venv before recreating it (Windows) The Windows venv-recreate guard only runs `taskkill /IM hermes.exe`, but the gateway that a scheduled task or watchdog autostarts runs as `pythonw.exe -m hermes_cli.main gateway run` straight out of venv\Scripts\. Its image name is python/pythonw, so taskkill never matches it; it keeps the venv's native extensions (e.g. tornado\speedups.pyd) loaded, and the following Remove-Item fails with "Access to the path is denied" -- aborting boot at the venv stage so the desktop app never loads. Additionally stop any process whose executable lives under this venv, matched by path so the image name is irrelevant and a global/system python outside the venv is never touched. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-24 13:22:36 -04:00
infinitycrew39	d8fe1c0b41	test(desktop): cover scoped onboarding runtime readiness checks Assert setup.runtime_check honors provider params and that Nous OAuth onboarding persists model config before validating the connected provider.	2026-06-24 23:19:51 +07:00
infinitycrew39	6da615c77c	fix(desktop): scope onboarding runtime check to connected provider Let setup.runtime_check accept an optional provider, persist the selected provider/model before the gate, and validate the provider the user just connected instead of a stale config entry such as anthropic.	2026-06-24 23:19:45 +07:00
Teknium	9259d1e5da	chore(desktop): sync package-lock version to apps/desktop 0.17.0 The apps/desktop workspace was bumped to 0.17.0 in apps/desktop/package.json but package-lock.json still recorded 0.15.1, so npm install reports the lock as out of date and rewrites it on every fresh install. Regenerate the lock (npm install --package-lock-only) to record the current 0.17.0; one-line change, no dependency resolution churn.	2026-06-24 07:50:30 -07:00
kshitij	c42d44cb2f	revert(plugins): restore user dashboard plugin backend API auto-import (#43719 ) (#51950 ) * Revert "refactor(security): centralize non-bundled plugin sources in one constant" This reverts commit `e2bea0abe6`. * Revert "fix(security): restrict dashboard plugin backend import to bundled plugins (#43719)" This reverts commit `8845f3316c`.	2026-06-24 07:46:54 -07:00
kshitij	7fb2027d85	Merge pull request #51881 from NousResearch/fix/29559-compression-abort-on-network-failure fix(compression): abort + preserve context on transient network summary failure (#29559, #25585)	2026-06-24 19:54:21 +05:30
kshitij	f477f892b3	Merge pull request #51043 from NousResearch/salvage/tui-config-destruction fix(tui): preserve config on model switch — atomic writes + custom-provider guard (#48305)	2026-06-24 19:42:56 +05:30
kshitijk4poor	fce2af780f	chore(release): add Elshayib to AUTHOR_MAP (PR #48351 )	2026-06-24 19:34:33 +05:30
Elshayib	1a435a6d5d	fix(model-switch): prevent custom-provider misattribution in model picker (#48305 ) When the current provider is a custom endpoint (custom or custom:), the model switch pipeline must NOT auto-switch to a native provider/OpenRouter based on a static-catalog match. The user explicitly configured their own endpoint and the same model name may be served there; silently rewriting model.provider destroys their config. - detect_static_provider_for_model(): skip the static-catalog scan when the current provider is custom/custom: - switch_model() Step e: extend is_custom to cover custom:* so the detect_provider_for_model() last-resort fallback cannot fire Salvaged from #48351 by Elshayib (authorship preserved). Fixes #48305	2026-06-24 19:34:33 +05:30
kyssta-exe	b85c460540	fix(tui): targeted save_config_value for model persistence (#48305 ) The TUI model-switch persistence (_persist_model_switch) rewrote the entire model config block via save_config(), destroying sibling keys the user set under model: (model_slots, model_fallback, base_url, ...) on every switch. Use targeted, atomic, comment-preserving save_config_value("model.default" / "model.provider" / "model.base_url") writes instead, so a model switch only touches the keys it changes. Salvaged from #48391 by kyssta-exe (authorship preserved). Fixes #48305	2026-06-24 19:34:33 +05:30
kshitij	2187fd884c	Merge pull request #51027 from NousResearch/salvage/typed-model-routing fix(model_switch): route typed configured models off openai-codex (#45006)	2026-06-24 19:32:35 +05:30
kshitijk4poor	1a174dfb50	fix(models): gate openai-codex/xai-oauth soft-accept to family-shaped slugs (#45006 ) Completes the #45006 fix. PR-base commit (configured-provider routing) handles the case where a typed model IS declared in user/custom provider config. This commit closes the other root: when a typed model is NOT in any config and the current provider is a soft-accepting one (openai-codex / xai-oauth), the hidden-model soft-accept (#16172 / #19729) would accept ANY unknown name as a hidden model — so `qwen3.5-4b` typed on a Codex-default session "succeeded" and mislabeled the provider as "OpenAI Codex" (the exact reported symptom), then 400'd on the next turn. Gate the soft-accept to slugs that plausibly belong to the provider's family (openai-codex -> gpt-/codex-/o1/o3/o4; xai-oauth -> grok-). Family-shaped unknown slugs are still soft-accepted (preserving the #16172 entitlement-gated hidden-model intent); unrelated names are rejected with actionable guidance to pin the right provider via `--provider <slug>` or the picker. Adds TestCodexSoftAcceptPlausibilityGate (5 tests): unrelated names rejected on codex/xai, family-shaped hidden slugs still accepted, real catalog models unaffected. Verified load-bearing.	2026-06-24 19:23:53 +05:30
kshitij	ae20c3fb90	Merge pull request #51025 from NousResearch/salvage/cron-autoreset-override fix(gateway): consume was_auto_reset so /model survives session auto-reset (#48031)	2026-06-24 19:20:11 +05:30
x7peeps	6879d77d74	fix(gateway): consume was_auto_reset so /model survives session auto-reset When `/model X` is the FIRST message after an idle/daily/suspended auto-reset, the slash-command path stores a session model override but leaves `session_entry.was_auto_reset = True` (it never passes through `_handle_message_with_agent`, which is where the flag was consumed). On the NEXT regular message, the auto-reset cleanup block pops the freshly-stored model/reasoning override BEFORE the flag is consumed — so the switch is silently lost and resolution falls back to the config default, while the session DB still shows the switched model (a two-sources-of-truth divergence). Consume the flag at both sites: 1. gateway/run.py — capture `was_auto_reset` into a local and set the attribute False immediately at the top of the cleanup block, so the cleanup can't re-fire on a later message and wipe an override stored between turns. Downstream reads use the captured local. 2. gateway/slash_commands.py — the model path consumes the flag before storing the override, so a /model-first-after-auto-reset isn't wiped by the next message's cleanup. Salvaged from #48062 by x7peeps (authorship preserved). Tests: tests/gateway/test_48031_model_switch_after_auto_reset.py — AST invariants pinning both consume sites (load-bearing; verified they fail when either consume is removed). Mirrors the AST-pin approach in test_35809_auto_reset_clean_context.py. Gateway session/reset suite: 16 passed. Fixes #48031	2026-06-24 19:12:44 +05:30
kshitij	d68a133458	Merge pull request #51890 from NousResearch/salvage/40695-handoff-watcher-async fix(gateway): offload handoff-watcher SQLite calls to avoid blocking the async heartbeat (#40695)	2026-06-24 19:10:52 +05:30
kshitij	7634488074	Merge pull request #51889 from NousResearch/salvage/41289-model-cmd-async fix(gateway): offload Discord /model provider-listing off the event loop (#41289)	2026-06-24 19:06:23 +05:30
kshitij	4f521a5382	Merge pull request #51898 from kshitijk4poor/salvage/openviking-recall-48927 feat(openviking): add full recall prefetch policy (salvage #48927)	2026-06-24 19:01:15 +05:30
kshitijk4poor	ab9134bf16	feat(openviking): add full recall prefetch policy Salvage of PR #48927 by @ehz0ah, which consolidates OpenViking recall work from #41706 (@huangxun375-stack), #33260, #49975, and #32444. Replaces stale background post-turn prefetch warming with synchronous current-query recall. The old queue_prefetch warmed the PREVIOUS user message while turn-start recall consumed the CURRENT one, so injected context was always about the wrong topic. Changes: - prefetch() now does session-aware /api/v1/search/search with the current query, falls back to /api/v1/search/find on failure - Contract-safe payloads: limit, score_threshold, context_type, session_id — no top_k, no search-body mode, no target_uri - L2 content reads for items with level=2 or empty abstracts, capped at full_read_limit (default 2) - Local ranking (score + query-token overlap + leaf boost), dedup, score threshold, and injected-char budget - queue_prefetch() is now a no-op (background warming removed) - Additive batched viking_read: uris param accepts up to 3 URIs - Per-request timeout support on _VikingClient.get/post/delete - Removes stale _prefetch_result/_prefetch_thread/_prefetch_generation state and _invalidate_prefetch_state() - Strengthened system_prompt_block guidance Salvage follow-up fixes: - Expose all 8 recall config knobs in get_config_schema() (PR #48927 had removed them; #41706 correctly exposed them). Env vars remain as internal mechanism but are now visible in setup wizard. - Lower default timeout 8s→4s, request_timeout 6s→3s, full_read_limit 3→2 to reduce per-turn blocking latency. Co-authored-by: Hao Zhe <haozhe4547@gmail.com> Co-authored-by: Eurekaxun <eurekaxun@163.com>	2026-06-24 18:53:49 +05:30

1 2 3 4 5 ...

12783 commits