feat(memory): pluggable memory provider interface with profile isolation, review fixes, and honcho CLI restoration (#4623)

* feat(memory): add pluggable memory provider interface with profile isolation Introduces a pluggable MemoryProvider ABC so external memory backends can integrate with Hermes without modifying core files. Each backend becomes a plugin implementing a standard interface, orchestrated by MemoryManager. Key architecture: - agent/memory_provider.py — ABC with core + optional lifecycle hooks - agent/memory_manager.py — single integration point in the agent loop - agent/builtin_memory_provider.py — wraps existing MEMORY.md/USER.md Profile isolation fixes applied to all 6 shipped plugins: - Cognitive Memory: use get_hermes_home() instead of raw env var - Hindsight Memory: check $HERMES_HOME/hindsight/config.json first, fall back to legacy ~/.hindsight/ for backward compat - Hermes Memory Store: replace hardcoded ~/.hermes paths with get_hermes_home() for config loading and DB path defaults - Mem0 Memory: use get_hermes_home() instead of raw env var - RetainDB Memory: auto-derive profile-scoped project name from hermes_home path (hermes-<profile>), explicit env var overrides - OpenViking Memory: read-only, no local state, isolation via .env MemoryManager.initialize_all() now injects hermes_home into kwargs so every provider can resolve profile-scoped storage without importing get_hermes_home() themselves. Plugin system: adds register_memory_provider() to PluginContext and get_plugin_memory_providers() accessor. Based on PR #3825. 46 tests (37 unit + 5 E2E + 4 plugin registration). * refactor(memory): drop cognitive plugin, rewrite OpenViking as full provider Remove cognitive-memory plugin (#727) — core mechanics are broken: decay runs 24x too fast (hourly not daily), prefetch uses row ID as timestamp, search limited by importance not similarity. Rewrite openviking-memory plugin from a read-only search wrapper into a full bidirectional memory provider using the complete OpenViking session lifecycle API: - sync_turn: records user/assistant messages to OpenViking session (threaded, non-blocking) - on_session_end: commits session to trigger automatic memory extraction into 6 categories (profile, preferences, entities, events, cases, patterns) - prefetch: background semantic search via find() endpoint - on_memory_write: mirrors built-in memory writes to the session - is_available: checks env var only, no network calls (ABC compliance) Tools expanded from 3 to 5: - viking_search: semantic search with mode/scope/limit - viking_read: tiered content (abstract ~100tok / overview ~2k / full) - viking_browse: filesystem-style navigation (list/tree/stat) - viking_remember: explicit memory storage via session - viking_add_resource: ingest URLs/docs into knowledge base Uses direct HTTP via httpx (no openviking SDK dependency needed). Response truncation on viking_read to prevent context flooding. * fix(memory): harden Mem0 plugin — thread safety, non-blocking sync, circuit breaker - Remove redundant mem0_context tool (identical to mem0_search with rerank=true, top_k=5 — wastes a tool slot and confuses the model) - Thread sync_turn so it's non-blocking — Mem0's server-side LLM extraction can take 5-10s, was stalling the agent after every turn - Add threading.Lock around _get_client() for thread-safe lazy init (prefetch and sync threads could race on first client creation) - Add circuit breaker: after 5 consecutive API failures, pause calls for 120s instead of hammering a down server every turn. Auto-resets after cooldown. Logs a warning when tripped. - Track success/failure in prefetch, sync_turn, and all tool calls - Wait for previous sync to finish before starting a new one (prevents unbounded thread accumulation on rapid turns) - Clean up shutdown to join both prefetch and sync threads * fix(memory): enforce single external memory provider limit MemoryManager now rejects a second non-builtin provider with a warning. Built-in memory (MEMORY.md/USER.md) is always accepted. Only ONE external plugin provider is allowed at a time. This prevents tool schema bloat (some providers add 3-5 tools each) and conflicting memory backends. The warning message directs users to configure memory.provider in config.yaml to select which provider to activate. Updated all 47 tests to use builtin + one external pattern instead of multiple externals. Added test_second_external_rejected to verify the enforcement. * feat(memory): add ByteRover memory provider plugin Implements the ByteRover integration (from PR #3499 by hieuntg81) as a MemoryProvider plugin instead of direct run_agent.py modifications. ByteRover provides persistent memory via the brv CLI — a hierarchical knowledge tree with tiered retrieval (fuzzy text then LLM-driven search). Local-first with optional cloud sync. Plugin capabilities: - prefetch: background brv query for relevant context - sync_turn: curate conversation turns (threaded, non-blocking) - on_memory_write: mirror built-in memory writes to brv - on_pre_compress: extract insights before context compression Tools (3): - brv_query: search the knowledge tree - brv_curate: store facts/decisions/patterns - brv_status: check CLI version and context tree state Profile isolation: working directory at $HERMES_HOME/byterover/ (scoped per profile). Binary resolution cached with thread-safe double-checked locking. All write operations threaded to avoid blocking the agent (curate can take 120s with LLM processing). * fix(memory): thread remaining sync_turns, fix holographic, add config key Plugin fixes: - Hindsight: thread sync_turn (was blocking up to 30s via _run_in_thread) - RetainDB: thread sync_turn (was blocking on HTTP POST) - Both: shutdown now joins sync threads alongside prefetch threads Holographic retrieval fixes: - reason(): removed dead intersection_key computation (bundled but never used in scoring). Now reuses pre-computed entity_residuals directly, moved role_content encoding outside the inner loop. - contradict(): added _MAX_CONTRADICT_FACTS=500 scaling guard. Above 500 facts, only checks the most recently updated ones to avoid O(n^2) explosion (~125K comparisons at 500 is acceptable). Config: - Added memory.provider key to DEFAULT_CONFIG ("" = builtin only). No version bump needed (deep_merge handles new keys automatically). * feat(memory): extract Honcho as a MemoryProvider plugin Creates plugins/honcho-memory/ as a thin adapter over the existing honcho_integration/ package. All 4 Honcho tools (profile, search, context, conclude) move from the normal tool registry to the MemoryProvider interface. The plugin delegates all work to HonchoSessionManager — no Honcho logic is reimplemented. It uses the existing config chain: $HERMES_HOME/honcho.json -> ~/.honcho/config.json -> env vars. Lifecycle hooks: - initialize: creates HonchoSessionManager via existing client factory - prefetch: background dialectic query - sync_turn: records messages + flushes to API (threaded) - on_memory_write: mirrors user profile writes as conclusions - on_session_end: flushes all pending messages This is a prerequisite for the MemoryManager wiring in run_agent.py. Once wired, Honcho goes through the same provider interface as all other memory plugins, and the scattered Honcho code in run_agent.py can be consolidated into the single MemoryManager integration point. * feat(memory): wire MemoryManager into run_agent.py Adds 8 integration points for the external memory provider plugin, all purely additive (zero existing code modified): 1. Init (~L1130): Create MemoryManager, find matching plugin provider from memory.provider config, initialize with session context 2. Tool injection (~L1160): Append provider tool schemas to self.tools and self.valid_tool_names after memory_manager init 3. System prompt (~L2705): Add external provider's system_prompt_block alongside existing MEMORY.md/USER.md blocks 4. Tool routing (~L5362): Route provider tool calls through memory_manager.handle_tool_call() before the catchall handler 5. Memory write bridge (~L5353): Notify external provider via on_memory_write() when the built-in memory tool writes 6. Pre-compress (~L5233): Call on_pre_compress() before context compression discards messages 7. Prefetch (~L6421): Inject provider prefetch results into the current-turn user message (same pattern as Honcho turn context) 8. Turn sync + session end (~L8161, ~L8172): sync_all() after each completed turn, queue_prefetch_all() for next turn, on_session_end() + shutdown_all() at conversation end All hooks are wrapped in try/except — a failing provider never breaks the agent. The existing memory system, Honcho integration, and all other code paths are completely untouched. Full suite: 7222 passed, 4 pre-existing failures. * refactor(memory): remove legacy Honcho integration from core Extracts all Honcho-specific code from run_agent.py, model_tools.py, toolsets.py, and gateway/run.py. Honcho is now exclusively available as a memory provider plugin (plugins/honcho-memory/). Removed from run_agent.py (-457 lines): - Honcho init block (session manager creation, activation, config) - 8 Honcho methods: _honcho_should_activate, _strip_honcho_tools, _activate_honcho, _register_honcho_exit_hook, _queue_honcho_prefetch, _honcho_prefetch, _honcho_save_user_observation, _honcho_sync - _inject_honcho_turn_context module-level function - Honcho system prompt block (tool descriptions, CLI commands) - Honcho context injection in api_messages building - Honcho params from __init__ (honcho_session_key, honcho_manager, honcho_config) - HONCHO_TOOL_NAMES constant - All honcho-specific tool dispatch forwarding Removed from other files: - model_tools.py: honcho_tools import, honcho params from handle_function_call - toolsets.py: honcho toolset definition, honcho tools from core tools list - gateway/run.py: honcho params from AIAgent constructor calls Removed tests (-339 lines): - 9 Honcho-specific test methods from test_run_agent.py - TestHonchoAtexitFlush class from test_exit_cleanup_interrupt.py Restored two regex constants (_SURROGATE_RE, _BUDGET_WARNING_RE) that were accidentally removed during the honcho function extraction. The honcho_integration/ package is kept intact — the plugin delegates to it. tools/honcho_tools.py registry entries are now dead code (import commented out in model_tools.py) but the file is preserved for reference. Full suite: 7207 passed, 4 pre-existing failures. Zero regressions. * refactor(memory): restructure plugins, add CLI, clean gateway, migration notice Plugin restructure: - Move all memory plugins from plugins/<name>-memory/ to plugins/memory/<name>/ (byterover, hindsight, holographic, honcho, mem0, openviking, retaindb) - New plugins/memory/__init__.py discovery module that scans the directory directly, loading providers by name without the general plugin system - run_agent.py uses load_memory_provider() instead of get_plugin_memory_providers() CLI wiring: - hermes memory setup — interactive curses picker + config wizard - hermes memory status — show active provider, config, availability - hermes memory off — disable external provider (built-in only) - hermes honcho — now shows migration notice pointing to hermes memory setup Gateway cleanup: - Remove _get_or_create_gateway_honcho (already removed in prev commit) - Remove _shutdown_gateway_honcho and _shutdown_all_gateway_honcho methods - Remove all calls to shutdown methods (4 call sites) - Remove _honcho_managers/_honcho_configs dict references Dead code removal: - Delete tools/honcho_tools.py (279 lines, import was already commented out) - Delete tests/gateway/test_honcho_lifecycle.py (131 lines, tested removed methods) - Remove if False placeholder from run_agent.py Migration: - Honcho migration notice on startup: detects existing honcho.json or ~/.honcho/config.json, prints guidance to run hermes memory setup. Only fires when memory.provider is not set and not in quiet mode. Full suite: 7203 passed, 4 pre-existing failures. Zero regressions. * feat(memory): standardize plugin config + add per-plugin documentation Config architecture: - Add save_config(values, hermes_home) to MemoryProvider ABC - Honcho: writes to $HERMES_HOME/honcho.json (SDK native) - Mem0: writes to $HERMES_HOME/mem0.json - Hindsight: writes to $HERMES_HOME/hindsight/config.json - Holographic: writes to config.yaml under plugins.hermes-memory-store - OpenViking/RetainDB/ByteRover: env-var only (default no-op) Setup wizard (hermes memory setup): - Now calls provider.save_config() for non-secret config - Secrets still go to .env via env vars - Only memory.provider activation key goes to config.yaml Documentation: - README.md for each of the 7 providers in plugins/memory/<name>/ - Requirements, setup (wizard + manual), config reference, tools table - Consistent format across all providers The contract for new memory plugins: - get_config_schema() declares all fields (REQUIRED) - save_config() writes native config (REQUIRED if not env-var-only) - Secrets use env_var field in schema, written to .env by wizard - README.md in the plugin directory * docs: add memory providers user guide + developer guide New pages: - user-guide/features/memory-providers.md — comprehensive guide covering all 7 shipped providers (Honcho, OpenViking, Mem0, Hindsight, Holographic, RetainDB, ByteRover). Each with setup, config, tools, cost, and unique features. Includes comparison table and profile isolation notes. - developer-guide/memory-provider-plugin.md — how to build a new memory provider plugin. Covers ABC, required methods, config schema, save_config, threading contract, profile isolation, testing. Updated pages: - user-guide/features/memory.md — replaced Honcho section with link to new Memory Providers page - user-guide/features/honcho.md — replaced with migration redirect to the new Memory Providers page - sidebars.ts — added both new pages to navigation * fix(memory): auto-migrate Honcho users to memory provider plugin When honcho.json or ~/.honcho/config.json exists but memory.provider is not set, automatically set memory.provider: honcho in config.yaml and activate the plugin. The plugin reads the same config files, so all data and credentials are preserved. Zero user action needed. Persists the migration to config.yaml so it only fires once. Prints a one-line confirmation in non-quiet mode. * fix(memory): only auto-migrate Honcho when enabled + credentialed Check HonchoClientConfig.enabled AND (api_key OR base_url) before auto-migrating — not just file existence. Prevents false activation for users who disabled Honcho, stopped using it (config lingers), or have ~/.honcho/ from a different tool. * feat(memory): auto-install pip dependencies during hermes memory setup Reads pip_dependencies from plugin.yaml, checks which are missing, installs them via pip before config walkthrough. Also shows install guidance for external_dependencies (e.g. brv CLI for ByteRover). Updated all 7 plugin.yaml files with pip_dependencies: - honcho: honcho-ai - mem0: mem0ai - openviking: httpx - hindsight: hindsight-client - holographic: (none) - retaindb: requests - byterover: (external_dependencies for brv CLI) * fix: remove remaining Honcho crash risks from cli.py and gateway cli.py: removed Honcho session re-mapping block (would crash importing deleted tools/honcho_tools.py), Honcho flush on compress, Honcho session display on startup, Honcho shutdown on exit, honcho_session_key AIAgent param. gateway/run.py: removed honcho_session_key params from helper methods, sync_honcho param, _honcho.shutdown() block. tests: fixed test_cron_session_with_honcho_key_skipped (was passing removed honcho_key param to _flush_memories_for_session). * fix: include plugins/ in pyproject.toml package list Without this, plugins/memory/ wouldn't be included in non-editable installs. Hermes always runs from the repo checkout so this is belt- and-suspenders, but prevents breakage if the install method changes. * fix(memory): correct pip-to-import name mapping for dep checks The heuristic dep.replace('-', '_') fails for packages where the pip name differs from the import name: honcho-ai→honcho, mem0ai→mem0, hindsight-client→hindsight_client. Added explicit mapping table so hermes memory setup doesn't try to reinstall already-installed packages. * chore: remove dead code from old plugin memory registration path - hermes_cli/plugins.py: removed register_memory_provider(), _memory_providers list, get_plugin_memory_providers() — memory providers now use plugins/memory/ discovery, not the general plugin system - hermes_cli/main.py: stripped 74 lines of dead honcho argparse subparsers (setup, status, sessions, map, peer, mode, tokens, identity, migrate) — kept only the migration redirect - agent/memory_provider.py: updated docstring to reflect new registration path - tests: replaced TestPluginMemoryProviderRegistration with TestPluginMemoryDiscovery that tests the actual plugins/memory/ discovery system. Added 3 new tests (discover, load, nonexistent). * chore: delete dead honcho_integration/cli.py and its tests cli.py (794 lines) was the old 'hermes honcho' command handler — nobody calls it since cmd_honcho was replaced with a migration redirect. Deleted tests that imported from removed code: - tests/honcho_integration/test_cli.py (tested _resolve_api_key) - tests/honcho_integration/test_config_isolation.py (tested CLI config paths) - tests/tools/test_honcho_tools.py (tested the deleted tools/honcho_tools.py) Remaining honcho_integration/ files (actively used by the plugin): - client.py (445 lines) — config loading, SDK client creation - session.py (991 lines) — session management, queries, flush * refactor: move honcho_integration/ into the honcho plugin Moves client.py (445 lines) and session.py (991 lines) from the top-level honcho_integration/ package into plugins/memory/honcho/. No Honcho code remains in the main codebase. - plugins/memory/honcho/client.py — config loading, SDK client creation - plugins/memory/honcho/session.py — session management, queries, flush - Updated all imports: run_agent.py (auto-migration), hermes_cli/doctor.py, plugin __init__.py, session.py cross-import, all tests - Removed honcho_integration/ package and pyproject.toml entry - Renamed tests/honcho_integration/ → tests/honcho_plugin/ * docs: update architecture + gateway-internals for memory provider system - architecture.md: replaced honcho_integration/ with plugins/memory/ - gateway-internals.md: replaced Honcho-specific session routing and flush lifecycle docs with generic memory provider interface docs * fix: update stale mock path for resolve_active_host after honcho plugin migration * fix(memory): address review feedback — P0 lifecycle, ABC contract, honcho CLI restore Review feedback from Honcho devs (erosika): P0 — Provider lifecycle: - Remove on_session_end() + shutdown_all() from run_conversation() tail (was killing providers after every turn in multi-turn sessions) - Add shutdown_memory_provider() method on AIAgent for callers - Wire shutdown into CLI atexit, reset_conversation, gateway stop/expiry Bug fixes: - Remove sync_honcho=False kwarg from /btw callsites (TypeError crash) - Fix doctor.py references to dead 'hermes honcho setup' command - Cache prefetch_all() before tool loop (was re-calling every iteration) ABC contract hardening (all backwards-compatible): - Add session_id kwarg to prefetch/sync_turn/queue_prefetch - Make on_pre_compress() return str (provider insights in compression) - Add **kwargs to on_turn_start() for runtime context - Add on_delegation() hook for parent-side subagent observation - Document agent_context/agent_identity/agent_workspace kwargs on initialize() (prevents cron corruption, enables profile scoping) - Fix docstring: single external provider, not multiple Honcho CLI restoration: - Add plugins/memory/honcho/cli.py (from main's honcho_integration/cli.py with imports adapted to plugin path) - Restore full hermes honcho command with all subcommands (status, peer, mode, tokens, identity, enable/disable, sync, peers, --target-profile) - Restore auto-clone on profile creation + sync on hermes update - hermes honcho setup now redirects to hermes memory setup * fix(memory): wire on_delegation, skip_memory for cron/flush, fix ByteRover return type - Wire on_delegation() in delegate_tool.py — parent's memory provider is notified with task+result after each subagent completes - Add skip_memory=True to cron scheduler (prevents cron system prompts from corrupting user representations — closes #4052) - Add skip_memory=True to gateway flush agent (throwaway agent shouldn't activate memory provider) - Fix ByteRover on_pre_compress() return type: None -> str * fix(honcho): port profile isolation fixes from PR #4632 Ports 5 bug fixes found during profile testing (erosika's PR #4632): 1. 3-tier config resolution — resolve_config_path() now checks $HERMES_HOME/honcho.json → ~/.hermes/honcho.json → ~/.honcho/config.json (non-default profiles couldn't find shared host blocks) 2. Thread host=_host_key() through from_global_config() in cmd_setup, cmd_status, cmd_identity (--target-profile was being ignored) 3. Use bare profile name as aiPeer (not host key with dots) — Honcho's peer ID pattern is ^[a-zA-Z0-9_-]+$, dots are invalid 4. Wrap add_peers() in try/except — was fatal on new AI peers, killed all message uploads for the session 5. Gate Honcho clone behind --clone/--clone-all on profile create (bare create should be blank-slate) Also: sanitize assistant_peer_id via _sanitize_id() * fix(tests): add module cleanup fixture to test_cli_provider_resolution test_cli_provider_resolution._import_cli() wipes tools.*, cli, and run_agent from sys.modules to force fresh imports, but had no cleanup. This poisoned all subsequent tests on the same xdist worker — mocks targeting tools.file_tools, tools.send_message_tool, etc. patched the NEW module object while already-imported functions still referenced the OLD one. Caused ~25 cascade failures: send_message KeyError, process_registry FileNotFoundError, file_read_guards timeouts, read_loop_detection file-not-found, mcp_oauth None port, and provider_parity/codex_execution stale tool lists. Fix: autouse fixture saves all affected modules before each test and restores them after, matching the pattern in test_managed_browserbase_and_modal.py.
2026-04-25 00:51:20 +00:00 · 2026-04-02 15:33:51 -07:00 · 2026-04-02 15:33:51 -07:00 · 924bc67eee
commit 924bc67eee
parent e0b2bdb089
69 changed files with 7501 additions and 2317 deletions
--- a/website/docs/user-guide/features/honcho.md
+++ b/website/docs/user-guide/features/honcho.md
@ -1,404 +1,43 @@
 ---
-title: Honcho Memory
-description: AI-native persistent memory for cross-session user modeling and personalization.
-sidebar_label: Honcho Memory
-sidebar_position: 8
+sidebar_position: 99
+title: "Honcho Memory"
+description: "Honcho is now available as a memory provider plugin"
 ---

 # Honcho Memory

-[Honcho](https://honcho.dev) is an AI-native memory system that gives Hermes persistent, cross-session understanding of users. While Hermes has built-in memory (`MEMORY.md` and `USER.md`), Honcho adds a deeper layer of **user modeling** — learning preferences, goals, communication style, and context across conversations via a dual-peer architecture where both the user and the AI build representations over time.
-
-## Works Alongside Built-in Memory
-
-Hermes has two memory systems that can work together or be configured separately. In `hybrid` mode (the default), both run side by side — Honcho adds cross-session user modeling while local files handle agent-level notes.
-
-| Feature | Built-in Memory | Honcho Memory |
-|---------|----------------|---------------|
-| Storage | Local files (`~/.hermes/memories/`) | Cloud-hosted Honcho API |
-| Scope | Agent-level notes and user profile | Deep user modeling via dialectic reasoning |
-| Persistence | Across sessions on same machine | Across sessions, machines, and platforms |
-| Query | Injected into system prompt automatically | Prefetched + on-demand via tools |
-| Content | Manually curated by the agent | Automatically learned from conversations |
-| Write surface | `memory` tool (add/replace/remove) | `honcho_conclude` tool (persist facts) |
-
-Set `memoryMode` to `honcho` to use Honcho exclusively. See [Memory Modes](#memory-modes) for per-peer configuration.
-
-
-## Self-hosted / Docker
-
-Hermes supports a local Honcho instance (e.g. via Docker) in addition to the hosted API. Point it at your instance using `HONCHO_BASE_URL` — no API key required.
-
-**Via `hermes config`:**
-
-```bash
-hermes config set HONCHO_BASE_URL http://localhost:8000
-```
-
-**Via `~/.honcho/config.json`:**
-
-```json
-{
-  "hosts": {
-    "hermes": {
-      "base_url": "http://localhost:8000",
-      "enabled": true
-    }
-  }
-}
-```
-
-Hermes auto-enables Honcho when either `apiKey` or `base_url` is present, so no further configuration is needed for a local instance.
-
-To run Honcho locally, refer to the [Honcho self-hosting docs](https://docs.honcho.dev).
+:::info Honcho is now a Memory Provider Plugin
+Honcho has been integrated into the [Memory Providers](./memory-providers.md) system. All Honcho features are available through the unified memory provider interface.
+:::

 ## Setup

-### Interactive Setup
-
 ```bash
-hermes honcho setup
+hermes memory setup    # select "honcho"
 ```

-The setup wizard walks through API key, peer names, workspace, memory mode, write frequency, recall mode, and session strategy. It offers to install `honcho-ai` if missing.
-
-### Manual Setup
-
-#### 1. Install the Client Library
-
-```bash
-pip install 'honcho-ai>=2.0.1'
-```
-
-#### 2. Get an API Key
-
-Go to [app.honcho.dev](https://app.honcho.dev) > Settings > API Keys.
-
-#### 3. Configure
-
-Honcho reads from `~/.honcho/config.json` (shared across all Honcho-enabled applications):
-
-```json
-{
-  "apiKey": "your-honcho-api-key",
-  "hosts": {
-    "hermes": {
-      "workspace": "hermes",
-      "peerName": "your-name",
-      "aiPeer": "hermes",
-      "memoryMode": "hybrid",
-      "writeFrequency": "async",
-      "recallMode": "hybrid",
-      "sessionStrategy": "per-session",
-      "enabled": true
-    }
-  }
-}
-```
-
-`apiKey` lives at the root because it is a shared credential across all Honcho-enabled tools. All other settings are scoped under `hosts.hermes`. The `hermes honcho setup` wizard writes this structure automatically.
-
-Or set the API key as an environment variable:
-
-```bash
-hermes config set HONCHO_API_KEY your-key
-```
-
-:::info
-When an API key is present (either in `~/.honcho/config.json` or as `HONCHO_API_KEY`), Honcho auto-enables unless explicitly set to `"enabled": false`.
-:::
-
-## Configuration
-
-### Global Config (`~/.honcho/config.json`)
-
-Settings are scoped to `hosts.hermes` and fall back to root-level globals when the host field is absent. Root-level keys are managed by the user or the honcho CLI -- Hermes only writes to its own host block (except `apiKey`, which is a shared credential at root).
-
-**Root-level (shared)**
-
-| Field | Default | Description |
-|-------|---------|-------------|
-| `apiKey` | — | Honcho API key (required, shared across all hosts) |
-| `sessions` | `{}` | Manual session name overrides per directory (shared) |
-
-**Host-level (`hosts.hermes`)**
-
-| Field | Default | Description |
-|-------|---------|-------------|
-| `workspace` | `"hermes"` | Workspace identifier |
-| `peerName` | *(derived)* | Your identity name for user modeling |
-| `aiPeer` | `"hermes"` | AI assistant identity name |
-| `environment` | `"production"` | Honcho environment |
-| `enabled` | *(auto)* | Auto-enables when API key is present |
-| `saveMessages` | `true` | Whether to sync messages to Honcho |
-| `memoryMode` | `"hybrid"` | Memory mode: `hybrid` or `honcho` |
-| `writeFrequency` | `"async"` | When to write: `async`, `turn`, `session`, or integer N |
-| `recallMode` | `"hybrid"` | Retrieval strategy: `hybrid`, `context`, or `tools` |
-| `sessionStrategy` | `"per-session"` | How sessions are scoped |
-| `sessionPeerPrefix` | `false` | Prefix session names with peer name |
-| `contextTokens` | *(Honcho default)* | Max tokens for auto-injected context |
-| `dialecticReasoningLevel` | `"low"` | Floor for dialectic reasoning: `minimal` / `low` / `medium` / `high` / `max` |
-| `dialecticMaxChars` | `600` | Char cap on dialectic results injected into system prompt |
-| `linkedHosts` | `[]` | Other host keys whose workspaces to cross-reference |
-
-All host-level fields fall back to the equivalent root-level key if not set under `hosts.hermes`. Existing configs with settings at root level continue to work.
-
-### Memory Modes
-
-| Mode | Effect |
-|------|--------|
-| `hybrid` | Write to both Honcho and local files (default) |
-| `honcho` | Honcho only — skip local file writes |
-
-Memory mode can be set globally or per-peer (user, agent1, agent2, etc):
-
-```json
-{
-  "memoryMode": {
-    "default": "hybrid",
-    "hermes": "honcho"
-  }
-}
-```
-
-To disable Honcho entirely, set `enabled: false` or remove the API key.
-
-### Recall Modes
-
-Controls how Honcho context reaches the agent:
-
-| Mode | Behavior |
-|------|----------|
-| `hybrid` | Auto-injected context + Honcho tools available (default) |
-| `context` | Auto-injected context only — Honcho tools hidden |
-| `tools` | Honcho tools only — no auto-injected context |
-
-### Write Frequency
-
-| Setting | Behavior |
-|---------|----------|
-| `async` | Background thread writes (zero blocking, default) |
-| `turn` | Synchronous write after each turn |
-| `session` | Batched write at session end |
-| *integer N* | Write every N turns |
-
-### Session Strategies
-
-| Strategy | Session key | Use case |
-|----------|-------------|----------|
-| `per-session` | Unique per run | Default. Fresh session every time. |
-| `per-directory` | CWD basename | Each project gets its own session. |
-| `per-repo` | Git repo root name | Groups subdirectories under one session. |
-| `global` | Fixed `"global"` | Single cross-project session. |
-
-Resolution order: manual map > session title > strategy-derived key > platform key.
-
-### Multi-host Configuration
-
-Multiple Honcho-enabled tools share `~/.honcho/config.json`. Each tool writes only to its own host block, reads its host block first, and falls back to root-level globals:
-
-```json
-{
-  "apiKey": "your-key",
-  "peerName": "eri",
-  "hosts": {
-    "hermes": {
-      "workspace": "my-workspace",
-      "aiPeer": "hermes-assistant",
-      "memoryMode": "honcho",
-      "linkedHosts": ["claude-code"],
-      "contextTokens": 2000,
-      "dialecticReasoningLevel": "medium"
-    },
-    "claude-code": {
-      "workspace": "my-workspace",
-      "aiPeer": "clawd"
-    }
-  }
-}
-```
-
-Resolution: `hosts.<tool>` field > root-level field > default. In this example, both tools share the root `apiKey` and `peerName`, but each has its own `aiPeer` and workspace settings.
-
-### Hermes Config (`~/.hermes/config.yaml`)
-
-Intentionally minimal — most configuration comes from `~/.honcho/config.json`:
+Or set manually:

 ```yaml
-honcho: {}
+# ~/.hermes/config.yaml
+memory:
+  provider: honcho
 ```

-## How It Works
-
-### Async Context Pipeline
-
-Honcho context is fetched asynchronously to avoid blocking the response path:
-
-```mermaid
-flowchart TD
-    user["User message"] --> cache["Consume cached Honcho context<br/>from the previous turn"]
-    cache --> prompt["Inject user, AI, and dialectic context<br/>into the system prompt"]
-    prompt --> llm["LLM call"]
-    llm --> response["Assistant response"]
-    response --> fetch["Start background fetch for Turn N+1"]
-    fetch --> ctx["Fetch context"]
-    fetch --> dia["Fetch dialectic"]
-    ctx --> next["Cache for the next turn"]
-    dia --> next
-```
-
-Turn 1 is a cold start (no cache). All subsequent turns consume cached results with zero HTTP latency on the response path. The system prompt on turn 1 uses only static context to preserve prefix cache hits at the LLM provider.
-
-### Dual-Peer Architecture
-
-Both the user and AI have peer representations in Honcho:
-
- **User peer** — observed from user messages. Honcho learns preferences, goals, communication style.
- **AI peer** — observed from assistant messages (`observe_me=True`). Honcho builds a representation of the agent's knowledge and behavior.
-
-Both representations are injected into the system prompt when available.
-
-### Dynamic Reasoning Level
-
-Dialectic queries scale reasoning effort with message complexity:
-
-| Message length | Reasoning level |
-|----------------|-----------------|
-| < 120 chars | Config default (typically `low`) |
-| 120-400 chars | One level above default (cap: `high`) |
-| > 400 chars | Two levels above default (cap: `high`) |
-
-`max` is never selected automatically.
-
-### Gateway Integration
-
-The gateway creates short-lived `AIAgent` instances per request. Honcho managers are owned at the gateway session layer (`_honcho_managers` dict) so they persist across requests within the same session and flush at real session boundaries (reset, resume, expiry, server stop).
-
-#### Session Isolation
-
-Each gateway session (e.g., a Telegram chat, a Discord channel) gets its own Honcho session context. The session key — derived from the platform and chat ID — is threaded through the entire tool dispatch chain so that Honcho tool calls always execute against the correct session, even when multiple users are messaging concurrently.
-
-This means:
- **`honcho_profile`**, **`honcho_search`**, **`honcho_context`**, and **`honcho_conclude`** all resolve the correct session at call time, not at startup
- Background memory flushes (triggered by `/reset`, `/resume`, or session expiry) preserve the original session key so they write to the correct Honcho session
- Synthetic flush turns (where the agent saves memories before context is lost) skip Honcho sync to avoid polluting conversation history with internal bookkeeping
-
-#### Session Lifecycle
-
-| Event | What happens to Honcho |
-|-------|------------------------|
-| New message arrives | Agent inherits the gateway's Honcho manager + session key |
-| `/reset` | Memory flush fires with the old session key, then Honcho manager shuts down |
-| `/resume` | Current session is flushed, then the resumed session's Honcho context loads |
-| Session expiry | Automatic flush + shutdown after the configured idle timeout |
-| Gateway stop | All active Honcho managers are flushed and shut down gracefully |
-
-## Tools
-
-When Honcho is active, four tools become available. Availability is gated dynamically — they are invisible when Honcho is disabled.
-
-### `honcho_profile`
-
-Fast peer card retrieval (no LLM). Returns a curated list of key facts about the user.
-
-### `honcho_search`
-
-Semantic search over memory (no LLM). Returns raw excerpts ranked by relevance. Cheaper and faster than `honcho_context` — good for factual lookups.
-
-Parameters:
- `query` (string) — search query
- `max_tokens` (integer, optional) — result token budget
-
-### `honcho_context`
-
-Dialectic Q&A powered by Honcho's LLM. Synthesizes an answer from accumulated conversation history.
-
-Parameters:
- `query` (string) — natural language question
- `peer` (string, optional) — `"user"` (default) or `"ai"`. Querying `"ai"` asks about the assistant's own history and identity.
-
-Example queries the agent might make:
-
-```
-"What are this user's main goals?"
-"What communication style does this user prefer?"
-"What topics has this user discussed recently?"
-"What is this user's technical expertise level?"
-```
-
-### `honcho_conclude`
-
-Writes a fact to Honcho memory. Use when the user explicitly states a preference, correction, or project context worth remembering. Feeds into the user's peer card and representation.
-
-Parameters:
- `conclusion` (string) — the fact to persist
-
-## CLI Commands
-
-```
-hermes honcho setup                        # Interactive setup wizard
-hermes honcho status                       # Show config and connection status
-hermes honcho sessions                     # List directory → session name mappings
-hermes honcho map <name>                   # Map current directory to a session name
-hermes honcho peer                         # Show peer names and dialectic settings
-hermes honcho peer --user NAME             # Set user peer name
-hermes honcho peer --ai NAME               # Set AI peer name
-hermes honcho peer --reasoning LEVEL       # Set dialectic reasoning level
-hermes honcho mode                         # Show current memory mode
-hermes honcho mode [hybrid|honcho|local]   # Set memory mode
-hermes honcho tokens                       # Show token budget settings
-hermes honcho tokens --context N           # Set context token cap
-hermes honcho tokens --dialectic N         # Set dialectic char cap
-hermes honcho identity                     # Show AI peer identity
-hermes honcho identity <file>              # Seed AI peer identity from file (SOUL.md, etc.)
-hermes honcho migrate                      # Migration guide: OpenClaw → Hermes + Honcho
-```
-
-### Doctor Integration
-
-`hermes doctor` includes a Honcho section that validates config, API key, and connection status.
-
-## Migration
-
-### From Local Memory
-
-When Honcho activates on an instance with existing local history, migration runs automatically:
-
-1. **Conversation history** — prior messages are uploaded as an XML transcript file
-2. **Memory files** — existing `MEMORY.md`, `USER.md`, and `SOUL.md` are uploaded for context
-
-### From OpenClaw
-
 ```bash
-hermes honcho migrate
+echo "HONCHO_API_KEY=your-key" >> ~/.hermes/.env
 ```

-Walks through converting an OpenClaw native Honcho setup to the shared `~/.honcho/config.json` format.
+## Migrating from `hermes honcho`

-## AI Peer Identity
+If you previously used `hermes honcho setup`:

-Honcho can build a representation of the AI assistant over time (via `observe_me=True`). You can also seed the AI peer explicitly:
+1. Your existing configuration (`honcho.json` or `~/.honcho/config.json`) is preserved
+2. Your server-side data (memories, conclusions, user profiles) is intact
+3. Just set `memory.provider: honcho` to reactivate

-```bash
-hermes honcho identity ~/.hermes/SOUL.md
-```
+No re-login or re-setup needed. Run `hermes memory setup` and select "honcho" — the wizard detects your existing config.

-This uploads the file content through Honcho's observation pipeline. The AI peer representation is then injected into the system prompt alongside the user's, giving the agent awareness of its own accumulated identity.
+## Full Documentation

-```bash
-hermes honcho identity --show
-```
-
-Shows the current AI peer representation from Honcho.
-
-## Use Cases
-
- **Personalized responses** — Honcho learns how each user prefers to communicate
- **Goal tracking** — remembers what users are working toward across sessions
- **Expertise adaptation** — adjusts technical depth based on user's background
- **Cross-platform memory** — same user understanding across CLI, Telegram, Discord, etc.
- **Multi-user support** — each user (via messaging platforms) gets their own user model
-
-:::tip
-Honcho is fully opt-in — zero behavior change when disabled or unconfigured. All Honcho calls are non-fatal; if the service is unreachable, the agent continues normally.
-:::
+See [Memory Providers — Honcho](./memory-providers.md#honcho) for tools, config reference, and details.
--- a/website/docs/user-guide/features/memory-providers.md
+++ b/website/docs/user-guide/features/memory-providers.md
@ -0,0 +1,277 @@
+---
+sidebar_position: 4
+title: "Memory Providers"
+description: "External memory provider plugins — Honcho, OpenViking, Mem0, Hindsight, Holographic, RetainDB, ByteRover"
+---
+
+# Memory Providers
+
+Hermes Agent ships with 7 external memory provider plugins that give the agent persistent, cross-session knowledge beyond the built-in MEMORY.md and USER.md. Only **one** external provider can be active at a time — the built-in memory is always active alongside it.
+
+## Quick Start
+
+```bash
+hermes memory setup      # interactive picker + configuration
+hermes memory status     # check what's active
+hermes memory off        # disable external provider
+```
+
+Or set manually in `~/.hermes/config.yaml`:
+
+```yaml
+memory:
+  provider: openviking   # or honcho, mem0, hindsight, holographic, retaindb, byterover
+```
+
+## How It Works
+
+When a memory provider is active, Hermes automatically:
+
+1. **Injects provider context** into the system prompt (what the provider knows)
+2. **Prefetches relevant memories** before each turn (background, non-blocking)
+3. **Syncs conversation turns** to the provider after each response
+4. **Extracts memories on session end** (for providers that support it)
+5. **Mirrors built-in memory writes** to the external provider
+6. **Adds provider-specific tools** so the agent can search, store, and manage memories
+
+The built-in memory (MEMORY.md / USER.md) continues to work exactly as before. The external provider is additive.
+
+## Available Providers
+
+### Honcho
+
+AI-native cross-session user modeling with dialectic Q&A, semantic search, and persistent conclusions.
+
+| | |
+|---|---|
+| **Best for** | Teams using Honcho's user modeling platform |
+| **Requires** | `pip install honcho-ai` + API key |
+| **Data storage** | Honcho Cloud |
+| **Cost** | Honcho pricing |
+
+**Tools:** `honcho_profile` (peer card), `honcho_search` (semantic search), `honcho_context` (LLM-synthesized), `honcho_conclude` (store facts)
+
+**Setup:**
+```bash
+hermes memory setup    # select "honcho"
+# Or manually:
+hermes config set memory.provider honcho
+echo "HONCHO_API_KEY=your-key" >> ~/.hermes/.env
+```
+
+**Config:** `$HERMES_HOME/honcho.json` — existing Honcho users' configuration and data are fully preserved.
+
+:::tip Migrating from `hermes honcho`
+If you previously used `hermes honcho setup`, your config and all server-side data are intact. Just set `memory.provider: honcho` to reactivate via the new system.
+:::
+
+---
+
+### OpenViking
+
+Context database by Volcengine (ByteDance) with filesystem-style knowledge hierarchy, tiered retrieval, and automatic memory extraction into 6 categories.
+
+| | |
+|---|---|
+| **Best for** | Self-hosted knowledge management with structured browsing |
+| **Requires** | `pip install openviking` + running server |
+| **Data storage** | Self-hosted (local or cloud) |
+| **Cost** | Free (open-source, AGPL-3.0) |
+
+**Tools:** `viking_search` (semantic search), `viking_read` (tiered: abstract/overview/full), `viking_browse` (filesystem navigation), `viking_remember` (store facts), `viking_add_resource` (ingest URLs/docs)
+
+**Setup:**
+```bash
+# Start the OpenViking server first
+pip install openviking
+openviking-server
+
+# Then configure Hermes
+hermes memory setup    # select "openviking"
+# Or manually:
+hermes config set memory.provider openviking
+echo "OPENVIKING_ENDPOINT=http://localhost:1933" >> ~/.hermes/.env
+```
+
+**Key features:**
+- Tiered context loading: L0 (~100 tokens) → L1 (~2k) → L2 (full)
+- Automatic memory extraction on session commit (profile, preferences, entities, events, cases, patterns)
+- `viking://` URI scheme for hierarchical knowledge browsing
+
+---
+
+### Mem0
+
+Server-side LLM fact extraction with semantic search, reranking, and automatic deduplication.
+
+| | |
+|---|---|
+| **Best for** | Hands-off memory management — Mem0 handles extraction automatically |
+| **Requires** | `pip install mem0ai` + API key |
+| **Data storage** | Mem0 Cloud |
+| **Cost** | Mem0 pricing |
+
+**Tools:** `mem0_profile` (all stored memories), `mem0_search` (semantic search + reranking), `mem0_conclude` (store verbatim facts)
+
+**Setup:**
+```bash
+hermes memory setup    # select "mem0"
+# Or manually:
+hermes config set memory.provider mem0
+echo "MEM0_API_KEY=your-key" >> ~/.hermes/.env
+```
+
+**Config:** `$HERMES_HOME/mem0.json`
+
+| Key | Default | Description |
+|-----|---------|-------------|
+| `user_id` | `hermes-user` | User identifier |
+| `agent_id` | `hermes` | Agent identifier |
+
+---
+
+### Hindsight
+
+Long-term memory with knowledge graph, entity resolution, and multi-strategy retrieval. The `hindsight_reflect` tool provides cross-memory synthesis that no other provider offers.
+
+| | |
+|---|---|
+| **Best for** | Knowledge graph-based recall with entity relationships |
+| **Requires** | Cloud: `pip install hindsight-client` + API key. Local: `pip install hindsight` + LLM key |
+| **Data storage** | Hindsight Cloud or local embedded PostgreSQL |
+| **Cost** | Hindsight pricing (cloud) or free (local) |
+
+**Tools:** `hindsight_retain` (store with entity extraction), `hindsight_recall` (multi-strategy search), `hindsight_reflect` (cross-memory synthesis)
+
+**Setup:**
+```bash
+hermes memory setup    # select "hindsight"
+# Or manually:
+hermes config set memory.provider hindsight
+echo "HINDSIGHT_API_KEY=your-key" >> ~/.hermes/.env
+```
+
+**Config:** `$HERMES_HOME/hindsight/config.json`
+
+| Key | Default | Description |
+|-----|---------|-------------|
+| `mode` | `cloud` | `cloud` or `local` |
+| `bank_id` | `hermes` | Memory bank identifier |
+| `budget` | `mid` | Recall thoroughness: `low` / `mid` / `high` |
+
+---
+
+### Holographic
+
+Local SQLite fact store with FTS5 full-text search, trust scoring, and HRR (Holographic Reduced Representations) for compositional algebraic queries.
+
+| | |
+|---|---|
+| **Best for** | Local-only memory with advanced retrieval, no external dependencies |
+| **Requires** | Nothing (SQLite is always available). NumPy optional for HRR algebra. |
+| **Data storage** | Local SQLite |
+| **Cost** | Free |
+
+**Tools:** `fact_store` (9 actions: add, search, probe, related, reason, contradict, update, remove, list), `fact_feedback` (helpful/unhelpful rating that trains trust scores)
+
+**Setup:**
+```bash
+hermes memory setup    # select "holographic"
+# Or manually:
+hermes config set memory.provider holographic
+```
+
+**Config:** `config.yaml` under `plugins.hermes-memory-store`
+
+| Key | Default | Description |
+|-----|---------|-------------|
+| `db_path` | `$HERMES_HOME/memory_store.db` | SQLite database path |
+| `auto_extract` | `false` | Auto-extract facts at session end |
+| `default_trust` | `0.5` | Default trust score (0.0–1.0) |
+
+**Unique capabilities:**
+- `probe` — entity-specific algebraic recall (all facts about a person/thing)
+- `reason` — compositional AND queries across multiple entities
+- `contradict` — automated detection of conflicting facts
+- Trust scoring with asymmetric feedback (+0.05 helpful / -0.10 unhelpful)
+
+---
+
+### RetainDB
+
+Cloud memory API with hybrid search (Vector + BM25 + Reranking), 7 memory types, and delta compression.
+
+| | |
+|---|---|
+| **Best for** | Teams already using RetainDB's infrastructure |
+| **Requires** | RetainDB account + API key |
+| **Data storage** | RetainDB Cloud |
+| **Cost** | $20/month |
+
+**Tools:** `retaindb_profile` (user profile), `retaindb_search` (semantic search), `retaindb_context` (task-relevant context), `retaindb_remember` (store with type + importance), `retaindb_forget` (delete memories)
+
+**Setup:**
+```bash
+hermes memory setup    # select "retaindb"
+# Or manually:
+hermes config set memory.provider retaindb
+echo "RETAINDB_API_KEY=your-key" >> ~/.hermes/.env
+```
+
+---
+
+### ByteRover
+
+Persistent memory via the `brv` CLI — hierarchical knowledge tree with tiered retrieval (fuzzy text → LLM-driven search). Local-first with optional cloud sync.
+
+| | |
+|---|---|
+| **Best for** | Developers who want portable, local-first memory with a CLI |
+| **Requires** | ByteRover CLI (`npm install -g byterover-cli` or [install script](https://byterover.dev)) |
+| **Data storage** | Local (default) or ByteRover Cloud (optional sync) |
+| **Cost** | Free (local) or ByteRover pricing (cloud) |
+
+**Tools:** `brv_query` (search knowledge tree), `brv_curate` (store facts/decisions/patterns), `brv_status` (CLI version + tree stats)
+
+**Setup:**
+```bash
+# Install the CLI first
+curl -fsSL https://byterover.dev/install.sh | sh
+
+# Then configure Hermes
+hermes memory setup    # select "byterover"
+# Or manually:
+hermes config set memory.provider byterover
+```
+
+**Key features:**
+- Automatic pre-compression extraction (saves insights before context compression discards them)
+- Knowledge tree stored at `$HERMES_HOME/byterover/` (profile-scoped)
+- SOC2 Type II certified cloud sync (optional)
+
+---
+
+## Provider Comparison
+
+| Provider | Storage | Cost | Tools | Dependencies | Unique Feature |
+|----------|---------|------|-------|-------------|----------------|
+| **Honcho** | Cloud | Paid | 4 | `honcho-ai` | Dialectic user modeling |
+| **OpenViking** | Self-hosted | Free | 5 | `openviking` + server | Filesystem hierarchy + tiered loading |
+| **Mem0** | Cloud | Paid | 3 | `mem0ai` | Server-side LLM extraction |
+| **Hindsight** | Cloud/Local | Free/Paid | 3 | `hindsight-client` | Knowledge graph + reflect synthesis |
+| **Holographic** | Local | Free | 2 | None | HRR algebra + trust scoring |
+| **RetainDB** | Cloud | $20/mo | 5 | `requests` | Delta compression |
+| **ByteRover** | Local/Cloud | Free/Paid | 3 | `brv` CLI | Pre-compression extraction |
+
+## Profile Isolation
+
+Each provider's data is isolated per [profile](/docs/user-guide/profiles):
+
+- **Local storage providers** (Holographic, ByteRover) use `$HERMES_HOME/` paths which differ per profile
+- **Config file providers** (Honcho, Mem0, Hindsight) store config in `$HERMES_HOME/` so each profile has its own credentials
+- **Cloud providers** (RetainDB) auto-derive profile-scoped project names
+- **Env var providers** (OpenViking) are configured via each profile's `.env` file
+
+## Building a Memory Provider
+
+See the [Developer Guide: Memory Provider Plugins](/docs/developer-guide/memory-provider-plugin) for how to create your own.
--- a/website/docs/user-guide/features/memory.md
+++ b/website/docs/user-guide/features/memory.md
@ -207,12 +207,15 @@ memory:
  user_char_limit: 1375     # ~500 tokens
 ```

-## Honcho Integration (Cross-Session User Modeling)
+## External Memory Providers

-For deeper, AI-generated user understanding that works across sessions and platforms, you can enable [Honcho Memory](./honcho.md). Honcho runs alongside built-in memory in `hybrid` mode (the default) — `MEMORY.md` and `USER.md` stay as-is, and Honcho adds a persistent user modeling layer on top.
+For deeper, persistent memory that goes beyond MEMORY.md and USER.md, Hermes ships with 7 external memory provider plugins — including Honcho, OpenViking, Mem0, Hindsight, Holographic, RetainDB, and ByteRover.
+
+External providers run **alongside** built-in memory (never replacing it) and add capabilities like knowledge graphs, semantic search, automatic fact extraction, and cross-session user modeling.

 ```bash
-hermes honcho setup
+hermes memory setup      # pick a provider and configure it
+hermes memory status     # check what's active
 ```

-See the [Honcho Memory](./honcho.md) docs for full configuration, tools, and CLI reference.
+See the [Memory Providers](./memory-providers.md) guide for full details on each provider, setup instructions, and comparison.