mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-05-03 02:11:48 +00:00

feat(plugins): bundle hermes-achievements + scan full session history (#17754 )

* feat(plugins): bundle hermes-achievements, scan full session history

Ships @PCinkusz's hermes-achievements dashboard plugin (https://github.com/PCinkusz/hermes-achievements) as a bundled plugin at plugins/hermes-achievements/ and fixes a bug in the scan path that made the plugin only see the first 200 sessions — making lifetime badges (50k tool calls, 75k errors, etc.) unreachable on long-running installs.

Changes:

- plugins/hermes-achievements/: vendor v0.3.1 verbatim (manifest, dist/, plugin_api.py, tests, docs, README).
- plugins/hermes-achievements/dashboard/plugin_api.py:
* scan_sessions(): limit=None now scans ALL sessions via SQLite LIMIT -1. Previously capped at 200, so users with 8000+ sessions saw ~2% of their history.
* evaluate_all(): first-ever scans run in a background thread so the dashboard request path never blocks. Stale snapshots serve immediately while a background refresh runs. force=True still blocks synchronously for manual /rescan.
* _build_pending_snapshot(), _start_background_scan(), _run_scan_and_update_cache(): supporting plumbing + idempotent thread spawn.
- tests/plugins/test_achievements_plugin.py: new tests covering the 200-cap regression, the background-scan first-run flow, stale-serve-plus-background-refresh, forced sync rescan, and scan-thread idempotency.
- website/docs/user-guide/features/built-in-plugins.md: lists hermes-achievements in the bundled-plugins table and documents API endpoints, state files, and performance characteristics.

E2E validated against a real 8564-session ~6.4GB state.db:
* Cold scan: 13m 19s (one-time, backgrounded — UI never blocks)
* Warm rescan: 1.47s (8563/8564 sessions reused from checkpoint cache)
* 57/60 achievements unlocked, 3 discovered — aggregates like total_tool_calls=259958, total_errors=164213, skill_events=368243 correctly surface lifetime badges that the 200-cap made unreachable.

Original credit: @PCinkusz (MIT-licensed). Upstream repo remains the staging ground for new badges; this bundle keeps the dashboard feature parity with Hermes core changes.

* feat(achievements): publish partial snapshots during cold scan

Previously a cold scan on a large session DB (13min on 8564 sessions)
showed zero badges for the entire duration, then every badge at once
when the scan completed. A dashboard refresh mid-scan was indistinguishable
from a fresh install with no history.

Now the scanner publishes a partial snapshot to _SNAPSHOT_CACHE every
250 sessions, so each refresh during a cold scan surfaces more badges
incrementally.

Mechanism:
- scan_sessions() takes an optional progress_callback fired every
progress_every sessions with (sessions_so_far, scanned, total).
- _compute_from_scan() is extracted from compute_all() and gains an
is_partial flag that skips writing to state.json — we don't want
to record unlocked_at based on a half-complete aggregate that a
later session might rebalance.
- _run_scan_and_update_cache() installs a publisher callback that
builds a partial snapshot, marks it mode='in_progress', and writes
it to the cache with age=0 so the UI keeps polling /scan-status
and picks up the final snapshot when the scan completes.
- Manual /rescan (force=True) disables partial publishing — the
caller is blocking on the final result anyway.

E2E against real 8564-session state.db (polled cache every 10s):
t=10s: cache empty
t=20s: 250/8564 scanned, 35 unlocked, 25 discovered
t=40s: 500/8564 scanned, 42 unlocked, 18 discovered
t=60s: 1000/8564 scanned, 49 unlocked, 11 discovered
...

Tests: 9/9 pass (2 new — partial snapshot publication + no-persist-on-partial).
Upstream unittest suite: 10/10 pass.

* feat(achievements): in-progress scan banner with live % progress

Previously the dashboard showed zero badges silently during long cold
scans (13min on 8564 sessions). The backend was publishing partial
snapshots every 250 sessions, but the bundled UI didn't surface any
indicator that a scan was running — it just rendered the main page
with whatever counts were currently published and no way for the user
to know more progress was coming.

UI changes (dist/index.js, dist/style.css):

- Added a scan-in-progress banner rendered between the hero and stats
when scan_meta.mode is 'pending' or 'in_progress'. Shows:
BUILDING ACHIEVEMENT PROFILE…
Scanned 1,750 of 8,564 sessions · 20%. Badges unlock as more history streams in.
with a pulsing teal indicator and a filling teal/cyan progress bar.
Disappears the moment the backend flips to 'full' or 'incremental'.

- Added an auto-poller via useEffect — while scanInFlight is true the
page re-fetches /achievements every 4s WITHOUT toggling the loading
skeleton, so unlock counts tick up visibly without the user refreshing.
The effect cleans itself up when the scan finishes.

- Added refresh() (re-fetch, no loading flip) alongside the existing
load() (full reload, used by the Rescan button).

Attribution preserved:

- Added a header comment to index.js crediting @PCinkusz
(https://github.com/PCinkusz/hermes-achievements, MIT) as the
original author, noting the banner is a layered addition on top
of the original dist bundle.
- Matching header comment in style.css, flagging the new
.ha-scan-banner* rules as the local addition.

Live-verified end to end:

- Spun up `hermes dashboard --port 9229 --no-open` against a fresh
HERMES_HOME symlinked to the real 8564-session state.db.
- Opened /achievements in a browser, confirmed the banner renders with
live progress: 'Scanned 1,000 of 8,564 sessions · 11%' → updates to
'1,250 ... · 14%' → '1,750 ... · 20%' without user interaction,
matching the backend's partial publications.
- Stats row simultaneously climbed from 35 → 49 → 53 unlocked as
more history streamed in.
- Vision analysis of the rendered page confirms the banner styling
matches the rest of the dashboard (dark card bg, teal accent, same
small-caps typography, pulsing indicator reusing ha-pulse keyframes).

2026-04-29 23:23:57 -07:00

5.6 KiB

Raw Blame History

Hermes Achievements Implementation Spec (Detailed)

This document is implementation-facing detail to execute the performance refactor later.

Decision scope: keep only Achievements tab flow; remove /overview + top-banner slot integration.

A) Current Behavior Summary

evaluate_all() performs:
- full scan_sessions()
- SessionDB.list_sessions_rich(...)
- db.get_messages(session_id) for each session
- text/tool regex analysis + aggregation + evaluation
/overview and /achievements both currently call evaluate_all() directly.
slot calls (sessions:top, analytics:top) currently invoke /overview.

Consequence: repeated full recomputes and contention.

B) De-scope/Removal Changes

Remove backend route:

GET /overview

Remove frontend slot usage:

SummarySlot component
registerSlot("sessions:top")
registerSlot("analytics:top")

Remove manifest slot declarations:

"slots": ["sessions:top", "analytics:top"]

Keep:

tab route/page for Achievements
/achievements endpoint and full tab rendering

C) Target Internal Interfaces

1) `SnapshotStore`

Responsibilities:

hold latest computed snapshot in memory
persist/load snapshot from disk
expose age and staleness checks

Storage path:

~/.hermes/plugins/hermes-achievements/scan_snapshot.json

Methods (conceptual):

get() -> snapshot | null
set(snapshot)
is_stale(ttl_seconds)

2) `ScanCoordinator`

Responsibilities:

single-flight guard for compute jobs
track scan status

Methods:

run_if_needed(force: bool = false)
get_status()

State fields:

state: idle|running|failed
started_at, finished_at
last_error
run_count

3) `build_snapshot()`

Responsibilities:

execute current compute logic once
on first run, perform full scan and materialize per-session contributions
on subsequent runs, process only changed/new sessions via checkpoint fingerprints
produce shape consumed by /achievements

Output:

achievements
count fields
optional scan_meta

D) Endpoint Behavior Matrix (No `/overview`)

Endpoint	Cache fresh	Cache stale	No cache	Force rescan
`/achievements`	return cached	return stale + trigger bg refresh	blocking bootstrap scan	n/a
`/rescan`	trigger refresh	trigger refresh	trigger refresh	yes
`/scan-status`	status only	status only	status only	status only

Notes:

At most one scan run active.
Other callers either await same run or receive stale snapshot according to policy.

E) Data Shape (Proposed)

{
  "generated_at": 0,
  "is_stale": false,
  "scan_meta": {
    "duration_ms": 0,
    "sessions_scanned": 0,
    "messages_scanned": 0,
    "mode": "full",
    "error": null
  },
  "achievements": [],
  "unlocked_count": 0,
  "discovered_count": 0,
  "secret_count": 0,
  "total_count": 0,
  "error": null
}

Compatibility guidance:

Keep existing /achievements keys.
Add metadata keys without breaking old callers.

Checkpoint file (new):

~/.hermes/plugins/hermes-achievements/scan_checkpoint.json

Suggested checkpoint shape:

{
  "schema_version": 1,
  "generated_at": 0,
  "sessions": {
    "<session_id>": {
      "fingerprint": {
        "updated_at": 0,
        "message_count": 0,
        "hash": "optional"
      },
      "contribution": {
        "metrics": {}
      }
    }
  }
}

Notes:

fingerprint mismatch => recompute that session contribution only.
unchanged fingerprint => reuse stored contribution.

F) Concurrency Contract

Any request path that needs fresh data must pass through single-flight coordinator.
If a scan is running:
- do not start second scan
- either await in-flight run (bounded) or serve stale snapshot immediately
lock scope must include scan start/finish state transitions.

G) Error Handling Contract

If refresh fails and prior snapshot exists:
- return prior snapshot with is_stale=true and error metadata
If refresh fails and no prior snapshot:
- return explicit error response (current behavior equivalent)
scan-status should always return last known state/error.

H) Frontend Integration Contract

Achievements page:
- one fetch on mount to /achievements
- optional background refresh indicator if stale
no top-banner slot integration
avoid duplicate in-flight calls during fast navigation by cancellation/debounce.

I) Validation Checklist

/overview route removed
manifest has no sessions:top/analytics:top slots
frontend has no api("/overview") calls
repeated Achievements navigation does not create multiple heavy scans
average warm load times meet SLOs
unlock totals match pre-refactor baseline for same history
no schema regression in /achievements response

J) Suggested File Placement for Future Work

backend changes: dashboard/plugin_api.py
optional extraction:
- dashboard/perf_snapshot.py
- dashboard/perf_scan_coordinator.py
frontend request hygiene: dashboard/dist/index.js (or source if available)
plugin metadata: dashboard/manifest.json
persisted runtime files:
- ~/.hermes/plugins/hermes-achievements/state.json (existing unlock state)
- ~/.hermes/plugins/hermes-achievements/scan_snapshot.json (new)
- ~/.hermes/plugins/hermes-achievements/scan_checkpoint.json (new)

K) Post-Implementation Reporting Template

Record:

dataset size (sessions/messages/tool calls)
pre/post /achievements timings (cold/warm)
whether single-flight dedupe triggered under repeated tab open
any behavioral diffs in unlock counts

5.6 KiB Raw Blame History

Hermes Achievements Implementation Spec (Detailed)

A) Current Behavior Summary

B) De-scope/Removal Changes

C) Target Internal Interfaces

1) SnapshotStore

2) ScanCoordinator

3) build_snapshot()

D) Endpoint Behavior Matrix (No /overview)

E) Data Shape (Proposed)

F) Concurrency Contract

G) Error Handling Contract

H) Frontend Integration Contract

I) Validation Checklist

J) Suggested File Placement for Future Work

K) Post-Implementation Reporting Template

5.6 KiB

Raw Blame History

1) `SnapshotStore`

2) `ScanCoordinator`

3) `build_snapshot()`

D) Endpoint Behavior Matrix (No `/overview`)