mirror of
https://github.com/NousResearch/hermes-agent.git
synced 2026-06-09 08:21:50 +00:00
Drops the React `<Profiler>` approach (no-op because Vite is currently
serving the production React build) in favor of an externally-observable
measurement stack: rAF frame intervals, `PerformanceObserver({entryTypes:
['longtask']})`, and a `MutationObserver` on the live streaming message.
Adds a synthetic stream driver — `window.__PERF_DRIVE__.stream({...})` —
that pushes tokens through the live `$messages` atom at a controlled rate,
so the assistant-ui runtime, incremental repository, and Streamdown
markdown pipeline see the same workload they'd see during a real LLM
stream, without the LLM cost.
The driver lives in `src/app/chat/perf-probe.tsx`; `main.tsx` side-imports
it under `import.meta.env.MODE !== 'production'` so it tree-shakes out of
prod builds. (Using `MODE` rather than `DEV` because our Vite setup
currently reports `DEV=false` even under `vite dev` — see the dev-build
note in `profile-typing-lag.md`.)
Scripts:
- measure-synthetic-stream.mjs drive synthetic + record frame/longtask/mutation
- profile-synth-stream.mjs CPU profile + top self-time during synthetic
- measure-real-stream.mjs same harness, real LLM stream
- profile-real-stream.mjs CPU profile bracketing the real stream window
- eval.mjs / reload.mjs small CDP helpers
A real-LLM measurement on Cloud Shadows (gpt-4o-mini, 39 s window) showed
12 longtasks in the same 75-127 ms range the synthetic predicted, so the
synthetic is a faithful proxy.
36 lines
1.2 KiB
JavaScript
36 lines
1.2 KiB
JavaScript
// Hard reload the Electron renderer over CDP. Vite-no-HMR mode means edits
|
|
// don't auto-apply — call this after editing source.
|
|
const targets = await (await fetch('http://127.0.0.1:9222/json')).json()
|
|
const t = targets.find((t) => t.url.includes('5174'))
|
|
if (!t) {
|
|
console.error('renderer not found')
|
|
process.exit(1)
|
|
}
|
|
const ws = new WebSocket(t.webSocketDebuggerUrl)
|
|
let id = 0
|
|
const pending = new Map()
|
|
ws.addEventListener('message', (ev) => {
|
|
const m = JSON.parse(ev.data)
|
|
if (pending.has(m.id)) {
|
|
pending.get(m.id)(m)
|
|
pending.delete(m.id)
|
|
}
|
|
})
|
|
await new Promise((r) => ws.addEventListener('open', r))
|
|
const send = (method, params = {}) =>
|
|
new Promise((res) => {
|
|
const i = ++id
|
|
pending.set(i, res)
|
|
ws.send(JSON.stringify({ id: i, method, params }))
|
|
})
|
|
|
|
await send('Page.reload', { ignoreCache: true })
|
|
console.log('reload sent')
|
|
// Wait for new doc.
|
|
await new Promise((r) => setTimeout(r, 2500))
|
|
const r = await send('Runtime.evaluate', {
|
|
expression: 'JSON.stringify({ hasProbe: !!window.__PERF_PROBE__, composer: !!document.querySelector("[contenteditable=true]"), url: location.hash })',
|
|
returnByValue: true,
|
|
})
|
|
console.log(r.result.result.value)
|
|
ws.close()
|