mirror of
https://github.com/NousResearch/hermes-agent.git
synced 2026-06-09 08:21:50 +00:00
Some checks are pending
Deploy Site / deploy-vercel (push) Waiting to run
Deploy Site / deploy-docs (push) Waiting to run
Docker Build and Publish / build-amd64 (push) Waiting to run
Docker Build and Publish / build-arm64 (push) Waiting to run
Docker Build and Publish / merge (push) Blocked by required conditions
Lint (ruff + ty) / ruff + ty diff (push) Waiting to run
Lint (ruff + ty) / ruff enforcement (blocking) (push) Waiting to run
Lint (ruff + ty) / Windows footguns (blocking) (push) Waiting to run
Nix Lockfile Fix / auto-fix-main (push) Waiting to run
Nix Lockfile Fix / fix (push) Waiting to run
Nix / nix (macos-latest) (push) Waiting to run
Nix / nix (ubuntu-latest) (push) Waiting to run
OSV-Scanner / Scan lockfiles (push) Waiting to run
Tests / test (1) (push) Waiting to run
Tests / test (2) (push) Waiting to run
Tests / test (3) (push) Waiting to run
Tests / test (4) (push) Waiting to run
Tests / test (5) (push) Waiting to run
Tests / test (6) (push) Waiting to run
Tests / save-durations (push) Blocked by required conditions
Tests / e2e (push) Waiting to run
uv.lock check / uv lock --check (push) Waiting to run
* feat(tui): HERMES_DEV_CREDITS live-spend dev readout (L0 tracer for usage-aware credits)
L0 of the usage-aware-credits feature: a dev-only, env-gated tracer that
exercises the real header -> CreditsState -> TUI pipe end-to-end behind
HERMES_DEV_CREDITS, de-risking the L1/L5 build before the notice policy exists.
- agent/credits_tracker.py: CreditsState + parse_credits_headers (headers are
strings -> paid_access via == "true", never bool(); retain-last-known; only
subscription_micros may be negative; *_usd kept verbatim).
- run_agent.py: _capture_credits / get_credits_state / get_credits_spent_micros,
session-start baseline latch, + dev-gated "credits" capture log.
- agent/chat_completion_helpers.py: capture on the streaming response.
- agent/agent_init.py: init _credits_state + _credits_session_start_micros.
- tui_gateway/server.py: _get_usage emits dev_credits_spent_micros only when flagged.
- ui-tui appChrome.tsx / types.ts: cents delta status segment + "(dev credits)" banner.
Off by default; silent for normal users. Validated live against staging
(capture log delta matches the TUI segment). Throwaway consumer (readout/log/
banner); credits_tracker + the capture plumbing are the real feature foundation.
* test(credits): lock parser under 9-state matrix + harden validation (L2)
Add tests/agent/test_credits_tracker.py with 92 tests covering the 9-state
matrix (healthy, sub_90pct, grant_exhausted, purchased_only, tool_pool_free,
depleted, debt, missing, no_org) plus validation edge cases: version strict==1
with warn-once latch for v>1, bool-string trap (paid_access/tool_pool_gated_off
== "true"/"false", never bool()), half-pair subscription limit treated as
both-absent while parse succeeds, USD regex ^-?\d+\.\d{2}$, non-int micros
→ None, negative non-subscription micros → None, as_of_ms junk → None, zero
limit ZeroDivision guard.
Harden agent/credits_tracker.py to match the spec:
- Add tool_pool_micros/tool_pool_gated_off/from_header fields to CreditsState
- Add depleted property (== not paid_access, never remaining==0)
- Change used_fraction guard to key off subscription_limit_micros (the actual
denominator) not denominator_kind (metadata)
- Replace fail-soft _safe_int with a sentinel-returning variant; full validation
now returns None on any malformed field rather than silently defaulting
- Add module-level warn-once latch for version > 1
- Add USD regex validation; add denominator_kind allow-list check
- Parse x-nous-tool-pool-* prefix headers (not x-nous-credits-tool-pool-*)
* feat(credits): notice spine — AgentNotice + notice_callback/notice_clear_callback + TUI binding (L1)
L1 of usage-aware credits: the driver-agnostic notice delivery spine that L4's
policy will fire through and L5's TUI render will consume.
- agent/credits_tracker.py: AgentNotice dataclass (text/level/kind/ttl_ms/key/id;
kind defaults "sticky", kept TTL-expressive for a future config seam).
- run_agent.py: AIAgent gains notice_callback + notice_clear_callback slots and
_emit_notice / _emit_notice_clear emitters (swallow all callback errors — a
notice must never break the agent loop; no-op when unbound).
- agent/agent_init.py: thread both callbacks through init_agent.
- tui_gateway/server.py: bind both in _agent_cbs → notification.show / notification.clear
WS events (snake_case payload, matching the existing gateway-event convention).
- ui-tui/src/gatewayTypes.ts: notification.show / notification.clear arms on GatewayEvent.
- tests/run_agent/test_notice_spine.py: 15 tests (emitter fire + fail-open + no-op,
signature threading, TUI binding payload shape).
Messaging push is out of v1 (binds neither callback). CLI binding + the TUI render/
decode land with L4 (firing) and L5 (render) so turn-end flush is wired correctly.
* feat(credits): threshold reconciliation policy + tests (L4.1)
* feat(credits): wire threshold policy into capture + latch (L4.2)
After a fresh header parse, _capture_credits runs evaluate_credits_notices against
the agent's _credits_latch and emits the result — clears first, then shows (so a
recovered depletion clears before the "restored" success lands, and depleted wins
the latest-wins slot). Gated on a bound notice_callback: messaging (no callbacks)
still caches state for /usage but runs no policy. Parse stays fail-open (miss →
keep last-known); the eval/emit path warns on failure rather than swallowing, so a
depletion-notice bug can't vanish silently.
- run_agent.py: _capture_credits split into parse (swallow→miss) + policy (warn);
latch lazy-guarded (object.__new__ safety).
- agent/agent_init.py: init agent._credits_latch = {"active": set(), "seen_below_90": False}.
* feat(tui): render credits notices in the status bar (L5, Strategy B)
The TUI now renders the notification.show / notification.clear gateway events the
agent emits — a level-colored notice overrides the status/verb slot when not busy.
- Notice state machine on turnController (pendingNotice + dedicated noticeTimer +
show/clear/applyNotice/flushPendingNotice/clearNoticeState). createGatewayEventHandler
decodes the events and delegates.
- Render priority busy > notice > status (appChrome StatusRule); notice text rendered
verbatim (its glyph comes from the policy), shrinkable so it never clips model│ctx;
dev-credits banner + Δ segment preserved. UiState.notice is snake_case (matches wire).
- Busy-wins: a notice arriving mid-turn is held and flushed at the THREE turn-end sites
(recordMessageComplete / interruptTurn / recordError) — never idle(), which reset()
also calls (would leak across sessions); reset() clears instead.
- Dedicated noticeTimer (never statusTimer); TTL starts on visibility with an id-guard;
latest-wins cancels the prior timer; clear is key-matched (no-op on mismatch); a sticky
survives a turn (flush no-ops with no pending); session reset clears (no cross-session leak).
- 20 tests (handler/turnController logic incl. R3-C2 timer isolation + render priority).
* feat(credits): cold-start seed for new Nous sessions (L3)
A genuinely-new Nous session has no inference header yet, so seed credits state from
the authoritative GET /api/oauth/account snapshot at session start (in the new-session
branch of _restore_or_build_system_prompt — inline, since the on_session_start plugin
hook gets no agent reference). The seed runs the shared notice policy, so a session that
opens already depleted warns IMMEDIATELY rather than only after the first turn.
- Maps the nested account fields (paid_service_access → paid_access; total_usable /
subscription / purchased on paid_service_access_info; rollover on subscription), each
None-guarded; float dollars → micros via round(d*1e6), *_usd left "" (render formats
from micros — never synthesize a verbatim usd from a float).
- Magnitudes-only: no monthlyCredits on the endpoint → subscription_limit_* unset →
used_fraction None → no warn90 from the seed (% only once a header lands, per D-E).
- Provider-guarded to Nous; fail-open (any error leaves _credits_state None, never
blocks startup); paid_access unknown ⇒ True (never falsely depleted).
- run_agent.py: extracted the warm-path policy/emit block into a shared
_emit_credits_notices() so capture and the seed fire notices identically.
* feat(credits): /usage Nous credits magnitudes view + recovery trigger (L6)
Add Nous credit dollar magnitudes to /usage (subscription / top-up / total
+ rollover + renewal + portal CTA), magnitudes-only per v1 (no % until the
account endpoint exposes a denominator). Reuses the existing account-usage
render machinery via a new pure build_nous_credits_snapshot() that maps a
NousPortalAccountInfo to an AccountUsageSnapshot; no nous branch is added to
fetch_account_usage (keeps the per-provider boundary intact).
CLI /usage also doubles as a depletion-recovery trigger: a force_fresh
account fetch, kept in a SEPARATE local so it never clobbers the
header-sourced agent._credits_state (which alone carries used_fraction). If
paid access recovered while credits.depleted is latched and a notice
consumer is bound, it reuses agent._emit_credits_notices() to clear it.
Gateway /usage displays magnitudes only — messaging binds no notice
consumer, so it performs no recovery emit.
Fail-open throughout: any portal hiccup leaves /usage unaffected.
* refactor(credits): dedupe HERMES_DEV_CREDITS flag parse via shared helpers
The dev-flag truthy check was inlined in three places. Replace with the shared
utils.is_truthy_value (run_agent.py, tui_gateway/server.py — also drops a
redundant inline `import os`) and a hoisted DEV_CREDITS_MODE export in
ui-tui/src/config/env.ts (consumed by appChrome, which also stops recomputing the
env check on every render). Behaviour-preserving; identical truthy set.
* fix(credits): cut dead /usage recovery trigger + bound portal fetches (L6 review)
Adversarial review found the /usage depletion-recovery trigger dead AND broken:
the CLI binds no notice_clear_callback, the TUI runs /usage in a separate
slash-worker subprocess (its own agent/latch), and the no-clobber rule made it
evaluate stale paid_access anyway. Recovery already happens on the next inference
(warm path), so the trigger was redundant — remove it and stop the depleted
notice over-promising.
- cli.py: remove the dead recovery block; bound the /usage portal fetch with a
10s wall-clock timeout (ThreadPoolExecutor) like the per-provider fetch —
urllib's per-socket timeout is not a wall-clock guarantee.
- agent/credits_tracker.py: reword the depleted CTA to "run /usage for balance"
(no false recovery promise; /usage shows fresh magnitudes, sticky clears next turn).
- agent/conversation_loop.py: same wall-clock timeout on the cold-start seed fetch
so a stalled portal can't hang session startup; tidy its time import.
* chore(credits): dev notice-state fixtures (HERMES_DEV_CREDITS_FIXTURE)
Throwaway dev scaffolding to exercise the notice pipeline without real spend or
Redis seeding. Set HERMES_DEV_CREDITS_FIXTURE to a state name (healthy / sub_90pct
/ grant_exhausted / depleted / clear) or a file path whose contents name a state
(re-read each turn → flip states live for recovery testing). _capture_credits
injects the chosen CreditsState instead of parsing real headers and runs the
shared notice policy. Deletable with the rest of the HERMES_DEV_CREDITS scaffolding.
* feat(credits): /usage monthly-grant % gauge
The portal /api/oauth/account subscription block now carries monthly_credits
(the per-period grant allowance, the % denominator). The consumer parsed
monthly_charge but dropped monthly_credits, so /usage stayed magnitudes-only.
Capture monthly_credits into NousPortalSubscriptionInfo + _subscription_from_payload.
build_nous_credits_snapshot emits a Subscription usage window (real % used, routed
through the existing render machinery) when monthly_credits is a finite positive
denominator and credits_remaining is finite and <= cap; otherwise it degrades to
magnitudes-only (older portals, rollover-over-cap, or non-finite payloads).
Guards (adversarial-review-driven): reject non-finite operands (json.loads parses
bare NaN/Infinity by default → would render $nan + a false 100% used), reject
bools, guard div-by-zero (cap>0), and suppress the gauge when remaining > cap
(rollover spanning the period makes the cap a nonsensical denominator → the
$X-of-$Y detail would read as a contradiction). Debt (remaining<0) clamps to 100%.
Money rule preserved: the ratio + magnitudes are computed from numeric float
account fields via display formatting, never by parsing a server *_usd string
(there are none on these dataclasses).
13 gauge tests added (tests/agent/test_nous_credits_gauge.py).
* fix(credits): show /usage Nous block whenever a Nous account is present
/usage runs in a slash-worker subprocess whose resolved inference provider is
often not "nous" even when the user has a Nous account, so gating the Nous
credits block on (provider == "nous") hid it entirely — the account data was
fully available but never rendered.
Gate instead on "a Nous account is logged in": a cheap local auth-state lookup
(get_provider_auth_state('nous') has an access_token) decides whether to attempt
the portal fetch, regardless of which provider inference runs on. In the gateway
the block is also lifted out of the 'if provider:' scope so a Nous-credentialled
user with another (or no) resident inference provider still sees their balance.
Fail-open and the per-fetch wall-clock timeout are preserved.
* fix(credits): show /usage Nous block when there's no live agent (TUI slash-worker)
In the TUI, /usage runs in a slash-worker subprocess that resumes the session
WITHOUT building an agent (self.agent is None), so _show_usage early-returned
"(._.) No active agent" before ever reaching the Nous credits block — which is
agent-independent (a portal fetch gated on Nous auth-state). Extract the block
into _print_nous_credits_block() and run it at the no-agent / no-calls
early-returns too (returns True if it printed, so the fallback message only
shows when there's genuinely nothing).
Verified live against staging: the block + monthly-grant gauge now render in the
slash-worker /usage path (previously hidden). The plain CLI REPL + messaging
paths are unchanged (they have a live agent).
* feat(credits): escalating 50/75/90 usage bands (single status line)
Replace the lone 90%-used warning with three escalating bands (50 info, 75 warn,
90 warn) shown as ONE status-bar line: it displays the highest band the
subscription grant has crossed, replaces the line as usage climbs, steps back
down on recovery, and clears below 50%. No stacking, no per-turn churn.
Bands live in a tunable CREDITS_USAGE_BANDS list; the policy derives everything
from it. Single notice key (credits.usage) with a usage_band latch field so the
notice only re-emits when the band actually changes. The crossing gate
(seen_below_90) is preserved so a fresh live session that opens mid-range stays
quiet until it has been observed below the lowest band (cold-start primes it when
it wants an open-high warning). Denominator math unchanged: % = subscription
grant burn (cap - grant_remaining)/cap, clamped [0,1]; top-up never moves the %.
Migrated test_credits_policy.py to the new key + added TestUsageBands (climb,
step-down, recovery-clear, idempotent, inclusive boundaries).
* feat(credits): hydrate notices at session OPEN via shared seed (TUI + first-turn)
Notices previously only fired inside a conversation turn (first message), so a
session that opened already depleted / past a usage band showed nothing at
'ready'. Extract the cold-start seed into a shared seed_credits_at_session_start()
and call it (a) in the TUI/desktop agent build right after the notice callback is
wired (fires at 'ready', before any message) and (b) as the first-turn fallback in
conversation_loop. Idempotent (skips once _credits_state exists) and fail-open.
The seed now maps monthly_credits -> subscription_limit_micros +
denominator_kind='subscription_cap', so used_fraction is computable at seed time
and usage-band warnings (not just depletion) hydrate on open. Primes the crossing
latch so a session opening already in a band warns immediately. Degrades to
depletion-only when monthly_credits is absent (older portals).
Adds test_credits_cold_start.py covering open-at-band, depletion, debt, no-cap
degradation, and the shared seed (fires/idempotent/skips-non-nous).
* feat(credits): /usage monthly-grant % gauge + fixture support + TUI surfacing
agent/account_usage.py: build_nous_credits_snapshot emits a subscription %% gauge
when the portal supplies a positive, finite monthly_credits denominator with
remaining <= cap (guards reject NaN/Infinity and rollover-over-cap, which would
render $nan or a contradictory $X-of-$Y); degrades to magnitudes-only otherwise.
Adds shared nous_credits_lines() (auth-gated, wall-clock-bounded portal fetch) so
the CLI and TUI /usage render the same block, and _snapshot_from_credits_state()
so HERMES_DEV_CREDITS_FIXTURE drives /usage offline too.
TUI: session.usage RPC carries credits_lines (agent-independent) and the /usage
panel renders them regardless of API-call count or resume state — previously the
TUI's separate /usage implementation only showed token counts.
Money rule preserved: %% and magnitudes come from numeric float account fields via
display formatting, never by parsing a server *_usd string.
* feat(credits): CLI REPL inline notices (parity with TUI)
The plain CLI agent bound no notice callbacks, so credit notices were TUI-only.
Bind notice_callback/notice_clear_callback on the CLI AIAgent; _on_notice renders
a single level-colored line above the prompt (error red / warn yellow / success
green / info dim) via _cprint, and seed credits at session open so a depletion or
usage-band warning shows before the first message — the same hydration the TUI
got. _on_notice_clear is a no-op (the REPL prints lines, no persistent slot).
* test(credits): add sub_50pct + sub_75pct dev fixtures for the new usage bands
The fixture set jumped 10%% -> 90%%; add sub_50pct (uf 0.5 -> band 50 info) and
sub_75pct (uf 0.75 -> band 75 warn) so the new escalating bands are exercisable
via HERMES_DEV_CREDITS_FIXTURE across all three surfaces (notice, session-open
seed, /usage gauge).
* fix(credits): usage-band notice clears on next prompt (not sticky-forever)
A 50/75/90 usage heads-up was sticky and camped the status bar indefinitely. Clear
the visible credits.usage notice when a new turn starts (startMessage), so it shows
until your next prompt then yields. The server latch is unchanged, so it won't
re-nag at the same band — it only re-shows when the band actually changes (climb)
or clears when usage drops below the lowest band. Depletion stays sticky.
* refactor(credits): consolidate the /usage credits block behind nous_credits_lines()
The CLI (_print_nous_credits_block) and the messaging gateway (_handle_usage_command)
each re-implemented the auth-gate + portal fetch + render, and both bypassed the
dev-fixture short-circuit that only the TUI honored — so /usage ignored
HERMES_DEV_CREDITS_FIXTURE on the CLI and in chat. Route both through the shared
agent.account_usage.nous_credits_lines() helper: one fetch/render path, one auth
gate, and the fixture works on every surface (~60 fewer duplicated lines).
The gateway usage test recorded only the last asyncio.to_thread call; /usage now
dispatches both the account fetch and the credits fetch, so it records every call
and matches the account fetch by its provider arg.
* fix(credits): keep the /usage gauge type-safe and log its fail-open path
_is_finite_num is now a TypeGuard[float], so the type checker narrows the gauge
operands (monthly_credits / credits_remaining) and the magnitudes passed to
_fmt_usd through it — no more None-operand warnings on the arithmetic. Add a debug
breadcrumb on the nous_credits_lines portal-fetch fail-open so a dead /usage block
is diagnosable in agent.log without a dev flag.
* fix(credits): harden the header tracker — prod-leak gate, hot-path probe, fire-and-forget seed
- Prod-leak guard: dev fixtures (HERMES_DEV_CREDITS_FIXTURE) now also require
HERMES_DEV_CREDITS, so a stray fixture var can't surface fabricated balances on a
real account. Matches the documented run workflow (both vars set together).
- Hot-path probe: parse_credits_headers checks for the version sentinel header
before allocating a lowercased copy of the response headers — skips that work on
every non-Nous API call. Behaviour-identical and still case-insensitive.
- Fire-and-forget seed: the real portal fetch in seed_credits_at_session_start now
runs in a daemon thread, so a slow/unreachable portal never delays session "ready"
(previously blocked up to 10s). The dev-fixture path stays synchronous; the thread
re-checks idempotency before hydrating (a live header may land first).
- Diagnostics: debug breadcrumbs on the parse and seed fail-open paths so a crashed
parser / dead seed is distinguishable from a legitimate no-headers miss.
Cold-start tests set HERMES_DEV_CREDITS alongside the fixture to match the gate.
* test(tui): fix env-timing in the StatusRule dev-credits assertion
DEV_CREDITS_MODE is read once at module load (config/env), so mutating
process.env.HERMES_DEV_CREDITS inside the test couldn't flip it — the dev-banner
assertion only passed if the env was exported before vitest started, and failed in a
normal run. Move that assertion to a sibling file that mocks config/env with
DEV_CREDITS_MODE: true (scoped, no module-reset / React-identity hazard).
* test(credits): cover the dev-fixture /usage render and usage-band clear-on-prompt
- _snapshot_from_credits_state (the offline /usage renderer) had no direct test:
lock the gauge math, the verbatim *_usd magnitudes, the depletion line and the
fixture marker, plus the no-cap (no gauge) and None-state cases.
- turnController.startMessage had no test for clearing the credits.usage notice on
the next prompt while leaving credits.depleted sticky.
* feat(credits): deliver credit notices over messaging gateways
Bind notice_callback/notice_clear_callback on the per-turn gateway agent
so usage-band / depletion / restored notices reach Telegram/Discord/Slack/
etc. Previously the messaging gateway bound neither callback, so the agent's
_emit_credits_notices early-returned and a chat user crossing a band got
nothing unless they ran /usage manually.
- render_notice_line(): AgentNotice -> single plaintext line (level glyph +
text), plaintext-only so it renders uniformly without per-platform escaping.
Fail-soft on malformed/empty notices.
- Standalone push for every notice (messaging has no persistent status bar):
route through the shared _deliver_platform_notice rail (honors private/
public delivery + thread metadata), scheduled onto the gateway loop via
safe_schedule_threadsafe from the agent's sync worker thread — same pattern
as _status_callback_sync.
- The fired-once latch lives on the cached (reused-in-place) agent and
persists across turns, so a band crosses once -> one push, no per-turn
re-nag. Re-fires only after idle-eviction rebuilds the agent (a reminder).
- Recovery ('Credit access restored') rides the show path (emitted as a
success notice, not a clear). notice_clear_callback is a no-op: a sent
platform message can't be cleanly retracted.
Tests: render glyph/levels/fail-soft + public/private delivery seam through
_deliver_platform_notice + no-adapter no-op.
* fix(credits): don't double the glyph on messaging notices
render_notice_line prepended a per-level glyph, but the notice policy already
bakes the glyph into the text (and the TUI + CLI render it verbatim) — so every
credit notice over messaging came out doubled ("⚠ ⚠ Credits 90% used",
"⛔ ✕ Credit access paused"). Emit the text verbatim instead; drop the now-dead
level→glyph map.
The render tests fed glyph-less text (and the success case only checked
startswith), so the doubling slipped through. Rework them around the verbatim
contract and add an end-to-end regression that runs real evaluate_credits_notices
output through render_notice_line and asserts the line is returned unchanged.
550 lines
21 KiB
Python
550 lines
21 KiB
Python
from __future__ import annotations
|
|
|
|
import logging
|
|
import math
|
|
from dataclasses import dataclass
|
|
from datetime import datetime, timezone
|
|
from typing import TYPE_CHECKING, Any, Optional
|
|
|
|
import httpx
|
|
|
|
from agent.anthropic_adapter import _is_oauth_token, resolve_anthropic_token
|
|
from hermes_cli.auth import _read_codex_tokens, resolve_codex_runtime_credentials
|
|
from hermes_cli.runtime_provider import resolve_runtime_provider
|
|
|
|
if TYPE_CHECKING:
|
|
from typing import TypeGuard
|
|
|
|
logger = logging.getLogger(__name__)
|
|
|
|
|
|
def _utc_now() -> datetime:
|
|
return datetime.now(timezone.utc)
|
|
|
|
|
|
@dataclass(frozen=True)
|
|
class AccountUsageWindow:
|
|
label: str
|
|
used_percent: Optional[float] = None
|
|
reset_at: Optional[datetime] = None
|
|
detail: Optional[str] = None
|
|
|
|
|
|
@dataclass(frozen=True)
|
|
class AccountUsageSnapshot:
|
|
provider: str
|
|
source: str
|
|
fetched_at: datetime
|
|
title: str = "Account limits"
|
|
plan: Optional[str] = None
|
|
windows: tuple[AccountUsageWindow, ...] = ()
|
|
details: tuple[str, ...] = ()
|
|
unavailable_reason: Optional[str] = None
|
|
|
|
@property
|
|
def available(self) -> bool:
|
|
return bool(self.windows or self.details) and not self.unavailable_reason
|
|
|
|
|
|
def _title_case_slug(value: Optional[str]) -> Optional[str]:
|
|
cleaned = str(value or "").strip()
|
|
if not cleaned:
|
|
return None
|
|
return cleaned.replace("_", " ").replace("-", " ").title()
|
|
|
|
|
|
def _parse_dt(value: Any) -> Optional[datetime]:
|
|
if value in {None, ""}:
|
|
return None
|
|
if isinstance(value, (int, float)):
|
|
return datetime.fromtimestamp(float(value), tz=timezone.utc)
|
|
if isinstance(value, str):
|
|
text = value.strip()
|
|
if not text:
|
|
return None
|
|
if text.endswith("Z"):
|
|
text = text[:-1] + "+00:00"
|
|
try:
|
|
dt = datetime.fromisoformat(text)
|
|
return dt if dt.tzinfo else dt.replace(tzinfo=timezone.utc)
|
|
except ValueError:
|
|
return None
|
|
return None
|
|
|
|
|
|
def _format_reset(dt: Optional[datetime]) -> str:
|
|
if not dt:
|
|
return "unknown"
|
|
local_dt = dt.astimezone()
|
|
delta = dt - _utc_now()
|
|
total_seconds = int(delta.total_seconds())
|
|
if total_seconds <= 0:
|
|
return f"now ({local_dt.strftime('%Y-%m-%d %H:%M %Z')})"
|
|
hours, rem = divmod(total_seconds, 3600)
|
|
minutes = rem // 60
|
|
if hours >= 24:
|
|
days, hours = divmod(hours, 24)
|
|
rel = f"in {days}d {hours}h"
|
|
elif hours > 0:
|
|
rel = f"in {hours}h {minutes}m"
|
|
else:
|
|
rel = f"in {minutes}m"
|
|
return f"{rel} ({local_dt.strftime('%Y-%m-%d %H:%M %Z')})"
|
|
|
|
|
|
def render_account_usage_lines(snapshot: Optional[AccountUsageSnapshot], *, markdown: bool = False) -> list[str]:
|
|
if not snapshot:
|
|
return []
|
|
header = f"📈 {'**' if markdown else ''}{snapshot.title}{'**' if markdown else ''}"
|
|
lines = [header]
|
|
if snapshot.plan:
|
|
lines.append(f"Provider: {snapshot.provider} ({snapshot.plan})")
|
|
else:
|
|
lines.append(f"Provider: {snapshot.provider}")
|
|
for window in snapshot.windows:
|
|
if window.used_percent is None:
|
|
base = f"{window.label}: unavailable"
|
|
else:
|
|
remaining = max(0, round(100 - float(window.used_percent)))
|
|
used = max(0, round(float(window.used_percent)))
|
|
base = f"{window.label}: {remaining}% remaining ({used}% used)"
|
|
if window.reset_at:
|
|
base += f" • resets {_format_reset(window.reset_at)}"
|
|
elif window.detail:
|
|
base += f" • {window.detail}"
|
|
lines.append(base)
|
|
for detail in snapshot.details:
|
|
lines.append(detail)
|
|
if snapshot.unavailable_reason:
|
|
lines.append(f"Unavailable: {snapshot.unavailable_reason}")
|
|
return lines
|
|
|
|
|
|
def _fmt_usd(d: float) -> str:
|
|
return f"${d:,.2f}"
|
|
|
|
|
|
def _is_finite_num(v: Any) -> TypeGuard[float]:
|
|
"""True iff v is a real numeric value (int or float, not bool, not NaN/Inf).
|
|
|
|
Typed as a ``TypeGuard[float]`` so the type checker narrows ``v`` to a real
|
|
number in the positive branch — callers can then do arithmetic / pass it to
|
|
``_fmt_usd`` without a None-operand warning.
|
|
"""
|
|
return isinstance(v, (int, float)) and not isinstance(v, bool) and math.isfinite(v)
|
|
|
|
|
|
def build_nous_credits_snapshot(account_info) -> Optional[AccountUsageSnapshot]:
|
|
"""Map a NousPortalAccountInfo into an AccountUsageSnapshot for /usage.
|
|
|
|
Shows dollar magnitudes (subscription / top-up / total) + renewal date + a
|
|
portal CTA. When the portal supplies a subscription denominator
|
|
(``monthly_credits``), also emits a subscription-usage window so the renderer
|
|
shows a real ``% used`` gauge; when it's absent (older portals) the view
|
|
gracefully degrades to magnitudes-only. Returns None when there's no usable
|
|
account info to show (fail-open: caller just shows nothing).
|
|
"""
|
|
try:
|
|
from hermes_cli.nous_account import nous_portal_billing_url
|
|
|
|
if account_info is None or not getattr(account_info, "logged_in", False):
|
|
return None
|
|
|
|
access = getattr(account_info, "paid_service_access_info", None)
|
|
sub = getattr(account_info, "subscription", None)
|
|
|
|
windows: list[AccountUsageWindow] = []
|
|
details: list[str] = []
|
|
|
|
# Subscription usage gauge — only when the portal supplies a positive
|
|
# monthly_credits denominator AND a finite remaining balance that does
|
|
# not exceed the cap. Money math is on float dollars (allowed: numeric
|
|
# account fields, NOT a server-provided *_usd string). used = cap -
|
|
# remaining; clamp [0,100] so a debt balance (remaining < 0) reads 100%.
|
|
# Excluded on purpose:
|
|
# - non-finite values (NaN/Infinity slip past isinstance and json.loads
|
|
# parses bare NaN/Infinity by default) → would render "$nan"/"$inf"
|
|
# and a falsely-confident gauge;
|
|
# - remaining > cap (rollover balance spanning the period) → monthly_credits
|
|
# is no longer a meaningful denominator, and "$X of $Y left" with X>Y
|
|
# reads as a contradiction. Both fall back to the magnitudes lines.
|
|
if sub is not None:
|
|
monthly_credits = getattr(sub, "monthly_credits", None)
|
|
sub_remaining = getattr(sub, "credits_remaining", None)
|
|
if (
|
|
_is_finite_num(monthly_credits)
|
|
and monthly_credits > 0
|
|
and _is_finite_num(sub_remaining)
|
|
and sub_remaining <= monthly_credits
|
|
):
|
|
used = monthly_credits - sub_remaining
|
|
used_pct = max(0.0, min(100.0, used / monthly_credits * 100.0))
|
|
windows.append(
|
|
AccountUsageWindow(
|
|
label="Subscription",
|
|
used_percent=used_pct,
|
|
detail=f"{_fmt_usd(sub_remaining)} of {_fmt_usd(monthly_credits)} left",
|
|
)
|
|
)
|
|
|
|
if access is not None:
|
|
sub_credits = getattr(access, "subscription_credits_remaining", None)
|
|
if _is_finite_num(sub_credits):
|
|
details.append(f"Subscription credits: {_fmt_usd(sub_credits)}")
|
|
purchased = getattr(access, "purchased_credits_remaining", None)
|
|
if _is_finite_num(purchased):
|
|
details.append(f"Top-up credits: {_fmt_usd(purchased)}")
|
|
total_usable = getattr(access, "total_usable_credits", None)
|
|
if _is_finite_num(total_usable):
|
|
details.append(f"Total usable: {_fmt_usd(total_usable)}")
|
|
|
|
if sub is not None:
|
|
rollover = getattr(sub, "rollover_credits", None)
|
|
if _is_finite_num(rollover) and rollover > 0:
|
|
details.append(f"Rollover: {_fmt_usd(rollover)}")
|
|
period_end = getattr(sub, "current_period_end", None)
|
|
if period_end:
|
|
details.append(f"Renews: {period_end}")
|
|
|
|
paid = getattr(account_info, "paid_service_access", None)
|
|
if paid is False:
|
|
details.append("Status: access depleted — top up to restore")
|
|
|
|
if not windows and not details:
|
|
return None
|
|
|
|
details.append(f"Manage / top up: {nous_portal_billing_url(account_info)}")
|
|
|
|
plan = getattr(sub, "plan", None) if sub is not None else None
|
|
return AccountUsageSnapshot(
|
|
provider="nous",
|
|
source="portal-account",
|
|
fetched_at=_utc_now(),
|
|
title="Nous credits",
|
|
plan=plan,
|
|
windows=tuple(windows),
|
|
details=tuple(details),
|
|
)
|
|
except (AttributeError, TypeError):
|
|
return None
|
|
|
|
|
|
def nous_credits_lines(*, markdown: bool = False, timeout: float = 10.0) -> list[str]:
|
|
"""Return rendered Nous-credits /usage lines, or [] when there's nothing to show.
|
|
|
|
Account-independent of any live agent: gated on "a Nous account is logged in"
|
|
(a cheap local auth-state check), then a wall-clock-bounded portal fetch. Shared
|
|
by the CLI ``_show_usage`` and the TUI ``session.usage`` RPC so both surfaces show
|
|
the same block regardless of session API-call count or resume state. Fail-open:
|
|
any auth/portal hiccup or timeout returns [] (the caller shows nothing).
|
|
|
|
Dev override: when HERMES_DEV_CREDITS_FIXTURE selects a fixture state, /usage
|
|
renders from that fixture instead of the real portal (so the block + gauge are
|
|
testable without a live account). Throwaway scaffolding.
|
|
"""
|
|
# Dev fixture short-circuit — render /usage from the injected state, no portal.
|
|
try:
|
|
from agent.credits_tracker import dev_fixture_credits_state
|
|
|
|
fixture = dev_fixture_credits_state()
|
|
except Exception:
|
|
fixture = None
|
|
if fixture is not None:
|
|
snapshot = _snapshot_from_credits_state(fixture)
|
|
return render_account_usage_lines(snapshot, markdown=markdown)
|
|
|
|
try:
|
|
from hermes_cli.auth import get_provider_auth_state
|
|
|
|
tok = (get_provider_auth_state("nous") or {}).get("access_token")
|
|
if not (isinstance(tok, str) and tok.strip()):
|
|
return []
|
|
except Exception:
|
|
return []
|
|
try:
|
|
import concurrent.futures
|
|
|
|
from hermes_cli.nous_account import get_nous_portal_account_info
|
|
|
|
with concurrent.futures.ThreadPoolExecutor(max_workers=1) as pool:
|
|
account = pool.submit(
|
|
get_nous_portal_account_info, force_fresh=True
|
|
).result(timeout=timeout)
|
|
snapshot = build_nous_credits_snapshot(account)
|
|
return render_account_usage_lines(snapshot, markdown=markdown)
|
|
except Exception:
|
|
# Fail-open (caller shows nothing), but leave a breadcrumb so a dead
|
|
# /usage credits block is diagnosable in agent.log without a dev flag.
|
|
logger.debug("credits ▸ /usage portal fetch/render failed (fail-open)", exc_info=True)
|
|
return []
|
|
|
|
|
|
def _snapshot_from_credits_state(state) -> Optional[AccountUsageSnapshot]:
|
|
"""Map a header-shaped CreditsState (e.g. a dev fixture) to the /usage snapshot.
|
|
|
|
Renders the same magnitudes + monthly-grant % window the portal path produces,
|
|
so HERMES_DEV_CREDITS_FIXTURE can exercise /usage without a live account. The
|
|
*_usd strings are mock display values here (not server balance to compute on);
|
|
the % comes from CreditsState.used_fraction (micros math). Fail-open → None.
|
|
"""
|
|
try:
|
|
if state is None:
|
|
return None
|
|
|
|
windows: list[AccountUsageWindow] = []
|
|
details: list[str] = []
|
|
|
|
uf = getattr(state, "used_fraction", None)
|
|
if isinstance(uf, (int, float)) and math.isfinite(uf):
|
|
cap_usd = getattr(state, "subscription_limit_usd", None)
|
|
sub_usd = getattr(state, "subscription_usd", None)
|
|
detail = None
|
|
if sub_usd and cap_usd:
|
|
detail = f"${sub_usd} of ${cap_usd} left"
|
|
windows.append(
|
|
AccountUsageWindow(
|
|
label="Subscription",
|
|
used_percent=max(0.0, min(100.0, uf * 100.0)),
|
|
detail=detail,
|
|
)
|
|
)
|
|
|
|
sub_usd = getattr(state, "subscription_usd", None)
|
|
if sub_usd:
|
|
details.append(f"Subscription credits: ${sub_usd}")
|
|
purchased_usd = getattr(state, "purchased_usd", None)
|
|
if purchased_usd:
|
|
details.append(f"Top-up credits: ${purchased_usd}")
|
|
remaining_usd = getattr(state, "remaining_usd", None)
|
|
if remaining_usd:
|
|
details.append(f"Total usable: ${remaining_usd}")
|
|
if getattr(state, "paid_access", True) is False:
|
|
details.append("Status: access depleted — top up to restore")
|
|
|
|
if not windows and not details:
|
|
return None
|
|
|
|
details.append("(dev fixture — HERMES_DEV_CREDITS_FIXTURE)")
|
|
return AccountUsageSnapshot(
|
|
provider="nous",
|
|
source="dev-fixture",
|
|
fetched_at=_utc_now(),
|
|
title="Nous credits",
|
|
windows=tuple(windows),
|
|
details=tuple(details),
|
|
)
|
|
except (AttributeError, TypeError):
|
|
return None
|
|
|
|
|
|
def _resolve_codex_usage_url(base_url: str) -> str:
|
|
normalized = (base_url or "").strip().rstrip("/")
|
|
if not normalized:
|
|
normalized = "https://chatgpt.com/backend-api/codex"
|
|
if normalized.endswith("/codex"):
|
|
normalized = normalized[: -len("/codex")]
|
|
if "/backend-api" in normalized:
|
|
return normalized + "/wham/usage"
|
|
return normalized + "/api/codex/usage"
|
|
|
|
|
|
def _fetch_codex_account_usage() -> Optional[AccountUsageSnapshot]:
|
|
creds = resolve_codex_runtime_credentials(refresh_if_expiring=True)
|
|
token_data = _read_codex_tokens()
|
|
tokens = token_data.get("tokens") or {}
|
|
account_id = str(tokens.get("account_id", "") or "").strip() or None
|
|
headers = {
|
|
"Authorization": f"Bearer {creds['api_key']}",
|
|
"Accept": "application/json",
|
|
"User-Agent": "codex-cli",
|
|
}
|
|
if account_id:
|
|
headers["ChatGPT-Account-Id"] = account_id
|
|
with httpx.Client(timeout=15.0) as client:
|
|
response = client.get(_resolve_codex_usage_url(creds.get("base_url", "")), headers=headers)
|
|
response.raise_for_status()
|
|
payload = response.json() or {}
|
|
rate_limit = payload.get("rate_limit") or {}
|
|
windows: list[AccountUsageWindow] = []
|
|
for key, label in (("primary_window", "Session"), ("secondary_window", "Weekly")):
|
|
window = rate_limit.get(key) or {}
|
|
used = window.get("used_percent")
|
|
if used is None:
|
|
continue
|
|
windows.append(
|
|
AccountUsageWindow(
|
|
label=label,
|
|
used_percent=float(used),
|
|
reset_at=_parse_dt(window.get("reset_at")),
|
|
)
|
|
)
|
|
details: list[str] = []
|
|
credits = payload.get("credits") or {}
|
|
if credits.get("has_credits"):
|
|
balance = credits.get("balance")
|
|
if isinstance(balance, (int, float)):
|
|
details.append(f"Credits balance: ${float(balance):.2f}")
|
|
elif credits.get("unlimited"):
|
|
details.append("Credits balance: unlimited")
|
|
return AccountUsageSnapshot(
|
|
provider="openai-codex",
|
|
source="usage_api",
|
|
fetched_at=_utc_now(),
|
|
plan=_title_case_slug(payload.get("plan_type")),
|
|
windows=tuple(windows),
|
|
details=tuple(details),
|
|
)
|
|
|
|
|
|
def _fetch_anthropic_account_usage() -> Optional[AccountUsageSnapshot]:
|
|
token = (resolve_anthropic_token() or "").strip()
|
|
if not token:
|
|
return None
|
|
if not _is_oauth_token(token):
|
|
return AccountUsageSnapshot(
|
|
provider="anthropic",
|
|
source="oauth_usage_api",
|
|
fetched_at=_utc_now(),
|
|
unavailable_reason="Anthropic account limits are only available for OAuth-backed Claude accounts.",
|
|
)
|
|
headers = {
|
|
"Authorization": f"Bearer {token}",
|
|
"Accept": "application/json",
|
|
"Content-Type": "application/json",
|
|
"anthropic-beta": "oauth-2025-04-20",
|
|
"User-Agent": "claude-code/2.1.0",
|
|
}
|
|
with httpx.Client(timeout=15.0) as client:
|
|
response = client.get("https://api.anthropic.com/api/oauth/usage", headers=headers)
|
|
response.raise_for_status()
|
|
payload = response.json() or {}
|
|
windows: list[AccountUsageWindow] = []
|
|
mapping = (
|
|
("five_hour", "Current session"),
|
|
("seven_day", "Current week"),
|
|
("seven_day_opus", "Opus week"),
|
|
("seven_day_sonnet", "Sonnet week"),
|
|
)
|
|
for key, label in mapping:
|
|
window = payload.get(key) or {}
|
|
util = window.get("utilization")
|
|
if util is None:
|
|
continue
|
|
used = float(util) * 100 if float(util) <= 1 else float(util)
|
|
windows.append(
|
|
AccountUsageWindow(
|
|
label=label,
|
|
used_percent=used,
|
|
reset_at=_parse_dt(window.get("resets_at")),
|
|
)
|
|
)
|
|
details: list[str] = []
|
|
extra = payload.get("extra_usage") or {}
|
|
if extra.get("is_enabled"):
|
|
used_credits = extra.get("used_credits")
|
|
monthly_limit = extra.get("monthly_limit")
|
|
currency = extra.get("currency") or "USD"
|
|
if isinstance(used_credits, (int, float)) and isinstance(monthly_limit, (int, float)):
|
|
details.append(
|
|
f"Extra usage: {used_credits:.2f} / {monthly_limit:.2f} {currency}"
|
|
)
|
|
return AccountUsageSnapshot(
|
|
provider="anthropic",
|
|
source="oauth_usage_api",
|
|
fetched_at=_utc_now(),
|
|
windows=tuple(windows),
|
|
details=tuple(details),
|
|
)
|
|
|
|
|
|
def _fetch_openrouter_account_usage(base_url: Optional[str], api_key: Optional[str]) -> Optional[AccountUsageSnapshot]:
|
|
runtime = resolve_runtime_provider(
|
|
requested="openrouter",
|
|
explicit_base_url=base_url,
|
|
explicit_api_key=api_key,
|
|
)
|
|
token = str(runtime.get("api_key", "") or "").strip()
|
|
if not token:
|
|
return None
|
|
normalized = str(runtime.get("base_url", "") or "").rstrip("/")
|
|
credits_url = f"{normalized}/credits"
|
|
key_url = f"{normalized}/key"
|
|
headers = {
|
|
"Authorization": f"Bearer {token}",
|
|
"Accept": "application/json",
|
|
}
|
|
with httpx.Client(timeout=10.0) as client:
|
|
credits_resp = client.get(credits_url, headers=headers)
|
|
credits_resp.raise_for_status()
|
|
credits = (credits_resp.json() or {}).get("data") or {}
|
|
try:
|
|
key_resp = client.get(key_url, headers=headers)
|
|
key_resp.raise_for_status()
|
|
key_data = (key_resp.json() or {}).get("data") or {}
|
|
except Exception:
|
|
key_data = {}
|
|
total_credits = float(credits.get("total_credits") or 0.0)
|
|
total_usage = float(credits.get("total_usage") or 0.0)
|
|
details = [f"Credits balance: ${max(0.0, total_credits - total_usage):.2f}"]
|
|
windows: list[AccountUsageWindow] = []
|
|
limit = key_data.get("limit")
|
|
limit_remaining = key_data.get("limit_remaining")
|
|
limit_reset = str(key_data.get("limit_reset") or "").strip()
|
|
usage = key_data.get("usage")
|
|
if (
|
|
isinstance(limit, (int, float))
|
|
and float(limit) > 0
|
|
and isinstance(limit_remaining, (int, float))
|
|
and 0 <= float(limit_remaining) <= float(limit)
|
|
):
|
|
limit_value = float(limit)
|
|
remaining_value = float(limit_remaining)
|
|
used_percent = ((limit_value - remaining_value) / limit_value) * 100
|
|
detail_parts = [f"${remaining_value:.2f} of ${limit_value:.2f} remaining"]
|
|
if limit_reset:
|
|
detail_parts.append(f"resets {limit_reset}")
|
|
windows.append(
|
|
AccountUsageWindow(
|
|
label="API key quota",
|
|
used_percent=used_percent,
|
|
detail=" • ".join(detail_parts),
|
|
)
|
|
)
|
|
if isinstance(usage, (int, float)):
|
|
usage_parts = [f"API key usage: ${float(usage):.2f} total"]
|
|
for value, label in (
|
|
(key_data.get("usage_daily"), "today"),
|
|
(key_data.get("usage_weekly"), "this week"),
|
|
(key_data.get("usage_monthly"), "this month"),
|
|
):
|
|
if isinstance(value, (int, float)) and float(value) > 0:
|
|
usage_parts.append(f"${float(value):.2f} {label}")
|
|
details.append(" • ".join(usage_parts))
|
|
return AccountUsageSnapshot(
|
|
provider="openrouter",
|
|
source="credits_api",
|
|
fetched_at=_utc_now(),
|
|
windows=tuple(windows),
|
|
details=tuple(details),
|
|
)
|
|
|
|
|
|
def fetch_account_usage(
|
|
provider: Optional[str],
|
|
*,
|
|
base_url: Optional[str] = None,
|
|
api_key: Optional[str] = None,
|
|
) -> Optional[AccountUsageSnapshot]:
|
|
normalized = str(provider or "").strip().lower()
|
|
if normalized in {"", "auto", "custom"}:
|
|
return None
|
|
try:
|
|
if normalized == "openai-codex":
|
|
return _fetch_codex_account_usage()
|
|
if normalized == "anthropic":
|
|
return _fetch_anthropic_account_usage()
|
|
if normalized == "openrouter":
|
|
return _fetch_openrouter_account_usage(base_url, api_key)
|
|
except Exception:
|
|
return None
|
|
return None
|