hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-04 12:33:08 +00:00

History

Teknium 8f19078c6a feat(goals): /subgoal — user-added criteria appended to active /goal (#25449 ) * feat(goals): /subgoal — user-added criteria appended to active /goal Layers a /subgoal command on top of the existing freeform Ralph judge loop. The user can append extra criteria mid-loop; the judge factors them into its done/continue verdict and the continuation prompt surfaces them to the agent. No new tool, no agent self-judging — the existing judge model just sees a richer prompt. Forms: /subgoal show current subgoals /subgoal <text> append a criterion /subgoal remove <n> drop subgoal n (1-based) /subgoal clear wipe all subgoals How it integrates: - GoalState gains `subgoals: List[str]` (default []), backwards-compat for existing state_meta rows. - judge_goal accepts an optional subgoals kwarg; non-empty switches to JUDGE_USER_PROMPT_WITH_SUBGOALS_TEMPLATE which lists them as numbered criteria and asks 'is the goal AND every additional criterion satisfied?' - next_continuation_prompt picks CONTINUATION_PROMPT_WITH_SUBGOALS_TEMPLATE when non-empty so the agent sees what to target. - /subgoal is allowed mid-run on the gateway since it only touches the state the judge reads at turn boundary — no race with the running turn. - Status line shows '... , N subgoals' when present. Surface: - hermes_cli/goals.py — field, prompt blocks, manager methods, judge weave - hermes_cli/commands.py — /subgoal CommandDef - cli.py — _handle_subgoal_command - gateway/run.py — _handle_subgoal_command + mid-run dispatch - tests/hermes_cli/test_goals.py — 15 new tests (backcompat, mutation, persistence, prompt template selection, judge-prompt content via mock, status-line rendering) 77 goal-related tests passing across goals + cli + gateway + tui. * fix(goals): slash commands don't preempt the goal-continuation hook Two findings from live-testing /subgoal: 1. Slash commands queued while the agent is running landed in _pending_input (same queue as real user messages). The goal hook's 'is a real user message pending?' check returned True and silently skipped — but the slash command consumes its queue slot via process_command() which never re-fires the goal hook, so the loop stalls indefinitely. Now the hook peeks the queue and only defers when a non-slash payload is present. 2. The with-subgoals judge prompt was too soft — opus 4.7 said 'done, implying all requirements met' without verifying. Tightened to demand specific per-criterion evidence (file contents, output line, command result) and explicitly reject phrases like 'implying it was done.' Live verified: /subgoal injected mid-loop now correctly forces the judge to refuse done until the new criterion is met. Agent gets the continuation prompt with subgoals listed, updates the script, judge confirms done with specific evidence cited.		2026-05-13 22:55:09 -07:00
..
assets	fix: improve telegram topic mode setup	2026-05-04 12:07:17 -07:00
builtin_hooks	remove: BOOT.md built-in hook (#17093 )	2026-04-28 09:50:27 -07:00
platforms	feat(discord): add thread_require_mention for multi-bot threads	2026-05-13 22:21:43 -07:00
__init__.py	docs(gateway): mention Weixin in gateway help and docstrings	2026-05-12 17:08:51 -07:00
channel_directory.py	feat: complete plugin platform parity — all 12 integration points	2026-04-29 21:56:51 -07:00
config.py	feat(discord): add thread_require_mention for multi-bot threads	2026-05-13 22:21:43 -07:00
delivery.py	fix(gateway): preserve case-sensitive chat IDs in DeliveryTarget.parse	2026-05-01 14:01:26 -07:00
display_config.py	chore: ruff auto-fix PLR6201 — tuple → set in membership tests (#23937 )	2026-05-11 11:13:25 -07:00
hooks.py	fix(plugins): register dynamically-loaded modules in sys.modules before exec	2026-04-29 23:34:35 -07:00
mirror.py	fix(gateway): avoid cross-user mirror writes in per-user group sessions	2026-04-26 18:31:24 -07:00
pairing.py	fix(pairing): enforce lockout on approve_code, not just generate_code (#10195 ) (#21325 )	2026-05-07 07:18:21 -07:00
platform_registry.py	refactor(plugins): add apply_yaml_config_fn registry hook	2026-05-13 22:20:30 -07:00
restart.py	fix(gateway): address restart review feedback	2026-04-10 21:18:34 -07:00
run.py	feat(goals): /subgoal — user-added criteria appended to active /goal (#25449 )	2026-05-13 22:55:09 -07:00
runtime_footer.py	feat(gateway): opt-in runtime-metadata footer on final replies (#17026 )	2026-04-28 06:50:04 -07:00
session.py	chore: ruff auto-fix PLR6201 — tuple → set in membership tests (#23937 )	2026-05-11 11:13:25 -07:00
session_context.py	feat: expose HERMES_SESSION_ID to agent tools via ContextVar + env (#23847 )	2026-05-12 00:16:45 +05:30
shutdown_forensics.py	chore: ruff auto-fixes — collapsible-else-if, if-stmt-min-max, dict.fromkeys (#23926 )	2026-05-11 11:03:29 -07:00
slash_access.py	feat(gateway): per-platform admin/user split for slash commands (salvage of #4443 ) (#23373 )	2026-05-10 12:33:54 -07:00
status.py	fix(gateway): consult lock record argv when cmdline unreadable in scoped-lock stale check	2026-05-12 16:33:09 -07:00
sticker_cache.py	chore: remove ~100 unused imports across 55 files (#3016 )	2026-03-25 15:02:03 -07:00
stream_consumer.py	perf(gateway): tune Telegram cadence + adaptive fast-path for short replies	2026-05-10 22:22:25 -07:00
whatsapp_identity.py	fix(whatsapp_identity): pin identifier regex to ASCII, clarify it's defense-in-depth	2026-04-26 20:48:31 -07:00