hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-24 10:52:21 +00:00

History

Teknium 2ba1cfeb2e feat(goals): completion contracts for /goal — evidence-based judging (#50501 ) Adds an optional structured completion contract to the standing-goal loop, adapted from OpenAI Codex's /goal guidance (a durable objective works best when it names what done means, how to prove it, what not to break, what's in scope, and when to stop). A contract has five optional fields — outcome, verification, constraints, boundaries, stop_when. When set, the continuation prompt tells the agent to target the verification surface and respect constraints, and the judge marks the goal done only when the verification criterion is met with concrete evidence (command result, file excerpt, test output) instead of a loose "looks done" claim. This tightens the most common /goal failure mode: premature completion / endless over-continuation on an underspecified goal. Two ways to set a contract, both backward compatible (bare /goal <text> behaves exactly as before): - /goal draft <objective> — expands plain text into a full contract via the goal_judge aux model (cache-safe side call), falls back to a free-form goal if the model is unavailable. - /goal <text> with inline 'field: value' lines (verify:, constraints:, boundaries:, stop when:, ...). Plain goals with an incidental colon are not mangled — only known field prefixes are pulled out. - /goal show prints the active contract. Contracts persist in SessionDB.state_meta alongside the goal (survive /resume), compose with /subgoal criteria, and old goal rows load unchanged. CLI + every gateway platform via the shared GoalManager engine; zero new model tools. Tests: +18 in tests/hermes_cli/test_goals.py (parse/serialize/judge-prompt/ draft/fallback), 73/73 green; 42/42 across the broader goal test surface; live E2E roundtrip (set -> persist -> reload -> contract-aware prompts) green.		2026-06-22 12:20:09 -07:00
..
assets	fix: improve telegram topic mode setup	2026-05-04 12:07:17 -07:00
builtin_hooks	remove: BOOT.md built-in hook (#17093 )	2026-04-28 09:50:27 -07:00
platforms	fix(delivery): drop env-var knob, flag all chunking adapters	2026-06-22 05:41:22 -07:00
relay	feat(relay): forward a stable instance id at self-provision (Phase 6 Unit α) (#50772 )	2026-06-22 21:46:59 +10:00
__init__.py	docs(gateway): mention Weixin in gateway help and docstrings	2026-05-12 17:08:51 -07:00
authz_mixin.py	Address email pairing review feedback	2026-06-21 22:43:57 -07:00
channel_directory.py	fix: harden WhatsApp target alias salvage	2026-06-15 05:51:47 -07:00
config.py	Address email pairing review feedback	2026-06-21 22:43:57 -07:00
delivery.py	fix(delivery): drop env-var knob, flag all chunking adapters	2026-06-22 05:41:22 -07:00
display_config.py	feat(gateway): rename to tool_progress_grouping, add config/docs/tests	2026-06-16 05:49:24 -07:00
hooks.py	feat(hooks): expose thread_id and chat_type in agent:start/end context (#41672 )	2026-06-07 19:16:36 -07:00
kanban_watchers.py	fix(kanban): honor kanban.auto_decompose toggle live, without a gateway restart (#50358 )	2026-06-21 12:43:44 -07:00
memory_monitor.py	Port from cline/cline#10343: periodic gateway memory logging (#27102 )	2026-05-16 12:55:23 -07:00
message_timestamps.py	feat(gateway): inject stable human-readable message timestamps	2026-06-16 15:49:59 -07:00
mirror.py	refactor(gateway): drop _append_to_jsonl from mirror	2026-05-20 13:00:57 -07:00
pairing.py	fix(gateway): preserve WhatsApp pairing approvals across JID/LID alias flips	2026-05-23 01:46:34 -07:00
platform_registry.py	refactor(plugins): add apply_yaml_config_fn registry hook	2026-05-13 22:20:30 -07:00
response_filters.py	fix(gateway): suppress exact silence tokens without mutating history	2026-06-14 03:25:08 -07:00
restart.py	fix(gateway): address restart review feedback	2026-04-10 21:18:34 -07:00
rich_sent_store.py	fix(telegram): resolve replies to rich (sendRichMessage) messages	2026-06-16 13:04:20 -07:00
run.py	feat(goals): /goal wait <pid> — park the loop on a background process (#50503 )	2026-06-22 06:27:29 -07:00
runtime_footer.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
session.py	refactor(session): simplify traversal guard to a helper + logger, harden non-leading separators	2026-06-21 15:23:36 -07:00
session_context.py	fix(api-server): stop silently promising async delivery on stateless HTTP path (#50319 )	2026-06-21 12:15:14 -07:00
shutdown_forensics.py	chore: ruff auto-fixes — collapsible-else-if, if-stmt-min-max, dict.fromkeys (#23926 )	2026-05-11 11:03:29 -07:00
slash_access.py	feat(gateway): per-platform admin/user split for slash commands (salvage of #4443 ) (#23373 )	2026-05-10 12:33:54 -07:00
slash_commands.py	feat(goals): completion contracts for /goal — evidence-based judging (#50501 )	2026-06-22 12:20:09 -07:00
status.py	fix(status): cross-platform start-time fingerprint via psutil fallback	2026-06-21 17:23:33 -07:00
sticker_cache.py	fix: guard yaml.safe_load, flock unlock, TOCTOU races, and atomic writes	2026-05-19 00:12:41 -07:00
stream_consumer.py	fix(gateway): respect adapter decline of fresh-final to prevent double delivery	2026-06-21 13:55:50 -07:00
stream_dispatch.py	feat(gateway): structured stream-event protocol + Telegram draft formatting parity (#37250 )	2026-06-02 00:33:50 -07:00
stream_events.py	feat(gateway): structured stream-event protocol + Telegram draft formatting parity (#37250 )	2026-06-02 00:33:50 -07:00
whatsapp_identity.py	fix(whatsapp): normalize bare phone targets to JIDs before bridge send	2026-06-21 13:32:22 -07:00