hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-05-02 02:01:47 +00:00

History

Shannon Sands 499490d06a Track tool-call validity vs attempts; shape reward accordingly - AgentResult now includes tool-call metrics: attempted, schema_valid, executed_ok, exec_error - HermesAgentLoop normalizes args robustly without crashing, but distinguishes schema-valid args (dict) from coerced formats (stringified JSON, plain strings) - SweSmithOracleEnv reward shaping now prefers schema-valid tool calls while still giving small credit for attempted tool use		2026-02-14 09:17:05 +10:00
..
configs	Add support for Atropos Agentic RL environments (requires branch tool_call_support in Atropos atm)	2026-02-07 09:17:16 +00:00
tool_call_parsers	Fix RL training pipeline: context truncation, double-encoding, shaped rewards	2026-02-13 22:21:32 +00:00
__init__.py	Add support for Atropos Agentic RL environments (requires branch tool_call_support in Atropos atm)	2026-02-07 09:17:16 +00:00
agent_loop.py	Track tool-call validity vs attempts; shape reward accordingly	2026-02-14 09:17:05 +10:00
gsm8k_agent_env.py	refactor on SlotPoolEnvironment	2026-02-10 08:30:37 +00:00
hermes_base_env.py	Fix RL training pipeline: context truncation, double-encoding, shaped rewards	2026-02-13 22:21:32 +00:00
hermes_swe_env.py	Add support for Atropos Agentic RL environments (requires branch tool_call_support in Atropos atm)	2026-02-07 09:17:16 +00:00
patches.py	testing training	2026-02-11 22:13:05 +00:00
swe_smith_oracle_env.py	Track tool-call validity vs attempts; shape reward accordingly	2026-02-14 09:17:05 +10:00
terminal_test_env.py	Enhance async tool execution and error handling in Hermes agent for Atropos integration	2026-02-08 05:00:47 +00:00
tool_context.py	Enhance async tool execution and error handling in Hermes agent for Atropos integration	2026-02-08 05:00:47 +00:00