hermes-agent/environments
Shannon Sands 499490d06a Track tool-call validity vs attempts; shape reward accordingly
- AgentResult now includes tool-call metrics: attempted, schema_valid,
  executed_ok, exec_error
- HermesAgentLoop normalizes args robustly without crashing, but
  distinguishes schema-valid args (dict) from coerced formats
  (stringified JSON, plain strings)
- SweSmithOracleEnv reward shaping now prefers schema-valid tool calls
  while still giving small credit for attempted tool use
2026-02-14 09:17:05 +10:00
..
configs Add support for Atropos Agentic RL environments (requires branch tool_call_support in Atropos atm) 2026-02-07 09:17:16 +00:00
tool_call_parsers Fix RL training pipeline: context truncation, double-encoding, shaped rewards 2026-02-13 22:21:32 +00:00
__init__.py Add support for Atropos Agentic RL environments (requires branch tool_call_support in Atropos atm) 2026-02-07 09:17:16 +00:00
agent_loop.py Track tool-call validity vs attempts; shape reward accordingly 2026-02-14 09:17:05 +10:00
gsm8k_agent_env.py refactor on SlotPoolEnvironment 2026-02-10 08:30:37 +00:00
hermes_base_env.py Fix RL training pipeline: context truncation, double-encoding, shaped rewards 2026-02-13 22:21:32 +00:00
hermes_swe_env.py Add support for Atropos Agentic RL environments (requires branch tool_call_support in Atropos atm) 2026-02-07 09:17:16 +00:00
patches.py testing training 2026-02-11 22:13:05 +00:00
swe_smith_oracle_env.py Track tool-call validity vs attempts; shape reward accordingly 2026-02-14 09:17:05 +10:00
terminal_test_env.py Enhance async tool execution and error handling in Hermes agent for Atropos integration 2026-02-08 05:00:47 +00:00
tool_context.py Enhance async tool execution and error handling in Hermes agent for Atropos integration 2026-02-08 05:00:47 +00:00