hermes-agent/environments/benchmarks/terminalbench_2
teknium 85e629e915 Add cleanup functionality for orphaned sandboxes in TerminalBench2EvalEnv
- Implemented a cleanup process to terminate any remaining sandboxes after evaluation, addressing issues with orphaned thread pool workers.
- Enhanced logging to inform users about the cleanup process, ensuring better resource management and user awareness.
2026-02-10 23:48:49 +00:00
..
__init__.py Add new environments and enhance tool context functionality 2026-02-10 19:39:05 +00:00
default.yaml Enhance TerminalBench 2 configuration and evaluation handling 2026-02-10 22:53:24 +00:00
run_eval.sh Add new environments and enhance tool context functionality 2026-02-10 19:39:05 +00:00
terminalbench2_env.py Add cleanup functionality for orphaned sandboxes in TerminalBench2EvalEnv 2026-02-10 23:48:49 +00:00