hermes-agent/environments/benchmarks
dmahan93 ed27b826c5 feat: add eval_concurrency limit + Docker local config for TBLite
- Add eval_concurrency config field with asyncio.Semaphore
- Add local.yaml config using Docker backend (sandboxed, no cloud costs)
- Register docker_image alongside modal_image for backend flexibility
- Default: 8 parallel tasks for local runs
2026-03-11 06:52:26 -07:00
..
tblite feat: add eval_concurrency limit + Docker local config for TBLite 2026-03-11 06:52:26 -07:00
terminalbench_2 feat: add eval_concurrency limit + Docker local config for TBLite 2026-03-11 06:52:26 -07:00
yc_bench fix: update OpenRouter model names for yc-bench config 2026-03-06 19:58:56 -08:00
__init__.py Add new environments and enhance tool context functionality 2026-02-10 19:39:05 +00:00