hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-09 08:21:50 +00:00

History

teknium1 0ea6c34325 feat: add OpenThoughts-TBLite evaluation environment and configuration files Introduced a new evaluation environment for OpenThoughts-TBLite, including the main evaluation script, configuration YAML, and README documentation. This environment provides a faster alternative to Terminal-Bench 2.0, featuring 100 difficulty-calibrated tasks for terminal agents. The setup allows for easy evaluation and configuration, enhancing the benchmarking capabilities for terminal agents.		2026-03-04 11:42:41 +00:00
..
tblite	feat: add OpenThoughts-TBLite evaluation environment and configuration files	2026-03-04 11:42:41 +00:00
terminalbench_2	Enhance TerminalBench2 environment with task filtering due to incompat with modal and logging improvements	2026-02-12 05:36:45 +00:00
__init__.py	Add new environments and enhance tool context functionality	2026-02-10 19:39:05 +00:00