hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-25 17:18:11 +00:00

History

Teknium f362083c64 fix(providers): complete NVIDIA NIM parity with other providers Follow-up on the native NVIDIA NIM provider salvage. The original PR wired PROVIDER_REGISTRY + HERMES_OVERLAYS correctly but missed several touchpoints required for full parity with other OpenAI-compatible providers (xai, huggingface, deepseek, zai). Gaps closed: - hermes_cli/main.py: - Add 'nvidia' to the _model_flow_api_key_provider dispatch tuple so selecting 'NVIDIA NIM' in `hermes model` actually runs the api-key provider flow (previously fell through silently). - Add 'nvidia' to `hermes chat --provider` argparse choices so the documented test command (`hermes chat --provider nvidia --model ...`) parses successfully. - hermes_cli/config.py: Register NVIDIA_API_KEY and NVIDIA_BASE_URL in OPTIONAL_ENV_VARS so setup wizard can prompt for them and they're auto-added to the subprocess env blocklist. - hermes_cli/doctor.py: Add NVIDIA NIM row to `_apikey_providers` so `hermes doctor` probes https://integrate.api.nvidia.com/v1/models. - hermes_cli/dump.py: Add NVIDIA_API_KEY → 'nvidia' mapping for `hermes dump` credential masking. - tests/tools/test_local_env_blocklist.py: Extend registry_vars fixture with NVIDIA_API_KEY to verify it's blocked from leaking into subprocesses. - agent/model_metadata.py: Add 'nemotron' → 131072 context-length entry so all Nemotron variants get 128K context via substring match (rather than falling back to MINIMUM_CONTEXT_LENGTH). - hermes_cli/models.py: Fix hallucinated model ID 'nvidia/nemotron-3-nano-8b-a4b' → 'nvidia/nemotron-3-nano-30b-a3b' (verified against live integrate.api.nvidia.com/v1/models catalog). Expand curated list from 5 to 9 agentic models mapping to OpenRouter defaults per provider-guide convention: add qwen3.5-397b-a17b, deepseek-v3.2, llama-3.3-nemotron-super-49b-v1.5, gpt-oss-120b. - cli-config.yaml.example: Document 'nvidia' provider option. - scripts/release.py: Map asurla@nvidia.com → anniesurla in AUTHOR_MAP for CI attribution. E2E verified: `hermes chat --provider nvidia ...` now reaches NVIDIA's endpoint (returns 401 with bogus key instead of argparse error); `hermes doctor` detects NVIDIA NIM when NVIDIA_API_KEY is set.		2026-04-17 13:47:46 -07:00
..
whatsapp-bridge	security: supply chain hardening — CI pinning, dep pinning, and code fixes (#9801 )	2026-04-14 14:23:37 -07:00
build_skills_index.py	feat(skills): centralized skills index — eliminate GitHub API calls for search/install	2026-04-12 16:39:04 -07:00
contributor_audit.py	feat(ci): add contributor attribution check on PRs (#9376 )	2026-04-13 21:13:08 -07:00
discord-voice-doctor.py	feat(tools): add Voxtral TTS provider (Mistral AI)	2026-04-11 01:56:55 -07:00
hermes-gateway	fix: prevent systemd restart storm on gateway connection failure	2026-03-21 09:26:39 -07:00
install.cmd	feat: Windows native support via Git Bash	2026-03-02 22:03:29 -08:00
install.ps1	fix: re-order providers,Quick Install, subscription polling	2026-04-06 11:16:07 -04:00
install.sh	fix(installer): make prompt_yes_no bash 3.2 compatible	2026-04-16 20:14:02 -07:00
kill_modal.sh	refactor: replace swe-rex with native Modal SDK for Modal backend (#3538 )	2026-03-28 11:21:44 -07:00
release.py	fix(providers): complete NVIDIA NIM parity with other providers	2026-04-17 13:47:46 -07:00
run_tests.sh	test: make test env hermetic; enforce CI parity via scripts/run_tests.sh (#11577 )	2026-04-17 06:09:09 -07:00
sample_and_compress.py	refactor: codebase-wide lint cleanup — unused imports, dead code, and inefficient patterns (#5821 )	2026-04-07 10:25:31 -07:00