hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-31 19:16:29 +00:00

History

Teknium 8d302e37a8 feat(tts): add Piper as a native local TTS provider (closes #8508 ) (#17885 ) Piper (OHF-Voice/piper1-gpl) is a fast, local neural TTS engine from the Home Assistant project that supports 44 languages with zero API keys. Adds it as a native built-in provider alongside edge/neutts/kittentts, installable via 'hermes tools' with one keystroke. What ships: - New 'piper' built-in provider in tools/tts_tool.py - Lazy import via _import_piper() - Module-level voice cache keyed on (model_path, use_cuda) so switching voices doesn't invalidate older cached voices - _resolve_piper_voice_path() accepts either an absolute .onnx path or a voice name (auto-downloaded on first use via 'python -m piper.download_voices --download-dir <cache>') - Voice cache at ~/.hermes/cache/piper-voices/ (profile-aware via get_hermes_dir) - Optional SynthesisConfig knobs: length_scale, noise_scale, noise_w_scale, volume, normalize_audio, use_cuda — passed through only when configured, so older piper-tts versions aren't broken - WAV output then ffmpeg conversion path (same as neutts/kittentts) so Telegram voice bubbles work when ffmpeg is present - Piper added to BUILTIN_TTS_PROVIDERS so a user's tts.providers.piper.command cannot shadow the native provider (regression test included) - 'hermes tools' wizard entry - Piper appears under Voice and TTS as local free, with 'pip install piper-tts' auto-install via post_setup handler - Prints voice-catalog URL and default-voice info after install - config.yaml defaults - tts.piper.voice defaults to en_US-lessac-medium - Commented advanced knobs for discoverability - Docs - New 'Piper (local, 44 languages)' section in features/tts.md explaining install path, voice switching, pre-downloaded voices, and advanced knobs - Piper listed in the ten-provider table and ffmpeg table - Custom-command-providers section updated to drop the Piper example (now native) and add a piper-custom example for users with their own trained .onnx models - overview.md bumps provider count to ten - Tests (tests/tools/test_tts_piper.py, 16 tests) - Registration (BUILTIN_TTS_PROVIDERS, PROVIDER_MAX_TEXT_LENGTH) - _resolve_piper_voice_path across every branch: direct .onnx path, cached voice name, fresh download with correct CLI args, download failure, successful-exit-but-missing-files, empty voice to default - _generate_piper_tts: loads voice once, reuses cache, voice-name download wiring, advanced knobs flow through SynthesisConfig - text_to_speech_tool end-to-end dispatch and missing-package error - check_tts_requirements: piper availability toggles the return value - Regression guard: piper cannot be shadowed by a command provider with the same name - Pre-existing test_tts_mistral test broadened to mock the new piper/kittentts/command-provider checks (otherwise it false-passes when piper is installed in the test venv) E2E verification (live): Actual pip install piper-tts, config piper + en_US-lessac-low, text_to_speech_tool call, voice auto-downloaded from HuggingFace, WAV synthesized, ffmpeg-converted to Ogg/Opus. Second call hits the cache (~60ms). Cache dir populated with .onnx and .onnx.json. This caught a real bug during development: the first pass used '-d' as the download-dir flag; the actual piper.download_voices CLI wants '--download-dir'. Fixed before PR opened.		2026-04-30 02:53:20 -07:00
..
browser_providers	feat: ungate Tool Gateway — subscription-based access with per-tool opt-in	2026-04-16 12:36:49 -07:00
environments	fix(ci): stabilize main test suite regressions (#17660 )	2026-04-29 23:18:55 -07:00
neutts_samples	refactor(tts): replace NeuTTS optional skill with built-in provider + setup flow	2026-03-17 02:33:12 -07:00
__init__.py	Merge branch 'main' into rewbs/tool-use-charge-to-subscription	2026-03-31 08:48:54 +09:00
ansi_strip.py	fix: strip ANSI at the source — clean terminal output before it reaches the model	2026-03-23 07:43:12 -07:00
approval.py	chore(salvage): strip duplicated/merge-corrupted blocks from PR #17664	2026-04-29 21:56:51 -07:00
binary_extensions.py	fix(tools): address PR review — remove _extract_raw_output, BudgetConfig everywhere, read_file hardening	2026-04-08 02:24:32 -07:00
browser_camofox.py	refactor(config): add cfg_get() helper; migrate 20 nested-get call sites (#17304 )	2026-04-28 23:17:39 -07:00
browser_camofox_state.py	feat(browser): add persistent Camofox sessions and VNC URL discovery (salvage #4400 ) (#4419 )	2026-04-01 04:18:50 -07:00
browser_cdp_tool.py	chore: remove unused imports and dead locals (ruff F401, F841) (#17010 )	2026-04-28 06:46:45 -07:00
browser_dialog_tool.py	feat(browser): CDP supervisor — dialog detection + response + cross-origin iframe eval (#14540 )	2026-04-23 22:23:37 -07:00
browser_supervisor.py	chore: remove unused imports and dead locals (ruff F401, F841) (#17010 )	2026-04-28 06:46:45 -07:00
browser_tool.py	refactor(config): add cfg_get() helper; migrate 20 nested-get call sites (#17304 )	2026-04-28 23:17:39 -07:00
budget_config.py	fix: preserve existing thresholds, remove pre-read byte guard	2026-04-08 02:24:32 -07:00
checkpoint_manager.py	feat(checkpoints): auto-prune orphan and stale shadow repos at startup (#16303 )	2026-04-26 19:05:52 -07:00
clarify_tool.py	refactor: add tool_error/tool_result helpers + read_raw_config, migrate 129 callsites	2026-04-07 13:36:38 -07:00
code_execution_tool.py	feat: add Vercel Sandbox backend	2026-04-29 07:22:33 -07:00
credential_files.py	refactor(config): add cfg_get() helper; migrate 20 nested-get call sites (#17304 )	2026-04-28 23:17:39 -07:00
cronjob_tools.py	fix(cron): accept list-form deliver values so deliver=['telegram'] works (#17456 )	2026-04-29 06:35:34 -07:00
debug_helpers.py	refactor: codebase-wide lint cleanup — unused imports, dead code, and inefficient patterns (#5821 )	2026-04-07 10:25:31 -07:00
delegate_tool.py	chore: remove unused imports and dead locals (ruff F401, F841) (#17010 )	2026-04-28 06:46:45 -07:00
discord_tool.py	fix(discord_tool): coerce limit parameter to int before min() call	2026-04-26 20:48:38 -07:00
env_passthrough.py	refactor(config): add cfg_get() helper; migrate 20 nested-get call sites (#17304 )	2026-04-28 23:17:39 -07:00
feishu_doc_tool.py	fix(feishu-comment): use get_hermes_home(); drop dead asyncio wrapper; AUTHOR_MAP	2026-04-17 19:04:11 -07:00
feishu_drive_tool.py	fix(feishu-comment): use get_hermes_home(); drop dead asyncio wrapper; AUTHOR_MAP	2026-04-17 19:04:11 -07:00
file_operations.py	chore: remove unused imports and dead locals (ruff F401, F841) (#17010 )	2026-04-28 06:46:45 -07:00
file_state.py	feat(delegate): cross-agent file state coordination for concurrent subagents (#13718 )	2026-04-21 16:41:26 -07:00
file_tools.py	fix(ci): stabilize main test suite regressions (#17660 )	2026-04-29 23:18:55 -07:00
fuzzy_match.py	fix(patch): gate 'did you mean?' to no-match + extend to v4a/skill_manage	2026-04-21 02:03:46 -07:00
homeassistant_tool.py	fix: clean up description escaping, add string-data tests	2026-04-13 04:45:07 -07:00
image_generation_tool.py	fix(image-gen): force-refresh plugin providers in long-lived sessions	2026-04-23 03:01:18 -07:00
interrupt.py	fix(interrupt): propagate to concurrent-tool workers + opt-in debug trace (#11907 )	2026-04-17 20:39:25 -07:00
managed_tool_gateway.py	fix(tools): add debug logging for token refresh and tighten domain check	2026-04-02 12:40:03 +11:00
mcp_oauth.py	fix(mcp-oauth): preserve server_url path for protected-resource validation (#16031 )	2026-04-26 05:43:54 -07:00
mcp_oauth_manager.py	fix(mcp-oauth): preserve server_url path for protected-resource validation (#16031 )	2026-04-26 05:43:54 -07:00
mcp_tool.py	fix: clean up defensive shims and finish CI stabilization from #17660 (#17801 )	2026-04-29 23:53:17 -07:00
memory_tool.py	refactor: consolidate symlink-safe atomic replace into shared helper	2026-04-28 04:58:22 -07:00
mixture_of_agents_tool.py	Fix (mixture_of_agents): replace deprecated Gemini model and forward max_tokens to OpenRouter (#6621 )	2026-04-23 15:14:11 -07:00
neutts_synth.py	fix(tts): document NeuTTS provider and align install guidance (#1903 )	2026-03-18 02:55:30 -07:00
openrouter_client.py	refactor: route ad-hoc LLM consumers through centralized provider router	2026-03-11 20:02:36 -07:00
osv_check.py	feat: OSV malware check for MCP extension packages (#5305 )	2026-04-05 12:46:07 -07:00
patch_parser.py	fix(patch): gate 'did you mean?' to no-match + extend to v4a/skill_manage	2026-04-21 02:03:46 -07:00
path_security.py	refactor: extract shared helpers to deduplicate repeated code patterns (#7917 )	2026-04-11 13:59:52 -07:00
process_registry.py	fix(process): reconcile session.exited against real child exit in poll/wait (#17430 )	2026-04-29 04:59:21 -07:00
registry.py	perf(tools): memoize get_tool_definitions + TTL-cache check_fn results (#17098 )	2026-04-28 18:20:17 -07:00
rl_training_tool.py	refactor: codebase-wide lint cleanup — unused imports, dead code, and inefficient patterns (#5821 )	2026-04-07 10:25:31 -07:00
schema_sanitizer.py	refactor(schema): consolidate nullable-union stripping in schema_sanitizer	2026-04-28 04:58:03 -07:00
send_message_tool.py	feat(gateway): centralize audio routing + FLAC support + Telegram doc fallback (#17833 )	2026-04-30 01:32:31 -07:00
session_search_tool.py	fix(session-search): exclude current lineage root deterministically in recent mode	2026-04-26 19:03:17 -07:00
skill_manager_tool.py	feat(skills): refuse skill_manage writes on pinned skills (#17562 )	2026-04-29 10:28:25 -07:00
skill_usage.py	fix(curator): defense-in-depth gates against bundled/hub skills	2026-04-28 22:33:33 -07:00
skills_guard.py	feat(skills-guard): gate agent-created scanner on config.skills.guard_agent_created (default off)	2026-04-23 06:20:47 -07:00
skills_hub.py	feat(skills): install skills from a direct HTTP(S) URL (#16323 )	2026-04-26 20:57:10 -07:00
skills_sync.py	refactor: consolidate symlink-safe atomic replace into shared helper	2026-04-28 04:58:22 -07:00
skills_tool.py	refactor(reload-skills): queue note for next turn, drop cache invalidation + agent tool	2026-04-29 21:07:47 -07:00
slash_confirm.py	feat(gateway,cli): confirm /reload-mcp to warn about prompt cache invalidation	2026-04-29 21:56:47 -07:00
terminal_tool.py	feat: add Vercel Sandbox backend	2026-04-29 07:22:33 -07:00
tirith_security.py	fix: guard against None tirith path in security scanner	2026-04-23 03:08:53 -07:00
todo_tool.py	fix(tools): enforce ID uniqueness in TODO store during replace operations	2026-04-11 16:22:50 -07:00
tool_backend_helpers.py	fix(cli): coerce use_gateway config flags in tool routing	2026-04-26 19:02:55 -07:00
tool_output_limits.py	feat(skills): add design-md skill for Google's DESIGN.md spec (#14876 )	2026-04-23 21:51:19 -07:00
tool_result_storage.py	fix(tools): neutralize shell injection in _write_to_sandbox via path quoting (#7940 )	2026-04-11 14:26:11 -07:00
transcription_tools.py	fix(ci): stabilize main test suite regressions (#17660 )	2026-04-29 23:18:55 -07:00
tts_tool.py	feat(tts): add Piper as a native local TTS provider (closes #8508 ) (#17885 )	2026-04-30 02:53:20 -07:00
url_safety.py	fix(security): treat quoted false as false in browser SSRF guards	2026-04-26 18:27:13 -07:00
vision_tools.py	fix(vision): use HERMES_HOME-based cache dir instead of cwd (#17719 )	2026-04-29 20:14:02 -07:00
voice_mode.py	fix: point optional-dep install hints at the venv's python (#11938 )	2026-04-17 21:16:33 -07:00
web_tools.py	perf(tools): memoize get_tool_definitions + TTL-cache check_fn results (#17098 )	2026-04-28 18:20:17 -07:00
website_policy.py	refactor: codebase-wide lint cleanup — unused imports, dead code, and inefficient patterns (#5821 )	2026-04-07 10:25:31 -07:00
xai_http.py	feat(xai): upgrade to Responses API, add TTS provider	2026-04-16 02:24:08 -07:00
yuanbao_tools.py	chore: remove unused imports and dead locals (ruff F401, F841) (#17010 )	2026-04-28 06:46:45 -07:00