hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-30 01:41:43 +00:00

History

Teknium 01906e99dd feat(image_gen): multi-model FAL support with picker in hermes tools (#11265 ) * feat(image_gen): multi-model FAL support with picker in hermes tools Adds 8 FAL text-to-image models selectable via `hermes tools` → Image Generation → (FAL.ai \| Nous Subscription) → model picker. Models supported: - fal-ai/flux-2/klein/9b (new default, <1s, $0.006/MP) - fal-ai/flux-2-pro (previous default, kept backward-compat upscaling) - fal-ai/z-image/turbo (Tongyi-MAI, bilingual EN/CN) - fal-ai/nano-banana (Gemini 2.5 Flash Image) - fal-ai/gpt-image-1.5 (with quality tier: low/medium/high) - fal-ai/ideogram/v3 (best typography) - fal-ai/recraft-v3 (vector, brand styles) - fal-ai/qwen-image (LLM-based) Architecture: - FAL_MODELS catalog declares per-model size family, defaults, supports whitelist, and upscale flag. Three size families handled uniformly: image_size_preset (flux family), aspect_ratio (nano-banana), and gpt_literal (gpt-image-1.5). - _build_fal_payload() translates unified inputs (prompt + aspect_ratio) into model-specific payloads, merges defaults, applies caller overrides, wires GPT quality_setting, then filters to the supports whitelist — so models never receive rejected keys. - IMAGEGEN_BACKENDS registry in tools_config prepares for future imagegen providers (Replicate, Stability, etc.); each provider entry tags itself with imagegen_backend: 'fal' to select the right catalog. - Upscaler (Clarity) defaults off for new models (preserves <1s value prop), on for flux-2-pro (backward-compat). Per-model via FAL_MODELS. Config: image_gen.model = fal-ai/flux-2/klein/9b (new) image_gen.quality_setting = medium (new, GPT only) image_gen.use_gateway = bool (existing) Agent-facing schema unchanged (prompt + aspect_ratio only) — model choice is a user-level config decision, not an agent-level arg. Picker uses curses_radiolist (arrow keys, auto numbered-fallback on non-TTY). Column-aligned: Model / Speed / Strengths / Price. Docs: image-generation.md rewritten with the model table and picker walkthrough. tools-reference, tool-gateway, overview updated to drop the stale "FLUX 2 Pro" wording. Tests: 42 new in tests/tools/test_image_generation.py covering catalog integrity, all 3 size families, supports filter, default merging, GPT quality wiring, model resolution fallback. 8 new in tests/hermes_cli/test_tools_config.py for picker wiring (registry, config writes, GPT quality follow-up prompt, corrupt-config repair). * feat(image_gen): translate managed-gateway 4xx to actionable error When the Nous Subscription managed FAL proxy rejects a model with 4xx (likely portal-side allowlist miss or billing gate), surface a clear message explaining: 1. The rejected model ID + HTTP status 2. Two remediation paths: set FAL_KEY for direct access, or pick a different model via `hermes tools` 5xx, connection errors, and direct-FAL errors pass through unchanged (those have different root causes and reasonable native messages). Motivation: new FAL models added to this release (flux-2-klein-9b, z-image-turbo, nano-banana, gpt-image-1.5, ideogram-v3, recraft-v3, qwen-image) are untested against the Nous Portal proxy. If the portal allowlists model IDs, users on Nous Subscription will hit cryptic 4xx errors without guidance on how to work around it. Tests: 8 new cases covering status extraction across httpx/fal error shapes and 4xx-vs-5xx-vs-ConnectionError translation policy. Docs: brief note in image-generation.md for Nous subscribers. Operator action (Nous Portal side): verify that fal-queue-gateway passes through these 7 new FAL model IDs. If the proxy has an allowlist, add them; otherwise Nous Subscription users will see the new translated error and fall back to direct FAL. * feat(image_gen): pin GPT-Image quality to medium (no user choice) Previously the tools picker asked a follow-up question for GPT-Image quality tier (low / medium / high) and persisted the answer to `image_gen.quality_setting`. This created two problems: 1. Nous Portal billing complexity — the 22x cost spread between tiers ($0.009 low / $0.20 high) forces the gateway to meter per-tier per user, which the portal team can't easily support at launch. 2. User footgun — anyone picking `high` by mistake burns through credit ~6x faster than `medium`. This commit pins quality at medium by baking it into FAL_MODELS defaults for gpt-image-1.5 and removes all user-facing override paths: - Removed `_resolve_gpt_quality()` runtime lookup - Removed `honors_quality_setting` flag on the model entry - Removed `_configure_gpt_quality_setting()` picker helper - Removed `_GPT_QUALITY_CHOICES` constant - Removed the follow-up prompt call in `_configure_imagegen_model()` - Even if a user manually edits `image_gen.quality_setting` in config.yaml, no code path reads it — always sends medium. Tests: - Replaced TestGptQualitySetting (6 tests) with TestGptQualityPinnedToMedium (5 tests) — proves medium is baked in, config is ignored, flag is removed, helper is removed, non-gpt models never get quality. - Replaced test_picker_with_gpt_image_also_prompts_quality with test_picker_with_gpt_image_does_not_prompt_quality — proves only 1 picker call fires when gpt-image is selected (no quality follow-up). Docs updated: image-generation.md replaces the quality-tier table with a short note explaining the pinning decision. * docs(image_gen): drop stale 'wires GPT quality tier' line from internals section Caught in a cleanup sweep after pinning quality to medium. The "How It Works Internally" walkthrough still described the removed quality-wiring step.		2026-04-16 20:19:53 -07:00
..
__init__.py	chore: release v0.10.0 (2026.4.16) (#11209 )	2026-04-16 12:53:06 -07:00
auth.py	feat(gemini): add Google Gemini CLI OAuth provider via Cloud Code Assist (free + paid tiers) (#11270 )	2026-04-16 16:49:00 -07:00
auth_commands.py	feat(gemini): add Google Gemini CLI OAuth provider via Cloud Code Assist (free + paid tiers) (#11270 )	2026-04-16 16:49:00 -07:00
backup.py	feat: fix SQLite safety in hermes backup + add --quick snapshots + /snapshot command (#8971 )	2026-04-13 04:46:13 -07:00
banner.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
callbacks.py	fix: ESC cancels secret/sudo prompts, clearer skip messaging (#9902 )	2026-04-14 16:11:37 -07:00
claw.py	fix: unify OpenClaw detection, add isatty guard, fix print_warning import	2026-04-12 16:40:37 -07:00
cli_output.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
clipboard.py	feat(gateway): WSL-aware gateway with smart systemd detection (#7510 )	2026-04-10 21:15:47 -07:00
codex_models.py	fix: add gpt-5.4-mini to Codex fallback catalog (#3855 )	2026-03-29 20:10:00 -07:00
colors.py	feat: respect NO_COLOR env var and TERM=dumb (#4079 )	2026-03-30 17:07:21 -07:00
commands.py	feat(gemini): add Google Gemini CLI OAuth provider via Cloud Code Assist (free + paid tiers) (#11270 )	2026-04-16 16:49:00 -07:00
completion.py	fix: preserve profile name completion in dynamic shell completion	2026-04-14 10:45:42 -07:00
config.py	feat(gemini): add Google Gemini CLI OAuth provider via Cloud Code Assist (free + paid tiers) (#11270 )	2026-04-16 16:49:00 -07:00
copilot_auth.py	fix(copilot): resolve GHE token poisoning when GITHUB_TOKEN is set	2026-04-13 05:12:36 -07:00
cron.py	feat(cron): track delivery failures in job status (#6042 )	2026-04-07 22:49:01 -07:00
curses_ui.py	feat: ungate Tool Gateway — subscription-based access with per-tool opt-in	2026-04-16 12:36:49 -07:00
debug.py	fix: bump debug share paste TTL from 1 hour to 6 hours (#11240 )	2026-04-16 14:34:46 -07:00
default_soul.py	fix: reset default SOUL.md to baseline identity text (#3159 )	2026-03-26 01:34:27 -07:00
doctor.py	feat(gemini): add Google Gemini CLI OAuth provider via Cloud Code Assist (free + paid tiers) (#11270 )	2026-04-16 16:49:00 -07:00
dump.py	fix: QQBot missing integration points, timestamp parsing, test fix	2026-04-14 00:11:49 -07:00
env_loader.py	fix: detect and strip non-ASCII characters from API keys (#6843 )	2026-04-14 20:20:31 -07:00
gateway.py	fix: use POSIX ps -A instead of BSD -ax for Docker compat (#9723 ) (#10569 )	2026-04-15 17:07:22 -07:00
logs.py	feat: component-separated logging with session context and filtering (#7991 )	2026-04-11 17:23:36 -07:00
main.py	feat(gemini): add Google Gemini CLI OAuth provider via Cloud Code Assist (free + paid tiers) (#11270 )	2026-04-16 16:49:00 -07:00
mcp_config.py	feat: add --env and --preset support to hermes mcp add	2026-04-11 15:34:57 -07:00
memory_setup.py	fix(memory): discover user-installed memory providers from $HERMES_HOME/plugins/ (#10529 )	2026-04-15 14:25:40 -07:00
model_normalize.py	feat: add Ollama Cloud as built-in provider	2026-04-16 02:22:09 -07:00
model_switch.py	fix(opencode): strip /v1 from base_url on mid-session /model switch to Anthropic-routed models (#11286 )	2026-04-16 19:41:41 -07:00
models.py	fix(models): add glm-5.1 to opencode-go catalogs	2026-04-16 16:49:22 -07:00
nous_subscription.py	feat: ungate Tool Gateway — subscription-based access with per-tool opt-in	2026-04-16 12:36:49 -07:00
pairing.py	chore: fix 154 f-strings, simplify getattr/URL patterns, remove dead code (#3119 )	2026-03-25 19:47:58 -07:00
platforms.py	feat(gateway): unify QQBot branding, add PLATFORM_HINTS, fix streaming, restore missing setup functions	2026-04-14 00:11:49 -07:00
plugins.py	feat(plugins): add dispatch_tool() to PluginContext (#10763 )	2026-04-15 22:23:01 -07:00
plugins_cmd.py	fix: no auto-activation + unified hermes plugins UI with provider categories	2026-04-10 19:15:50 -07:00
profiles.py	fix: improve profile creation UX — seed SOUL.md + credential warning (#8553 )	2026-04-12 12:22:34 -07:00
providers.py	feat(gemini): add Google Gemini CLI OAuth provider via Cloud Code Assist (free + paid tiers) (#11270 )	2026-04-16 16:49:00 -07:00
runtime_provider.py	feat(gemini): add Google Gemini CLI OAuth provider via Cloud Code Assist (free + paid tiers) (#11270 )	2026-04-16 16:49:00 -07:00
setup.py	fix(models): add glm-5.1 to opencode-go catalogs	2026-04-16 16:49:22 -07:00
skills_config.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
skills_hub.py	fix(skills): cache GitHub repo trees to avoid rate-limit exhaustion on install	2026-04-12 16:39:04 -07:00
skin_engine.py	fix(cli): handle null/non-dict display config in skin initialization	2026-04-16 06:35:31 -07:00
status.py	feat: ungate Tool Gateway — subscription-based access with per-tool opt-in	2026-04-16 12:36:49 -07:00
tips.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
tools_config.py	feat(image_gen): multi-model FAL support with picker in hermes tools (#11265 )	2026-04-16 20:19:53 -07:00
uninstall.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
web_server.py	dashboard: show GATEWAY_HEALTH_URL instead of PID for remote gateways	2026-04-16 16:48:14 -07:00
webhook.py	refactor: replace inline HERMES_HOME re-implementations with get_hermes_home()	2026-04-07 10:40:34 -07:00