hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-14 14:12:44 +00:00

History

Teknium cb38ce28cb refactor(codex): drop SDK responses.stream() helper; consume events directly (#33042 ) * refactor(codex): drop SDK responses.stream() helper; consume events directly The OpenAI Python SDK's high-level `client.responses.stream(...)` helper does post-hoc typed reconstruction from the terminal `response.completed.response.output` field. The chatgpt.com Codex backend has been observed (today, gpt-5.5) to ship `response.output = null` on terminal frames, which crashes the SDK with `TypeError: 'NoneType' object is not iterable` mid-iteration. Carlton's #32963 patched the symptom by wrapping the helper in try/except and recovering from the same per-event accumulator the SDK was supposed to populate. This PR removes the helper from the call path entirely: we now use `client.responses.create(stream=True)` (raw AsyncIterable of SSE events) and assemble the final response object ourselves from `response.output_item.done` events as they arrive. The terminal event's `output` field is never read for content. Same strategy OpenClaw uses for the same backend. This makes Hermes structurally immune to the bug class, not patched. The next time OpenAI ships a shape change to chatgpt.com's terminal frame, our consumer keeps working because it doesn't read that frame for content — only for usage/status/id. Changes - `agent/codex_runtime.py`: new `_consume_codex_event_stream()` shared consumer; `run_codex_stream()` uses `responses.create(stream=True)`; `run_codex_create_stream_fallback()` collapses into a thin alias since the primary path now does what the fallback used to do. - `agent/auxiliary_client.py`: `_CodexCompletionsAdapter` uses the same consumer; old null-output recovery helpers deleted as unreferenced. - Tests migrated: fixtures that mocked `responses.stream` now mock `responses.create` returning a raw iterable. New regression test asserts the auxiliary path returns streamed items even when the terminal event's `output` is literally `null`. Validation - Live: tested against fresh OAuth on `chatgpt.com/backend-api/codex` with `gpt-5.5` — response built correctly with `response.output=null` on the terminal frame, all events consumed, usage/reasoning tokens propagated. - `tests/run_agent/test_run_agent_codex_responses.py` + `tests/agent/test_auxiliary_client.py`: 242 passed. * test+fix(codex): migrate streaming tests, raise on truncated streams CI surfaced 10 test failures across tests/run_agent/test_streaming.py and tests/run_agent/test_codex_xai_oauth_recovery.py — both files had their own `responses.stream(...)` mocks I missed in the first sweep. agent/codex_runtime.py: _consume_codex_event_stream() now raises "Codex Responses stream did not emit a terminal response" when the stream ends without any terminal frame AND no usable content. This preserves the signal callers used to get from the SDK's high-level helper, which they distinguished from "completed with empty body" in error handling. Tests migrated: - test_streaming.py: text-delta callback, activity-touch, and remote-protocol-error tests all switch from mocking responses.stream to responses.create returning an iterable of events. - test_codex_xai_oauth_recovery.py: prelude-error tests are recast as wire-error-event tests (the new path raises _StreamErrorEvent directly when the wire emits type=error, which is strictly better than the old two-phase "SDK RuntimeError → retry → fallback"). The retry-on-transport-error test moves from responses.stream side-effect to responses.create side-effect. Verified live against chatgpt.com Codex with gpt-5.5 — AIAgent.chat() through the full codex_responses path returns correctly, 319/319 targeted tests passing.		2026-05-27 00:30:06 -07:00
..
lsp	chore: ruff auto-fix PLR6201 resweep — tuple → set in membership tests (#27355 )	2026-05-17 02:29:41 -07:00
secret_sources	perf(cli): cut hermes startup 63% — flip head-to-head vs codex (#31968 )	2026-05-25 03:06:39 -07:00
transports	fix(codex-responses): gracefully recover from invalid_encrypted_content (salvage #10144 ) (#33035 )	2026-05-26 22:01:17 -07:00
__init__.py	Refactor Terminal and AIAgent cleanup	2026-02-21 22:31:43 -08:00
account_usage.py	chore: ruff auto-fix PLR6201 — tuple → set in membership tests (#23937 )	2026-05-11 11:13:25 -07:00
agent_init.py	fix(codex-responses): gracefully recover from invalid_encrypted_content (salvage #10144 ) (#33035 )	2026-05-26 22:01:17 -07:00
agent_runtime_helpers.py	fix(credential-pool): correct pool rotation when weekly usage limit is reached	2026-05-25 06:32:30 -07:00
anthropic_adapter.py	fix(security): close TOCTOU window when saving Claude Code OAuth credentials (#21152 )	2026-05-24 17:45:12 -07:00
async_utils.py	fix(async): close unscheduled coroutines in all threadsafe bridges (#26584 )	2026-05-15 14:00:01 -07:00
auxiliary_client.py	refactor(codex): drop SDK responses.stream() helper; consume events directly (#33042 )	2026-05-27 00:30:06 -07:00
azure_identity_adapter.py	feat(azure-foundry): add Microsoft Entra ID auth	2026-05-18 10:14:38 -07:00
background_review.py	fix(background-review): allow pinned skills to be improved	2026-05-23 22:57:42 -07:00
bedrock_adapter.py	chore(deps): lazy-install boto3/botocore for bedrock adapter	2026-05-17 02:31:18 -07:00
browser_provider.py	fix(browser): self-review pass — dead-import, log levels, future-proofing	2026-05-17 04:04:15 -07:00
browser_registry.py	fix(browser): self-review pass — dead-import, log levels, future-proofing	2026-05-17 04:04:15 -07:00
chat_completion_helpers.py	fix(codex-responses): gracefully recover from invalid_encrypted_content (salvage #10144 ) (#33035 )	2026-05-26 22:01:17 -07:00
codex_responses_adapter.py	fix(codex-responses): gracefully recover from invalid_encrypted_content (salvage #10144 ) (#33035 )	2026-05-26 22:01:17 -07:00
codex_runtime.py	refactor(codex): drop SDK responses.stream() helper; consume events directly (#33042 )	2026-05-27 00:30:06 -07:00
context_compressor.py	fix(compressor): ABC compliance — total_tokens, api_mode, logger consistency	2026-05-23 17:38:19 -07:00
context_engine.py	fix(compressor): ABC compliance — total_tokens, api_mode, logger consistency	2026-05-23 17:38:19 -07:00
context_references.py	fix(agent): fall back when rg is blocked for @folder references	2026-04-20 01:56:41 -07:00
conversation_compression.py	fix(cli): synchronize HERMES_SESSION_ID across environment and contextvar during session switches	2026-05-23 17:46:55 -07:00
conversation_loop.py	fix(codex-responses): gracefully recover from invalid_encrypted_content (salvage #10144 ) (#33035 )	2026-05-26 22:01:17 -07:00
copilot_acp_client.py	fix: guard yaml.safe_load, flock unlock, TOCTOU races, and atomic writes	2026-05-19 00:12:41 -07:00
credential_persistence.py	fix: avoid persisting borrowed credential secrets (#31416 )	2026-05-25 00:32:08 -07:00
credential_pool.py	fix(anthropic): API-key path skips OAuth autodiscovery + prunes stale entries	2026-05-25 17:41:40 -07:00
credential_sources.py	docs(auth): replace stale 'hermes login' references with 'hermes auth add'	2026-05-26 15:41:11 -07:00
curator.py	feat(curator): hint at `hermes curator pin` in the rename block (#23212 )	2026-05-10 06:44:53 -07:00
curator_backup.py	fix(skills): prune dependency/venv dirs from all skill scanners (#30042 )	2026-05-21 14:18:02 -07:00
display.py	feat(cli): show todo progress as done/total fraction	2026-05-23 21:03:51 -07:00
error_classifier.py	fix(codex-responses): gracefully recover from invalid_encrypted_content (salvage #10144 ) (#33035 )	2026-05-26 22:01:17 -07:00
file_safety.py	fix(security): block read_file on project-local .env files	2026-05-25 03:40:47 -07:00
gemini_cloudcode_adapter.py	fix(agent/gemini-cloudcode): seed delta defaults for reasoning-only stream chunks	2026-05-14 08:03:56 -07:00
gemini_native_adapter.py	fix(auxiliary): evict async wrappers on poisoned client (follow-up to #23482 )	2026-05-11 11:13:20 -07:00
gemini_schema.py	chore: remove unused imports and dead locals (ruff F401, F841) (#17010 )	2026-04-28 06:46:45 -07:00
google_code_assist.py	chore: remove unused imports and dead locals (ruff F401, F841) (#17010 )	2026-04-28 06:46:45 -07:00
google_oauth.py	docs(auth): replace stale 'hermes login' references with 'hermes auth add'	2026-05-26 15:41:11 -07:00
i18n.py	feat(i18n): localize all gateway commands + web dashboard, add 8 new locales (16 total) (#22914 )	2026-05-10 07:14:14 -07:00
image_gen_provider.py	fix(image_gen): cache xAI ephemeral URL responses to disk (#26942 ) (#31759 )	2026-05-24 18:10:47 -07:00
image_gen_registry.py	fix(plugins): filter resolution by is_available() in web + image_gen registries	2026-05-13 22:31:28 -07:00
image_routing.py	fix(agent): consult supports_vision override in auto-mode routing	2026-05-20 23:27:10 -07:00
insights.py	Merge branch 'main' into feat/dashboard-skill-analytics	2026-04-20 05:25:49 -07:00
iteration_budget.py	refactor(run_agent): extract OpenAI proxy, safe stdio, IterationBudget	2026-05-16 17:59:32 -07:00
lmstudio_reasoning.py	feat(agent): add lmstudio integration	2026-04-28 12:27:36 -07:00
manual_compression_feedback.py	fix(compression): include system prompt + tool schemas in token estimates (#18265 )	2026-04-30 23:03:54 -07:00
markdown_tables.py	fix(cli): vertical fallback for markdown tables wider than terminal (#23948 )	2026-05-11 16:49:13 -07:00
memory_manager.py	🐛 fix(memory): require newline after context tag	2026-05-18 10:53:08 -07:00
memory_provider.py	docs(agent): remove stale BuiltinMemoryProvider references from memory module docstrings	2026-05-05 13:33:49 -07:00
message_sanitization.py	refactor(run_agent): extract message sanitization to agent/message_sanitization.py	2026-05-16 17:41:09 -07:00
model_metadata.py	chore(models): drop retired grok-4-1-fast from metadata, tests, docs	2026-05-25 14:51:43 -07:00
models_dev.py	fix(xai): resolve Grok Build context for OAuth	2026-05-22 13:05:36 -07:00
moonshot_schema.py	fix(moonshot): strip $ref siblings and collapse tuple items in tool schemas (#27104 )	2026-05-16 13:02:19 -07:00
nous_rate_guard.py	codebase: add encoding='utf-8' to all bare open() calls (PLW1514)	2026-05-08 14:27:40 -07:00
onboarding.py	docs(onboarding): lead OpenClaw residue banner with migrate, warn that cleanup breaks OpenClaw (#17507 )	2026-04-29 08:08:36 -07:00
plugin_llm.py	feat(plugins): run any LLM call from inside a plugin via ctx.llm (#23194 )	2026-05-10 07:09:28 -07:00
portal_tags.py	feat(nous): unified client=hermes-client-v<version> tag on every Portal request (#24779 )	2026-05-12 20:49:20 -07:00
process_bootstrap.py	refactor(run_agent): extract OpenAI proxy, safe stdio, IterationBudget	2026-05-16 17:59:32 -07:00
prompt_builder.py	feat(security): promptware defense — shared threat patterns + memory load-time scan + tool-result delimiters (#32269 )	2026-05-25 14:52:24 -07:00
prompt_caching.py	fix(cache): kill long-lived prefix layout — system prompt is now byte-static within a session (#24778 )	2026-05-12 20:46:04 -07:00
rate_limit_tracker.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
redact.py	fix(debug): redact BlueBubbles webhook secrets	2026-05-24 15:43:48 -07:00
retry_utils.py	feat(agent): add jittered retry backoff	2026-04-08 00:41:36 -07:00
shell_hooks.py	fix: guard yaml.safe_load, flock unlock, TOCTOU races, and atomic writes	2026-05-19 00:12:41 -07:00
skill_bundles.py	feat(skills): add skill bundles — alias /<name> loads multiple skills (#28373 )	2026-05-18 21:38:05 -07:00
skill_commands.py	fix(skills): load symlinked skill slash commands	2026-05-18 00:34:29 -07:00
skill_preprocessing.py	fix: treat inline-shell timeout guard as timeout	2026-05-18 19:36:04 -07:00
skill_utils.py	fix(skills): load Linux-tagged skills on Termux (android sys.platform)	2026-05-21 19:08:38 -07:00
stream_diag.py	refactor(run_agent): extract stream diagnostics to agent/stream_diag.py	2026-05-16 18:28:17 -07:00
subdirectory_hints.py	fix(subdirectory_hints): prevent loading AGENTS.md outside workspace	2026-05-25 23:17:33 -07:00
system_prompt.py	fix(profiles): cross-profile soft guard on file-write tools + system-prompt hint (#31290 )	2026-05-24 00:38:17 -07:00
think_scrubber.py	fix(agent): stateful streaming scrubber for reasoning-block leaks (#17924 ) (#20184 )	2026-05-05 04:33:38 -07:00
title_generator.py	fix: improve telegram topic mode setup	2026-05-04 12:07:17 -07:00
tool_dispatch_helpers.py	feat(security): promptware defense — shared threat patterns + memory load-time scan + tool-result delimiters (#32269 )	2026-05-25 14:52:24 -07:00
tool_executor.py	fix(cli): surface tool failures with specific error messages	2026-05-23 21:03:51 -07:00
tool_guardrails.py	fix: add recovery hints to loop guard warnings	2026-05-19 00:12:12 -07:00
tool_result_classification.py	fix: classify landed file mutations with diagnostics	2026-05-13 06:46:23 -07:00
trajectory.py	Refactor Terminal and AIAgent cleanup	2026-02-21 22:31:43 -08:00
transcription_provider.py	feat(stt): add register_transcription_provider() plugin hook	2026-05-25 01:41:19 -07:00
transcription_registry.py	feat(stt): add register_transcription_provider() plugin hook	2026-05-25 01:41:19 -07:00
tts_provider.py	feat(tts): add register_tts_provider() plugin hook (closes #30398 )	2026-05-24 18:04:54 -07:00
tts_registry.py	feat(tts): add register_tts_provider() plugin hook (closes #30398 )	2026-05-24 18:04:54 -07:00
usage_pricing.py	fix(pricing): add deepseek-v4-pro to official docs pricing table	2026-05-12 16:32:57 -07:00
video_gen_provider.py	feat(video_gen): unified video_generate tool with pluggable provider backends (#25126 )	2026-05-13 16:39:41 -07:00
video_gen_registry.py	feat(video_gen): unified video_generate tool with pluggable provider backends (#25126 )	2026-05-13 16:39:41 -07:00
web_search_provider.py	fix(web): align _LEGACY_PREFERENCE with legacy 7-provider order + doc cleanup	2026-05-13 22:31:28 -07:00
web_search_registry.py	fix(web): align _LEGACY_PREFERENCE with legacy 7-provider order + doc cleanup	2026-05-13 22:31:28 -07:00