hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-11 08:42:11 +00:00

History

teknium1 71e81728ac feat: Codex OAuth vision support + multimodal content adapter The Codex Responses API (chatgpt.com/backend-api/codex) supports vision via gpt-5.3-codex. This was verified with real API calls using image analysis. Changes to _CodexCompletionsAdapter: - Added _convert_content_for_responses() to translate chat.completions multimodal format to Responses API format: - {type: 'text'} → {type: 'input_text'} - {type: 'image_url', image_url: {url: '...'}} → {type: 'input_image', image_url: '...'} - Fixed: removed 'stream' from resp_kwargs (responses.stream() handles it) - Fixed: removed max_output_tokens and temperature (Codex endpoint rejects them) Provider changes: - Added 'codex' as explicit auxiliary provider option - Vision auto-fallback now includes Codex (OpenRouter → Nous → Codex) since gpt-5.3-codex supports multimodal input - Updated docs with Codex OAuth examples Tested with real Codex OAuth token + ~/.hermes/image2.png — confirmed working end-to-end through the full adapter pipeline. Tests: 2459 passed.		2026-03-08 18:44:33 -07:00
..
features	feat: browser screenshot sharing via MEDIA: on all messaging platforms	2026-03-07 22:57:05 -08:00
messaging	fix: improve /model user feedback + update docs	2026-03-08 06:13:12 -07:00
_category_.json	feat: add documentation website (Docusaurus)	2026-03-05 05:24:55 -08:00
cli.md	docs: add resume history display to sessions, CLI, config, and AGENTS docs	2026-03-08 17:55:14 -07:00
configuration.md	feat: Codex OAuth vision support + multimodal content adapter	2026-03-08 18:44:33 -07:00
security.md	docs: complete Daytona backend documentation coverage	2026-03-06 03:37:05 -08:00
sessions.md	docs: add resume history display to sessions, CLI, config, and AGENTS docs	2026-03-08 17:55:14 -07:00