Enhance BatchRunner and AIAgent with new configuration options, default model now opus 4.6, default summarizer gemini flash 3

- Added `max_tokens`, `reasoning_config`, and `prefill_messages` parameters to `BatchRunner` and `AIAgent` for improved model response control. - Updated CLI to support new options for reasoning effort and prefill messages from a JSON file. - Modified example configuration files to reflect changes in default model and summary model. - Improved error handling for loading prefill messages and reasoning configurations in the CLI. - Updated documentation to include new parameters and usage examples.
2026-07-19 15:18:03 +00:00 · 2026-02-08 10:49:24 +00:00 · 2026-02-08 10:49:24 +00:00 · f12ea1bc02
commit f12ea1bc02
parent fa76a331b0
7 changed files with 324 additions and 40 deletions
--- a/cli-config.yaml.example
+++ b/cli-config.yaml.example
@ -7,7 +7,7 @@
 # =============================================================================
 model:
  # Default model to use (can be overridden with --model flag)
-  default: "anthropic/claude-sonnet-4"
+  default: "anthropic/claude-opus-4.6"
  
  # API configuration (falls back to OPENROUTER_API_KEY env var)
  # api_key: "your-key-here"  # Uncomment to set here instead of .env
@ -140,7 +140,7 @@ compression:
  
  # Model to use for generating summaries (fast/cheap recommended)
  # This model compresses the middle turns into a concise summary
-  summary_model: "google/gemini-2.0-flash-001"
+  summary_model: "google/gemini-3-flash-preview"

 # =============================================================================
 # Agent Behavior