feat(categories): change quick category default model from claude-haiku-4-5 to gpt-5.4-mini

GPT-5.4-mini provides stronger reasoning at comparable speed and cost. Haiku remains as the next fallback priority in the chain. Changes: - DEFAULT_CATEGORIES quick model: anthropic/claude-haiku-4-5 → openai/gpt-5.4-mini - Fallback chain: gpt-5.4-mini → haiku → gemini-3-flash → minimax-m2.5 → gpt-5-nano - OpenAI-only catalog: quick uses gpt-5.4-mini directly - Think-mode: add gpt-5-4-mini and gpt-5-4-nano high variants - Update all documentation references
2026-03-18 18:31:11 +09:00
parent 826284f3d9
commit 22fd976eb9
15 changed files with 66 additions and 41 deletions
--- a/docs/guide/overview.md
+++ b/docs/guide/overview.md
@@ -41,7 +41,7 @@ We used to call this "Claude Code on steroids." That was wrong.

 This isn't about making Claude Code better. It's about breaking free from the idea that one model, one provider, one way of working is enough. Anthropic wants you locked in. OpenAI wants you locked in. Everyone wants you locked in.

-Oh My OpenCode doesn't play that game. It orchestrates across models, picking the right brain for the right job. Claude for orchestration. GPT for deep reasoning. Gemini for frontend. Haiku for quick tasks. All working together, automatically.
+Oh My OpenCode doesn't play that game. It orchestrates across models, picking the right brain for the right job. Claude for orchestration. GPT for deep reasoning. Gemini for frontend. GPT-5.4 Mini for quick tasks. All working together, automatically.

 ---

@@ -99,9 +99,9 @@ Use Hephaestus when you need deep architectural reasoning, complex debugging acr

 **Why this beats vanilla Codex CLI:**

- **Multi-model orchestration.** Pure Codex is single-model. OmO routes different tasks to different models automatically. GPT for deep reasoning. Gemini for frontend. Haiku for speed. The right brain for the right job.
+- **Multi-model orchestration.** Pure Codex is single-model. OmO routes different tasks to different models automatically. GPT for deep reasoning. Gemini for frontend. GPT-5.4 Mini for speed. The right brain for the right job.
 - **Background agents.** Fire 5+ agents in parallel. Something Codex simply cannot do. While one agent writes code, another researches patterns, another checks documentation. Like a real dev team.
- **Category system.** Tasks are routed by intent, not model name. `visual-engineering` gets Gemini. `ultrabrain` gets GPT-5.4. `quick` gets Haiku. No manual juggling.
+- **Category system.** Tasks are routed by intent, not model name. `visual-engineering` gets Gemini. `ultrabrain` gets GPT-5.4. `quick` gets GPT-5.4 Mini. No manual juggling.
 - **Accumulated wisdom.** Subagents learn from previous results. Conventions discovered in task 1 are passed to task 5. Mistakes made early aren't repeated. The system gets smarter as it works.

 ### Prometheus: The Strategic Planner
@@ -195,8 +195,8 @@ You can override specific agents or categories in your config:
    // General high-effort work
    "unspecified-high": { "model": "anthropic/claude-opus-4-6", "variant": "max" },

-    // Quick tasks: use the cheapest models
-    "quick": { "model": "anthropic/claude-haiku-4-5" },
+    // Quick tasks: use GPT-5.4-mini (fast and cheap)
+    "quick": { "model": "openai/gpt-5.4-mini" },

    // Deep reasoning: GPT-5.4
    "ultrabrain": { "model": "openai/gpt-5.4", "variant": "xhigh" },