diff --git a/README.md b/README.md index 856ffb7b5..ac8c31bf3 100644 --- a/README.md +++ b/README.md @@ -181,7 +181,7 @@ When Sisyphus delegates to a subagent, it doesn't pick a model. It picks a **cat | `quick` | Single-file changes, typos | | `ultrabrain` | Hard logic, architecture decisions | -Agent says what kind of work. Harness picks the right model. You touch nothing. +Agent says what kind of work. Harness picks the right model. `ultrabrain` now routes to GPT-5.4 xhigh by default. You touch nothing. ### Claude Code Compatibility diff --git a/docs/guide/agent-model-matching.md b/docs/guide/agent-model-matching.md index ff6ac0d2d..dfebed181 100644 --- a/docs/guide/agent-model-matching.md +++ b/docs/guide/agent-model-matching.md @@ -147,11 +147,11 @@ When agents delegate work, they don't pick a model name — they pick a **catego | Category | When Used | Fallback Chain | | -------------------- | -------------------------- | -------------------------------------------- | | `visual-engineering` | Frontend, UI, CSS, design | Gemini 3.1 Pro → GLM 5 → Claude Opus | -| `ultrabrain` | Maximum reasoning needed | GPT-5.3 Codex → Gemini 3.1 Pro → Claude Opus | +| `ultrabrain` | Maximum reasoning needed | GPT-5.4 → Gemini 3.1 Pro → Claude Opus | | `deep` | Deep coding, complex logic | GPT-5.3 Codex → Claude Opus → Gemini 3.1 Pro | | `artistry` | Creative, novel approaches | Gemini 3.1 Pro → Claude Opus → GPT-5.4 | | `quick` | Simple, fast tasks | Claude Haiku → Gemini Flash → GPT-5-Nano | -| `unspecified-high` | General complex work | GPT-5.4 → Claude Opus → GLM 5 → K2P5 | +| `unspecified-high` | General complex work | Claude Opus → GPT-5.4 → GLM 5 → K2P5 | | `unspecified-low` | General standard work | Claude Sonnet → GPT-5.3 Codex → Gemini Flash | | `writing` | Text, docs, prose | Gemini Flash → Claude Sonnet | @@ -190,7 +190,7 @@ See the [Orchestration System Guide](./orchestration.md) for how agents dispatch "categories": { "quick": { "model": "opencode/gpt-5-nano" }, "unspecified-low": { "model": "anthropic/claude-sonnet-4-6" }, - "unspecified-high": { "model": "openai/gpt-5.4-high" }, + "unspecified-high": { "model": "anthropic/claude-opus-4-6", "variant": "max" }, "visual-engineering": { "model": "google/gemini-3.1-pro", "variant": "high", diff --git a/docs/guide/orchestration.md b/docs/guide/orchestration.md index 27b2afd9c..6dd8d2caa 100644 --- a/docs/guide/orchestration.md +++ b/docs/guide/orchestration.md @@ -296,12 +296,12 @@ task({ category: "quick", prompt: "..." }); // "Just get it done fast" | Category | Model | When to Use | | -------------------- | ---------------------- | ----------------------------------------------------------- | | `visual-engineering` | Gemini 3.1 Pro | Frontend, UI/UX, design, styling, animation | -| `ultrabrain` | GPT-5.3 Codex (xhigh) | Deep logical reasoning, complex architecture decisions | +| `ultrabrain` | GPT-5.4 (xhigh) | Deep logical reasoning, complex architecture decisions | | `artistry` | Gemini 3.1 Pro (high) | Highly creative or artistic tasks, novel ideas | | `quick` | Claude Haiku 4.5 | Trivial tasks - single file changes, typo fixes | | `deep` | GPT-5.3 Codex (medium) | Goal-oriented autonomous problem-solving, thorough research | | `unspecified-low` | Claude Sonnet 4.6 | Tasks that don't fit other categories, low effort | -| `unspecified-high` | GPT-5.4 (high) | Tasks that don't fit other categories, high effort | +| `unspecified-high` | Claude Opus 4.6 (max) | Tasks that don't fit other categories, high effort | | `writing` | Gemini 3 Flash | Documentation, prose, technical writing | ### Skills: Domain-Specific Instructions diff --git a/docs/guide/overview.md b/docs/guide/overview.md index 2afcbd4c1..800586360 100644 --- a/docs/guide/overview.md +++ b/docs/guide/overview.md @@ -101,7 +101,7 @@ Use Hephaestus when you need deep architectural reasoning, complex debugging acr - **Multi-model orchestration.** Pure Codex is single-model. OmO routes different tasks to different models automatically. GPT for deep reasoning. Gemini for frontend. Haiku for speed. The right brain for the right job. - **Background agents.** Fire 5+ agents in parallel. Something Codex simply cannot do. While one agent writes code, another researches patterns, another checks documentation. Like a real dev team. -- **Category system.** Tasks are routed by intent, not model name. `visual-engineering` gets Gemini. `ultrabrain` gets GPT-5.3 Codex. `quick` gets Haiku. No manual juggling. +- **Category system.** Tasks are routed by intent, not model name. `visual-engineering` gets Gemini. `ultrabrain` gets GPT-5.4. `quick` gets Haiku. No manual juggling. - **Accumulated wisdom.** Subagents learn from previous results. Conventions discovered in task 1 are passed to task 5. Mistakes made early aren't repeated. The system gets smarter as it works. ### Prometheus: The Strategic Planner @@ -193,13 +193,13 @@ You can override specific agents or categories in your config: }, // General high-effort work - "unspecified-high": { "model": "openai/gpt-5.4", "variant": "high" }, + "unspecified-high": { "model": "anthropic/claude-opus-4-6", "variant": "max" }, // Quick tasks: use the cheapest models "quick": { "model": "anthropic/claude-haiku-4-5" }, - // Deep reasoning: GPT-5.3-codex - "ultrabrain": { "model": "openai/gpt-5.3-codex", "variant": "xhigh" }, + // Deep reasoning: GPT-5.4 + "ultrabrain": { "model": "openai/gpt-5.4", "variant": "xhigh" }, }, } ``` diff --git a/docs/reference/configuration.md b/docs/reference/configuration.md index 5d1c03586..5f06f30be 100644 --- a/docs/reference/configuration.md +++ b/docs/reference/configuration.md @@ -100,7 +100,7 @@ Here's a practical starting configuration: "unspecified-low": { "model": "anthropic/claude-sonnet-4-6" }, // unspecified-high — complex work - "unspecified-high": { "model": "openai/gpt-5.4-high" }, + "unspecified-high": { "model": "anthropic/claude-opus-4-6", "variant": "max" }, // writing — docs/prose "writing": { "model": "google/gemini-3-flash" }, @@ -225,12 +225,12 @@ Domain-specific model delegation used by the `task()` tool. When Sisyphus delega | Category | Default Model | Description | | -------------------- | ------------------------------- | ---------------------------------------------- | | `visual-engineering` | `google/gemini-3.1-pro` (high) | Frontend, UI/UX, design, animation | -| `ultrabrain` | `openai/gpt-5.3-codex` (xhigh) | Deep logical reasoning, complex architecture | +| `ultrabrain` | `openai/gpt-5.4` (xhigh) | Deep logical reasoning, complex architecture | | `deep` | `openai/gpt-5.3-codex` (medium) | Autonomous problem-solving, thorough research | | `artistry` | `google/gemini-3.1-pro` (high) | Creative/unconventional approaches | | `quick` | `anthropic/claude-haiku-4-5` | Trivial tasks, typo fixes, single-file changes | | `unspecified-low` | `anthropic/claude-sonnet-4-6` | General tasks, low effort | -| `unspecified-high` | `openai/gpt-5.4` (high) | General tasks, high effort | +| `unspecified-high` | `anthropic/claude-opus-4-6` (max) | General tasks, high effort | | `writing` | `google/gemini-3-flash` | Documentation, prose, technical writing | > **Note**: Built-in defaults only apply if the category is present in your config. Otherwise the system default model is used. @@ -283,12 +283,12 @@ Disable categories: `{ "disabled_categories": ["ultrabrain"] }` | Category | Default Model | Provider Priority | | ---------------------- | ------------------- | -------------------------------------------------------------- | | **visual-engineering** | `gemini-3.1-pro` | `gemini-3.1-pro` → `glm-5` → `claude-opus-4-6` | -| **ultrabrain** | `gpt-5.3-codex` | `gpt-5.3-codex` → `gemini-3.1-pro` → `claude-opus-4-6` | +| **ultrabrain** | `gpt-5.4` | `gpt-5.4` → `gemini-3.1-pro` → `claude-opus-4-6` | | **deep** | `gpt-5.3-codex` | `gpt-5.3-codex` → `claude-opus-4-6` → `gemini-3.1-pro` | | **artistry** | `gemini-3.1-pro` | `gemini-3.1-pro` → `claude-opus-4-6` → `gpt-5.4` | | **quick** | `claude-haiku-4-5` | `claude-haiku-4-5` → `gemini-3-flash` → `gpt-5-nano` | | **unspecified-low** | `claude-sonnet-4-6` | `claude-sonnet-4-6` → `gpt-5.3-codex` → `gemini-3-flash` | -| **unspecified-high** | `gpt-5.4` | `gpt-5.4` → `claude-opus-4-6` → `glm-5` → `k2p5` → `kimi-k2.5` | +| **unspecified-high** | `claude-opus-4-6` | `claude-opus-4-6` → `gpt-5.4` → `glm-5` → `k2p5` → `kimi-k2.5` | | **writing** | `gemini-3-flash` | `gemini-3-flash` → `claude-sonnet-4-6` | Run `bunx oh-my-opencode doctor --verbose` to see effective model resolution for your config. diff --git a/docs/reference/features.md b/docs/reference/features.md index a89510a78..488803590 100644 --- a/docs/reference/features.md +++ b/docs/reference/features.md @@ -108,12 +108,12 @@ By combining these two concepts, you can generate optimal agents through `task`. | Category | Default Model | Use Cases | | -------------------- | ------------------------------- | --------------------------------------------------------------------------------------------------------------------------- | | `visual-engineering` | `google/gemini-3.1-pro` | Frontend, UI/UX, design, styling, animation | -| `ultrabrain` | `openai/gpt-5.3-codex` (xhigh) | Deep logical reasoning, complex architecture decisions requiring extensive analysis | +| `ultrabrain` | `openai/gpt-5.4` (xhigh) | Deep logical reasoning, complex architecture decisions requiring extensive analysis | | `deep` | `openai/gpt-5.3-codex` (medium) | Goal-oriented autonomous problem-solving. Thorough research before action. For hairy problems requiring deep understanding. | | `artistry` | `google/gemini-3.1-pro` (high) | Highly creative/artistic tasks, novel ideas | | `quick` | `anthropic/claude-haiku-4-5` | Trivial tasks - single file changes, typo fixes, simple modifications | | `unspecified-low` | `anthropic/claude-sonnet-4-6` | Tasks that don't fit other categories, low effort required | -| `unspecified-high` | `openai/gpt-5.4` (high) | Tasks that don't fit other categories, high effort required | +| `unspecified-high` | `anthropic/claude-opus-4-6` (max) | Tasks that don't fit other categories, high effort required | | `writing` | `google/gemini-3-flash` | Documentation, prose, technical writing | ### Usage diff --git a/src/tools/AGENTS.md b/src/tools/AGENTS.md index b03dc5bbe..d7b0e840c 100644 --- a/src/tools/AGENTS.md +++ b/src/tools/AGENTS.md @@ -92,12 +92,12 @@ | Category | Model | Domain | |----------|-------|--------| | visual-engineering | gemini-3.1-pro high | Frontend, UI/UX | -| ultrabrain | gpt-5.3-codex xhigh | Hard logic | +| ultrabrain | gpt-5.4 xhigh | Hard logic | | deep | gpt-5.3-codex medium | Autonomous problem-solving | | artistry | gemini-3.1-pro high | Creative approaches | | quick | claude-haiku-4-5 | Trivial tasks | | unspecified-low | claude-sonnet-4-6 | Moderate effort | -| unspecified-high | gpt-5.4 high | High effort | +| unspecified-high | claude-opus-4-6 max | High effort | | writing | kimi-k2p5 | Documentation | ## HOW TO ADD A TOOL