The AI model routing ladder

AI tools are becoming less like a single chatbot and more like a set of modes. OpenAI's model selector explains different choices for quick answers, deeper reasoning, and more advanced work. Its Enterprise model guidance describes Auto, Instant, Thinking, and Pro-style options. Anthropic's Claude model documentation makes a similar distinction across capability, speed, reasoning, and long-context work.

That creates a practical question for knowledge workers: which mode should you use for which job? The answer is not "always use the smartest model." The answer is to route the task based on risk, ambiguity, and consequence.

The skill

The AI model routing ladder is a simple way to decide how much reasoning to request before you start. It keeps lightweight tasks fast and makes higher-risk tasks slower, more deliberate, and easier to review.

AI model routing ladder

Level 1: Fast
Use for: rewriting, formatting, brainstorming, simple summaries, low-risk drafts
Prompt style: "Give me a quick version."

Level 2: Focused
Use for: structured analysis, comparison, planning, meeting prep, source review
Prompt style: "Work step by step and show assumptions."

Level 3: Thinking
Use for: ambiguous decisions, tradeoffs, technical tasks, financial or legal-adjacent review, strategy
Prompt style: "Think carefully, test alternatives, and state uncertainty."

Level 4: Expert review
Use for: high-consequence work, executive recommendations, public content, customer promises, irreversible actions
Prompt style: "Review like a senior expert. Look for flaws, missing evidence, and failure modes."

A worked example: customer churn analysis

If you ask for "ideas to reduce churn," a fast model may be enough. If you ask whether the business should change pricing, that deserves a higher rung.

Task:
Analyze whether our recent churn spike is caused by onboarding, pricing, product gaps, or support delays.

Routing:
Level 3: Thinking

Why:
- Multiple possible causes
- Needs evidence from several sources
- Could affect customer communication and pricing decisions

Instructions:
- Compare at least three hypotheses
- State what evidence would support or weaken each one
- Separate facts from guesses
- End with confidence level and follow-up data needed

Do not:
- Recommend a pricing change without stronger evidence
- Write customer-facing messaging yet

The prompt

Use this before starting a task when you are unsure how much reasoning it needs:

Before answering, route this task.

Task:
{what I need}

Choose one:
1. Fast
2. Focused
3. Thinking
4. Expert review

Explain the choice in one sentence based on:
- Risk
- Ambiguity
- Consequence
- Need for evidence

Then answer using the chosen level.

If the task could affect customers, money, legal/compliance, public content, private data, or irreversible actions, choose Thinking or Expert review.

The review checklist

Risk: What happens if the answer is wrong?
Ambiguity: Is there one clear answer, or several plausible paths?
Evidence: Does the task need sources, files, data, or citations?
Consequence: Will the answer influence money, customers, policy, or public messaging?
Review: Does the output need a second pass for gaps and failure modes?

Why it works

Using a powerful model for every tiny task wastes time. Using a fast answer for high-consequence work creates quiet risk. The routing ladder gives you a lightweight judgment step before the model starts working.

It also improves prompts. When you say "think carefully" only for tasks that deserve it, your AI workflow becomes more deliberate: fast where speed matters, careful where correctness matters.

Try it today. Take three AI tasks from your week. Label each one Fast, Focused, Thinking, or Expert review. Then rewrite the prompt to match the level.

The AI model routing ladder

The skill

A worked example: customer churn analysis

The prompt

The review checklist

Why it works

Sources

Related posts

The AI definition of done

The AI research plan review