Chain-of-Thought Prompting: Make AI Think Step by Step
By Dorian Laurenceau
📅 Last reviewed: April 24, 2026. Updated with April 2026 findings and community feedback.
Have you ever asked an AI a complex question only to receive a wrong or shallow answer? The solution might be simpler than you think: ask the AI to think step by step.
Chain-of-Thought (CoT) prompting is one of the most powerful techniques in modern prompt engineering. It transforms how Large Language Models reason through problems-and the results are remarkable.
<!-- manual-insight -->
Reddit's honest verdict: CoT isn't magic anymore, but the habit still pays
Scroll through r/PromptEngineering or r/LocalLLaMA in 2026 and you'll notice something: nobody writes "Let's think step by step" in their prompts anymore. Not because it stopped working, but because the frontier models (Claude Opus 4.6, GPT-5.3, Gemini 3 Pro) now do it internally through extended thinking / reasoning modes. You pay for the tokens either way; you just don't see them.
That shift has created a two-speed reality that most tutorials miss:
- →On frontier reasoning models, explicit CoT instructions are often redundant, and occasionally harmful — you end up double-reasoning and the outputs get verbose without gaining accuracy. The practical advice from the Reddit community has been to trust the built-in thinking and optimise your prompt for the answer format you want, not the process.
- →On smaller / cheaper models (Llama 3.1 8B, Mistral Small, Haiku), the original Wei et al. trick is still the single cheapest accuracy boost you can get. The 2022 paper that introduced it — and the follow-up self-consistency paper — are still the right mental model for anything running on edge hardware or tight budgets.
My rule of thumb after shipping a few CoT pipelines: if the model has a "thinking" mode, let it do its job and measure. If it doesn't, ask for reasoning and structure it (numbered steps, a final Answer: line) so you can parse output reliably. Don't copy-paste the 2022 prompt wholesale into a 2026 model and assume you're getting 2022 gains.
Learn AI — From Prompts to Agents
What Is Chain-of-Thought Prompting?
Chain-of-Thought prompting is a technique that encourages AI models to break down complex problems into intermediate reasoning steps before arriving at a final answer.
Instead of jumping directly to a conclusion, the model "thinks aloud," showing its work just like a student solving a math problem on a whiteboard.
Standard Prompting vs. Chain-of-Thought
Standard prompting:
Q: Roger has 5 tennis balls. He buys 2 more cans of tennis balls. Each can has 3 tennis balls. How many tennis balls does he have now?
A: 11
Chain-of-Thought prompting:
Q: Roger has 5 tennis balls. He buys 2 more cans of tennis balls. Each can has 3 tennis balls. How many tennis balls does he have now?
A: Roger started with 5 balls. He bought 2 cans × 3 balls = 6 balls. So he now has 5 + 6 = 11 tennis balls.
The answer is the same, but the reasoning is explicit-and for harder problems, this transparency dramatically improves accuracy.
Why Does Chain-of-Thought Work?
CoT prompting works because it mirrors how humans solve complex problems: by decomposing them into manageable steps.
Key Benefits
- →Improved accuracy on complex tasks: Research shows CoT can boost performance by 10-20% on reasoning benchmarks
- →Transparency: You can verify how the AI reached its conclusion
- →Error detection: When reasoning steps are visible, mistakes are easier to spot
- →Better performance on multi-step problems: Math, logic puzzles, and analytical tasks see the biggest gains
The Research Behind It
The landmark 2022 paper by Wei et al. at Google demonstrated that Chain-of-Thought prompting enabled PaLM 540B to improve from 55% to 74% accuracy on the GSM8K math benchmark-a 19-point jump simply by changing how the prompt was structured.
"Chain-of-thought prompting allows models to decompose multi-step problems into intermediate steps, solving problems that cannot be solved with standard prompting." , Wei et al., 2022
Zero-Shot Chain-of-Thought: The Magic Phrase
You don't always need elaborate examples. Sometimes, a simple phrase is enough.
Zero-Shot CoT involves adding a trigger phrase to your prompt:
"Let's think step by step."
That's it. This single sentence can unlock reasoning capabilities in the model without providing any examples.
Example
Without Zero-Shot CoT:
Is 17 a prime number? A: Yes.
With Zero-Shot CoT:
Is 17 a prime number? Let's think step by step.
A: To check if 17 is prime, I need to see if it's divisible by any number other than 1 and itself.
- →17 ÷ 2 = 8.5 (not divisible)
- →17 ÷ 3 = 5.67 (not divisible)
- →17 ÷ 4 = 4.25 (not divisible)
- →I only need to check up to √17 ≈ 4.1
Since 17 is not divisible by 2, 3, or 4, 17 is a prime number.
The reasoning is now explicit and verifiable.
Self-Consistency: Taking CoT Further
What if the model's reasoning path leads to an error? Self-Consistency addresses this by generating multiple reasoning chains and selecting the most common answer.
How It Works
- →Prompt the model with the same question multiple times
- →Let it generate different reasoning paths
- →Take the majority vote on the final answer
This ensemble approach reduces the impact of occasional reasoning errors and improves reliability-especially for problems with definitive answers.
When to Use Chain-of-Thought Prompting
CoT is most effective for:
- →Mathematical reasoning, arithmetic, algebra, word problems
- →Logical deduction, puzzles, constraint satisfaction
- →Multi-step analysis, business decisions, comparisons
- →Commonsense reasoning, everyday scenarios requiring inference
When It's Less Useful
- →Simple factual recall ("What's the capital of France?")
- →Creative writing without analytical components
- →Very small models (under ~10B parameters see limited gains)
Key Takeaways
- →Chain-of-Thought prompting makes AI reason step-by-step, improving accuracy on complex tasks
- →Zero-Shot CoT works with just "Let's think step by step"
- →Self-Consistency uses multiple reasoning paths for more reliable answers
- →CoT is most powerful for math, logic, and analytical problems
Ready to Master Advanced Reasoning Techniques?
This article covered the what and why of Chain-of-Thought prompting. But knowing the concept is just the beginning.
In our Module 3, Chain-of-Thought & Reasoning, you'll learn:
- →How to design production-ready CoT prompt templates
- →Advanced techniques: Tree-of-Thoughts, Chain-of-Verification
- →Hands-on workshops to apply CoT to real business problems
- →How to evaluate and measure reasoning quality
- →When to combine CoT with other techniques for maximum impact
Module 3 — Chain-of-Thought & Reasoning
Master advanced reasoning techniques and Self-Consistency methods.
Dorian Laurenceau
Full-Stack Developer & Learning DesignerFull-stack web developer and learning designer. I spent 4 years as a freelance full-stack developer and 4 years teaching React, JavaScript, HTML/CSS and WordPress to adult learners. Today I design learning paths in web development and AI, grounded in learning science. I founded learn-prompting.fr to make AI practical and accessible, and built the Bluff app to gamify political transparency.
Weekly AI Insights
Tools, techniques & news — curated for AI practitioners. Free, no spam.
Free, no spam. Unsubscribe anytime.
→Related Articles
FAQ
What is Chain-of-Thought prompting?+
Chain-of-Thought (CoT) prompting is a technique where you ask AI to show its reasoning step-by-step before giving a final answer. This dramatically improves accuracy on math, logic, and complex reasoning tasks.
How do I use Chain-of-Thought prompting?+
Add 'Let's think step by step' or 'Show your reasoning' to your prompt. For few-shot CoT, provide examples that include the reasoning process, not just the answers.
Does Chain-of-Thought work with all AI models?+
CoT works best with larger models (GPT-4, Claude, Gemini). Smaller models may not benefit as much. The technique is most effective for math, logic, and multi-step reasoning problems.
What is Zero-Shot Chain-of-Thought?+
Zero-Shot CoT adds 'Let's think step by step' without providing examples. It's simpler than few-shot CoT and often works nearly as well for many reasoning tasks.