How Skill Acquisition Works

From Novice to Expert: The Science of Learning

Author

Affiliation

Andrew Ellis

Virtual Academy, Bern University of Applied Sciences

Published

23 Jun, 2025

Key Takeaways

Expertise requires progressive skill building through weak → strong methods
Cognitive effort is not a bug—it’s the feature that enables learning through prediction errors
AI bypasses the very struggles that build lasting capabilities
Expert methods actively harm novices by creating cognitive overload and dependency
Real understanding emerges from thousands of effortful practice cycles with prediction errors
Choose your tools wisely: Use AI to reduce irrelevant cognitive load, not to skip essential learning

Connecting to Your Experience

Think back to the exercise you just completed:

Without AI: You engaged in deep thinking and struggle
Single prompt: You got quick answers with low effort
Iterative AI: You maintained some cognitive engagement

Which mode built lasting understanding? Let’s explore the science behind skill acquisition to understand why struggle matters for learning.

The Journey from Novice to Expert

When we learn new skills, our brains undergo a remarkable transformation. Understanding this process helps us appreciate why offloading cognitive processing to AI can prevent genuine expertise development—and why expert-level tools can actively harm novice learning.

The Weak-to-Strong Methods Progression

According to John Anderson’s ACT-R theory (Anderson 1982, 2007), expertise develops through a predictable progression:

1. Weak Methods (Novice Stage)

Weak methods are domain-general problem-solving strategies that novices use when they lack specific knowledge:

Means-End Analysis: What's the difference between where I am and where I need to be?
Working Backward: Start from the goal and work toward current state
Analogy: This reminds me of... (often based on surface features)
Trial and Error: Let me try different things until something works
Hill Climbing: Always move toward what seems better

Example: Novice solving \(3x + 5 = 20\)

Using weak methods:

Goal is to find \(x\)
Current state: \(3x + 5 = 20\)
How to reduce difference?
Maybe subtract \(5\)?
Why? Um… to get \(x\) alone?
OK, so \(3x = 15\)
Now what reduces the difference?
Divide by \(3\)?
\(x = 5\)

Note the deliberative, step-by-step reasoning!

2. Proceduralization (Intermediate Stage)

Repeated patterns become procedures
Still conscious but more fluid
Reduced cognitive load
Faster with fewer errors
Example: Using multiplication tables from memory

During this stage, knowledge begins forming into production rules. These are IF-THEN condition-action pairs:

IF I see “3 × 4” THEN answer “12”
IF equation has form \(ax + b = c\) THEN subtract b from both sides
IF code won’t compile THEN check for syntax errors first

3. Compilation (Expert Stage)

Knowledge becomes automatic through many practice episodes
Pattern recognition dominates
Minimal cognitive load
Fast and accurate
Cannot explain steps anymore!

Production strength determines which rules fire: frequently successful rules become stronger and fire faster. This is why practice improves both speed AND accuracy—stronger productions outcompete weaker alternatives.

Example: Expert solving \(3x + 5 = 20\)

Expert sees equation and can instantly recognize → \(x = 5\)

When asked to explain:

“I just… know?”
“You subtract 5 and divide by 3”
“How did I know that? I’m not sure…”

The knowledge has compiled into automatic procedures.

Why Cognitive Effort Matters: The Biology of Learning

The Prediction Error Learning System

The human brain learns through a sophisticated prediction error system that can easily be bypassed when outsourcing cognitive processing. This biological reality explains why struggle is essential for learning.

Understanding Cognitive Load Theory

The Evolutionary Foundation of Learning

Cognitive Load Theory (CLT) (Sweller 2023) is specifically concerned with how humans acquire biologically secondary knowledge—the academic content that schools exist to teach. Understanding this distinction is crucial for grasping why AI can disrupt learning so profoundly.

Biologically Primary vs. Secondary Knowledge

Knowledge Type	Examples
Biologically Primary (we evolved to learn)	• Recognizing faces and voices • Understanding basic social interactions • Learning spoken language • Basic spatial navigation • Recognizing danger and safety
Biologically Secondary (we didn’t evolve to learn)	• Reading and writing • Mathematics and algebra • Programming and data analysis • Scientific reasoning • Historical analysis

Why This Distinction Matters

Primary knowledge can be learned through natural discovery and play
Secondary knowledge requires explicit instruction and guided practice
Schools exist precisely because secondary knowledge doesn’t develop naturally
AI disruption is particularly damaging for secondary knowledge acquisition

The Cognitive Architecture of Learning

CLT is based on how human cognition actually processes information, using an architecture that mirrors evolution by natural selection (Sweller 2023):

Two Pathways for Novel Information

Humans can acquire new biologically secondary information through two routes:

Discovery During Problem Solving: Working through challenges independently
Learning From Others: Receiving information through instruction

Crucially: Both these acquisition skills are themselves biologically primary—we evolved to be excellent at discovering things and learning from others. This is why humans dominate the mammalian world.

The Processing Bottleneck

Once information is acquired through either pathway, it must be processed through:

Working Memory:

Severely limited capacity (Miller’s 7±2, more recently Cowan’s 4±1 chunks)
Severely limited duration (15-30 seconds without rehearsal)
The bottleneck where all conscious learning occurs

Long-Term Memory:

No known limits of capacity or duration
Where expertise lives through domain-specific information storage
The goal of all instruction

Element Interactivity in Action

Consider solving: \((x + 3)(x - 2) = 0\)

For a novice, each element must be held in working memory: - What \(x\) represents - The meaning of parentheses - Addition and subtraction operations - Implied multiplication - The significance of “equals zero” - How to find solutions

Total: 7+ interacting elements = Working memory overload

For an expert, this is one chunk retrieved from long-term memory: - “Factored form, so \(x = -3\) or \(x = 2\)”

Total: 1 chunk = Minimal working memory load

With Offloading to AI: - Input → Answer

Total: 0 working memory processing = No learning, no long-term memory storage

How Expertise Actually Develops

As De Groot first demonstrated in 1965 (Groot and Groot 1978; Sweller 2024), expertise is domain-specific and results from enormous amounts of domain-specific information stored in long-term memory. This isn’t just “knowing more facts”—it’s having organized knowledge structures (schemas) that allow experts to:

Chunk multiple elements into single units
Recognize patterns automatically
Access procedures without conscious effort
Free up working memory for higher-level thinking

The AI Bypass Problem

How AI Disrupts Cognitive Architecture

Offloading cognitive processing to AI can short-circuit this entire learning system:

Bypasses working memory processing: No cognitive effort means no encoding
Prevents long-term memory storage: Information that isn’t processed isn’t stored
Eliminates schema construction: Students get answers without building knowledge structures
Blocks the novice→expert transition: No pathway from limited to unlimited knowledge

The fundamental problem: AI provides the benefits of expertise (instant access to organized knowledge) without requiring the process that creates expertise (working memory struggle → long-term memory storage).

Pedagogical Implications

The Expert-Novice Teaching Trap

The most dangerous mistake in education is assuming that novices should learn the way experts work. This leads to what Kirschner (Kirschner, Hendrick, and Heal 2022) calls confusing “the epistemology of the expert with the pedagogy for the learner.”

What Experts Do (Producing Knowledge):

Work forward from automated principles
Recognize deep patterns instantly
Apply strong, domain-specific methods
Create and discover new knowledge
Process multiple elements simultaneously

What Novices Need (Learning Knowledge):

Work backward using means-end analysis
Build basic pattern recognition through practice
Start with weak, general methods
Reconstruct and understand existing knowledge
Process elements sequentially to avoid overload

Why Expert Methods Actively Harm Novices

Giving novices expert-level tools isn’t just ineffective—it’s actively harmful to learning:

1. Cognitive Load Mismatch

Experts can handle multiple interacting elements because they’ve chunked them into patterns
Novices become overwhelmed when given the same complexity
Result: Cognitive overload prevents any learning from occurring

2. Production Rule Interference

Experts have strong, successful production rules that fire automatically
Novices need to build these rules through repeated weak-method practice
AI shortcuts prevent the repetition needed for rule strengthening
Result: Weak rules never develop, leaving permanent dependency

3. Schema Construction Bypass

Experts have rich schemas (organized knowledge structures) built through experience
Novices must construct these schemas element by element
AI answers provide the outcome without the construction process
Result: No schemas form, preventing transfer to new situations

4. Metacognitive Damage

Experts know when and why to apply different strategies
Novices develop this awareness through monitoring their own thinking
AI dependency eliminates the need for self-monitoring
Result: Students become unable to evaluate their own understanding

The Dependency Cascade

When novices use expert-level tools prematurely:

Initial success creates false confidence
Weak methods atrophy from disuse
Problem-solving skills deteriorate without practice
Dependency deepens as internal capabilities weaken
Transfer fails when AI isn’t available
Learned helplessness develops when facing novel challenges

This isn’t just “not learning”—it’s unlearning existing capabilities.

Teaching That Matches How Brains Learn

Principle 1: Respect the Developmental Sequence

Weak methods must come first—they’re not a bug, they’re a feature
Proceduralization requires extensive practice with deliberative processes
Compilation happens automatically after sufficient repetition

Principle 2: Calibrate Cognitive Load

Novices need simplified, sequential presentation
Experts can handle complex, parallel processing
The same material must be presented differently based on expertise level

Principle 3: Protect the Struggle Window

Allow sufficient time for independent struggle before AI consultation
Document thinking process before seeking external help
Multiple attempts required before accessing solutions

Principle 4: Scaffold, Don’t Substitute

Use AI to reduce irrelevant cognitive load, not eliminate all effort
Provide worked examples that show the process, not just answers
Gradually fade support as competence develops (scaffolding)

How Prediction Errors Drive Learning

Step 1: Prediction Formation

When you encounter a problem, your brain automatically predicts what should happen based on your current (prior) knowledge:

“If I subtract 5 from both sides, I should get closer to solving for \(x\)”
“If I apply this formula, I should get the right answer”
“This code structure should compile without errors”

Step 2: Reality Check

You attempt the solution and compare the actual outcome to your prediction:

Positive prediction error: “That worked better than expected!”
Negative prediction error: “That didn’t work as I thought it would”
Zero prediction error: “That went exactly as predicted”

Step 3: Neural Update

Your brain uses these errors to update its knowledge:

Dopamine release marks prediction errors as important learning moments
Eligibility traces tag the neural pathways involved for strengthening
Synaptic weights adjust based on the magnitude and direction of the error
Memory consolidation during sleep makes these changes permanent

Example: Learning Algebra Through Prediction Errors

Attempt 1: Student thinks “I need to eliminate the 5, so I’ll add 5 to both sides”

Prediction: This should help solve for x
Reality: Gets \(3x + 10 = 25\) (moving away from solution)
Prediction Error: Negative! This approach made things worse
Learning: Brain tags “adding same number” as unsuccessful strategy

Attempt 2: Student tries “I’ll subtract 5 from both sides”

Prediction: This might work better
Reality: Gets \(3x = 15\) (much cleaner!)
Prediction Error: Positive! This approach worked well
Learning: Brain strengthens “subtract to isolate” pathway

After many cycles: The “subtract to isolate” rule becomes automatic

Why Offloading Cognitive Processing to AI Eliminates the Learning Signal

AI provides perfect answers instantly:

No prediction phase (student doesn’t generate expectations)
No attempt phase (student doesn’t try solutions)
No error phase (no comparison between expected and actual)
Result: Zero prediction errors = zero learning signal

The brain literally has nothing to learn from because the prediction-error cycle never occurs.

This explains why students can watch hundreds of worked examples or get thousands of correct AI answers and still not learn. Learning requires errors, not perfection.

The Desirable Difficulties Principle

Learning requires what researchers call “desirable difficulties” (Bjork and Bjork 2011)—challenges that feel hard but promote long-term retention:

Retrieval Practice: Pulling information from memory strengthens it
Spaced Repetition: Forgetting and relearning creates durable knowledge
Interleaving: Mixing problem types improves discrimination
Generation: Creating answers (even wrong ones) beats passive consumption

The Compilation Process

Real expertise emerges through thousands of practice cycles:

Encounter problem → Make prediction → Attempt solution → Compare to prediction → Get feedback → Neural adjustment
                                                    ↓
                                            Neural pathway strengthens
                                                    ↓
                                            Pattern becomes automatic

Each cycle:

Strengthens neural connections through prediction errors
Reduces processing time through repetition
Frees working memory for higher-level thinking
Enables creative problem-solving through strong foundations

How expertise actually works: When you encounter a problem, multiple production rules compete. The rule with highest “utility” (based on past success through prediction errors) fires. Each successful use strengthens that rule, making it more likely to win future competitions. This is why experts seem to “just know”—their strongest productions fire automatically.

What We Lose When We Skip the Struggle

What We Lose

No Memory Formation: The brain doesn’t encode what it doesn’t process
No Pattern Development: Can’t recognize what we haven’t practiced
No Transfer Ability: Can’t apply unlearned principles to new contexts
No Metacognition: Don’t develop awareness of our own thinking
No Prediction Error Learning: The fundamental mechanism of learning is bypassed

The Expertise Illusion

Offloading cognitive processing to AI creates a dangerous illusion:

Surface Fluency: Perfect answers without understanding
False Confidence: Believing we “get it” without practice
Fragile Knowledge: Collapses without AI support
Limited Transfer: Can’t adapt to novel situations

Research Evidence: The Cost of Cognitive Offloading

Recent studies show alarming impacts of AI use on learning:

68% reduction in critical thinking among knowledge workers with high AI confidence (Lee et al. 2025)
Surface learning only for programming students using ChatGPT (Yang, Hsu, and Wu 2025)
Decreased problem-solving ability in math students using AI tools (Bastani et al. 2024)
“The fluency illusion”: Mistaking AI’s competence for your own

The Biological Reality of Learning

Learning isn’t just a cognitive process—it’s a biological one that physically changes your brain (Oakley et al. 2025):

Sleep Consolidation is Essential

Learning doesn’t stop when practice ends: - Sharp wave ripples during sleep replay daily experiences - Brain decides what to keep vs. discard - Memories transfer from hippocampus to cortex - New neural connections solidify

You can’t download sleep any more than you can download expertise!

The Grokking Phenomenon

Even machinel learning models show this pattern: - Long plateau with no apparent progress - Sudden jump to generalization - Previously dismissed as “overfitting” - Now recognized as deep pattern learning

The Memory Paradox

Flynn Effect Reversal

Since the 1970s-80s, when education moved away from memorization: - IQ scores began declining for the first time in history - “Why memorize when you can look it up?” became the mantra - Exactly when neuroscience was proving memorization builds critical thinking - We abandoned proven methods just as we discovered why they work (Oakley et al. 2025)

Building Real Expertise

Evidence-Based Strategies

Embrace the Struggle
- Your confusion is a necessary stage of learning
- Errors provide valuable prediction error signals
- Difficulty signals brain growth
Practice Retrieval
- Test yourself before checking answers
- Explain concepts without notes
- Teach others what you’re learning
Space Your Learning
- Return to concepts over days/weeks
- Allow forgetting between sessions
- Relearn in different contexts
Vary Your Practice
- Mix problem types
- Change contexts
- Seek novel applications
Protect Your Prediction Errors
- Make attempts before seeking help
- Notice when your expectations are wrong
- Use errors as learning opportunities, not failures

Connecting to Our Exercise

The exercise you just completed demonstrated these principles:

Without AI: You engaged in effortful retrieval and reasoning, generating prediction errors
With AI (single prompt): You practiced prompt engineering but bypassed deep thinking
With AI (iterative): You maintained some cognitive engagement through questioning

Notice how your understanding differed across modes? That’s the compilation process (or its absence) in action.

Reflection Questions

When have you experienced the satisfaction of mastering something difficult?
How might you redesign your learning to embrace rather than avoid cognitive effort?
What would change if we viewed struggle as evidence of growth rather than failure?
How can you protect the prediction error cycle in your own learning?

Quick Reference: How Theories Work Together

Weak Methods (Novices)	Strong Methods (Experts)
Means-end analysis	Pattern recognition
Working backward from goal	Forward chaining from principles
Trial and error	Automated procedures
Surface analogies	Deep structure
Step-by-step conscious	Chunked automatic

Stage	What’s Happening	Teaching Implications
Declarative	Following memorized rules	Provide clear worked examples
Compilation	Repeated sequences chunking	Many similar practice problems
Procedural	Automated expertise	Ready for complex applications

Watch for these signs of cognitive bypassing:

Smooth performance without ability to explain
Cannot handle problem variations
No improvement despite practice
Dependent on AI for basic tasks
Mistaking AI output for understanding
Immediate AI consultation without attempt
No prediction errors (always getting “right” answers)

References

Anderson, John R. 1982. “Acquisition of Cognitive Skill.” Psychological Review 89 (4): 369–406. https://doi.org/10.1037/0033-295X.89.4.369.

———. 2007. How Can the Human Mind Occur in the Physical Universe? Oxford University Press. https://doi.org/10.1093/acprof:oso/9780195324259.001.0001.

Bastani, Hamsa, Osbert Bastani, Alp Sungu, Haosen Ge, Özge Kabakcı, and Rei Mariman. 2024. “Generative AI Can Harm Learning.” SSRN Scholarly Paper. Rochester, NY. July 15, 2024. https://doi.org/10.2139/ssrn.4895486.

Bjork, Elizabeth Ligon, and Robert A. Bjork. 2011. “Making Things Hard on Yourself, but in a Good Way: Creating Desirable Difficulties to Enhance Learning.” In Psychology and the Real World: Essays Illustrating Fundamental Contributions to Society, 56–64. New York, NY, US: Worth Publishers.

Groot, Adriaan D. De, and Adrianus Dingeman de Groot. 1978. Thought and Choice in Chess. Walter de Gruyter. https://books.google.com?id=EI4gr42NwDQC.

Kirschner, Paul, Carl Hendrick, and Jim Heal. 2022. How Teaching Happens: Seminal Works in Teaching and Teacher Effectiveness and What They Mean in Practice. London: Routledge. https://doi.org/10.4324/9781003228165.

Lee, Hao-Ping (Hank), Advait Sarkar, Lev Tankelevitch, Ian Drosos, Sean Rintel, Richard Banks, and Nicholas Wilson. 2025. “The Impact of Generative AI on Critical Thinking: Self-Reported Reductions in Cognitive Effort and Confidence Effects From a Survey of Knowledge Workers.” In. https://www.microsoft.com/en-us/research/publication/the-impact-of-generative-ai-on-critical-thinking-self-reported-reductions-in-cognitive-effort-and-confidence-effects-from-a-survey-of-knowledge-workers/.

Oakley, Barbara, Michael Johnston, Kenzen Chen, Eulho Jung, and Terrence Sejnowski. 2025. “The Memory Paradox: Why Our Brains Need Knowledge in an Age of AI.” May 12, 2025. https://doi.org/10.31234/osf.io/3xye5_v2.

Sweller, John. 2023. “The Development of Cognitive Load Theory: Replication Crises and Incorporation of Other Theories Can Lead to Theory Expansion.” Educational Psychology Review 35 (4): 95. https://doi.org/10.1007/s10648-023-09817-2.

———. 2024. “Cognitive Load Theory and Individual Differences.” Learning and Individual Differences 110 (February): 102423. https://doi.org/10.1016/j.lindif.2024.102423.

Yang, Tzu-Chi, Yi-Chuan Hsu, and Jiun-Yu Wu. 2025. “The Effectiveness of ChatGPT in Assisting High School Students in Programming Learning: Evidence from a Quasi-Experimental Research.” Interactive Learning Environments, January, 1–18. https://doi.org/10.1080/10494820.2025.2450659.

Reuse

CC BY 4.0

Citation

BibTeX citation:

@online{ellis2025,
  author = {Ellis, Andrew},
  title = {How {Skill} {Acquisition} {Works}},
  date = {2025-06-23},
  url = {https://virtuelleakademie.github.io/denken-statt-delegieren/workshop/skill-acquisition/},
  langid = {en}
}

For attribution, please cite this work as:

Ellis, Andrew. 2025. “How Skill Acquisition Works.” June 23, 2025. https://virtuelleakademie.github.io/denken-statt-delegieren/workshop/skill-acquisition/.