The Replication Crisis - Surprising Paradoxes

MOST PUBLISHED STUDIES DON'T REPLICATE

Open Science Collaboration (2015): 270 researchers attempted to replicate 100 psychology studies from top journals. Of 97 original studies claiming statistically significant results, only 36 replicated. Effect sizes in replications were half the magnitude of originals. The foundation of psychological science was shakier than anyone realized.

The 2015 Reproducibility Project

Original Studies

97%

Claimed Significant Results

Replications

36%

Actually Replicated

100 Studies Tested (Click to Explore)

Successfully Replicated (36)

Failed to Replicate (64)

Effect Size Comparison

Original Studies

r = 0.403

Replications

r = 0.197

Replication effects were HALF the size of original claims

J. Personality & Social Psychology

23%

Replication Rate

Psychological Science

38%

Replication Rate

J. Experimental Psychology

48%

Replication Rate

Failed Replications

Successful

50%

Effect Size Shrinkage

270

Researchers Involved

Why Don't Studies Replicate?

📈 Publication Bias

Journals prefer "positive" results. Studies that find nothing get filed away. Only exciting (often exaggerated) findings get published.

🎯 P-Hacking

Researchers analyze data many ways until p < 0.05 appears. With enough tries, random noise looks significant. "Torture the data until it confesses."

📉 Low Statistical Power

Small sample sizes can detect effects only if they're huge—or lucky. Real small effects get missed; flukes get magnified.

🔮 HARKing

"Hypothesizing After Results are Known." Present exploratory findings as if you predicted them all along. Distorts true discovery rates.

🌍 Hidden Moderators

Contextual differences matter. Effects that work in one culture, time, or setting may not transfer. Science assumes universality that doesn't exist.

🏆 Incentive Structures

Publish or perish. Novel findings get jobs; replications don't. The entire career system rewards flash over rigor.

The Credibility Revolution: Solutions

📝 Pre-registration of hypotheses

📂 Open data sharing

📊 Registered Reports

🔢 Larger sample sizes

🔄 Replication as valued work

🤝 Adversarial collaborations

The crisis sparked a "credibility revolution"—positive structural changes to make science more reliable.

BEYOND PSYCHOLOGY

The replication crisis isn't just psychology. Cancer biology: Only 11% of 53 "landmark" studies replicated. Economics: 40% of 18 experimental economics papers failed. Medicine: Most clinical trials don't replicate in practice. We've been treating p < 0.05 as truth when it's just "probably not random noise." Science is self-correcting—but it took a crisis to trigger the correction.