False Assumption Registry

Lab Studies Predict Real Behavior


False Assumption: Controlled lab experiments in social psychology reliably demonstrate effects that occur and matter in the real world.

Summaries Written by FARAgent (AI) on February 11, 2026 · Pending Verification

For decades, social psychology sold a simple promise: if an effect shows up in a controlled lab experiment, it is telling you something real about how people behave outside the lab. That belief took hold because the lab looked like science at its cleanest, random assignment, tight controls, clear causal claims. Famous findings on priming, ego depletion, bystander behavior, stereotype threat, and delayed gratification were treated not just as proofs of concept but as guides to schools, workplaces, politics, and public policy. Behavioral economics rode the same logic, turning small experimental effects into claims about voting, saving, dieting, and consumer choice.

The supporting case was never empty. Lab experiments can isolate causes that are hard to see in ordinary life, and some findings have traveled reasonably well into field settings. Researchers such as William McGuire long argued that experiments often establish what can happen under specified conditions, which is useful even if the world is messier. But over the last fifteen years, a growing body of evidence has challenged the stronger claim that these studies reliably predict effects that occur and matter in everyday life. Large replication projects found that many published psychology results weakened or failed on rerun; critics such as Michael Inzlicht and Eli Finkel argued that whole literatures were built on small samples, flexible analysis, and effects too fragile to survive outside the lab. Some classic findings, including parts of the marshmallow-test story and ego depletion, looked less universal once researchers used larger samples, preregistration, and real-world follow-up.

The current debate is less about whether experiments are useful than about what they can honestly support. A substantial and influential minority of researchers now says the field routinely oversold lab results as maps of the real world, wasting money, distorting policy, and rewarding flashy claims over durable ones. Others answer that replication reforms, better statistics, and more field experiments are already improving matters, and that abandoning lab work would throw away one of the few tools that can identify causation at all. The assumption still stands in many papers and headlines, but it is increasingly questioned, especially when a neat laboratory effect is asked to carry a large claim about society.

Status: A small but growing and influential group of experts think this was false
  • Michael Inzlicht, a social psychology professor at the University of Toronto, published a 2015 blog post titled 'Reckoning with the Past' in which he publicly critiqued the field's replicability and validity issues after years of growing doubts. The post, hosted by Simine Vazire who had earlier hosted his self-critical guest post, detailed how he had come to view many lab effects as unreliable predictors of real behavior and described the personal toll of his shift toward skepticism. He lost close friendships and mentors, faced professional isolation, and experienced burnout that left him dreaming of early retirement. His writings gained coverage in The Atlantic, The Globe and Mail, and Undark Magazine, moving him from the mainstream to the replication-crisis camp. [1]
  • Eli Finkel, a psychology professor at Northwestern University, reached similar conclusions years before Inzlicht and warned that replicability alone was a low bar compared with validity concerns for real-world relevance. He argued that many lab findings failed to bridge the gap between what could happen under controlled conditions and what actually did happen amid everyday complexities. His early skepticism helped set the stage for broader questioning within the field. [1]
  • Walter Mischel, a key figure in social psychology, had long championed situationism over stable personality traits through influential work such as the marshmallow test. His research contributed to the widespread dismissal of personality psychology as a serious pillar of the discipline, nearly driving it to extinction in some quarters. Yet during the replication crisis many of his own findings came under scrutiny and failed to hold up consistently. [1]
  • Craig A. Anderson, a researcher who promoted the idea of high external validity, published meta-analyses in 1999 comparing lab and field studies across 38 topics and reported substantial agreement between the two. His work was widely cited as evidence that controlled experiments reliably translated to real-world outcomes. Later replications challenged the breadth of his conclusions. [4]
  • Greg Mitchell, a social psychologist at the University of Virginia School of Law, conducted a larger-scale replication of Anderson's approach examining 217 comparisons and found major variations, including 30 outright reversals between lab and field results. His analysis showed that correspondence was often poor in social psychology and gender-difference studies performed especially badly under realistic conditions. He concluded that external validity had to be assessed case by case rather than assumed. [4]
  • Richard H. Thaler, an economist who later won the 2017 Nobel Prize, catalogued empirical anomalies that contradicted rational choice theory by observing real behavior such as the endowment effect. His early papers faced rejection and seminar hostility, yet he persisted in integrating psychological insights from Daniel Kahneman, Amos Tversky, and Herbert Simon's bounded rationality. Over time his work shifted the field toward behavioral economics. [6]
Supporting Quotes (20)
“Ten years ago, I sat at my computer and hit “publish” on a blog post that would permanently alter my relationship with my field. The post, titled Reckoning with the Past, wasn't my first critique of social psychology.”— Ten Years a Skeptic
“Maybe I should have heeded the wise words of my friend Eli Finkel, psychology professor at Northwestern University, who came to similar conclusions, years before I did.”— Ten Years a Skeptic
“Which is more than I can say about the work of Walter Mischel, one of the chief architects of personality's near-death experience. For all his brilliance, Mischel's own research didn't escape the replication crisis unscathed.”— Ten Years a Skeptic
“The post, titled Reckoning with the Past, wasn't my first critique of social psychology... But this post was different. This was my coming-out moment as a full-blown skeptic in social psychology.”— Ten Years a Skeptic
“For all his brilliance, Mischel's own research didn't escape the replication crisis unscathed. There's probably a Greek tragedy in there somewhere: the scholar who tried to bury personality psychology saw his own legacy become diminished under the weight of methodological reform.”— Ten Years a Skeptic
“Maybe I should have heeded the wise words of my friend Eli Finkel, psychology professor at Northwestern University, who came to similar conclusions, years before I did.”— Ten Years a Skeptic
“Ten years ago, I sat at my computer and hit “publish” on a blog post that would permanently alter my relationship with my field.”— Ten Years a Skeptic
“Maybe I should have heeded the wise words of my friend Eli Finkel, psychology professor at Northwestern University, who came to similar conclusions, years before I did.”— Ten Years a Skeptic
“For all his brilliance, Mischel's own research didn't escape the replication crisis unscathed.”— Ten Years a Skeptic
“I had already dipped my toes in those waters with an earlier piece called Check Yourself Before You Wreck Yourself (a guest post on Simine Vazire’s blog)”— Ten Years a Skeptic
“in 1999, Craig A. Anderson and his colleagues compared laboratory and field research on 38 topics in 21 meta-analyses (or analyses of numerous other studies), and found a lot of agreement between the results of the two.”— Great Results in the Psych Lab—But Do They Hold Up in the Field?
“Greg Mitchell, a social psychologist at the University of Virginia School of Law, wanted to know if these findings hold up in a bigger sample—and whether there were differences among different kinds of psychological research.”— Great Results in the Psych Lab—But Do They Hold Up in the Field?
“his role in exposing and making sense of empirical anomalies in orthodox economics”— Richard H. Thaler: A Nobel Prize for Behavioural Economics
“exploring the economic implications of Kahneman’s work with the late Amos Tversky (Tversky and Kahneman 1974) on the predictable effects of bias-inducing heuristics on the quality of decision-making. He also pioneered the use within economics of their Prospect Theory approach”— Richard H. Thaler: A Nobel Prize for Behavioural Economics
“His contributions build on work by two previous Nobel Laureates, Herbert Simon (in 1978)”— Richard H. Thaler: A Nobel Prize for Behavioural Economics
“"The people at the rally are not a random or representative sample of the electorate," says Lynn Vavreck... "These are strategic and well-planned events. This isn't just happening."”— Campaign Mystery: Why Don't Bernie Sanders' Big Rallies Lead To Big Wins?
“"Some of the crowd size is people who come because they love Bernie and they want to hear this message and they keep doing it and they follow him around just like [fans of] the Grateful Dead or Phish."”— Campaign Mystery: Why Don't Bernie Sanders' Big Rallies Lead To Big Wins?
“Adam Grant, the superstar organizational psychologist who was one of the main characters in Part 1, is also worried about undue skepticism of psychological findings.”— Adam Grant vs. Coleman Hughes, Part 2: Causation Does Not Imply. . . Anything
“Nosek is a careful, measured speaker when he talks with the media, and yet here’s what he said to Nature’s Tom Chivers about social priming in 2019: “I don’t know a replicable finding. It’s not that there isn’t one, but I can’t name it.””— Adam Grant vs. Coleman Hughes, Part 2: Causation Does Not Imply. . . Anything
“James Manzi is a doctoral student in Sociology at the University of Oxford. This study analyzes approximately 600,000 English-language social science abstracts published between 1960 and 2024”— The Ideological Trajectory of the Social Sciences

Social psychology as a discipline had long promoted the view that controlled lab experiments reliably demonstrated effects that mattered outside the laboratory, an assumption that contributed to the replication crisis and eventually toppled some of the field's most prominent hierarchies and elder statesmen. Departments at Michigan, Ohio State, Stanford, Waterloo, and Yale lost their earlier unquestioned authority as failures accumulated. The crisis also revived interest in personality psychology, which proved more replicable than many situational lab effects. [1]

The Association for Psychological Science published Mitchell's large-scale replication study in its journal Perspectives on Psychological Science, which helped spread awareness of the limits of lab external validity across subfields. The study examined far more comparisons than earlier work and documented frequent mismatches between lab and field outcomes. Its appearance in a respected outlet gave the critique institutional visibility. [4]

Top journals such as the Quarterly Journal of Economics published one in five articles with less than a 50 percent chance of replication despite rigorous selection processes, showing that even high-prestige outlets enforced no strict quality standards on replicability. Funding agencies continued to support such work because grants were tied to publication records rather than later verification. [5]

The Journal of Economic Behavior and Organization provided an early platform for Thaler's seminal paper on consumer choice anomalies, allowing behavioral critiques of rational choice theory to reach academic audiences. Similar journals in psychology, including Group Processes & Intergroup Relations, published unreplicated priming studies without adequate checks for questionable research practices. [6][12]

Social science disciplines including economics and political science maintained left-of-center ideological means while remaining less extreme than fields such as sociology or gender studies, yet still shaped research output at scale through hiring, peer review, and topic selection. An analysis of 600,000 abstracts from 1960 to 2024 found every discipline leaned left on a fixed U.S. ideological spectrum. [13]

Supporting Quotes (7)
“I started seeing much of our cherished research as, frankly, worthless. And my skepticism, some would say cynicism, has only deepened over the years”— Ten Years a Skeptic
“in a new paper published in Perspectives on Psychological Science, a journal of the Association for Psychological Science”— Great Results in the Psych Lab—But Do They Hold Up in the Field?
“Even the crème de la crème of economics journals barely manage a ⅔ expected replication rate. 1 in 5 articles in QJE scores below 50%, and this is a journal that accepts just 1 out of every 30 submissions.”— What's Wrong with Social Science and How to Fix It: Reflections After Reading 2578 Papers
“For this stuff to be financed, generated, published, and eventually rewarded requires the complicity of funding agencies, journal editors, peer reviewers, and hiring/tenure committees. Given the current structure of the machine, ultimately the funding agencies are to blame.”— What's Wrong with Social Science and How to Fix It: Reflections After Reading 2578 Papers
“Thaler’s seminal (1980) article in the very first issue of the Journal of Economic Behavior and Organization (JEBO)”— Richard H. Thaler: A Nobel Prize for Behavioural Economics
“published in 2008 in Group Processes & Intergroup Relations”— Adam Grant vs. Coleman Hughes, Part 2: Causation Does Not Imply. . . Anything
“Disciplines concerned with public policy (“policy-proximal disciplines), such as economics and political science, tend to be less fanatically leftist than disciplines concerned more with feelings, such as psychology, sociology, and gender studies (“policy-distal disciplines”).”— The Ideological Trajectory of the Social Sciences

Academic institutions responded to the replication crisis by adopting preregistration, open data sharing, and larger sample sizes as standard practices, although vignette studies with self-reports continued to proliferate because they remained inexpensive. Hiring, tenure, and grant decisions still relied heavily on publication counts and journal prestige without systematic checks for replicability, directing resources toward work that later proved unreliable. [1][5]

Policymakers embraced libertarian paternalism and nudge techniques that rested on behavioral findings challenging strict rational choice assumptions, incorporating insights from Thaler and others into government programs. Organizations adopted diversity, equity, and inclusion trainings and interventions based on priming studies and critiques of color-blind messaging, even though many of the underlying experiments later failed to replicate. [6][12]

Policy-proximal fields such as economics and political science, which maintained left-of-center means, continued to influence public policy with research that reflected those ideological patterns, showing only brief moderation between 1970 and 1990 before shifting further left after 1990. [13]

Supporting Quotes (5)
“Sample sizes, for example, are way up... What seemed radical in 2015—preregistration, open data, open methods—is now standard practice... I’ve noticed an explosion of vignette studies paired with self-reports”— Ten Years a Skeptic
“all these things depend on their peers. When criticising bad research it's easier for everyone to blame the forking paths rather than the person walking them.”— What's Wrong with Social Science and How to Fix It: Reflections After Reading 2578 Papers
“developing the concept of ‘libertarian paternalism’ with Cass Sunstein and exploring its practical implications for policy, especially via the ‘nudge’ technique”— Richard H. Thaler: A Nobel Prize for Behavioural Economics
“thanks to the interventions offered by wise behavioral scientists.”— Adam Grant vs. Coleman Hughes, Part 2: Causation Does Not Imply. . . Anything
“policy-proximal disciplines generally showed limited rightward moderation between roughly 1970 and 1990 ... Economics, political science, and public administration (“policy-proximal disciplines”) all became less leftwing in the 1970s.”— The Ideological Trajectory of the Social Sciences

Know of a source that supports or relates to this entry?

Suggest a Source