False Assumption Registry

Stereotype Threat Impairs Performance


False Assumption: Reminding people of negative group stereotypes subtly causes them to underperform on relevant tasks like math.

Summaries Written by FARAgent (AI) on February 09, 2026 · Pending Verification

For years, the accepted line in psychology and education was that subtle cues about identity could depress performance. Claude Steele and Joshua Aronson’s 1995 study gave the idea its canonical form: if a hard test was framed as diagnostic of ability, Black students would do worse; if the same test was framed differently, the gap would shrink. The phrase "stereotype threat" spread fast because it fit the times and offered a clean mechanism for stubborn group gaps. By the 2000s it had become standard advice in schools, workplaces, and DEI training: avoid cues that might "trigger" stereotypes, reassure people they belong, and performance would improve.

Then the evidence began to look less tidy. Researchers tried to reproduce the classic effects with larger samples and stricter methods, and the results often came back weak, inconsistent, or null. Some early believers, including Michael Inzlicht, publicly acknowledged that the literature had serious problems, while skeptics such as Rob Kurzban had doubted the story much earlier. A major Registered Replication Report led by Andrea Stoevenbelt failed to recover a key effect, and meta-analytic claims that once looked persuasive were increasingly criticized for publication bias, small-study effects, and flexible analysis.

The idea has not vanished. It still appears in textbooks, corporate training, and popular explanations of achievement gaps, often in the old language about "identity threat" and "belonging cues." But a substantial body of experts now rejects the strong version of the claim, that merely reminding people of a negative stereotype reliably and substantially impairs performance on tasks like math. The current debate is narrower and less grand: whether stereotype threat exists only under limited conditions, whether its effects are much smaller than advertised, or whether the famous findings were mostly artifacts of a field that believed too much too soon.

Status: A significant portion of experts think this assumption was false
  • Claude Steele, a Stanford professor, proposed the idea of stereotype threat in the early 1990s and became its most eloquent advocate through a series of charismatic keynotes, including his magnetic 1999 address to the American Psychological Society. He framed the concept as a subtle but powerful force where reminding people of negative group stereotypes could impair their performance on relevant tasks. His work drew widespread academic acclaim and helped shape discussions on inequality for years. Steele presented the theory in good faith, drawing on lab experiments that appeared to show large effects from simple framing changes. The influence of his presentations extended beyond academia into policy circles. [2][8]
  • Michael Inzlicht, a social psychologist at the University of Toronto, studied stereotype threat as a PhD student, published early papers including his dissertation on the topic, edited a book devoted to it, and contributed to Supreme Court amicus briefs citing the research. He benefited immensely from the acclaim, securing jobs, grants, and tenure as the idea gained traction in the field. Years later, Inzlicht began voicing tentative doubts in blog posts and newsletters, arguing that the effects did not hold up under scrutiny. His shift from proponent to critic drew both attention and rebuttals from others in the field. The republication of one of his critical pieces reached roughly 10,000 readers and highlighted the growing debate. [2][7][8][11]
  • Joshua Aronson, who had been Steele's student and later supervised Inzlicht as a postdoc, co-authored the influential 1995 Stanford study that claimed Black students' performance gap with whites vanished when a test was reframed as problem-solving rather than intelligence testing. He continued as a good faith proponent of the idea that such environmental tweaks could address achievement gaps. Aronson's work helped propel the theory into broader discussions on race and gender differences. His role connected the original experiments to subsequent generations of researchers. The study itself relied on small samples and methods that later came under question. [2][8]
Supporting Quotes (19)
“I knew why. It’s not true.”— Psychology’s Greatest Misses (Part 1/3)
“The concept of stereotype threat, first proposed by Claude Steele in the early 1990s, posited that individuals who are part of a negatively stereotyped group can, in certain situations, experience anxiety about confirming those stereotypes, leading paradoxically to underperformance... I still remember seeing its most eloquent advocate, Stanford University’s Claude Steele, deliver a keynote address in 1999”— Revisiting Stereotype Threat
“In 1995, Steele and his student Joshua Aronson—who went on to become my postdoc supervisor years later—demonstrated that the notorious Black-white gap in academic performance could be partially closed”— Revisiting Stereotype Threat
“I too was swept up by this mania. I studied stereotype threat as a PhD student and published some of the first papers on the topic. My dissertation and very first publication suggested that subtle aspects of a room... Because my field became captivated by stereotype threat, this meant that I was quickly offered jobs, grants, tenure, and acclaim. I edited a book on stereotype threat and was asked to add my name and research to briefs delivered to the US Supreme Court. My career benefitted immensely.”— Revisiting Stereotype Threat
“new data now reveal... this study (still a preprint) was preregistered... and involved over 1,500 participants. It replicated the exact procedures of a well-known stereotype threat study published in 2005... Conducted by multiple labs across the U.S. and Europe, and led by Andrea Stoevenbelt”— Revisiting Stereotype Threat
“When I started voicing tentative doubts about stereotype threat in a blogpost that went viral, some of the responses pushed back.”— Does Data Matter in Psychology?
“I’ll never forget how a prominent African American psychology professor struggled to make sense of my doubts because she had experienced stereotype threat personally.”— Does Data Matter in Psychology?
“The concept of stereotype threat, first proposed by Claude Steele in the early 1990s, posited that individuals who are part of a negatively stereotyped group can, in certain situations, experience anxiety about confirming those stereotypes, leading paradoxically to underperformance, thus confirming the disparaging stereotype.”— Revisiting Stereotype Threat
“I too was swept up by this mania. I studied stereotype threat as a PhD student and published some of the first papers on the topic. My dissertation and very first publication suggested that subtle aspects of a room—like how many men and women were in a math classroom—could be enough to evoke stereotype threat and undermine performance. Because my field became captivated by stereotype threat, this meant that I was quickly offered jobs, grants, tenure, and acclaim.”— Revisiting Stereotype Threat
“In 1995, Steele and his student Joshua Aronson—who went on to become my postdoc supervisor years later—demonstrated that the notorious Black-white gap in academic performance could be partially closed when negative stereotypes impugning Black people’s intelligence were made irrelevant.”— Revisiting Stereotype Threat
“I too was swept up by this mania. I studied stereotype threat as a PhD student and published some of the first papers on the topic. My dissertation and very first publication suggested that subtle aspects of a room—like how many men and women were in a math classroom—could be enough to evoke stereotype threat and undermine performance. Because my field became captivated by stereotype threat, this meant that I was quickly offered jobs, grants, tenure, and acclaim.”— Revisiting Stereotype Threat
“Conducted by multiple labs across the U.S. and Europe, and led by Andrea Stoevenbelt this study (still a preprint) was preregistered (meaning all methods and analyses were specified before the data were collected) and involved over 1,500 participants.”— Revisiting Stereotype Threat
“The concept of stereotype threat, first proposed by Claude Steele in the early 1990s, posited that individuals who are part of a negatively stereotyped group can, in certain situations, experience anxiety about confirming those stereotypes, leading paradoxically to underperformance, thus confirming the disparaging stereotype.”— Revisiting Stereotype Threat
“I too was swept up by this mania. I studied stereotype threat as a PhD student and published some of the first papers on the topic. My dissertation and very first publication suggested that subtle aspects of a room—like how many men and women were in a math classroom—could be enough to evoke stereotype threat and undermine performance. Because my field became captivated by stereotype threat, this meant that I was quickly offered jobs, grants, tenure, and acclaim.”— Revisiting Stereotype Threat
“Stereotype threat was the focus of my dissertation research. I edited an entire book on the topic. I have signed my name to an amicus brief to the Supreme Court of the United States citing it. Yet now I am not as certain”— The Replication Crisis Is My Crisis
“I too was swept up by this mania. I studied stereotype threat as a PhD student and published some of the first papers on the topic. My dissertation and very first publication suggested that subtle aspects of a room—like how many men and women were in a math classroom—could be enough to evoke stereotype threat and undermine performance. Because my field became captivated by stereotype threat, this meant that I was quickly offered jobs, grants, tenure, and acclaim. I edited a book on stereotype threat and was asked to add my name and research to briefs delivered to the US Supreme Court.”— The Downfall of Stereotype Threat
“I wrote the piece below about 18 months ago. A few weeks ago, my friends Dominic Packer and Jay Van Bavel republished it in their newsletter, The Power of Us, under the title “The Downfall of Stereotype Threat.””— Is Stereotype Threat Just Vibes Now?
“Mary Murphy, a prominent stereotype threat researcher, wrote a lengthy public rebuttal on her own Substack, taking issue with my conclusions and the timing of the republication.”— Is Stereotype Threat Just Vibes Now?
“my friends Dominic Packer and Jay Van Bavel republished it in their newsletter, The Power of Us, under the title “The Downfall of Stereotype Threat.” That republication landed in the inbox of roughly 10,000 people”— Is Stereotype Threat Just Vibes Now?

The social psychology field embraced stereotype threat as one of its favorite topics by the mid-2000s, publishing dozens of studies, granting career advancement to researchers who pursued it, and featuring it prominently at conferences such as the Society for Personality and Social Psychology with countless posters and talks. The idea spread rapidly through academic channels because it aligned with prevailing views on how stereotypes could undermine achievement in measurable ways. Departments built research programs around it while career incentives like grants and tenure rewarded supporting findings. Critics later noted that the field was slow to address replication issues even as doubts accumulated. The theory became a staple in introductory psychology classes for years. [2][3][8]

The American Psychological Society, now known as the Association for Psychological Science, hosted Claude Steele's 1999 keynote that promoted the concept to a wide audience of psychologists and helped establish it as a dominant idea within the discipline. The organization provided a prestigious platform that amplified the original claims about test framing and performance. This event contributed to the theory's academic momentum at a key moment. Subsequent research built directly on the foundation laid there. The society's role reflected the broader enthusiasm in psychology for environmental explanations of group differences at the time. [2]

Sollah, a corporate training provider, incorporated ideas about how stereotypes and assumptions negatively impact workplace relationships and performance into its programs on inclusion and belonging. The company developed and sold eLearning modules, videos, and specific products such as TrainingBriefs Civility Matters and Inclusion 101 to organizations seeking to maximize workforce performance by addressing biases. These materials presented understanding and mitigating stereotype effects as essential tools for better collaboration and achievement. The trainings encouraged practices like mandatory sessions on civility, open communication, and team-building events. Such commercial dissemination extended the concept beyond academia into corporate settings. [10]

Supporting Quotes (5)
“Before long, stereotype threat was not only the darling of social psychology, but it also became the darling of the political left”— Revisiting Stereotype Threat
“It was psychology's darling theory for explaining and remedying achievement gaps that left progressives feeling good about themselves.”— Does Data Matter in Psychology?
“Before long, stereotype threat was not only the darling of social psychology, but it also became the darling of the political left who now had an answer to prevailing views of group differences held by the political right.”— Revisiting Stereotype Threat
“I still remember seeing its most eloquent advocate, Stanford University’s Claude Steele, deliver a keynote address in 1999 at the annual convention of what was then called the American Psychological Society.”— Revisiting Stereotype Threat
“Sollah Has a Plan for You! We have the in tools necessary to maximize your workforce for today and beyond. Interactive eLearning, facilitation guides, discussion cards, how-to books, and video situations... are ready to tackle the tough issues we each face in the workplace.”— Accelerate Inclusion & Belonging in Your Workplace

The assumption that reminding people of negative group stereotypes subtly causes them to underperform on relevant tasks such as math rested on a series of experiments beginning in 1995 that appeared to demonstrate clear effects from identity primes. Claude Steele and Joshua Aronson reported at Stanford that Black students performed worse when a test was presented as diagnostic of intelligence but closed the gap with white students when it was framed as a problem-solving exercise instead. The study used small samples and flexible analytical methods yet generated the sub-belief that racial and gender achievement gaps were malleable through simple environmental changes. It seemed credible at the time as groundbreaking lab evidence of situational forces at work. Subsequent work built on this foundation for more than two decades. [2][8]

A 2005 study by Johns, Schmader, and Martens claimed that teaching women about stereotype threat could mitigate the gender gap in math performance, and this intervention was cited as influential for years. The work appeared to offer a practical remedy rooted in the original theory. It aligned with the broader notion that high-performing members of stereotyped groups underperformed when reminded of negative stereotypes about their abilities. Researchers presented these findings as evidence that gaps could be addressed without altering underlying cognitive factors. The study later failed to replicate under more rigorous conditions. [2][8]

Over time the concept shifted from explaining objective Black-White or male-female testing gaps to descriptions of subjective feelings or vibes in uncomfortable environments, diluting some of its original empirical framing. Early studies had emphasized measurable performance decrements on standardized tasks. Meta-analyses later examined whether the effect held across contexts such as gender and mathematics. One analysis found the effect size may be near zero when corrected for publication bias. Another review indicated that publication bias and questionable research practices may have inflated apparent support from 14 percent to as high as 84 percent in the literature. [11][12][13]

Supporting Quotes (11)
“The idea was that you can cause people to perform worse on some task—math problems, or whatever—by reminding them, even subtly, that they are a member of a group that is stereotypically bad at that task.”— Psychology’s Greatest Misses (Part 1/3)
“In 1995, Steele and his student Joshua Aronson... demonstrated that the notorious Black-white gap in academic performance could be partially closed when negative stereotypes impugning Black people’s intelligence were made irrelevant. When Black students at Stanford University were told that a test was diagnostic of intellectual ability, they performed worse than their white counterparts. However, when this stereotype threat was ostensibly removed—by simply framing the test as a measure of problem-solving rather than intelligence—the performance gap between Black and white students nearly vanished.”— Revisiting Stereotype Threat
“It replicated the exact procedures of a well-known stereotype threat study published in 2005 by Mike Johns, Toni Schmader, and Andy Martens... The original study had found that women performed worse on math tests when reminded of gender stereotypes but performed on par with men when they were instead taught about stereotype threat.”— Revisiting Stereotype Threat
“Stereotype threat is the idea that people from negatively stereotyped groups choke under pressure when those stereotypes become salient. So, when high-performing Black students are reminded they're Black, they feel unnerved by negative academic-ability stereotypes and supposedly underperform on academic tests. Similarly, women become anxious when faced with the stereotype that “girls aren’t good at math” and are more likely to bomb math exams.”— Does Data Matter in Psychology?
“In 1995, Steele and his student Joshua Aronson—who went on to become my postdoc supervisor years later—demonstrated that the notorious Black-white gap in academic performance could be partially closed when negative stereotypes impugning Black people’s intelligence were made irrelevant.”— Revisiting Stereotype Threat
“When Black students at Stanford University were told that a test was diagnostic of intellectual ability, they performed worse than their white counterparts. However, when this stereotype threat was ostensibly removed—by simply framing the test as a measure of problem-solving rather than intelligence—the performance gap between Black and white students nearly vanished.”— Revisiting Stereotype Threat
“When Black students at Stanford University were told that a test was diagnostic of intellectual ability, they performed worse than their white counterparts. However, when this stereotype threat was ostensibly removed—by simply framing the test as a measure of problem-solving rather than intelligence—the performance gap between Black and white students nearly vanished.”— Revisiting Stereotype Threat
“Stereotype threat... offers situational explanations for things like the gender gap in science and math achievement... a meta-analysis published just last year suggests that stereotype threat, at least for some populations and under some conditions, might not be so robust after all.”— The Replication Crisis Is My Crisis
“When Black students at Stanford University were told that a test was diagnostic of intellectual ability, they performed worse than their white counterparts. However, when this stereotype threat was ostensibly removed—by simply framing the test as a measure of problem-solving rather than intelligence—the performance gap between Black and white students nearly vanished.”— The Downfall of Stereotype Threat
“Assumptions and stereotypes sure have gotten a lot of attention within the workplace as of late. But do you really understand them? Complete this short form to receive immediate access to a powerful primer that defines and highlights the impact of those types of biases on those around us.”— Accelerate Inclusion & Belonging in Your Workplace
“Mary's rebuttal left me wondering when stereotype threat stopped being about the Black-White or male-female testing gaps and started being about how people feel in uncomfortable rooms. Vibes, basically. Important vibes, yes, but vibes nonetheless.”— Is Stereotype Threat Just Vibes Now?

Stereotype threat proliferated in academia around 2005 as a dominant research area, driven by its alignment with political messaging that emphasized how stereotypes undermine achievement rather than biological explanations offered in works such as The Bell Curve. The idea spread through academic acclaim, keynotes, and media coverage that appealed especially to the political left. Conferences featured numerous posters on the topic, and researchers gained jobs, grants, and tenure for work that supported it. The theory also reached policy discussions through Supreme Court briefs citing the research on group differences. Its intuitive appeal to personal experiences helped it override emerging data questions for many. [1][2][3][7][8]

The replication crisis that began in the early 2010s raised broader doubts about methods in social psychology, with audits showing that only about a quarter of studies replicated reliably, including work on stereotype threat. This period prompted closer scrutiny of small samples, flexible analyses, and practices now recognized as p-hacking. Meta-analyses and bias tests began to question the strength of the original findings. Despite these challenges, the concept continued to appear in training materials and corporate inclusion programs. Newsletters and Substacks later amplified both critiques and rebuttals to thousands of readers. [8][11]

Career incentives within social psychology played a significant role in sustaining the research program even as questions mounted. Proponents gained prominent positions and funding by producing studies consistent with the assumption. The idea became a favored explanation among social-justice oriented scientists and influenced discussions in both academia and courts. One prominent African American psychology professor rejected early doubts because she had experienced the phenomenon personally. Publication bias analyses suggested that supportive results were far more likely to appear in print than null findings. [2][7][13]

Supporting Quotes (16)
“When I was in graduate school, it seemed to me that half the field of social psychology was studying what was called “stereotype threat.””— Psychology’s Greatest Misses (Part 1/3)
“stereotype threat... became the darling of the political left who now had an answer to prevailing views of group differences held by the political right. This is partly because shortly before stereotype threat took its turn in the spotlight, Charles Murray and Richard Herrnstein published The Bell Curve”— Revisiting Stereotype Threat
“psychologists will regularly ignore data when it goes against their own intuitions and personal experiences.”— Does Data Matter in Psychology?
“Before long, stereotype threat was not only the darling of social psychology, but it also became the darling of the political left who now had an answer to prevailing views of group differences held by the political right.”— Revisiting Stereotype Threat
“I edited a book on stereotype threat and was asked to add my name and research to briefs delivered to the US Supreme Court. My career benefitted immensely.”— Revisiting Stereotype Threat
“Before long, stereotype threat was not only the darling of social psychology, but it also became the darling of the political left who now had an answer to prevailing views of group differences held by the political right.”— Revisiting Stereotype Threat
“Because my field became captivated by stereotype threat, this meant that I was quickly offered jobs, grants, tenure, and acclaim. I edited a book on stereotype threat and was asked to add my name and research to briefs delivered to the US Supreme Court.”— Revisiting Stereotype Threat
“I still remember seeing its most eloquent advocate, Stanford University’s Claude Steele, deliver a keynote address in 1999 at the annual convention of what was then called the American Psychological Society.”— Revisiting Stereotype Threat
“Because my field became captivated by stereotype threat, this meant that I was quickly offered jobs, grants, tenure, and acclaim. I edited a book on stereotype threat and was asked to add my name and research to briefs delivered to the US Supreme Court.”— Revisiting Stereotype Threat
“Since its inception in 1995, the concept has been a favorite among scientists with a social-justice bent... I have signed my name to an amicus brief to the Supreme Court”— The Replication Crisis Is My Crisis
“Before long, stereotype threat was not only the darling of social psychology, but it also became the darling of the political left who now had an answer to prevailing views of group differences held by the political right.”— The Downfall of Stereotype Threat
“And when some brave scientists decided to audit the field by closely replicating many studies, only about a quarter from social psychology could be successfully replicated.”— The Downfall of Stereotype Threat
“Download Our Newest Resource! ... TrainingBriefs® Civility Matters ... Inclusion 101™ Optimizing Our Diversity Potential ... TrainingBriefs® She’s Just Too Old! ... The Future of Work™ Generations in the Workplace.”— Accelerate Inclusion & Belonging in Your Workplace
“my friends Dominic Packer and Jay Van Bavel republished it in their newsletter, The Power of Us... Mary Murphy... wrote a lengthy public rebuttal on her own Substack”— Is Stereotype Threat Just Vibes Now?
“the average reported effect of stereotype threat is small, and that those reports may be inflated by publication bias”— Does stereotype threat influence performance of girls in stereotyped domains? A meta-analysis
“publication bias and other questionable research practices have been estimated to inflate the percentage of studies supporting gender stereotype threat from 14% to 84%”— Potential Publication Bias in the Stereotype Threat Literature

Researchers contributed findings on stereotype threat to amicus briefs delivered to the U.S. Supreme Court, where the work informed arguments about inequality, group performance gaps, and the value of certain admissions or related policies. The briefs presented the idea that subtle reminders of stereotypes could impair performance as relevant evidence in legal debates over how to address disparities. This use extended the theory's influence from laboratories into judicial considerations. The citations relied on the early studies that had not yet faced large-scale replication tests. Later critiques noted that the evidentiary basis cited in such documents rested on research now viewed as questionable by some. [2][7][8]

Interventions based on the assumption, such as reframing tests as non-diagnostic of intelligence or teaching students about stereotype threat itself, were proposed as practical ways to close gender and racial academic gaps. These approaches were justified by the claim that removing the threat would allow stereotyped groups to perform closer to their true ability. Educators and policymakers explored such environmental fixes in schools and testing situations. The strategies promised relatively simple solutions to persistent achievement differences. Subsequent replication attempts cast doubt on whether these interventions produced the expected effects. [2]

Organizations were encouraged to adopt practices including mandatory training on civility and inclusion, open communication channels, collaboration initiatives, recognition of achievements, growth opportunities, and team-building events to counteract the assumed negative impact of stereotypes on workplace performance. Training providers framed these measures as essential for maximizing workforce potential and fostering belonging. The programs presented understanding stereotype effects as a key component of addressing biases in professional settings. Such policies spread through corporate eLearning modules and videos aimed at broad audiences. They reflected the translation of the academic idea into applied diversity efforts. [10]

Supporting Quotes (5)
“I edited a book on stereotype threat and was asked to add my name and research to briefs delivered to the US Supreme Court.”— Revisiting Stereotype Threat
“The idea was that awareness of the phenomenon of stereotype threat helped mitigate its effects, which was why this original paper was so influential: it offered a simple intervention to close the gender-gap in math performance.”— Revisiting Stereotype Threat
“I have signed my name to an amicus brief to the Supreme Court of the United States citing it.”— The Replication Crisis Is My Crisis
“I edited a book on stereotype threat and was asked to add my name and research to briefs delivered to the US Supreme Court.”— The Downfall of Stereotype Threat
“Here are a few ideas and strategies that you can use to cultivate a culture of belonging in our organization. Foster open communication & active listening. ... Embrace civility. A culture of belonging must be one that is inclusive. This involves actively creating policies and practices that promote a civil and respectful workplace, and providing training to all employees to increase their awareness and understanding of their co-workers. Encourage collaboration & teamwork.”— Accelerate Inclusion & Belonging in Your Workplace

The focus on stereotype threat diverted research attention and resources away from other potential causes of achievement gaps, directing funding and effort toward situational explanations and interventions that later showed weak or non-replicable effects. Social psychology produced a backlog of suspect findings that were taught in introductory classes for years. Careers were built on research programs that relied on small samples and flexible methods now recognized as problematic. This emphasis promised environmental remedies for inequality but left a legacy of questionable science according to critics. The field expended significant resources on studies and programs that failed to hold up under closer scrutiny. [2][3][7][8]

Misguided reliance on the assumption contributed to a distorted research agenda in which publication bias and questionable practices inflated the apparent strength of the evidence. Meta-analyses that corrected for these issues found effect sizes near zero in some domains such as gender and mathematics. The result was a body of literature that influenced policy and training while resting on foundations that a substantial body of experts now view as shaky. Critics argue this diverted attention from more robust findings on stereotype accuracy and other factors in group differences. The consequences included wasted resources and false hopes for easy fixes to complex problems. [12][13][14]

Supporting Quotes (7)
“Stereotype threat was more than an idea—it was a promise, a way to understand inequality and to imagine solutions.”— Revisiting Stereotype Threat
“explaining and remedying achievement gaps that left progressives feeling good about themselves.”— Does Data Matter in Psychology?
“Its failure forces us to confront uncomfortable questions about how science is done and what happens when beloved ideas turn out to be wrong.”— Revisiting Stereotype Threat
“Many of us engaged in practices that, in hindsight, were borderline dishonest. We abused experimenter degrees of freedom, engaged in questionable research practices, p-hacked, massaged our data—you pick the euphemism.”— Revisiting Stereotype Threat
“If stereotype threat is not real, not robust, what else was I taught in my introduction to psychology classes that is also suspect? Despite all our improvements that help us in the present and future, we still have a massive backlog of studies from the past that we need to reckon with.”— Revisiting Stereotype Threat
“offers situational explanations for things like the gender gap in science and math achievement and participation.”— The Replication Crisis Is My Crisis
“the entire field’s evidentiary basis was now suspect. After all, they were produced by methods that we now consider questionable.”— The Downfall of Stereotype Threat

The assumption came under sustained challenge during the replication crisis of the early 2010s, when audits of social psychology revealed that only about a quarter of studies replicated reliably and stereotype threat emerged as one of the casualties. Multiple attempts to reproduce the performance effects failed, and large-sample studies began to report null results. A bias-corrected meta-analysis suggested the true effect, if it exists at all, is far weaker than originally claimed. These developments prompted researchers like Michael Inzlicht to voice public doubts in blog posts and newsletters. The crisis exposed issues such as tiny samples, p-hacking, and flexible analyses in the foundational work. [2][3][7][8]

A pivotal moment arrived with the 2024 Registered Replication Report led by Andrea Stoevenbelt, which involved more than 1,500 participants across multiple labs and found no evidence of the stereotype threat effect when replicating a key 2005 mathematics study. The preregistered design was intended to address the methodological concerns that had plagued earlier research. This large-scale effort joined a series of smaller failed replications and meta-analyses that questioned the robustness of the original claims. The report added to evidence that the phenomenon may not operate as first believed. Proponents such as Mary Murphy offered public rebuttals challenging the conclusions and timing of such critiques. [2][8][11]

Republication of critical pieces on the theory's problems, including work by Michael Inzlicht and Dominic Packer with Jay Van Bavel, reached thousands of readers through newsletters and prompted further debate in academic circles. These discussions highlighted ongoing contention over whether the assumption holds in any meaningful form. Some experts maintain that certain forms of the effect persist under specific conditions, while others argue the evidence overall points to weakness or absence. The exchange of critiques and responses illustrated the contested nature of the idea without producing consensus. Growing scrutiny of publication bias and research practices continued to shape how the literature is evaluated. [11][13][14]

Supporting Quotes (11)
“There are now multiple failed replications, large-sample studies that found no effect, and at least one bias-corrected meta-analysis pointing to the same conclusion... Despite following these procedures to the letter, the replication found no effect.”— Revisiting Stereotype Threat
“There were warning signs: tiny sample sizes, flexible analyses, and implausibly large effect sizes... Many of us engaged in practices that, in hindsight, were borderline dishonest. We abused experimenter degrees of freedom, engaged in questionable research practices, p-hacked, massaged our data”— Revisiting Stereotype Threat
“This is perhaps clearest when we look at the various casualties of the replication crisis—and people’s responses to these failures.”— Does Data Matter in Psychology?
“Despite following these procedures to the letter, the replication found no effect. Women who were ostensibly in a threat condition didn’t perform any worse than those who were instead taught about threat.”— Revisiting Stereotype Threat
“There are now multiple failed replications, large-sample studies that found no effect, and at least one bias-corrected meta-analysis pointing to the same conclusion: if stereotype threat exists, it is far weaker and more inconsistent than we originally believed.”— Revisiting Stereotype Threat
“Despite following these procedures to the letter, the replication found no effect. Women who were ostensibly in a threat condition didn’t perform any worse than those who were instead taught about threat.”— Revisiting Stereotype Threat
“There are now multiple failed replications, large-sample studies that found no effect, and at least one bias-corrected meta-analysis pointing to the same conclusion: if stereotype threat exists, it is far weaker and more inconsistent than we originally believed.”— Revisiting Stereotype Threat
“Despite following these procedures to the letter, the replication found no effect. Women who were ostensibly in a threat condition didn’t perform any worse than those who were instead taught about threat.”— Revisiting Stereotype Threat
“a meta-analysis published just last year suggests that stereotype threat... might not be so robust after all. Conducting bias-tests of the original papers is also not comforting... rumor has it that another massive effort at replication of stereotype threat is now in the works.”— The Replication Crisis Is My Crisis
“Despite following these procedures to the letter, the replication found no effect. Women who were ostensibly in a threat condition didn’t perform any worse than those who were instead taught about threat.”— The Downfall of Stereotype Threat
“A few weeks ago, my friends Dominic Packer and Jay Van Bavel republished it in their newsletter, The Power of Us, under the title “The Downfall of Stereotype Threat.” That republication landed in the inbox of roughly 10,000 people, which is fine. What happened next is more interesting.”— Is Stereotype Threat Just Vibes Now?

Know of a source that supports or relates to this entry?

Suggest a Source