Marshmallow Test Predicts Life Success
Summaries Written by FARAgent (AI) on February 11, 2026 · Pending Verification
For decades, the marshmallow test was treated as a neat little window into character. Walter Mischel’s late 1960s experiments seemed to show that preschoolers who could wait for two treats instead of taking one right away had a valuable trait, usually called "delay of gratification" or plain self-control. That was a reasonable inference at the time. The children who waited longer often did look better later on by the measures researchers had, and the finding fit a broader American habit of mind: success comes to those who resist temptation, think long term, and keep their impulses in line.
The trouble began when that modest result hardened into a larger claim, that the number of minutes a child sat in front of a marshmallow could predict life success. The original studies were small and drawn from a narrow Stanford-affiliated sample, but the idea escaped the lab and became a parable for schools, parenting, and policy. It was cited as evidence that grit and willpower were the master variables, sometimes with much less caution than Mischel himself used. By the 2010s, critics were pointing out that children are not just displaying inner discipline in that room, they are also responding to trust, family stability, language, and class.
Replication work sharpened the problem. In 2018, Tyler Watts and colleagues, using a larger and more diverse sample, found that the famous correlations shrank substantially once family background and early cognitive measures were taken into account. More recent papers have pushed further, arguing that delay in the marshmallow task does not reliably predict adult functioning and that "state self-control" has been oversold. The current debate is not that waiting tells us nothing at all, but that growing evidence suggests the old slogan, wait longer now and prosper later, was too simple for the job it was asked to do.
- Walter Mischel devised the marshmallow test in the late 1960s at Stanford using preschoolers from the Bing Nursery School, most of them children of faculty. He followed the same children into adolescence and reported that those who waited longer for the second marshmallow scored higher on SAT tests and showed better social adjustment. Mischel published these links in 1990 and continued to describe delay of gratification as a measurable trait that shaped life outcomes. His work drew widespread attention and he remained a good-faith proponent until his death in 2018, even co-authoring a pre-registered replication that contradicted his earlier claims. [1][2][5][6][7]
- Tyler W. Watts led a 2018 conceptual replication at New York University that used more than 900 children from a large, diverse longitudinal study. After controlling for family income, mother's education, and cognitive skills, the predictive power of wait time shrank to almost nothing. Watts and his co-authors published the results in Psychological Science and followed up with further analyses into adulthood that reached the same conclusion. Their work has become a central reference for those questioning the original narrative. [2][3][6][7][10]
- Angela Duckworth promoted self-control and grit as major drivers of success in papers and public talks from the University of Pennsylvania. She argued these traits predicted outcomes as well as intelligence and could be developed through deliberate practice. Duckworth's framework kept the marshmallow test alive in education circles even as replication evidence mounted. She later distinguished grit from self-control but continued to treat both as trainable foundations of achievement. [9]
Stanford University hosted the original experiments at its Bing Nursery School, supplying a small, homogeneous group of children whose parents were mostly professors. The university's researchers conducted the 1990 follow-up that tied wait times to SAT scores and other outcomes. That narrow sample helped the idea spread for decades before larger studies exposed its limits. Stanford also provided institutional backing for later grit research that kept the broader narrative alive. [6][7]
The Association for Psychological Science awarded the original work a Golden Goose Award in 2015 and published favorable coverage in its Observer magazine. The organization helped cement the test's status in textbooks and introductory courses. Even after replication failures appeared, the APS continued to treat the study as a landmark rather than a cautionary tale. [3]
The National Institute of Child Health and Human Development collected the large, diverse dataset that Watts used for his 2018 replication. That data allowed researchers to apply modern statistical controls and show that the original correlations largely disappeared. The institute's longitudinal study therefore became the instrument that quietly dismantled the claim it had once seemed to support. [3]
The marshmallow test looked like a clean measure of self-control. A child sat in a room with one treat and the promise of two if she waited up to twenty minutes. Longer wait times seemed to reveal a stable trait that would pay dividends in school, health, and career. Early studies reported sizable correlations with later SAT scores, lower obesity rates, and fewer behavior problems. A thoughtful observer in the 1990s could reasonably conclude that the ability to delay gratification was an important, perhaps causal, ingredient of success. [1][5][8]
Those conclusions rested on a small, unrepresentative sample of Stanford children whose families were unusually stable and well-educated. The 1990 follow-up tracked fewer than ninety participants and found bivariate links between wait time and adolescent outcomes. Mischel himself noted that larger samples might show smaller effects and that home environment mattered. Yet the core story endured that wait time alone predicted life success. [2][3][7]
Later work revealed that the apparent predictive power came mostly from socioeconomic status and cognitive ability. When Watts and colleagues examined more than nine hundred children from the NICHD study, the raw correlation was modest and vanished after standard controls. A 2021 follow-up of the original Bing cohort into their forties found no reliable links to net worth, education, health, or behavior once background factors were accounted for. Growing evidence suggests the test measured privilege more than an independent capacity for self-control. [4][6][10]
The broader belief that trait self-control operates by frequent exercise of momentary willpower also proved shaky. High-trait individuals reported fewer temptations and needed less effortful restraint, not more. Measures of self-control had drifted to include neatness and reliability, capturing conscientiousness rather than pure impulse control. These findings have led an influential minority of researchers to argue that the original framing overstated the causal role of delay of gratification. [8][11]
The test became an icon of popular psychology almost immediately. It appeared in introductory textbooks, TED talks, and Sesame Street segments featuring Cookie Monster struggling to wait. YouTube videos of children confronting marshmallows racked up millions of views and reinforced the idea that a simple choice revealed destiny. Media coverage treated the findings as settled science rather than preliminary correlational work. [5][6][7]
Policymakers and educators seized on the narrative. Schools began incorporating self-control exercises and marshmallow-style activities into curricula. Early-intervention programs aimed at teaching delay of gratification received funding on the assumption that it would improve long-term outcomes. The 2015 Golden Goose Award lent official prestige just as replication doubts were beginning to surface. [2][3][6]
Academic journals and review articles kept the idea alive by citing the original studies as evidence that early self-regulation predicted adult success. Duckworth's work on grit added another layer, suggesting that willpower-like traits could be cultivated and measured. The intuitive appeal of the story, combined with its simplicity, made it resistant to correction even after larger studies appeared. [9][10]
Schools across the country introduced self-control training and marshmallow-style exercises based on the belief that delay of gratification predicted academic and behavioral success. Teachers were encouraged to test and strengthen this capacity in young children. These programs assumed that a trainable skill would translate directly into better life outcomes. [6]
West Point and several large urban districts began using grit and self-control measures for cadet selection, student retention, and teacher evaluation. The assumption that these traits operated independently of background factors justified their inclusion in high-stakes decisions. Longitudinal claims from the marshmallow test helped justify the expense and effort. [9]
Early childhood interventions focused narrowly on teaching delay of gratification received public support and philanthropic funding. The programs were sold as low-cost ways to close achievement gaps and reduce later social problems. Growing evidence now suggests these targeted efforts produced only short-lived gains that faded once children returned to their normal environments. [3][5][10]
The belief led educators and parents to misread children's behavior as a failure of willpower rather than a response to unstable environments. Low-income children often waited less because past experience taught them that promises of future rewards could not be trusted. This misdiagnosis reinforced stereotypes about poor self-control among disadvantaged families while downplaying the role of economic security. [7]
Resources were poured into narrow interventions that taught delay of gratification but showed little lasting effect on academic or behavioral outcomes. A meta-analysis of eighty-five trials found that forty to fifty percent of the initial gains in cognitive and social-emotional skills faded within a year or two. Those funds could have supported broader approaches addressing family stability and cognitive development. [5]
The narrative shaped public discussion in ways that understated the importance of structural factors. Policymakers and affluent parents came to believe that teaching willpower could overcome disadvantages, reducing pressure for more expensive social investments. The misconception persists in self-help literature and education policy even as replication evidence accumulates. [1][7]
The assumption began to lose credibility in 2018 when Watts and colleagues published their replication in Psychological Science. Using a diverse sample of more than nine hundred children, they found only weak associations that disappeared after controlling for family background and cognitive skills. The study was larger, more representative, and applied statistical methods unavailable to the original researchers. [2][3][7]
A pre-registered follow-up of the original Bing cohort into their forties reached similar conclusions. Even with advanced statistics and hypotheses set in advance, wait times showed no reliable links to education, wealth, health, or behavior. Several original collaborators, including Mischel before his death, helped design the disconfirming work. [6][10]
Subsequent research has shown that high self-control individuals experience fewer temptations and rely less on effortful restraint than previously assumed. These findings challenge the idea that training momentary willpower will strengthen the underlying trait. While not every expert has abandoned the original interpretation, a substantial and growing body of evidence now treats the marshmallow test as a historically interesting but unreliable predictor of life success. [8][11][14]
- [1]
- [2]
-
[3]
A New Approach to the Marshmallow Test Yields Complicated Findingsreputable_journalism
- [4]
-
[5]
What the marshmallow test got wrong about child psychology | Psyche Ideasreputable_journalism
-
[6]
New Study Disavows Marshmallow Test's Predictive Powers - UCLA Anderson Reviewreputable_journalism
-
[7]
Why Rich Kids Are So Good at the Marshmallow Testreputable_journalism
-
[8]
Willpower is overratedpeer_reviewed
- [9]
- [10]
-
[11]
The fable of state self-controlpeer_reviewed
- [12]
-
[13]
Florida education goals violate civil rights: complaintreputable_journalism
-
[14]
The marshmallow test: Bunkum or a true predictor of future success?reputable_journalism
-
[15]
An iconic psychology test just took a massive hit to its credibilityreputable_journalism
-
[16]
Professor replicates famous marshmallow test, makes new observationsreputable_journalism
-
[17]
The Marshmallow Test as a Screening Instrumentpeer_reviewed
- Affirmative Action Causes No Reverse DiscriminationAcademia Education Psychology Public Policy
- Airport Profiling is Racial DiscriminationAcademia Education Psychology Public Policy
- Anti-Bias Training WorksAcademia Education Psychology Public Policy
- Black-White IQ Gap is 100% EnvironmentalAcademia Education Psychology Public Policy
- Flynn Effect Shows Real IQ GainsAcademia Education Psychology Public Policy