Confidence Freeze: Early Success Induces a Metastable Decoupling of Metacognition and Behaviour

cs.LG Zhipeng Zhang, Hongshun He · Mar 22, 2026

What it does

Why it matters

Using a multi-reversal bandit task (N=332 across 3 experiments), the authors show that brief exposure to 90% success rates (vs. 60%) induces lock-in behavior where participants endure ~6 consecutive losses while reporting plummeting...

Main concern

The paper presents an intriguing mechanistic account of maladaptive persistence with solid effect sizes (Cohen's d ≈ 0. 7 for interventions, d = 0.

Community signal

0 up · 0 down

AI Review AI reviewed

Plain-language introduction

This paper investigates why humans persist with failing strategies despite negative feedback, proposing 'confidence freeze'—a metastable state where early success decouples metacognitive confidence from behavior. Using a multi-reversal bandit task (N=332 across 3 experiments), the authors show that brief exposure to 90% success rates (vs. 60%) induces lock-in behavior where participants endure ~6 consecutive losses while reporting plummeting confidence, suggesting a dynamic mechanism rather than stable individual traits.

Critical review

Verdict

Bottom line

The paper presents an intriguing mechanistic account of maladaptive persistence with solid effect sizes (Cohen's d ≈ 0.7 for interventions, d = 0.97 for policy stickiness). However, the statistical evidence for the central claim—early success suppressing loss sensitivity—rests on a barely significant interaction (p = 0.048) that falls within the 'uncanny valley' of p-values. The experimental design is elegant, but confidence ratings sampled only every 3 trials provide coarse temporal resolution for a phenomenon emphasized as dynamic. The reproducibility claim is weakened by data being promised 'upon publication' rather than available now, and key test statistics (e.g., manipulation check t-values) are reported as placeholders (X.XX) in the provided text.

“the loss-streak × group interaction was significant (β=-0.07, p=.048)”

Zhang & He, Confidence Freeze · Results, Mixed-effects Modelling

“t(97)=X.XX, p<.001”

Zhang & He, Confidence Freeze · Results, Manipulation Check

What holds up

The convergent evidence across behavioral, metacognitive, and computational measures strengthens the core claim. The finding that the same individuals transition in and out of lock-in states across reversals is methodologically important—it directly contradicts trait-based explanations. The computational modeling results showing elevated policy stickiness (φ: 0.76 vs. 0.16) with intact learning rates (α actually higher in high-success group) support the specific mechanism of belief-action decoupling rather than general learning deficits. The dual-pathway intervention finding—environmental (explicit trajectory) and cognitive (prompts) showing equivalent efficacy—is conceptually valuable for applications.

“high-success group exhibited significantly higher policy stickiness... (φ: M_high=0.76, M_normal=0.16, t(198)=6.88, p<0.001, Cohen's d=0.97)”

Zhang & He, Confidence Freeze · Results, Computational Modeling

“freeze index in prompted high-success participants dropped to 14%, significantly lower than unprompted controls (33%)”

Zhang & He, Confidence Freeze · Results, Experiment 3

Main concerns

First, the statistical threshold for the key interaction (p = 0.048) is concerning given the recent emphasis on p < 0.005 for new discoveries and the well-documented inflation of false positives near p = 0.05. This marginal significance, combined with moderate sample sizes per experiment (~99-123), warrants replication before mechanistic claims are accepted. Second, the sparse sampling of confidence (every 3 trials) risks missing the actual dynamics of the freeze state—participants could have switched confidence levels between ratings, making the 'freeze index' potentially noisy. Third, the paper lacks formal model comparison metrics (AIC/BIC) for the computational models; claiming the stickiness parameter captures the effect requires showing it outperforms simpler alternatives. Fourth, multiple comparisons across experiments and thresholds (Δ = 1, 2, 3) are conducted without clear correction procedures.

“loss-streak × group interaction (β=-0.07, p=.048)”

Zhang & He, Confidence Freeze · Results, Mixed-effects Modelling

“Confidence ratings were collected every three trials using a 1-7 scale”

Zhang & He, Confidence Freeze · Methods, Task Overview

“freeze detection... recomputed for drops of 1, 2, and 3 confidence points”

Zhang & He, Confidence Freeze · Results, Robustness of the Freeze Effect

Evidence and comparison

The evidence supports the broad phenomenon—early success induces persistence—but the specific 'confidence-freeze' mechanism requires stronger validation. The claim that confidence and behavior 'decouple' relies on measuring confidence at sparse intervals and inferring dissociation from group-level patterns. The paper appropriately compares to related work on exploration-exploitation trade-offs and sunk-cost fallacies, positioning confidence freeze as a dynamic alternative to trait-based accounts. However, the comparison to normative Bayesian updating (mentioned in Discussion) lacks formal hierarchical model fitting to participants' actual switching data; it remains a verbal theory rather than a quantified benchmark. The interventions in Experiments 2 and 3 are well-designed, but neither includes a no-intervention control group from the same subject pool, limiting causal claims about intervention efficacy.

“These findings align with Bayesian accounts of evidence accumulation in volatile environments”

Zhang & He, Confidence Freeze · Discussion, Trajectory evidence

“persistence length decreased from 3.2 to 2.1 trials; freeze index from 38% to 18%”

Zhang & He, Confidence Freeze · Results, Experiment 2

Reproducibility

Reproducibility is seriously compromised. The Data Availability statement indicates materials 'will be made publicly available on OSF upon publication'—meaning the data, code, and analysis scripts are not currently accessible for independent verification. This violates current standards for computational reproducibility. Additionally, the manuscript contains placeholder statistics (e.g., 't(97)=X.XX') that suggest incomplete reporting. Critical hyperparameters for the reinforcement learning model (initial values, fitting procedure, bounds on parameters) are not specified in the extract provided. The between-subject design across three experiments means effect sizes could be sensitive to unmeasured sampling differences. Without open data and code, independent reproduction is currently impossible.

“All behavioural data and analysis scripts will be made publicly available on OSF upon publication”

Zhang & He, Confidence Freeze · Data Availability

“t(97)=X.XX, p<.001”

Zhang & He, Confidence Freeze · Results, Manipulation Check

Abstract

Humans must flexibly arbitrate between exploring alternatives and exploiting learned strategies, yet they frequently exhibit maladaptive persistence by continuing to execute failing strategies despite accumulating negative evidence. Here we propose a ``confidence-freeze'' account that reframes such persistence as a dynamic learning state rather than a stable dispositional trait. Using a multi-reversal two-armed bandit task across three experiments (total N = 332; 19,920 trials), we first show that human learners normally make use of the symmetric statistical structure inherent in outcome trajectories: runs of successes provide positive evidence for environmental stability and thus for strategy maintenance, whereas runs of failures provide negative evidence and should raise switching probability. Behaviour in the control group conformed to this normative pattern. However, individuals who experienced a high rate of early success (90\% vs.\ 60\%) displayed a robust and selective distortion after the first reversal: they persisted through long stretches of non-reward (mean = 6.2 consecutive losses) while their metacognitive confidence ratings simultaneously dropped from 5 to 2 on a 7-point scale.

Challenge the Review

Pick a starting point or write your own. Challenges run in the background, so you can keep reading while the AI investigates.

Challenges are public to read, but only signed-in members can post them. Your challenge text is stored with your account for moderation, but usernames are not shown in the public thread.

No challenges yet. Disagree with the review? Ask the AI to revisit a specific claim.