Automation Bias

Automation bias is the tendency for humans to favor suggestions and outputs from automated decision-making systems over contradictory information from non-automated sources, even when the non-automated information is correct. The term describes a specific form of cognitive bias in which people assign greater authority and trust to automated recommendations than the situation warrants, often accepting them without independent verification. Automation bias affects both novice and expert users and has been documented across a wide range of domains, including aviation, healthcare, criminal justice, military operations, and autonomous driving.

The concept was formally introduced in the mid-1990s through research on cockpit automation by Kathleen Mosier, Linda Skitka, and colleagues, who defined automation bias as the use of automated cues "as a heuristic replacement for vigilant information seeking and processing." As artificial intelligence and machine learning systems become more prevalent in everyday decision-making, automation bias has become a growing concern in the fields of AI safety, fairness, and human-computer interaction.

ELI5 (Explain like I'm 5)

Imagine you have a magic toy that tells you the color of things. Most of the time it gets the answer right, so you start trusting it completely. One day, the toy says a red ball is blue. Even though you can see the ball is red with your own eyes, you believe the toy instead because it has been right so many times before. That is automation bias: trusting a machine so much that you stop thinking for yourself, even when you have good reasons to disagree.

Historical background

Early automation research

Concerns about human over-reliance on automated systems predate modern AI. In 1983, Lisanne Bainbridge published "Ironies of Automation" in the journal Automatica, a paper that has since attracted over 1,800 citations. Bainbridge identified a fundamental paradox: the more tasks are automated, the less practice human operators get at performing those tasks manually, which means they are less prepared for the rare but important moments when they need to intervene. She also observed that increased automation decreases cognitive workload under normal conditions but simultaneously increases the opportunities for monitoring errors.

Coining the term

The term "automation bias" was introduced by Mosier, Skitka, Heers, and Burdick in a 1996 study examining pilot behavior in simulated cockpit environments. Their research showed that pilots relied on automated aids as a shortcut for information processing and made systematic errors as a result. In a follow-up study published in 1999 in the International Journal of Human-Computer Studies, Skitka, Mosier, and Burdick demonstrated that participants working with a highly (but not perfectly) reliable automated aid performed worse on monitoring tasks than participants working without automation. The automated group committed both errors of omission and commission that the non-automated group avoided.

Theoretical integration

In 2010, Raja Parasuraman and Dietrich Manzey published a major review in Human Factors titled "Complacency and Bias in Human Use of Automation: An Attentional Integration." This paper proposed an integrated theoretical model showing that automation bias and automation-induced complacency arise from the dynamic interaction of personal, situational, and automation-related characteristics, with attention as the central mechanism. Their framework positioned automation bias and complacency as "different manifestations of overlapping automation-induced phenomena" rather than entirely separate constructs.

Mechanisms and causes

Research has identified several psychological and situational factors that contribute to automation bias.

Cognitive factors

Factor	Description
Cognitive miser tendency	Humans naturally prefer the path of least cognitive effort. Accepting an automated recommendation requires less mental work than independently evaluating all available information.
Perceived superiority of automation	Users tend to view automated systems as analytically superior to human judgment, leading them to defer to machine outputs even when their own assessment conflicts.
Effort reduction under task sharing	When sharing decision-making responsibility with an automated system, people reduce their own cognitive investment in the task.
Anchoring effect	An automated recommendation serves as an anchor, biasing subsequent human judgment toward that recommendation even after it has been shown to be incorrect.

Situational factors

Factor	Description
High workload and time pressure	When operators face multiple simultaneous tasks or time constraints, they are more likely to rely on automated suggestions rather than conducting independent analysis.
System reliability history	A system with a strong track record of accuracy builds user trust, which then persists even in situations where the system produces errors (a pattern called "learned carelessness").
Display design and salience	Automated recommendations that are displayed prominently on a screen are more likely to be followed without question, regardless of their accuracy.
Lack of transparency	When users do not understand how an automated system generates its recommendations, they are less able to evaluate whether a given output is trustworthy.
Team dynamics	Research shows that teams do not outperform individuals in detecting automation failures. Group settings can actually reinforce automation bias through diffusion of responsibility.

Types of errors

Automation bias produces two distinct categories of errors, first defined by Mosier and Skitka in 1996.

Errors of omission

Omission errors occur when an automated system fails to alert the user to a problem and the user, relying on the system's silence as an indication that everything is normal, fails to detect the issue independently. In these cases, the human operator does not take a necessary action because the automation did not prompt them to do so. Studies have reported omission error rates as high as 55% in some experimental settings.

Errors of commission

Commission errors occur when an automated system provides an incorrect recommendation and the user follows it despite the availability of contradictory information from other valid sources. The user actively takes an inappropriate action because they accept the automated suggestion over their own training, experience, or direct observations. Commission errors are considered especially concerning because they involve the active dismissal or ignoring of correct information.

Error type	Definition	Example
Omission error	Failure to act because automation did not provide an alert	A pilot misses an engine malfunction because the automated monitoring system does not flag it
Commission error	Following an incorrect automated recommendation despite contradictory evidence	A physician changes a correct diagnosis to match an incorrect suggestion from a clinical decision tree support system

Relationship to automation complacency

Automation bias and automation complacency are closely related but distinct concepts. Automation complacency refers to insufficient monitoring of an automated system's output, typically driven by a belief that the system is reliable enough that close monitoring is unnecessary. NASA's Aviation Safety Reporting System defines complacency as "self-satisfaction that may result in non-vigilance based on an unjustified assumption of satisfactory system state."

The key distinction is one of mechanism. Automation bias involves trusting the content of a decision-support system's output (accepting what it recommends), while complacency involves inadequate attention to and monitoring of the system's behavior (not watching it closely enough). In practice, both phenomena frequently co-occur and reinforce each other. Parasuraman and Manzey (2010) argued that the two concepts represent different aspects of the same underlying attentional dysfunction and proposed treating them within a single integrative framework.

A related concept is the "automation irony" identified by Bainbridge (1983): the more reliable and capable an automated system becomes, the less the human operator practices the skills needed to handle system failures. This creates a paradoxical situation in which the humans who are supposed to serve as a safety net for automation failures are progressively less capable of performing that role.

Domain-specific evidence

Aviation

Aviation was the first domain in which automation bias was systematically studied, and it remains one of the most extensively documented. Notable findings include:

In simulated flight studies, pilots who were given false automated alerts to shut down engines complied with the alerts, even though they had stated before the experiment that they would never follow such a recommendation without independent verification.
Pilots with approximately 440 hours of flight experience detected more automation failures than non-pilots, but both groups still demonstrated significant complacency effects.
In a study using the Engine-Indicating and Crew-Alerting System (EICAS), pilots using automated engine monitoring detected fewer engine malfunctions than pilots relying on manual monitoring.
A 2005 high-fidelity simulation of air-traffic control found that controllers detected significantly fewer airspace conflicts when their automated conflict-detection system failed silently.
The 1983 Korean Air Lines Flight 007 incident involved crew members who "relied on automation that had been inappropriately set up, and they never checked their progress manually," resulting in the aircraft straying into prohibited Soviet airspace.

Healthcare

Clinical decision support systems (CDSS) are designed to improve diagnostic accuracy, but research has shown that they also introduce new types of errors through automation bias.

A systematic review found that CDSS improved overall performance by about 21 percentage points (from 29% to 50% correct answers), but simultaneously caused 7% of previously correct answers to become incorrect because clinicians changed their correct initial assessment to match an erroneous system recommendation.
Studies of automated breast cancer detection (2004, 2008) found that cancers were correctly identified in 46% of cases without automated aids. When the automated system failed, the detection rate dropped to just 21%, a clear demonstration of omission errors.
Research on primary-care physicians using literature search tools (PubMed, Medline, Google) showed small to medium improvements in answer accuracy overall, but with documented instances of physicians being misled into changing correct answers to incorrect ones.
Older and more experienced users have been observed to exhibit higher levels of automation bias in clinical settings, possibly because familiarity with decision support tools leads to desensitization and habituation over time.

Criminal justice

Algorithmic risk assessment tools are increasingly used in the criminal justice system, raising concerns about automation bias in judicial decision-making.

The COMPAS (Correctional Offender Management Profiling for Alternative Sanctions) system, used in multiple U.S. jurisdictions, generates recidivism risk scores on a 10-point scale based on answers to 137 questions. A 2016 ProPublica investigation found that COMPAS correctly predicted recidivism only 61% of the time and that Black defendants were almost twice as likely as white defendants to be incorrectly labeled as high-risk.
In Wisconsin v. Loomis (2016), a judge cited the defendant's COMPAS score as a factor in imposing a sentence of eight years and six months in prison. The case raised questions about due process when judges defer to opaque algorithmic assessments.
The proprietary nature of tools like COMPAS compounds the automation bias problem, as judges and attorneys cannot examine or challenge the underlying methodology.

Military operations

A 2004 study by Mary Cummings documented cases in which automation bias contributed to fatal military decisions, including friendly-fire incidents during operations in Iraq. In these cases, operators relied on automated targeting systems without adequately verifying the identity of targets.

A 2021 U.S. drone strike in Kabul, Afghanistan, was later determined to have killed ten civilians, including seven children, rather than the intended target. Investigations pointed to over-reliance on automated surveillance and targeting systems as a contributing factor.

Autonomous vehicles

The rise of driver-assistance and autonomous driving technologies has created new contexts for automation bias.

The National Transportation Safety Board (NTSB) investigation of a 2016 fatal Tesla Autopilot crash in Williston, Florida, determined that the probable cause included "the car driver's inattention due to overreliance on vehicle automation."
An investigation of a fatal 2018 collision involving an Uber self-driving test vehicle and a pedestrian in Tempe, Arizona, highlighted how the backup safety driver had become disengaged from the monitoring task.
As of late 2025, NHTSA had opened investigations into over 50 fatal incidents involving Tesla's Autopilot and Full Self-Driving systems. NTSB Chairwoman Jennifer Homendy stated that hands-free driver-assistance systems "function primarily as convenience features rather than safety enhancements" because drivers using these systems are more likely to shift their attention away from the road.

Automation bias and large language models

The proliferation of large language models (LLMs) such as ChatGPT, Claude, and Gemini has introduced automation bias into new areas of everyday life. Because LLMs generate fluent, confident-sounding text, users may accept their outputs without critical evaluation, even when those outputs contain factual errors (hallucinations).

Research published in 2024 in International Data Privacy Law (Oxford Academic) examined the specific risks of automation bias with generative LLMs, noting that the conversational and authoritative tone of LLM responses makes them especially susceptible to uncritical acceptance. A 2025 clinical trial found that physicians who had received AI training still demonstrated automation bias when using LLMs for diagnostic reasoning, over-relying on LLM-generated differential diagnoses.

Key areas of concern include:

Education: Students and educators who use LLMs for research or writing may accept inaccurate information as factual without independent verification.
Software development: Developers who accept AI-generated code suggestions without careful review may introduce bugs, security vulnerabilities, or licensing violations.
Legal practice: Lawyers who have relied on LLM-generated legal research have submitted court filings containing fabricated case citations, resulting in sanctions.
Medical consultation: Patients who use chatbots for health information may delay seeking proper medical care or follow incorrect advice.

Systematic review findings

In 2012, Goddard, Roudsari, and Wyatt published a systematic review in the Journal of the American Medical Informatics Association (JAMIA) that examined the frequency of automation bias across research fields, the factors that mediate its effects, and the interventions that can mitigate it. Key findings from this review include:

Automation bias is consistently observed across domains and cannot be eliminated through training or instructions alone.
Both naive and expert participants are susceptible.
Verification behavior (the act of independently checking automated outputs) is the single most important variable in reducing automation bias.
Task complexity influences the degree of automation bias, with more complex tasks leading to greater reliance on automated aids.
The concept of automation bias remains "ill-defined" in the literature, with inconsistent use of terminology across studies.

Mitigation strategies

Researchers and system designers have proposed a range of interventions to reduce automation bias. No single strategy has been shown to eliminate the problem entirely, but several approaches have demonstrated partial effectiveness.

System design interventions

Strategy	Description	Evidence
Reduced display prominence	Making automated recommendations less visually dominant on the screen so they do not anchor the user's attention	Mixed results; some studies confirm the effect, others find display prominence has limited impact
Transparency and explainability	Providing users with information about how the system generated its recommendation (see interpretability)	Users who understand system reasoning adjust their reliance accordingly, but overly technical explanations can backfire by reinforcing misplaced trust
Confidence indicators	Showing the system's estimated confidence or reliability for each recommendation	Can help calibrate trust, but confidence indicators themselves can become a source of bias if users do not understand probability well
Variable reliability cues	Designing systems whose reliability indicators change over time rather than presenting a constant level of confidence	Reduces complacency by preventing users from developing a fixed expectation of system accuracy
Framing as support rather than directive	Presenting automated outputs as suggestions or second opinions rather than definitive answers	Encourages users to treat automated input as one factor among several rather than the final word

Training and procedural interventions

Strategy	Description	Evidence
Error exposure training	Deliberately introducing system errors during training so users experience automation failures firsthand	More effective than simply telling users that errors can occur; reduces commission errors but may not reduce omission errors
Accountability structures	Making users explicitly accountable for the accuracy of their decisions, regardless of whether they used automated assistance	Mosier et al. (1996) found that pilots with an internalized sense of accountability were significantly more likely to verify automated outputs
Verification checklists	Requiring users to complete a structured verification process before acting on automated recommendations	Effective but increases time pressure and task complexity, which can introduce its own set of problems
Cross-referencing protocols	Establishing procedures that require users to compare automated recommendations against at least one independent source of information	Effective in principle, but compliance decreases under workload and time pressure

Human-in-the-loop approaches

The human-in-the-loop paradigm, in which human experts review and approve automated outputs before they are acted upon, is widely recommended as a safeguard against automation bias. However, research has shown that simply placing a human in the loop does not automatically solve the problem. If the human reviewer is subject to automation bias (as most humans are), they may rubber-stamp automated outputs without meaningful scrutiny. Effective human-in-the-loop systems require:

Adequate training on system limitations
Sufficient time for genuine review (not just approval under time pressure)
Institutional incentives for catching errors rather than just processing outputs quickly
Access to independent information sources for cross-checking

Organizational-level interventions

Establishing a culture that treats automation as a tool, not an authority
Creating reporting systems for automation-related errors without penalty
Regularly auditing automated systems for accuracy and bias
Maintaining human expertise through periodic manual practice, even when automated systems are available

The 70% reliability threshold

Research suggests that the effectiveness of automated decision aids drops sharply when system reliability falls below approximately 70%. Below this threshold, users tend to distrust and ignore the system, which can lead to underuse rather than overuse. Above this threshold, users tend to trust the system increasingly, with the risk of automation bias growing as reliability increases.

This creates a nuanced problem for system designers: a system that is reliable enough to be useful (above 70%) is also reliable enough to induce automation bias. Systems that are extremely reliable (above 95%) may produce the strongest automation bias because users have so few experiences of system failure that they stop expecting errors entirely. This pattern has been described as the "first-failure effect," in which the first experienced system failure causes a sharp decline in trust that is followed by a slow recovery back to high trust levels.

References

Bainbridge, L. (1983). "Ironies of Automation." *Automatica*, 19(6), 775-779.
Mosier, K. L., Skitka, L. J., Heers, S., & Burdick, M. (1998). "Automation bias: Decision making and performance in high-tech cockpits." *International Journal of Aviation Psychology*, 8(1), 47-63.
Skitka, L. J., Mosier, K. L., & Burdick, M. (1999). "Does automation bias decision-making?" *International Journal of Human-Computer Studies*, 51(5), 991-1006.
Parasuraman, R., & Manzey, D. H. (2010). "Complacency and bias in human use of automation: An attentional integration." *Human Factors*, 52(3), 381-410.
Goddard, K., Roudsari, A., & Wyatt, J. C. (2012). "Automation bias: A systematic review of frequency, effect mediators, and mitigators." *Journal of the American Medical Informatics Association*, 19(1), 121-127.
Cummings, M. L. (2004). "Automation bias in intelligent time critical decision support systems." *AIAA 1st Intelligent Systems Technical Conference*, Chicago, Illinois.
Mosier, K. L., Skitka, L. J., Burdick, M. D., & Heers, S. T. (1996). "Automation bias, accountability, and verification behaviors." *Proceedings of the Human Factors and Ergonomics Society Annual Meeting*, 40(4), 204-208.
Angwin, J., Larson, J., Mattu, S., & Kirchner, L. (2016). "Machine bias." *ProPublica*. https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing
National Transportation Safety Board (2017). *Collision between a car operating with automated vehicle control systems and a tractor-semitrailer truck near Williston, Florida, May 7, 2016.* Highway Accident Report NTSB/HAR-17/02.
Endsley, M. R. (2017). "From here to autonomy: Lessons learned from human-automation research." *Human Factors*, 59(1), 5-27.
Carnat, I. (2024). "Human, all too human: Accounting for automation bias in generative large language models." *International Data Privacy Law*, 14(4), 299-312.
Fitch, S. (2024). "AI Safety and Automation Bias." Center for Security and Emerging Technology, Georgetown University.
Lyell, D., & Coiera, E. (2017). "Automation bias and verification complexity: A systematic review." *Journal of the American Medical Informatics Association*, 24(2), 423-431.

ELI5 (Explain like I'm 5)

Historical background

Early automation research

Coining the term

Theoretical integration

Mechanisms and causes

Cognitive factors

Situational factors

Types of errors

Errors of omission

Errors of commission

Relationship to automation complacency

Domain-specific evidence

Aviation

Healthcare

Criminal justice

Military operations

Autonomous vehicles

Automation bias and large language models

Systematic review findings

Mitigation strategies

System design interventions

Training and procedural interventions

Human-in-the-loop approaches

Organizational-level interventions

The 70% reliability threshold

See also

References

Improve this article

Related Articles

Machine learning terms/Fairness

Equality of Opportunity

Out-Group Homogeneity Bias

Predictive Parity

Unawareness (Fairness Through Unawareness)

ARC-AGI 2

ELI5 (Explain like I'm 5)

Historical background

Early automation research

Coining the term

Theoretical integration

Mechanisms and causes

Cognitive factors

Situational factors

Types of errors

Errors of omission

Errors of commission

Relationship to automation complacency

Domain-specific evidence

Aviation

Healthcare

Criminal justice

Military operations

Autonomous vehicles

Automation bias and large language models

Systematic review findings

Mitigation strategies

System design interventions

Training and procedural interventions

Human-in-the-loop approaches

Organizational-level interventions

The 70% reliability threshold

See also

References

Related Articles

Machine learning terms/Fairness

Equality of Opportunity

Out-Group Homogeneity Bias

Predictive Parity

Unawareness (Fairness Through Unawareness)

ARC-AGI 2