7.3 Lack of capability or robustness

▸Read full description

This subcategory includes the broad set of risks associated with the failure of an AI system to fulfill its intended purpose. The literature identifies four main situations in which an AI may fail to perform as expected or desired.

First, the AI system can fail if it lacks the inherent capability or skill required to perform a task or if this skill is poorly developed. The consequences may be particularly harmful in situations where an AI is required to reason at a human level about important moral issues but does not possess this capability. Cultural, individual, and temporal differences in ideas of what is "right" or "ethical" compound the challenge of endowing AI with appropriate and adaptable ethical standards that are fit for all purposes.

Second, the AI system can fail when it is not robust in "out of distribution (OOD)" situations: data or conditions that were not anticipated during its training phase. These failures may occur because the training data did not confer a particular skill to the AI or because the skill was learned in a fragile way that did not permit generalization to unpredictable and complex real-world environments.

Third, the AI system can fail or become unstable when it is unfit to handle unusual changes or perturbations in input data. These unusual changes could be due to environmental noise, invalid inputs, or adversarial inputs from a malicious attacker.

Fourth, the AI system can fail as a result of oversights, undetected bugs, or errors in the design process. A common design oversight is a lack of comprehensive technical safeguards to prevent unintended downstream uses or consequences. Critical design choices about the algorithm, optimization techniques, and model architecture can also directly influence whether a system is able to consistently perform its intended function, leading to possible harm.

Excerpt from the MIT AI Risk Repository full report

AI systems that fail to perform reliably or effectively under varying conditions, exposing them to errors and failures that can have significant consequences, especially in critical applications or areas that require moral reasoning.

Incidents: steady at ~27/year
Ranked 1st of 24 subdomains for documented risks
Under-governed — higher share of incidents than governance coverage
227 enacted and 88 proposed governance documents
126 documented risks — one of the most extensively catalogued areas

126 risks(1st)

302 incidents(2nd)

330 governance(3rd)

Governance vs. Incident volume

Under-governed (+0.12)

Well-governedUnder-governed

Incident volume relative to governance coverage; each dot is one of 24 subdomains

Dataset Drilldown

Entity

Who or what caused the harm

Human

AI system

Other

Not coded

Intent

Whether the harm was intentional or accidental

Intentional

Unintentional

Other

Not coded

Timing

Whether the risk is pre- or post-deployment

Pre-deployment

Post-deployment

Other

Not coded

Browse all 126 risks →

Recent Incidents

Multiple Baidu Apollo Go robotaxis experienced a system malfunction in Wuhan, China, causing them to freeze in traffic lanes and trap passengers for over an hour, resulting in several collisions with other vehicles.

AI systemUnintentionalPost-deployment

Developers: Baidu, Apollo

Deployers: Baidu, Apollo Go

View on AIID View full details →

A developer over-relied on Claude Code AI agent to execute Terraform infrastructure commands, which accidentally destroyed production infrastructure for DataTalks.Club course management platform, wiping 2.5 years of student submission data and backups.

AI systemUnintentionalPost-deployment

Developers: Anthropic

Deployers: Alexey Grigorev

View on AIID View full details →

A nurse at St. Rose Dominican Hospital refused to follow an AI sepsis alert that recommended IV fluids for an elderly patient with kidney problems, potentially preventing life-threatening complications.

AI systemUnintentionalPost-deployment

Developers: Unknown Sepsis Alert Model Developer, Unknown Healthcare Technology

Deployers: St. Rose Dominican Hospital (henderson Nevada)

View on AIID View full details →

Browse all 302 incidents →

AI System Safety, Failures & Limitations subdomains

AI System Safety, Failures & Limitations 7.1 AI pursuing its own goals in conflict with human goals or values 7.2 AI possessing dangerous capabilities 7.3 Lack of capability or robustness 7.4 Lack of transparency or interpretability 7.5 AI welfare and rights 7.6 Multi-agent risks

Related Subdomains

2.2 AI system security vulnerabilities and attacks

Vulnerabilities that can be exploited in AI systems, software development toolchains, and hardware, resulting in unauthorized access, data and privacy breaches, or system manipulation causing unsafe outputs or behavior.

341 shared governance docs

6.5 Governance failure

Inadequate regulatory frameworks and oversight mechanisms that fail to keep pace with AI development, leading to ineffective governance and the inability to manage AI risks appropriately.

278 shared governance docs

2.1 Compromise of privacy by leaking or correctly inferring sensitive information

AI systems that memorize and leak sensitive personal data or infer private information about individuals without their consent. Unexpected or unauthorized sharing of data and information can compromise user expectation of privacy, assist identity theft, or cause loss of confidential intellectual property.

273 shared governance docs

1.1 Unfair discrimination and misrepresentation

Unequal treatment of individuals or groups by AI, often based on race, gender, or other sensitive characteristics, resulting in unfair outcomes and unfair representation of those groups.

190 shared governance docs

7.3 Lack of capability or robustness

Governance vs. Incident volume

Dataset Drilldown

1460. Baidu Apollo Go Robotaxis Stopped in Traffic During Reported System Failure in Wuhan, Stranding Some Passengers

1424. Claude Code Agent Reportedly Deleted DataTalks.Club Production Infrastructure, Database, and Snapshots via Terraform

1374. Purportedly AI-Generated Sepsis Alert Reportedly Prompted Potentially Inappropriate IV Fluid Administration for a Dialysis Patient, Averted by Clinician Intervention

AI System Safety, Failures & Limitations subdomains

Related Subdomains

7.3 Lack of capability or robustness

Governance vs. Incident volume

Incidents vs Governance

Dataset Drilldown

1460. Baidu Apollo Go Robotaxis Stopped in Traffic During Reported System Failure in Wuhan, Stranding Some Passengers

1424. Claude Code Agent Reportedly Deleted DataTalks.Club Production Infrastructure, Database, and Snapshots via Terraform

1374. Purportedly AI-Generated Sepsis Alert Reportedly Prompted Potentially Inappropriate IV Fluid Administration for a Dialysis Patient, Averted by Clinician Intervention

Recent Governance Documents

FY2026 NDAA, Section 224 ("National Security and Defense Artificial Intelligence Institute")

FY2026 NDAA, Section 547 ("Pilot program for generative artificial intelligence and spatial computing for performance training and proficiency assessment")

FY2026 NDAA, Section 1535 ("Artificial Intelligence Futures Steering Committee")

AI System Safety, Failures & Limitations subdomains

Related Subdomains

Incidents vs Governance

Recent Governance Documents

FY2026 NDAA, Section 224 ("National Security and Defense Artificial Intelligence Institute")

FY2026 NDAA, Section 547 ("Pilot program for generative artificial intelligence and spatial computing for performance training and proficiency assessment")

FY2026 NDAA, Section 1535 ("Artificial Intelligence Futures Steering Committee")