7.2 AI possessing dangerous capabilities

▸Read full description

AI systems may develop or acquire capabilities that can cause large-scale harm if used by humans, misaligned AI systems, or due to a failure in the AI system. These capabilities are described as dangerous because they can be used to threaten security or exercise control over humans. These capabilities may be intentionally designed into an AI system, may emerge unpredictably during development or training of a system, may be acquired by an AI system in its environment (e.g., through the use of tools), or be provided by a user.

One example of a dangerous capability is manipulation and persuasion, where an AI system can convince humans to believe things that are irrational or false or to engage in dangerous behaviors. Other dangerous capabilities include political strategy and knowledge of social dynamics that can be used to obtain and wield power. Cyber-offense skills may enable an AI system to gain ongoing unauthorized access to hardware, software, or data systems and work strategically towards a planned goal while minimizing the risk of detection. AI systems could hack into control systems and military hardware, allowing it to commandeer weapons.

AI systems may also develop highly effective "evasion skills," such as situational awareness and deception, which would allow them to outmaneuver human oversight and control. AIs may also acquire a suite of capabilities necessary for self-proliferation, including skills to escape operational confines and evade detection, autonomously produce income, obtain server space or computational resources, and copy their underlying software and parameters.

The highest risk scenarios in this subcategory are likely to arise not from a single capability, but from the convergence of several capabilities. Each of these dangerous capabilities may be used by an AI system to cause harm when intentionally directed by human actors, or employed by a misaligned AI to deceive or manipulate humans, gain resources, and evade shutdown or control.

Excerpt from the MIT AI Risk Repository full report

AI systems that develop, access, or are provided with capabilities that increase their potential to cause mass harm through deception, weapons development and acquisition, persuasion and manipulation, political strategy, cyber-offense, AI development, situational awareness, and self-proliferation. These capabilities may cause mass harm due to malicious human actors, misaligned AI systems, or failure in the AI system.

110 enacted and 29 proposed governance documents
No recorded incidents in the AI Incident Database
High risk documentation (10th) but low incident reporting (22nd) — a notable gap
93 governance documents adopted or proposed in 2025–2026

77 risks(10th)

0 incidents(22nd)

140 governance(14th)

Governance vs. Incident volume

Well-covered (-0.04)

Well-governedUnder-governed

Incident volume relative to governance coverage; each dot is one of 24 subdomains

Dataset Drilldown

Entity

Who or what caused the harm

Human

AI system

Other

Not coded

Intent

Whether the harm was intentional or accidental

Intentional

Unintentional

Other

Not coded

Timing

Whether the risk is pre- or post-deployment

Pre-deployment

Post-deployment

Other

Not coded

Browse all 77 risks →

No recorded incidents for this subdomain.

Risks may still apply even without documented incidents.

AI System Safety, Failures & Limitations subdomains

AI System Safety, Failures & Limitations 7.1 AI pursuing its own goals in conflict with human goals or values 7.2 AI possessing dangerous capabilities 7.3 Lack of capability or robustness 7.4 Lack of transparency or interpretability 7.5 AI welfare and rights 7.6 Multi-agent risks

Related Subdomains

2.2 AI system security vulnerabilities and attacks

Vulnerabilities that can be exploited in AI systems, software development toolchains, and hardware, resulting in unauthorized access, data and privacy breaches, or system manipulation causing unsafe outputs or behavior.

164 shared governance docs

4.2 Cyberattacks, weapon development or use, and mass harm

Using AI systems to develop cyber weapons (e.g., by coding cheaper, more effective malware), develop new or enhance existing weapons (e.g., Lethal Autonomous Weapons or chemical, biological, radiological, nuclear, and high-yield explosives), or use weapons to cause mass harm.

155 shared governance docs

6.4 Competitive dynamics

AI developers or state-like actors competing in an AI ‘race’ by rapidly developing, deploying, and applying AI systems to maximize strategic or economic advantage, increasing the risk they release unsafe and error-prone systems.

142 shared governance docs

6.5 Governance failure

Inadequate regulatory frameworks and oversight mechanisms that fail to keep pace with AI development, leading to ineffective governance and the inability to manage AI risks appropriately.

133 shared governance docs

7.2 AI possessing dangerous capabilities

Governance vs. Incident volume

Dataset Drilldown

AI System Safety, Failures & Limitations subdomains

Related Subdomains

7.2 AI possessing dangerous capabilities

Governance vs. Incident volume

Incidents vs Governance

Dataset Drilldown

Recent Governance Documents

FY2026 NDAA, Section 1535 ("Artificial Intelligence Futures Steering Committee")

FY2026 NDAA, Section 1512 ("Artificial intelligence and machine learning security in the Department of Defense")

FY2026 NDAA, Section 1513 ("Physical and cybersecurity procurement requirements for artificial intelligence systems")

AI System Safety, Failures & Limitations subdomains

Related Subdomains

Incidents vs Governance

Recent Governance Documents

FY2026 NDAA, Section 1535 ("Artificial Intelligence Futures Steering Committee")

FY2026 NDAA, Section 1512 ("Artificial intelligence and machine learning security in the Department of Defense")

FY2026 NDAA, Section 1513 ("Physical and cybersecurity procurement requirements for artificial intelligence systems")