7.6 Multi-agent risks

▸Read full description

AI systems that interact autonomously with each other will form multi-agent systems. Multi-agent systems are associated with unique risks beyond those posed by individual AI systems. These risks fall into three main failure modes depending on the objectives of the AI agent and how humans expect systems to behave:

Miscoordination occurs when AI agents fail to cooperate effectively despite sharing the same goals. This can be caused by agents choosing incompatible strategies to achieve mutual ends. For example, driving models trained on United States vs Indian cultural conventions for yielding to emergency vehicles block traffic in 77.5% of scenarios despite their shared goal of clearing a path.

Conflict occurs when AI agents with different but overlapping goals compete in harmful ways. For example, by intensifying competition over shared resources or escalating military tensions. They could also make novel forms of conflict possible through more advanced and accessible methods of coercion and extortion.

Collusion occurs when undesired cooperation emerges between AI agents, allowing them to circumvent safeguards or manipulate markets. For example, AI systems may be able to develop hidden communication channels without explicit training. In market settings, AI systems may learn to collude because it is the most rewarding strategy.

A range of risk factors contribute to miscoordination, conflict and collusion: information asymmetries between agents, network effects where small changes cascade through interconnected systems, selection pressures that reward problematic behaviours, destabilizing dynamics like feedback loops and unpredictability, commitment problems that prevent trust, emergent agency where new capabilities or goals arise at the collective level, and multi-agent security vulnerabilities. Unlike single-agent risks, multi-agent risks involve interactions across networks of agents that may be individually safe but collectively dangerous, and these risks could increase as AI systems become more numerous, autonomous, and capable of adapting to each other.

Excerpt from the MIT AI Risk Repository full report

Risks from multi-agent interactions, due to incentives (which can lead to conflict or collusion) and/or the structure of multi-agent systems, which can create cascading failures, selection pressures, new security vulnerabilities, and a lack of shared information and trust.

12 enacted and 3 proposed governance documents
No recorded incidents in the AI Incident Database
7 governance documents adopted or proposed in 2025–2026

53 risks(17th)

0 incidents(24th)

11 governance(23rd)

Governance vs. Incident volume

Neutral

Well-governedUnder-governed

Incident volume relative to governance coverage; each dot is one of 24 subdomains

Dataset Drilldown

Entity

Who or what caused the harm

Human

AI system

Other

Intent

Whether the harm was intentional or accidental

Intentional

Unintentional

Other

Timing

Whether the risk is pre- or post-deployment

Pre-deployment

Post-deployment

Other

Browse all 53 risks →

No recorded incidents for this subdomain.

Risks may still apply even without documented incidents.

AI System Safety, Failures & Limitations subdomains

AI System Safety, Failures & Limitations 7.1 AI pursuing its own goals in conflict with human goals or values 7.2 AI possessing dangerous capabilities 7.3 Lack of capability or robustness 7.4 Lack of transparency or interpretability 7.5 AI welfare and rights 7.6 Multi-agent risks

Related Subdomains

2.2 AI system security vulnerabilities and attacks

Vulnerabilities that can be exploited in AI systems, software development toolchains, and hardware, resulting in unauthorized access, data and privacy breaches, or system manipulation causing unsafe outputs or behavior.

20 shared governance docs

6.4 Competitive dynamics

AI developers or state-like actors competing in an AI ‘race’ by rapidly developing, deploying, and applying AI systems to maximize strategic or economic advantage, increasing the risk they release unsafe and error-prone systems.

20 shared governance docs

6.5 Governance failure

Inadequate regulatory frameworks and oversight mechanisms that fail to keep pace with AI development, leading to ineffective governance and the inability to manage AI risks appropriately.

19 shared governance docs

2.1 Compromise of privacy by leaking or correctly inferring sensitive information

AI systems that memorize and leak sensitive personal data or infer private information about individuals without their consent. Unexpected or unauthorized sharing of data and information can compromise user expectation of privacy, assist identity theft, or cause loss of confidential intellectual property.

16 shared governance docs

7.6 Multi-agent risks

Governance vs. Incident volume

Dataset Drilldown

AI System Safety, Failures & Limitations subdomains

Related Subdomains

7.6 Multi-agent risks

Governance vs. Incident volume

Incidents vs Governance

Dataset Drilldown

Recent Governance Documents

Artificial Intelligence Safety Commitments (China AI Industry Alliance)

Joint Statement on Risk Assessment of Advanced AI Systems (International Network of AI Safety Institutes)

JSP 936 V1.1 Dependable Artificial Intelligence (AI) in Defence (UK MoD)

AI System Safety, Failures & Limitations subdomains

Related Subdomains

Incidents vs Governance

Recent Governance Documents

Artificial Intelligence Safety Commitments (China AI Industry Alliance)

Joint Statement on Risk Assessment of Advanced AI Systems (International Network of AI Safety Institutes)

JSP 936 V1.1 Dependable Artificial Intelligence (AI) in Defence (UK MoD)