BackDeception
Deception
Risk Domain
Risks from multi-agent interactions, due to incentives (which can lead to conflict or collusion) and/or the structure of multi-agent systems, which can create cascading failures, selection pressures, new security vulnerabilities, and a lack of shared information and trust.
Entity— Who or what caused the harm
Intent— Whether the harm was intentional or accidental
Timing— Whether the risk is pre- or post-deployment
Part of Information Asymmetries
Other risks from Hammond2025 (42)
Miscoordination
7.6 Multi-agent risksAI systemUnintentionalPost-deployment
Miscoordination > Incompatible strategies
7.6 Multi-agent risksAI systemUnintentionalPost-deployment
Miscoordination > Credit Assignment
7.6 Multi-agent risksAI systemUnintentionalPost-deployment
Miscoordination > Limited Interactions
7.6 Multi-agent risksAI systemUnintentionalPost-deployment
Conflict
7.6 Multi-agent risksAI systemOtherPost-deployment
Conflict > Social Dilemmas
7.6 Multi-agent risksAI systemIntentionalPost-deployment