BackUnhelpful responses
Unhelpful responses
Risk Domain
AI systems that fail to perform reliably or effectively under varying conditions, exposing them to errors and failures that can have significant consequences, especially in critical applications or areas that require moral reasoning.
Entity— Who or what caused the harm
Intent— Whether the harm was intentional or accidental
Timing— Whether the risk is pre- or post-deployment
Part of Serves as object of personal fantasy, violence, and abuse
Other risks from Stanley & Lettie (2024) (28)
False information
3.1 False or misleading informationAI systemOtherOther
Performative utterances
7.3 Lack of capability or robustnessAI systemUnintentionalPost-deployment
Information enabling malicious actions
1.2 Exposure to toxic contentAI systemOtherPost-deployment
Bad advice/failure to generate helpful content
7.3 Lack of capability or robustnessAI systemUnintentionalOther
Leakage
2.1 Compromise of privacy by leaking or correctly inferring sensitive informationAI systemUnintentionalOther
Toxic and disrespectful content
1.2 Exposure to toxic contentAI systemUnintentionalPost-deployment