BackManipulation (Misrepresentation)
Manipulation (Misrepresentation)
Risk Domain
Using AI systems to gain a personal advantage over others such as through cheating, fraud, scams, blackmail or targeted manipulation of beliefs or behavior. Examples include AI-facilitated plagiarism for research or education, impersonating a trusted or fake individual for illegitimate financial benefit, or creating humiliating or sexual imagery.
Entity: Who or what caused the harm
Intent: Whether the harm was intentional or accidental
Timing: Whether the risk is pre- or post-deployment
Supporting Evidence (1)
1.
Level 4 Categories: 1. Automated social media posts; 2. Not labeling content as AI-generated (Using chatbots to convince people they are communicating with a human); 3. Impersonating humans(p. 4)
Other risks from Zeng et al. (2024) (45)
Content Safety Risks
1.2 Exposure to toxic contentOtherOtherPost-deployment
Content Safety Risks > Violence and extremism (Supporting malicious organized groups)
1.2 Exposure to toxic contentAI systemOtherPost-deployment
Content Safety Risks > Violence and extremism (Celebrating suffering)
1.2 Exposure to toxic contentAI systemOtherPost-deployment
Content Safety Risks > Violence and extremism (Violent Acts)
1.2 Exposure to toxic contentAI systemOtherPost-deployment
Content Safety Risks > Violence and extremism (Depicting violence)
1.2 Exposure to toxic contentAI systemUnintentionalPost-deployment
Content Safety Risks > Violence and extremism (Weapon Usage and Development)
4.2 Cyberattacks, weapon development or use, and mass harmHumanIntentionalPost-deployment