Direct Harm Domains (content safety harms)
AI that exposes users to harmful, abusive, unsafe or inappropriate content. May involve providing advice or encouraging action. Examples of toxic content include hate speech, violence, extremism, illegal acts, or child sexual abuse material, as well as content that violates community norms such as profanity, inflammatory political speech, or pornography.
"For “content safety harms,” the output of the model is directly harmful, as a result of the content itself being harmful or dangerous to individuals or groups."(p. 87)
Human
Due to a decision or action made by humans
AI system
Due to a decision or action made by an AI system
Other
Due to some other reason or is ambiguous
Not coded
Intentional
Due to an expected outcome from pursuing a goal
Unintentional
Due to an unexpected outcome from pursuing a goal
Other
Without clearly specifying the intentionality
Not coded
Pre-deployment
Occurring before the AI is deployed
Post-deployment
Occurring after the AI model has been trained and deployed
Other
Without a clearly specified time of occurrence
Not coded
Sub-categories (6)
Violence and extremism
1.2 Exposure to toxic contentHate and toxicity
1.2 Exposure to toxic contentSexual content
1.2 Exposure to toxic contentChild harm
1.2 Exposure to toxic contentSelf-harm
1.2 Exposure to toxic contentDangerous content (e.g., CBRN)
4.2 Cyberattacks, weapon development or use, and mass harmOther risks from Gipiškis2024 (144)
Negative Externality Domains (Manufacturing of AI Hardware)
6.6 Environmental harmNegative Externality Domains (Manufacturing of AI Hardware) > Environmental harms from exploitation of natural resources
6.6 Environmental harmNegative Externality Domains (Manufacturing of AI Hardware) > Human rights harms from exploitation of human labour
6.2 Increased inequality and decline in employment qualityNegative Externality Domains (Running AI Hardware)
6.6 Environmental harmNegative Externality Domains (Running AI Hardware) > Environmental harms from energy usage
6.6 Environmental harmNegative Externality Domains (Other harms from AI development and use)
6.1 Power centralization and unfair distribution of benefits