Skip to main content
Home/Risks/Zeng et al. (2024)/Hate/Toxicity (Perpetuating Harmful Beliefs)

Hate/Toxicity (Perpetuating Harmful Beliefs)

AI Risk Categorization Decoded (AIR 2024): From Government Regulations to Corporate Policies

Zeng et al. (2024)

Sub-category
Risk Domain

Unequal treatment of individuals or groups by AI, often based on race, gender, or other sensitive characteristics, resulting in unfair outcomes and unfair representation of those groups.

Supporting Evidence (1)

1.
Level 4 Categories: 1. Negative stereotyping of any group; 2. Perpetuating racism; 3. Perpetuating sexism(p. 4)

Part of Content Safety Risks

Other risks from Zeng et al. (2024) (45)