Skip to main content
BackToxicity and Bias Tendencies
Home/Risks/Cui et al. (2024)/Toxicity and Bias Tendencies

Toxicity and Bias Tendencies

Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems

Cui et al. (2024)

Category
Risk Domain

Unequal treatment of individuals or groups by AI, often based on race, gender, or other sensitive characteristics, resulting in unfair outcomes and unfair representation of those groups.

"Extensive data collection in LLMs brings toxic content and stereotypical bias into the training data."(p. 4)

Other risks from Cui et al. (2024) (49)