Skip to main content

Diluting Rights

An Exploratory Diagnosis of Artificial Intelligence Risks for a Responsible Governance

Teixeira et al. (2022)

Category
Risk Domain

AI systems acting in conflict with human goals or values, especially the goals of designers or users, or ethical standards. These misaligned behaviors may be introduced by humans during design and development, such as through reward hacking and goal misgeneralisation, or may result from AI using dangerous capabilities such as manipulation, deception, situational awareness to seek power, self-proliferate, or achieve other goals.

"A possible consequence of self-interest in AI generation of ethical guidelines."(p. 31)

Other risks from Teixeira et al. (2022) (15)