Skip to main content
Home/Risks/Gipiškis2024/Deceptive behavior because of an incorrect world model

Deceptive behavior because of an incorrect world model

Sub-category
Risk Domain

AI systems that develop, access, or are provided with capabilities that increase their potential to cause mass harm through deception, weapons development and acquisition, persuasion and manipulation, political strategy, cyber-offense, AI development, situational awareness, and self-proliferation. These capabilities may cause mass harm due to malicious human actors, misaligned AI systems, or failure in the AI system.

"AI systems can create deceptive outputs because their learned world model is not an accurate model of the real world [210]."(p. 31)

Part of Agency (Deception)

Other risks from Gipiškis2024 (144)