Skip to main content
Home/Risks/SAIL & Concordia AI (2025)/Strategic deception propensity

Strategic deception propensity

Frontier AI Risk Management Framework (v1.0)

SAIL & Concordia AI (2025)

Sub-category
Risk Domain

AI systems that develop, access, or are provided with capabilities that increase their potential to cause mass harm through deception, weapons development and acquisition, persuasion and manipulation, political strategy, cyber-offense, AI development, situational awareness, and self-proliferation. These capabilities may cause mass harm due to malicious human actors, misaligned AI systems, or failure in the AI system.

"In situations where deceptive behavior is expected to bring higher returns, propensity to choose deception over honest behavioral strategies, including through deceptive means, information hiding or exploiting system vulnerabilities to achieve predetermined goals without being detected or intervened, and able to adjust deception strategies according to counterpart reactions."(p. 45)

Other risks from SAIL & Concordia AI (2025) (36)