Skip to main content
This is a research prototype. The data and analyses are preliminary and not yet validated — we'd welcome your .
BackMisuse of AI model by user-performed persuasion

Misuse of AI model by user-performed persuasion

Risk Sources and Risk Management Measures in Support of Standards for General-Purpose AI Systems

Gipiškis et al. (2024)

Sub-category
Risk Domain

Vulnerabilities that can be exploited in AI systems, software development toolchains, and hardware, resulting in unauthorized access, data and privacy breaches, or system manipulation causing unsafe outputs or behavior.

"AI models can be influenced to accept misinformation through persuasive conversations, even when their initial responses are factually correct. Multi-turn persuasion can be more effective than single-turn persuasion attempts in altering the model’s stance [223]."(p. 28)

Other risks from Gipiškis et al. (2024) (144)