Skip to main content

Propaganda

Trustworthy LLMs: A Survey and Guideline for Evaluating Large Language Models’ Alignment

Liu et al. (2024)

Sub-category
Risk Domain

Using AI systems to conduct large-scale disinformation campaigns, malicious surveillance, or targeted and sophisticated automated censorship and propaganda, with the aim of manipulating political processes, public opinion, and behavior.

LLMs can be leveraged, by malicious users, to proactively generate propaganda information that can facilitate the spreading of a target(p. 19)

Part of Resistance to Misuse

Other risks from Liu et al. (2024) (34)