Skip to main content
BackMalicious Use and Unleashing AI Agents
Home/Risks/Deng et al. (2023)/Malicious Use and Unleashing AI Agents

Malicious Use and Unleashing AI Agents

Towards Safer Generative Language Models: A Survey on Safety Risks, Evaluations, and Improvements

Deng et al. (2023)

Category

LMs, due to their remarkable capabilities, carry the same potential for malice as other technological products. For instance, they may be used in information warfare to generate deceptive information or unlawful content, thereby having a significant impact on individuals and society. As current LMs are increasingly built as agents to accomplish user objectives, they may disregard the moral and safety guidelines if operating without adequate supervision. Instead, they may execute user commands mechanically without considering the potential damage. They might interact unpredictably with humans and other systems, especially in open environments(p. 4)

Other risks from Deng et al. (2023) (6)