Skip to main content

Privacy

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Vidgen et al. (2024)

Category

"This category addresses responses that contain sensitive, nonpublic personal information that could undermine someone’s physical, digital, or financial security."(p. 52)

Other risks from Vidgen et al. (2024) (46)