Dangerous, Violent or Hateful Content

BackData Privacy

Environmental Impacts

Home/Risks/National Institute of Standards and Technology (2024)/Data Privacy

Dangerous, Violent or Hateful Content

Environmental Impacts

Home/Risks/National Institute of Standards and Technology (2024)/Data Privacy

Dangerous, Violent or Hateful Content

Environmental Impacts

Data Privacy

Artificial Intelligence Risk Management Framework: Generative Artificial Intelligence Profile

National Institute of Standards and Technology (2024)

Category

Risk Domain

2Privacy & Security

2.1Compromise of privacy by leaking or correctly inferring sensitive information

AI systems that memorize and leak sensitive personal data or infer private information about individuals without their consent. Unexpected or unauthorized sharing of data and information can compromise user expectation of privacy, assist identity theft, or cause loss of confidential intellectual property.

"Impacts due to leakage and unauthorized use, disclosure, or de-anonymization of biometric, health, location, or other personally identifiable information or sensitive data."(p. 4)

Entity— Who or what caused the harm

Human

Due to a decision or action made by humans

AI system

Due to a decision or action made by an AI system

Other

Due to some other reason or is ambiguous

Intent— Whether the harm was intentional or accidental

Intentional

Due to an expected outcome from pursuing a goal

Unintentional

Due to an unexpected outcome from pursuing a goal

Other

Without clearly specifying the intentionality

Timing— Whether the risk is pre- or post-deployment

Pre-deployment

Occurring before the AI is deployed

Post-deployment

Occurring after the AI model has been trained and deployed

Other

Without a clearly specified time of occurrence

Supporting Evidence (2)

1.

"GAI systems raise several risks to privacy. GAI system training requires large volumes of data, which in some cases may include personal data. The use of personal data for GAI training raises risks to widely accepted privacy principles, including to transparency, individual participation (including consent), and purpose specification. For example, most model developers do not disclose specific data sources on which models were trained, limiting user awareness of whether personally identifiably information (PII) was trained on and, if so, how it was collected."(p. 7)

2.

"Models may leak, generate, or correctly infer sensitive information about individuals. For example, during adversarial attacks, LLMs have revealed sensitive information (from the public domain) that was included in their training data. This problem has been referred to as data memorization, and may pose exacerbated privacy risks even for data present only in a small number of training samples. In addition to revealing sensitive information in GAI training data, GAI models may be able to correctly infer P II or sensitive data that was not in their training data nor disclosed by the user by stitching together information from disparate sources. These inferences can have negative impact on an individual even if the inferences are not accurate (e.g., confabulations), and especially if they reveal information that the individual considers sensitive or that is used to disadvantage or harm them."(p. 7)

Other risks from National Institute of Standards and Technology (2024) (11)

CBRN Information or Capabilities

4.2 Cyberattacks, weapon development or use, and mass harm

OtherOtherPost-deployment

Confabulation

3.1 False or misleading information

AI systemUnintentionalPost-deployment

Dangerous, Violent or Hateful Content

1.2 Exposure to toxic content

AI systemOtherPost-deployment

Environmental Impacts

6.6 Environmental harm

OtherUnintentionalPre-deployment

Harmful Bias or Homogenization

1.1 Unfair discrimination and misrepresentation

OtherUnintentionalOther

Human-AI Configuration

5.1 Overreliance and unsafe use

OtherUnintentionalPost-deployment

View all 11 risks from this paper →