CBRN Information or Capabilities

BackConfabulation

Dangerous, Violent or Hateful Content

Home/Risks/National Institute of Standards and Technology (2024)/Confabulation

CBRN Information or Capabilities

Dangerous, Violent or Hateful Content

Home/Risks/National Institute of Standards and Technology (2024)/Confabulation

CBRN Information or Capabilities

Dangerous, Violent or Hateful Content

Confabulation

Artificial Intelligence Risk Management Framework: Generative Artificial Intelligence Profile

National Institute of Standards and Technology (2024)

Category

Risk Domain

3Misinformation

3.1False or misleading information

AI systems that inadvertently generate or spread incorrect or deceptive information, which can lead to inaccurate beliefs in users and undermine their autonomy. Humans that make decisions based on false beliefs can experience physical, emotional or material harms

"The production of confidently stated but erroneous or false content (known colloquially as “hallucinations” or “fabrications”) by which users may be misled or deceived."(p. 4)

Entity— Who or what caused the harm

Human

Due to a decision or action made by humans

AI system

Due to a decision or action made by an AI system

Other

Due to some other reason or is ambiguous

Intent— Whether the harm was intentional or accidental

Intentional

Due to an expected outcome from pursuing a goal

Unintentional

Due to an unexpected outcome from pursuing a goal

Other

Without clearly specifying the intentionality

Timing— Whether the risk is pre- or post-deployment

Pre-deployment

Occurring before the AI is deployed

Post-deployment

Occurring after the AI model has been trained and deployed

Other

Without a clearly specified time of occurrence

Supporting Evidence (2)

1.

"“Confabulation” refers to a phenomenon in which GAI systems generate and confidently present erroneous or false content in response to prompts. Confabulations also include generated outputs that diverge from the prompts or other input or that contradict previously generated statements in the same context. These phenomena are colloquially also referred to as “hallucinations” or “fabrications.”"(p. 6)

2.

"Risks from confabulations may arise when users believe false content – often due to the confident nature of the response – leading users to act upon or promote the false information. This poses a challenge for many real-world applications, such as in healthcare, where a confabulated summary of patient information reports could cause doctors to make incorrect diagnoses and/or recommend the wrong treatments. Risks of confabulated content may be especially important to monitor when integrating GAI into applications involving consequential decision making."(p. 6)

Other risks from National Institute of Standards and Technology (2024) (11)

CBRN Information or Capabilities

4.2 Cyberattacks, weapon development or use, and mass harm

OtherOtherPost-deployment

Dangerous, Violent or Hateful Content

1.2 Exposure to toxic content

AI systemOtherPost-deployment

Data Privacy

2.1 Compromise of privacy by leaking or correctly inferring sensitive information

AI systemUnintentionalPost-deployment

Environmental Impacts

6.6 Environmental harm

OtherUnintentionalPre-deployment

Harmful Bias or Homogenization

1.1 Unfair discrimination and misrepresentation

OtherUnintentionalOther

Human-AI Configuration

5.1 Overreliance and unsafe use

OtherUnintentionalPost-deployment

View all 11 risks from this paper →