General Evaluations (Incorrect outputs o…

BackGeneral Evaluations (Limited coverage of capabilities evaluations)

General Evaluations (Difficulty of ident…

Home/Risks/Gipiškis2024/General Evaluations (Limited coverage of capabilities evaluations)

General Evaluations (Incorrect outputs o…

General Evaluations (Difficulty of ident…

Home/Risks/Gipiškis2024/General Evaluations (Limited coverage of capabilities evaluations)

General Evaluations (Incorrect outputs o…

General Evaluations (Difficulty of ident…

General Evaluations (Limited coverage of capabilities evaluations)

Sub-category

Risk Domain

6Socioeconomic & Environmental

6.5Governance failure

Inadequate regulatory frameworks and oversight mechanisms that fail to keep pace with AI development, leading to ineffective governance and the inability to manage AI risks appropriately.

"GPAI model developers might run capabilities evaluations to determine whether it has dangerous or dual-use capabilities, and then decide whether it is safe to deploy. Such capabilities evaluations can fail to demonstrate all the capabilities of a model. For example, evaluations may miss certain capabilities that are difficult to assess, prohibitively costly to verify, or obscured by the model’s tendency to refuse responses due to safety training, even if it possesses some of these capabilities."(p. 16)

Entity— Who or what caused the harm

Human

Due to a decision or action made by humans

AI system

Due to a decision or action made by an AI system

Other

Due to some other reason or is ambiguous

Intent— Whether the harm was intentional or accidental

Intentional

Due to an expected outcome from pursuing a goal

Unintentional

Due to an unexpected outcome from pursuing a goal