Skip to main content
BackGeneral Evaluations (Limited coverage of capabilities evaluations)
Home/Risks/Gipiškis2024/General Evaluations (Limited coverage of capabilities evaluations)

General Evaluations (Limited coverage of capabilities evaluations)

Sub-category
Risk Domain

Inadequate regulatory frameworks and oversight mechanisms that fail to keep pace with AI development, leading to ineffective governance and the inability to manage AI risks appropriately.

"GPAI model developers might run capabilities evaluations to determine whether it has dangerous or dual-use capabilities, and then decide whether it is safe to deploy. Such capabilities evaluations can fail to demonstrate all the capabilities of a model. For example, evaluations may miss certain capabilities that are difficult to assess, prohibitively costly to verify, or obscured by the model’s tendency to refuse responses due to safety training, even if it possesses some of these capabilities."(p. 16)

Other risks from Gipiškis2024 (144)