Conduct evaluations, red-teaming and adv…

BackCollaborate on standardised benchmarks and techniques

This page is still being polished. If you have thoughts, please share them via the feedback form.

Data on this page is preliminary and may change. Please do not share or cite these figures publicly.

Collaborate on standardised benchmarks and techniques

Somai (2025)|LLM classified

Mitigation Taxonomy

3Ecosystem

3.2Shared Infrastructure

3.2.1Benchmarks & Evaluation

Shared evaluation datasets, testing frameworks, and measurement tools for AI systems.

Also in Shared Infrastructure

3.2.2 Technical Standards3.2.3 Research Resources

Additional Informationp. 4

Stage: Detection; Stakeholder: Third Party Researchers; Additional information: AI developers and researchers should refine detection by developing standardised benchmarks and improving their reliability and validity. Developers should enhance detection of control-undermining capabilities.

LLM Classification Details

Reasoning

Develops shared evaluation benchmarks and techniques for ecosystem-wide adoption by researchers.

Code: 3.2.1Version: v0.6Classified: Jan 27, 2026

Other mitigations from Somai (2025) (38)

Monitor critical capability levels

2.2.2 Testing & Evaluation

Lifecycle:Other (multiple stages)Actor:DeveloperAIRM:Measure

Identify early warning signs and emergent capabilities

2.2.1 Risk Assessment

Lifecycle:Build and Use ModelActor:DeveloperAIRM:Measure

Establish standardised benchmarks and reporting

3.2.1 Benchmarks & Evaluation

Lifecycle:Verify and ValidateActor:DeveloperAIRM:Measure

Implement compute monitoring and anomaly detection

1.2.3 Monitoring & Detection

Lifecycle:Other (multiple stages)Actor:Infrastructure ProviderAIRM:Measure

Enhance hardware and supply chain oversight

2.3.3 Monitoring & Logging

Lifecycle:Other (stage not listed)Actor:Infrastructure ProviderAIRM:Govern

Lead efforts to establish shared criteria for AI LOC

3.2.2 Technical Standards

Lifecycle:Plan and DesignActor:Governance ActorAIRM:Govern

View all 38 mitigations from this source →

Source Document

Strengthening Emergency Preparedness and Response for AI Loss of Control Incidents

Somani, Elika; Friedman, Anjay; Wu, Henry; Lu, Marianne; Byrd, Christopher; van Soest, Henri; Zakaria, Sana (2025)

As artificial intelligence (AI) systems become increasingly embedded in essential infrastructure and services, the risks associated with unintended failures rise. Developing comprehensive emergency response protocols could help mitigate these significant risks. This report focuses on understanding and addressing AI loss of control (LOC) scenarios where human oversight fails to adequately constrain an autonomous, general-purpose AI.

View source DOI: 10.7249/RRA3847-1

Classification

AI Lifecycle Stage

Other (outside lifecycle)

Outside the standard AI system lifecycle

Responsible Actor

Other

Actor type not captured by the standard categories

NIST AI RMF Function

Measure

Quantifying, testing, and monitoring identified AI risks