Develop and test model shutdown measures

BackAdvance research on containment and layered defences

This page is still being polished. If you have thoughts, please share them via the feedback form.

Data on this page is preliminary and may change. Please do not share or cite these figures publicly.

Advance research on containment and layered defences

Somai (2025)|LLM classified

Mitigation Taxonomy

2Organisation

2.4Engineering & Development

2.4.1Research & Foundations

Foundational safety research, theoretical understanding, and scientific inquiry informing AI development.

Also in Engineering & Development

2.4.2 Design Standards2.4.3 Development Workflows2.4.4 Training & Awareness

Additional Informationp. 5

Stage: Containment and Mitigation; Stakeholder: AI Developers; Additional information: AI developers and other stakeholders should further explore and advance research on containment methods. Existing research shows that current containment efforts face limitations, especially for self-replicating AI (Clymer, Wijk & Barnes 2024; Salib 2025; Pan et al. 2024). Investments should be made in containment technologies to shut off models, restrict capabilities, limit harm or unintended actions, and ensure retention of human control. This may also include research using AI models for containment and exploring techniques such as sandboxing, model distillation and layered defence strategies.

LLM Classification Details

Reasoning

Foundational research advances understanding of containment and layered defense techniques for development.

Code: 2.4.1Version: v0.5Classified: Jan 22, 2026

Other mitigations from Somai (2025) (38)

Monitor critical capability levels

2.2.2 Testing & Evaluation

Lifecycle:Other (multiple stages)Actor:DeveloperAIRM:Measure

Identify early warning signs and emergent capabilities

2.2.1 Risk Assessment

Lifecycle:Build and Use ModelActor:DeveloperAIRM:Measure

Establish standardised benchmarks and reporting

3.2.1 Benchmarks & Evaluation

Lifecycle:Verify and ValidateActor:DeveloperAIRM:Measure

Implement compute monitoring and anomaly detection

1.2.3 Monitoring & Detection

Lifecycle:Other (multiple stages)Actor:Infrastructure ProviderAIRM:Measure

Enhance hardware and supply chain oversight

2.3.3 Monitoring & Logging

Lifecycle:Other (stage not listed)Actor:Infrastructure ProviderAIRM:Govern

Lead efforts to establish shared criteria for AI LOC

3.2.2 Technical Standards

Lifecycle:Plan and DesignActor:Governance ActorAIRM:Govern

View all 38 mitigations from this source →

Source Document

Strengthening Emergency Preparedness and Response for AI Loss of Control Incidents

Somani, Elika; Friedman, Anjay; Wu, Henry; Lu, Marianne; Byrd, Christopher; van Soest, Henri; Zakaria, Sana (2025)

As artificial intelligence (AI) systems become increasingly embedded in essential infrastructure and services, the risks associated with unintended failures rise. Developing comprehensive emergency response protocols could help mitigate these significant risks. This report focuses on understanding and addressing AI loss of control (LOC) scenarios where human oversight fails to adequately constrain an autonomous, general-purpose AI.

View source DOI: 10.7249/RRA3847-1

Classification

AI Lifecycle Stage

Other (outside lifecycle)

Outside the standard AI system lifecycle

Responsible Actor

Developer

Entity that creates, trains, or modifies the AI system

NIST AI RMF Function

Unable to classify

Could not be classified to a specific AIRM function