Establish standardised benchmarks and re…

BackImplement compute monitoring and anomaly detection

This page is still being polished. If you have thoughts, please share them via the feedback form.

Data on this page is preliminary and may change. Please do not share or cite these figures publicly.

Implement compute monitoring and anomaly detection

Somai (2025)|LLM classified

Mitigation Taxonomy

1AI System

1.2Non-Model

1.2.3Monitoring & Detection

Runtime behavior observation, anomaly detection, and activity logging.

Also in Non-Model

1.2.1 Guardrails & Filtering1.2.2 Runtime Environment1.2.4 Security Infrastructure1.2.5 Provenance & Watermarking

Additional Informationp. 4

Stage: Detection; Stakeholder: Compute Providers; Additional information: Early detection could also be improved by robust real-time monitoring tools that log outputs, decisions and compute usage to detect potential anomalies (Kaur et al. 2023; Greenblatt, Shlegeris et al. 2024). Governments should enhance awareness and information sharing between all stakeholders, including the tracking of compute resources.

LLM Classification Details

Reasoning

Anomaly detection system monitors compute activity for suspicious behavior patterns.

Code: 1.2.3Version: v0.5Classified: Jan 22, 2026

Other mitigations from Somai (2025) (38)

Monitor critical capability levels

2.2.2 Testing & Evaluation

Lifecycle:Other (multiple stages)Actor:DeveloperAIRM:Measure

Identify early warning signs and emergent capabilities

2.2.1 Risk Assessment

Lifecycle:Build and Use ModelActor:DeveloperAIRM:Measure

Establish standardised benchmarks and reporting

3.2.1 Benchmarks & Evaluation

Lifecycle:Verify and ValidateActor:DeveloperAIRM:Measure

Enhance hardware and supply chain oversight

2.3.3 Monitoring & Logging

Lifecycle:Other (stage not listed)Actor:Infrastructure ProviderAIRM:Govern

Lead efforts to establish shared criteria for AI LOC

3.2.2 Technical Standards

Lifecycle:Plan and DesignActor:Governance ActorAIRM:Govern

Coordinate evaluations and safety testing

2.2.2 Testing & Evaluation

Lifecycle:Verify and ValidateActor:Governance ActorAIRM:Govern

View all 38 mitigations from this source →

Source Document

Strengthening Emergency Preparedness and Response for AI Loss of Control Incidents

Somani, Elika; Friedman, Anjay; Wu, Henry; Lu, Marianne; Byrd, Christopher; van Soest, Henri; Zakaria, Sana (2025)

As artificial intelligence (AI) systems become increasingly embedded in essential infrastructure and services, the risks associated with unintended failures rise. Developing comprehensive emergency response protocols could help mitigate these significant risks. This report focuses on understanding and addressing AI loss of control (LOC) scenarios where human oversight fails to adequately constrain an autonomous, general-purpose AI.

View source DOI: 10.7249/RRA3847-1

Classification

AI Lifecycle Stage

Other (multiple stages)

Applies across multiple lifecycle stages

Responsible Actor

Infrastructure Provider

Entity providing compute, platforms, or tooling for AI systems

NIST AI RMF Function

Measure

Quantifying, testing, and monitoring identified AI risks