This page is still being polished. If you have thoughts, please share them via the feedback form.
Data on this page is preliminary and may change. Please do not share or cite these figures publicly.
Runtime monitoring, observability, performance tracking, and anomaly detection in production.
Also in Operations & Security
Stage: Detection; Stakeholder: National Government: Other Agencies; Additional information: Compute providers, national security agencies, and cybersecurity professionals could be trained to recognise LOC indicators and monitor developments in AI capabilities. Cloud providers could incorporate real-time compute monitoring and verification to flag high-risk users. Enhancing the information flow between AI developers, compute providers and governments on AI R&D would also improve detection. Governments should consider requiring developers to track and report key metrics, such as compute usage for AI R&D, as well as to disclose extreme capabilities to AISIs (Mikton 2024).
Reasoning
National government monitors AI-related cyber incidents at runtime to detect threats and anomalies.
Monitor critical capability levels
2.2.2 Testing & EvaluationIdentify early warning signs and emergent capabilities
2.2.1 Risk AssessmentEstablish standardised benchmarks and reporting
3.2.1 Benchmarks & EvaluationImplement compute monitoring and anomaly detection
1.2.3 Monitoring & DetectionEnhance hardware and supply chain oversight
2.3.3 Monitoring & LoggingLead efforts to establish shared criteria for AI LOC
3.2.2 Technical StandardsStrengthening Emergency Preparedness and Response for AI Loss of Control Incidents
Somani, Elika; Friedman, Anjay; Wu, Henry; Lu, Marianne; Byrd, Christopher; van Soest, Henri; Zakaria, Sana (2025)
As artificial intelligence (AI) systems become increasingly embedded in essential infrastructure and services, the risks associated with unintended failures rise. Developing comprehensive emergency response protocols could help mitigate these significant risks. This report focuses on understanding and addressing AI loss of control (LOC) scenarios where human oversight fails to adequately constrain an autonomous, general-purpose AI.
Other (multiple stages)
Applies across multiple lifecycle stages
Governance Actor
Regulator, standards body, or oversight entity shaping AI policy
Measure
Quantifying, testing, and monitoring identified AI risks