BackMonitor systems and their uses

This page is still being polished. If you have thoughts, please share them via the feedback form.

Data on this page is preliminary and may change. Please do not share or cite these figures publicly.

Monitor systems and their uses

Schuett (2023)|LLM classified

Mitigation Taxonomy

2Organisation

2.3Operations & Security

2.3.3Monitoring & Logging

Runtime monitoring, observability, performance tracking, and anomaly detection in production.

Also in Operations & Security

2.3.1 Deployment Management2.3.2 Access & Security Controls2.3.4 Incident Response

Definitionp. 18-20

AGI labs should closely monitor deployed systems, including how they are used and what impact they have on society.

LLM Classification Details

Reasoning

Organization monitors deployed systems' runtime behavior and societal impacts post-deployment.

Code: 2.3.3Version: v0.5Classified: Jan 22, 2026

Other mitigations from Schuett (2023) (49)

Alignment Techniques

AGI labs should implement state-of-the-art safety and alignment techniques.

1.1 Model

Lifecycle:Other (multiple stages)Actor:DeveloperAIRM:Unable to classify

API access to powerful models

AGI labs should strongly consider only deploying powerful models via an application programming interface (API).

2.3.1 Deployment Management

Lifecycle:DeployActor:DeployerAIRM:Manage

Avoid Capabilities Jumps

AGI labs should not deploy models that are much more capable than any existing models.*

2.3.1 Deployment Management

Lifecycle:Other (multiple stages)Actor:DeployerAIRM:Manage

Avoiding Hype

AGI labs should avoid releasing powerful models in a way that is likely to create hype around AGI (e.g. by overstating results or announcing them in attention-grabbing ways).

2.1.3 Policies & Procedures

Lifecycle:DeployActor:DeployerAIRM:Manage

Background checks

AGI labs should perform rigorous background checks before hiring/appointing members of the board of directors, senior executives, and key employees.*

2.3.2 Access & Security Controls

Lifecycle:Other (outside lifecycle)Actor:DeveloperAIRM:Govern

Board risk committee

AGI labs should have a board risk committee, i.e. a permanent committee within the board of directors which oversees the lab’s risk management practices.*

2.1.1 Leadership Oversight

Lifecycle:Other (outside lifecycle)Actor:DeveloperAIRM:Govern

View all 49 mitigations from this source →

Source Document

Towards best practices in AGI safety and governance: A survey of expert opinion

Schuett, Jonas; Dreksler, Noemi; Anderljung, Markus; McCaffary, David; Heim, Lennart; Bluemke, Emma; Garfinkel, Ben (2023)

A number of leading AI companies, including OpenAI, Google DeepMind, and Anthropic, have the stated goal of building artificial general intelligence (AGI) - AI systems that achieve or exceed human performance across a wide range of cognitive tasks. In pursuing this goal, they may develop and deploy AI systems that pose particularly significant risks. While they have already taken some measures to mitigate these risks, best practices have not yet emerged. To support the identification of best practices, we sent a survey to 92 leading experts from AGI labs, academia, and civil society and received 51 responses. Participants were asked how much they agreed with 50 statements about what AGI labs should do. Our main finding is that participants, on average, agreed with all of them. Many statements received extremely high levels of agreement. For example, 98% of respondents somewhat or strongly agreed that AGI labs should conduct pre-deployment risk assessments, dangerous capabilities evaluations, third-party model audits, safety restrictions on model usage, and red teaming. Ultimately, our list of statements may serve as a helpful foundation for efforts to develop best practices, standards, and regulations for AGI labs.

View source DOI: 10.48550/arXiv.2305.07153

Classification

AI Lifecycle Stage

Operate and Monitor

Running, maintaining, and monitoring the AI system post-deployment

Responsible Actor

Deployer

Entity that integrates and deploys the AI system for end users

Developer

NIST AI RMF Function

Measure

Quantifying, testing, and monitoring identified AI risks

Risk Domains

Primary

4 Malicious Actors & Misuse

Other

6 Socioeconomic & Environmental