This page is still being polished. If you have thoughts, please share them via the feedback form.
Data on this page is preliminary and may change. Please do not share or cite these figures publicly.
Runtime monitoring, observability, performance tracking, and anomaly detection in production.
Also in Operations & Security
AGI labs should closely monitor deployed systems, including how they are used and what impact they have on society.
Reasoning
Organization monitors deployed systems' runtime behavior and societal impacts post-deployment.
Alignment Techniques
AGI labs should implement state-of-the-art safety and alignment techniques.
1.1 ModelAPI access to powerful models
AGI labs should strongly consider only deploying powerful models via an application programming interface (API).
2.3.1 Deployment ManagementAvoid Capabilities Jumps
AGI labs should not deploy models that are much more capable than any existing models.*
2.3.1 Deployment ManagementAvoiding Hype
AGI labs should avoid releasing powerful models in a way that is likely to create hype around AGI (e.g. by overstating results or announcing them in attention-grabbing ways).
2.1.3 Policies & ProceduresBackground checks
AGI labs should perform rigorous background checks before hiring/appointing members of the board of directors, senior executives, and key employees.*
2.3.2 Access & Security ControlsBoard risk committee
AGI labs should have a board risk committee, i.e. a permanent committee within the board of directors which oversees the lab’s risk management practices.*
2.1.1 Leadership OversightTowards best practices in AGI safety and governance: A survey of expert opinion
Schuett, Jonas; Dreksler, Noemi; Anderljung, Markus; McCaffary, David; Heim, Lennart; Bluemke, Emma; Garfinkel, Ben (2023)
A number of leading AI companies, including OpenAI, Google DeepMind, and Anthropic, have the stated goal of building artificial general intelligence (AGI) - AI systems that achieve or exceed human performance across a wide range of cognitive tasks. In pursuing this goal, they may develop and deploy AI systems that pose particularly significant risks. While they have already taken some measures to mitigate these risks, best practices have not yet emerged. To support the identification of best practices, we sent a survey to 92 leading experts from AGI labs, academia, and civil society and received 51 responses. Participants were asked how much they agreed with 50 statements about what AGI labs should do. Our main finding is that participants, on average, agreed with all of them. Many statements received extremely high levels of agreement. For example, 98% of respondents somewhat or strongly agreed that AGI labs should conduct pre-deployment risk assessments, dangerous capabilities evaluations, third-party model audits, safety restrictions on model usage, and red teaming. Ultimately, our list of statements may serve as a helpful foundation for efforts to develop best practices, standards, and regulations for AGI labs.
Operate and Monitor
Running, maintaining, and monitoring the AI system post-deployment
Deployer
Entity that integrates and deploys the AI system for end users
Measure
Quantifying, testing, and monitoring identified AI risks
Primary
4 Malicious Actors & MisuseOther