Meta Frontier AI Framework (Version 1.1)

EnactedOtherAI SystemsGenerative AI

|Private-sector companies·February 3, 2025

Establishes a framework for managing and mitigating risks from frontier AI models at Meta, focusing on potential catastrophic outcomes. Involves threat modeling, risk assessment, and evaluations to determine access to and further development of the model

Analysis summaries, actor details, and coverage mappings were LLM-classified and may contain errors.

Analysis Summaries

This is an internal corporate policy document establishing Meta's own framework for managing frontier AI risks. It contains voluntary commitments and internal governance processes rather than legally binding obligations with external enforcement.

The document has good coverage of approximately 10-12 subdomains, with strong focus on malicious actors (4.1, 4.2, 4.3), AI system security (2.2), competitive dynamics (6.4), governance failure (6.5), and AI safety failures (7.1, 7.2, 7.3). Coverage is concentrated in security, misuse prevention, and AI safety domains, with particular emphasis on catastrophic risks from frontier AI models.

This is an internal corporate policy document that governs Meta's own operations as an AI developer and information services company. The primary sectors governed are Information (where Meta operates) and Scientific Research and Development Services (Meta's FAIR Lab). The document does not regulate external sectors but rather establishes internal governance for Meta's frontier AI development activities.

The document comprehensively covers all stages of the AI lifecycle with particular emphasis on evaluation, deployment, and monitoring. It describes detailed processes for planning (threat modeling, reference class identification), evaluation and mitigation throughout development, deployment decisions based on risk thresholds, and post-deployment monitoring.

The document explicitly focuses on frontier AI models and systems, which are defined as highly capable general-purpose generative AI models. It does not use the terms 'foundation models' or 'GPAI' but the description aligns with these concepts. The framework addresses both open-weight releases and includes implicit compute considerations through capability-based thresholds rather than explicit FLOP thresholds.

Actor Details

Proposer

Risk Subdomain Coverage

11 subdomains (7 Good, 4 Minimal)

2.2 AI system security vulnerabilities and attacks 4.1 Disinformation, surveillance, and influence at scale 4.2 Cyberattacks, weapon development or use, and mass harm 4.3 Fraud, scams, and targeted manipulation 6.5 Governance failure 7.2 AI possessing dangerous capabilities 7.3 Lack of capability or robustness

Minimal Coverage

2.1 Compromise of privacy by leaking or correctly inferring sensitive information 6.4 Competitive dynamics 7.1 AI pursuing its own goals in conflict with human goals or values 7.4 Lack of transparency or interpretability

AI SystemsAI ModelsFrontier AIGeneral Purpose AIGenerative AIOpen-Weight or Open-Source

TransparencySecuritySecurity: DisseminationSecurity: CybersecuritySafetyReliability: RobustnessReliability

Harms

Harm to infrastructureFinancial lossHarm to health/safety

Evaluation: Post-market monitoringEvaluationEvaluation: Impact assessmentConveningGovernance developmentEvaluation: Adversarial testingEvaluation: Conformity assessmentDisclosureDisclosure: In standard formDisclosure: About evaluationEvaluation: External auditingPerformance requirementsTieringTiering: Tiering based on impactTiering: Tiering based on domain of applicationTiering: Tiering based on planning ability

Plan and DesignBuild and Use ModelVerify and ValidateDeployOperate and Monitor

Meta Frontier AI Framework (Version 1.1)

EnactedOtherAI SystemsGenerative AI

|Private-sector companies·February 3, 2025

Analysis summaries, actor details, and coverage mappings were LLM-classified and may contain errors.

Analysis Summaries

Actor Details

Proposer

Meta Frontier AI Framework (Version 1.1)

Analysis Summaries

Actor Details

Risk Subdomain Coverage

Technical Scope

Risk Factors

Harms

Strategies

Lifecycle Stages

Meta Frontier AI Framework (Version 1.1)

Analysis Summaries

Actor Details

Risk Subdomain Coverage

Technical Scope

Risk Factors

Harms

Strategies

Lifecycle Stages