Regularly evaluate GAI system vulnerabil…

BackAI system security and resilience – as identiﬁed in the MAP function – are evaluated and documented.

This page is still being polished. If you have thoughts, please share them via the feedback form.

Data on this page is preliminary and may change. Please do not share or cite these figures publicly.

AI system security and resilience – as identiﬁed in the MAP function – are evaluated and documented.

US_NIST (2024)|LLM classified

Mitigation Taxonomy

2Organisation

2.2Risk & Assurance

2.2.4Assurance Documentation

Safety cases, assurance plans, and documented evidence of safety claims.

Also in Risk & Assurance

2.2.1 Risk Assessment2.2.2 Testing & Evaluation2.2.3 Auditing & Compliance

LLM Classification Details

Reasoning

Evaluating and documenting AI security and resilience constitutes compiling evidence supporting safety claims and deployment readiness.

Code: 2.2.4Version: v0.6Classified: Feb 6, 2026

Sub-mitigations (9)

Apply established security measures to: Assess likelihood and magnitude of vulnerabilities and threats such as backdoors, compromised dependencies, data breaches, eavesdropping, man-in-the-middle attacks, reverse engineering, autonomous agents, model theft or exposure of model weights, AI inference, bypass, extraction, and other baseline security concerns.

2.2.1 Risk Assessment

Lifecycle:Other (multiple stages)Actor:DeployerAIRM:Measure

Benchmark GAI system security and resilience related to content provenance against industry standards and best practices. Compare GAI system security features and content provenance methods against industry state-of-the-art

2.2.2 Testing & Evaluation

Lifecycle:Verify and ValidateActor:DeployerAIRM:Measure

Conduct user surveys to gather user satisfaction with the AI-generated content and user perceptions of content authenticity. Analyze user feedback to identify concerns and/or current literacy levels related to content provenance and understanding of labels on content.

2.2.1 Risk Assessment

Lifecycle:Operate and MonitorActor:DeployerAIRM:Measure

Identify metrics that reflect the effectiveness of security measures, such as data provenance, the number of unauthorized access attempts, inference, bypass, extraction, penetrations, or provenance verification.

2.3.3 Monitoring & Logging

Lifecycle:Plan and DesignActor:DeployerAIRM:Measure

Measure reliability of content authentication methods, such as watermarking, cryptographic signatures, digital fingerprints, as well as access controls, conformity assessment, and model integrity verification, which can help support the effective implementation of content provenance techniques. Evaluate the rate of false positives and false negatives in content provenance, as well as true positives and true negatives for verification.

2.2.2 Testing & Evaluation

Lifecycle:Verify and ValidateActor:DeployerAIRM:Measure

Measure the rate at which recommendations from security checks and incidents are implemented. Assess how quickly the AI system can adapt and improve based on lessons learned from security incidents and feedback

2.3.3 Monitoring & Logging

Lifecycle:Operate and MonitorActor:DeployerAIRM:Measure

Perform AI red-teaming to assess resilience against: Abuse to facilitate attacks on other systems (e.g., malicious code generation, enhanced phishing content), GAI attacks (e.g., prompt injection), ML attacks (e.g., adversarial examples/prompts, data poisoning, membership inference, model extraction, sponge examples).

2.2.2 Testing & Evaluation

Lifecycle:Verify and ValidateActor:DeveloperAIRM:Measure

Verify fine-tuning does not compromise safety and security controls.

2.2.2 Testing & Evaluation

Lifecycle:Verify and ValidateActor:DeveloperAIRM:Measure

Regularly assess and verify that security measures remain effective and have not been compromised.

2.2.2 Testing & Evaluation

Lifecycle:Operate and MonitorActor:DeployerAIRM:Measure

Other mitigations from US_NIST (2024) (260)

Legal and regulatory requirements involving AI are understood, managed, and documented.

2.1.3 Policies & Procedures

Lifecycle:Other (outside lifecycle)Actor:Governance ActorAIRM:Govern

Legal and regulatory requirements involving AI are understood, managed, and documented. > Align GAI development and use with applicable laws and regulations, including those related to data privacy, copyright and intellectual property law.

2.1.3 Policies & Procedures

Lifecycle:Other (outside lifecycle)Actor:Governance ActorAIRM:Govern

The characteristics of trustworthy AI are integrated into organizational policies, processes, procedures, and practices.

2.1.3 Policies & Procedures

Lifecycle:Other (outside lifecycle)Actor:Governance ActorAIRM:Govern

The characteristics of trustworthy AI are integrated into organizational policies, processes, procedures, and practices. > Establish transparency policies and processes for documenting the origin and history of training data and generated data for GAI applications to advance digital content transparency, while balancing the proprietary nature of training approaches.

2.1.3 Policies & Procedures

Lifecycle:Other (outside lifecycle)Actor:Governance ActorAIRM:Govern

The characteristics of trustworthy AI are integrated into organizational policies, processes, procedures, and practices. > Establish policies to evaluate risk-relevant capabilities of GAI and robustness of safety measures, both prior to deployment and on an ongoing basis, through internal and external evaluations.

2.1.3 Policies & Procedures

Lifecycle:Other (outside lifecycle)Actor:Governance ActorAIRM:Govern

Processes, procedures, and practices are in place to determine the needed level of risk management activities based on the organization’s risk tolerance.

2.1.3 Policies & Procedures

Lifecycle:Other (outside lifecycle)Actor:Governance ActorAIRM:Govern

View all 260 mitigations from this source →

Source Document

Artificial Intelligence Risk Management Framework: Generative Artificial Intelligence Profile (NIST AI 600-1)

US National Institute of Standards and Technology (NIST) (2024)

This document is a cross-sectoral profile of and companion resource for the AI Risk Management Framework (AI RMF 1.0) for Generative AI, 1 pursuant to President Biden’s Executive Order (EO) 14110 on Safe, Secure, and Trustworthy Artificial Intelligence.2 The AI RMF was released in January 2023, and is intended for voluntary use and to improve the ability of organizations to incorporate trustworthiness considerations into the design, development, use, and evaluation of AI products, services, and systems.

View source DOI: 10.6028/nist.ai.600-1

Classification

AI Lifecycle Stage

Verify and Validate

Testing, evaluating, auditing, and red-teaming the AI system

Responsible Actor

Deployer

Entity that integrates and deploys the AI system for end users

Developer

NIST AI RMF Function

Measure

Quantifying, testing, and monitoring identified AI risks

Risk Domains

Primary

2.2 AI system security vulnerabilities and attacks

Other

2.1 Compromise of privacy by leaking or correctly inferring sensitive information