Review various transparency artifacts (e…

BackPre-trained models which are used for development are monitored as part of AI system regular monitoring and maintenance.

This page is still being polished. If you have thoughts, please share them via the feedback form.

Data on this page is preliminary and may change. Please do not share or cite these figures publicly.

Pre-trained models which are used for development are monitored as part of AI system regular monitoring and maintenance.

US_NIST (2024)|LLM classified

Mitigation Taxonomy

2Organisation

2.3Operations & Security

2.3.3Monitoring & Logging

Runtime monitoring, observability, performance tracking, and anomaly detection in production.

Also in Operations & Security

2.3.1 Deployment Management2.3.2 Access & Security Controls2.3.4 Incident Response

LLM Classification Details

Reasoning

Organizational monitoring of pre-trained models during regular maintenance and system operations post-deployment.

Code: 2.3.3Version: v0.6Classified: Feb 6, 2026

Sub-mitigations (9)

Apply explainable AI (XAI) techniques (e.g., analysis of embeddings, model compression/distillation, gradient-based attributions, occlusion/term reduction, counterfactual prompts, word clouds) as part of ongoing continuous improvement processes to mitigate risks related to unexplainable GAI systems.

2.3.3 Monitoring & Logging

Lifecycle:Operate and MonitorActor:DeployerAIRM:Manage

Document how pre-trained models have been adapted (e.g., fine-tuned, or retrieval-augmented generation) for the specific generative task, including any data augmentations, parameter adjustments, or other modifications. Access to un-tuned (baseline) models supports debugging the relative influence of the pretrained weights compared to the fine-tuned model weights or other system updates.

2.2.4 Assurance Documentation

Lifecycle:Build and Use ModelActor:DeployerAIRM:Manage

Document sources and types of training data and their origins, potential biases present in the data related to the GAI application and its content provenance, architecture, training process of the pre-trained model including information on hyperparameters, training duration, and any fine-tuning or retrieval-augmented generation processes applied.

2.2.4 Assurance Documentation

Lifecycle:Plan and DesignActor:DeployerAIRM:Manage

Evaluate user reported problematic content and integrate feedback into system updates.

2.3.3 Monitoring & Logging

Lifecycle:Operate and MonitorActor:DeployerAIRM:Manage

Implement content filters to prevent the generation of inappropriate, harmful, false, illegal, or violent content related to the GAI application, including for CSAM and NCII. These filters can be rule-based or leverage additional machine learning models to flag problematic inputs and outputs.

1.2.1 Guardrails & Filtering

Lifecycle:Operate and MonitorActor:DeployerAIRM:Manage

Implement real-time monitoring processes for analyzing generated content performance and trustworthiness characteristics related to content provenance to identify deviations from the desired standards and trigger alerts for human intervention.

2.3.3 Monitoring & Logging

Lifecycle:Operate and MonitorActor:DeployerAIRM:Manage

Leverage feedback and recommendations from organizational boards or committees related to the deployment of GAI applications and content provenance when using third-party pre-trained models.

2.1.1 Leadership Oversight

Lifecycle:Other (outside lifecycle)Actor:DeployerAIRM:Manage

Use human moderation systems where appropriate to review generated content in accordance with human-AI configuration policies established in the Govern function, aligned with socio-cultural norms in the context of use, and for settings where AI models are demonstrated to perform poorly.

2.3.3 Monitoring & Logging

Lifecycle:Operate and MonitorActor:DeployerAIRM:Manage

Use organizational risk tolerance to evaluate acceptable risks and performance metrics and decommission or retrain pre-trained models that perform outside of defined limits.

2.2.1 Risk Assessment

Lifecycle:Operate and MonitorActor:DeployerAIRM:Manage

Other mitigations from US_NIST (2024) (260)

Legal and regulatory requirements involving AI are understood, managed, and documented.

2.1.3 Policies & Procedures

Lifecycle:Other (outside lifecycle)Actor:Governance ActorAIRM:Govern

Legal and regulatory requirements involving AI are understood, managed, and documented. > Align GAI development and use with applicable laws and regulations, including those related to data privacy, copyright and intellectual property law.

2.1.3 Policies & Procedures

Lifecycle:Other (outside lifecycle)Actor:Governance ActorAIRM:Govern

The characteristics of trustworthy AI are integrated into organizational policies, processes, procedures, and practices.

2.1.3 Policies & Procedures

Lifecycle:Other (outside lifecycle)Actor:Governance ActorAIRM:Govern

The characteristics of trustworthy AI are integrated into organizational policies, processes, procedures, and practices. > Establish transparency policies and processes for documenting the origin and history of training data and generated data for GAI applications to advance digital content transparency, while balancing the proprietary nature of training approaches.

2.1.3 Policies & Procedures

Lifecycle:Other (outside lifecycle)Actor:Governance ActorAIRM:Govern

The characteristics of trustworthy AI are integrated into organizational policies, processes, procedures, and practices. > Establish policies to evaluate risk-relevant capabilities of GAI and robustness of safety measures, both prior to deployment and on an ongoing basis, through internal and external evaluations.

2.1.3 Policies & Procedures

Lifecycle:Other (outside lifecycle)Actor:Governance ActorAIRM:Govern

Processes, procedures, and practices are in place to determine the needed level of risk management activities based on the organization’s risk tolerance.

2.1.3 Policies & Procedures

Lifecycle:Other (outside lifecycle)Actor:Governance ActorAIRM:Govern

View all 260 mitigations from this source →

Source Document

Artificial Intelligence Risk Management Framework: Generative Artificial Intelligence Profile (NIST AI 600-1)

US National Institute of Standards and Technology (NIST) (2024)

This document is a cross-sectoral profile of and companion resource for the AI Risk Management Framework (AI RMF 1.0) for Generative AI, 1 pursuant to President Biden’s Executive Order (EO) 14110 on Safe, Secure, and Trustworthy Artificial Intelligence.2 The AI RMF was released in January 2023, and is intended for voluntary use and to improve the ability of organizations to incorporate trustworthiness considerations into the design, development, use, and evaluation of AI products, services, and systems.

View source DOI: 10.6028/nist.ai.600-1

Classification

AI Lifecycle Stage

Operate and Monitor

Running, maintaining, and monitoring the AI system post-deployment

Responsible Actor

Deployer

Entity that integrates and deploys the AI system for end users

Developer

NIST AI RMF Function

Manage

Prioritising, responding to, and mitigating AI risks

Risk Domains

Primary

7.3 Lack of capability or robustness

Other

7.4 Lack of transparency or interpretability