BackEnsuring safety, reliability and trust

This page is still being polished. If you have thoughts, please share them via the feedback form.

Data on this page is preliminary and may change. Please do not share or cite these figures publicly.

Ensuring safety, reliability and trust

ITechLaw (2019)|LLM classified

Mitigation Taxonomy

2Organisation

2.2Risk & Assurance

2.2.2Testing & Evaluation

Red teaming, capability evaluations, adversarial testing, and performance verification.

Also in Risk & Assurance

2.2.1 Risk Assessment2.2.3 Auditing & Compliance2.2.4 Assurance Documentation

Additional Information

3.1 Governments should require and organisations should test AI systems thoroughly to ensure that they reliably adhere, in operation, to the underpinning ethical and moral principles and have been trained with data which are curated and are as ‘error-free’ as practicable, given the circumstances. 3.2 Governments are encouraged to adjust regulatory regimes and/or promote industry self-regulatory regimes for allowing market-entry of AI systems in order to reasonably reflect the positive exposure that may result from the public operation of such AI systems. Special regimes for intermediary and limited admissions to enable testing and refining of the operation of the AI system can help to expedite the completion of the AI system and improve its safety and reliability. 3.3 In order to ensure and maintain public trust in final human control, governments should consider implementing rules that ensure comprehensive and transparent investigation of such adverse and unanticipated outcomes of AI systems that have occurred through their usage, in particular if these outcomes have lethal or injurious consequences for the humans using such systems. Such investigations should be used for considering adjusting the regulatory framework for AI systems in particular to develop a more rounded understanding of how such systems should gracefully handover to their human operators.

LLM Classification Details

Reasoning

Mitigation spans multiple L1/L2 categories: organizational testing (2.2.2), government regulation (3.1.1), self-regulatory mechanisms (3.3.3), and investigation procedures. Primary focal activity insufficient for confident L3 classification.

Code: 99.9Version: v0.5Classified: Jan 22, 2026

Part of

Safety and Reliability

Organisations that develop, deploy or use AI systems and any national laws that regulate such use shall adopt design regimes and standards ensuring high safety and reliability of AI systems on one hand while limiting the exposure of developers and deployers on the other hand.

Other mitigations from ITechLaw (2019) (35)

Ethical Purpose and Societal Benefit

Organisations that develop, deploy or use AI systems and any national laws that regulate such use should require the purposes of such implementation to be identified and ensure that such purposes are consistent with the overall ethical purposes of beneficence and non-maleficence, as well as the other principles of the Policy Framework for Responsible AI.

3.2.2 Technical Standards

Lifecycle:Plan and DesignActor:Governance ActorAIRM:Govern

Ethical Purpose and Societal Benefit > Overarching principles

2.1.3 Policies & Procedures

Lifecycle:Other (general)Actor:DeveloperAIRM:Govern

Ethical Purpose and Societal Benefit > Work and automation

2.2.1 Risk Assessment

Lifecycle:Operate and MonitorActor:UserAIRM:Govern

Ethical Purpose and Societal Benefit > Environmental impact

2.2.1 Risk Assessment

Lifecycle:Other (outside lifecycle)Actor:DeployerAIRM:Measure

Ethical Purpose and Societal Benefit > Weaponised AI