‘Traditional’ security concerns of AI sy…

BackSecuring other systems

This page is still being polished. If you have thoughts, please share them via the feedback form.

Data on this page is preliminary and may change. Please do not share or cite these figures publicly.

Securing other systems

Jones (2024)|LLM classified

Mitigation Taxonomy

2Organisation

2.3Operations & Security

2.3.2Access & Security Controls

User vetting, access restrictions, encryption, and infrastructure security for deployed systems.

Also in Operations & Security

2.3.1 Deployment Management2.3.3 Monitoring & Logging2.3.4 Incident Response

Definition

AI systems are expected to increase the volume and impact of cyberattacks in the next 2 years. They’re also expected to improve the capability available to cyber crime and state actors in 2025 and beyond. Open-weights models are likely to increase this threat because their safeguards can be cheaply removed, they can be finetuned to help cyberattackers, and they cannot be recalled. Given many powerful open-weights models have been released, it’s infeasible to ‘put the genie back in the bottle’ that would prevent the use of AI systems for cyberattacks.15 This means significant work is likely necessary to defend against the upcoming wave of cyberattacks caused by AI systems.

LLM Classification Details

Reasoning

Mitigation identifies threat requiring defensive work but lacks specific mechanism, mechanism location, or implementation approach.

Code: 99.9Version: v0.5Classified: Jan 22, 2026

Part of

Cyber and information security

Establish and enforce cyber and information security measures for AI labs and systems to protect against various threats.

Other mitigations from Jones (2024) (49)

Compute goverance

Regulate companies in the highly concentrated AI chip supply chain, given AI chips are key inputs to developing frontier AI models.

3.1.1 Legislation & Policy

Lifecycle:Other (outside lifecycle)Actor:Governance ActorAIRM:Govern

Data input controls

Filter data used to train AI models, e.g. don’t train your model with instructions to launch cyberattacks.

1.1.1 Training Data

Lifecycle:Collect and Process DataActor:DeveloperAIRM:Manage

Licensing

Require organisations or specific training runs to be licensed by a regulatory body, similar to licensing regimes in other high-risk industries.

3.1.4 Compliance Requirements

Lifecycle:Other (outside lifecycle)Actor:Governance ActorAIRM:Govern

On-chip governance mechanisms

Make alterations to AI hardware (primarily AI chips), that enable verifying or controlling the usage of this hardware.

1.2.4 Security Infrastructure

Lifecycle:Other (stage not listed)Actor:Infrastructure ProviderAIRM:Govern

Safety cases

Develop structured arguments demonstrating that an AI system is unlikely to cause catastrophic harm, to inform decisions about training and deployment.

2.2.4 Assurance Documentation

Lifecycle:Plan and DesignActor:Governance ActorAIRM:Measure

Evaluations (aka “evals”)

Give AI systems standardised tests to assess their capabilities, which can inform the risks they might pose.

2.2.2 Testing & Evaluation

Lifecycle:Verify and ValidateActor:Governance ActorAIRM:Measure

View all 49 mitigations from this source →

Source Document

The AI regulator’s toolbox: A list of concrete AI governance practices

Jones, Adam (2024)

This article explains concrete AI governance practices people are exploring as of August 2024. Prior summaries have mapped out high-level areas of work, but rarely dive into concrete practice details. This summary explores specific practices addressing risks from advanced AI systems. Practices are grouped into categories based on where in the AI lifecycle they best fit. The primary goal of this article is to help newcomers contribute to the field of AI governance by providing a comprehensive overview of available practices.

View source

Classification

AI Lifecycle Stage

Other (outside lifecycle)

Outside the standard AI system lifecycle

Responsible Actor

Governance Actor

Regulator, standards body, or oversight entity shaping AI policy

NIST AI RMF Function

Manage

Prioritising, responding to, and mitigating AI risks

Risk Domains

Primary

4.2 Cyberattacks, weapon development or use, and mass harm

Other

4 Malicious Actors & Misuse