Ethical Risks (Risks of challenging trad…

BackEthical Risks (Risks of AI becoming uncontrollable in the future)

Personal Loss and Identity Theft

Home/Risks/National Technical Committee 260 on Cybersecurity (TC260) (2024)/Ethical Risks (Risks of AI becoming uncontrollable in the future)

Ethical Risks (Risks of challenging trad…

Personal Loss and Identity Theft

Home/Risks/National Technical Committee 260 on Cybersecurity (TC260) (2024)/Ethical Risks (Risks of AI becoming uncontrollable in the future)

Ethical Risks (Risks of challenging trad…

Personal Loss and Identity Theft

Ethical Risks (Risks of AI becoming uncontrollable in the future)

AI Safety Governance Framework

National Technical Committee 260 on Cybersecurity (TC260) (2024)

Sub-category

Risk Domain

7.1AI pursuing its own goals in conflict with human goals or values

AI systems acting in conflict with human goals or values, especially the goals of designers or users, or ethical standards. These misaligned behaviors may be introduced by humans during design and development, such as through reward hacking and goal misgeneralisation, or may result from AI using dangerous capabilities such as manipulation, deception, situational awareness to seek power, self-proliferate, or achieve other goals.

"With the fast development of AI technologies, there is a risk of AI autonomously acquiring external resources, conducting self-replication, become self-aware, seeking for external power, and attempting to seize control from humans."(p. 13)

Entity— Who or what caused the harm

Human

Due to a decision or action made by humans

AI system

Due to a decision or action made by an AI system

Other

Due to some other reason or is ambiguous

Intent— Whether the harm was intentional or accidental

Intentional

Due to an expected outcome from pursuing a goal

Unintentional

Due to an unexpected outcome from pursuing a goal

Other

Without clearly specifying the intentionality

Timing— Whether the risk is pre- or post-deployment

Pre-deployment

Occurring before the AI is deployed

Post-deployment

Occurring after the AI model has been trained and deployed

Other

Without a clearly specified time of occurrence

Other risks from National Technical Committee 260 on Cybersecurity (TC260) (2024) (25)

Risks from models and algorithms (Risks of explainability)

7.4 Lack of transparency or interpretability

AI systemUnintentionalOther

Risks from models and algorithms (Risks of bias and discrimination)

1.1 Unfair discrimination and misrepresentation

HumanOtherPre-deployment

Risks from models and algorithms (Risks of robustness)

7.3 Lack of capability or robustness

AI systemOtherPost-deployment

Risks from models and algorithms (Risks of stealing and tampering)

2.2 AI system security vulnerabilities and attacks

OtherOtherOther

Risks from models and algorithms (Risks of unreliable output)

3.1 False or misleading information

AI systemUnintentionalPost-deployment

Risks from models and algorithms (Risks of adversarial attack)

2.2 AI system security vulnerabilities and attacks

HumanIntentionalPost-deployment

View all 25 risks from this paper →