BackBenchmarking (Post-deployment contamination)
Benchmarking (Post-deployment contamination)
Risk Domain
Inadequate regulatory frameworks and oversight mechanisms that fail to keep pace with AI development, leading to ineffective governance and the inability to manage AI risks appropriately.
"Once a model is deployed, it can be exposed to benchmark data provided by the users [95, 170]. The model may then be further trained by these user inputs containing benchmark data."(p. 19)
Entity— Who or what caused the harm
Intent— Whether the harm was intentional or accidental
Timing— Whether the risk is pre- or post-deployment
Other risks from Gipiškis2024 (144)
Direct Harm Domains (content safety harms)
1.2 Exposure to toxic contentNot codedNot codedNot coded
Direct Harm Domains (content safety harms) > Violence and extremism
1.2 Exposure to toxic contentNot codedNot codedNot coded
Direct Harm Domains (content safety harms) > Hate and toxicity
1.2 Exposure to toxic contentNot codedNot codedNot coded
Direct Harm Domains (content safety harms) > Sexual content
1.2 Exposure to toxic contentNot codedNot codedNot coded
Direct Harm Domains (content safety harms) > Child harm
1.2 Exposure to toxic contentNot codedNot codedNot coded
Direct Harm Domains (content safety harms) > Self-harm
1.2 Exposure to toxic contentNot codedNot codedNot coded