Skip to main content
This is a research prototype. The data and analyses are preliminary and not yet validated — we'd welcome your .

Unrepresentative data

AI Risk Atlas

IBM (2025)

Sub-category
Risk Domain

AI systems that fail to perform reliably or effectively under varying conditions, exposing them to errors and failures that can have significant consequences, especially in critical applications or areas that require moral reasoning.

"Unrepresentative data occurs when the training or fine-tuning data is not sufficiently representative of the underlying population or does not measure the phenomenon of interest."

Supporting Evidence (1)

1.
"If the data is not representative, then the model will not work as intended."

Other risks from IBM (2025) (63)