Skip to main content
Home/Risks/IBM2025/Improper data curation

Improper data curation

Sub-category
Risk Domain

AI systems that fail to perform reliably or effectively under varying conditions, exposing them to errors and failures that can have significant consequences, especially in critical applications or areas that require moral reasoning.

"Improper collection and preparation of training or tuning data includes data label errors and by using data with conflicting information or misinformation."

Supporting Evidence (1)

1.
"Improper data curation can adversely affect how a model is trained, resulting in a model that does not behave in accordance with the intended values. Correcting problems after the model is trained and deployed might be insufficient for guaranteeing proper behavior."

Other risks from IBM2025 (63)