Skip to main content
BackInsufficient data representation
Home/Risks/Schnitzer2024/Insufficient data representation

Insufficient data representation

Category
Risk Domain

AI systems that fail to perform reliably or effectively under varying conditions, exposing them to errors and failures that can have significant consequences, especially in critical applications or areas that require moral reasoning.

"The distribution of the data used for training a model should match the operational data ́s distribution while consisting of sufficiently many samples. An important aspect of matching distributions between training and operational data is that also data which is rarely confronting the AI system in operation is represented in the training data."(p. 9)

Other risks from Schnitzer2024 (24)