Skip to main content

Model bias

Towards risk-aware artificial intelligence and machine learning systems: An overview

Zhang et al. (2022)

Sub-category
Risk Domain

Unequal treatment of individuals or groups by AI, often based on race, gender, or other sensitive characteristics, resulting in unfair outcomes and unfair representation of those groups.

"While data bias is a major contributor of model bias, model bias actually manifests itself in different forms and shapes, such as presentation bias, model evaluation bias, and popularity bias. In addition, model bias arises from various sources [62], such as AI/ML model selection (e.g., support vector machine, decision trees), regularization methods, algorithm configurations, and optimization techniques."(p. 5)

Supporting Evidence (3)

1.
Model form error:"When all explanatory variables are available, but the model fails to characterize the relationship between the explanatory variables X and the quantity of interest Y. The specified functional form is inadequate to characterize the true relationship, leading to underfitting of the training data."(p. 6)
2.
Model overfitting: "When a very complex model is fit, it may show excellent performance on the training data but poor performance on data beyond the training set. The model's performance is unstable when making predictions, and it might not generalize well on the testing data."(p. 6)
3.
Variable inclusion error "There are two types of variable inclusion error: (1) Significant variables that should be included in the model are omitted, resulting in the model's inability to characterize the underlying data-generation process and leading to omitted-variable bias. (2) Irrelevant variables are included in the model, which may lead to model overfitting."(p. 6)

Other risks from Zhang et al. (2022) (6)