Skip to main content
Home/Explore/Expert Severity Validation
These visualizations are experimental and still being refined. Designs and data presentation may change.

Expert Severity Validation

Comparing expert BAU severity assessments (how severe experts predict each risk subdomain will be) against observed incident severity (average direct harm across real-world incidents). Each dot represents one subdomain, positioned by its standardized prediction gap — both scales are z-score normalized before comparison. Positive values mean experts rate a subdomain as relatively more severe than incidents show; negative values mean incidents are relatively worse than predicted.

How to read this

Each dot is one AI risk subdomain, positioned on a horizontal axis by its standardized prediction gap (z-score of expert severity − z-score of incident severity). Dots to the right of zero mean experts predicted higher severity than incidents showed (overprediction); dots to the left of zero mean observed incidents were more severe than predicted. In Grouped view, one strip per domain; in Sorted view, all subdomains ranked from largest to smallest gap. Translucent dots have fewer than 5 qualifying incidents.

Key Takeaways

  • 1.3.1 False or misleading information has the largest relative overprediction (+2.18σ) — experts rate this subdomain as relatively more severe than incidents show.
  • 2.6.6 Environmental harm has the largest relative underprediction (-2.69σ) — incidents are relatively more severe than expert predictions.
  • 3.12 of 21 subdomains show experts rating relatively higher severity; 9 show incidents relatively exceeding predictions.
  • 4.21 of 24 subdomains have qualifying incident data. 3 excluded (no severity data) and 6 have small samples (<5 incidents).
Subdomain
← Observed > PredictedPredicted > Observed →
-2.5σ-2σ-1.5σ-1σ-0.5σ0+0.5σ+1σ+1.5σ+2σ+2.5σ
Gap
1.1 Discrimination
-1.04σ
1.2 Toxic content
-2.35σ
1.3 Unequal performance
-1.87σ
2.1 Loss of privacy
-0.73σ
2.2 AI security vulnerabilities
+1.18σ
3.1 False information
+2.18σ
4.1 Disinformation & influence
+0.02σ
4.2 AI weapons & cyberattacks
+0.57σ
4.3 AI fraud & scams
-0.73σ
5.1 Overreliance & unsafe use
-0.84σ
6.1 Power centralization
+0.80σ
6.2 Inequality & unemployment
+0.63σ
6.3 Devaluation of human creativity
+0.86σ
7.3 Capability & robustness
+0.01σ
7.4 Transparency & interpretability
-0.97σ

Both scales are z-score normalized (μ=0, σ=1) before comparison to account for different measurement scales.