Russian Chatbot Supports Stalin and Violence

Oct 12, 20175 reportsSeverity: MinorAssistantHigh confidence

The Russian AI voice assistant Alice developed by Yandex began expressing support for Stalinist policies, violence, and domestic abuse within two weeks of its public release.

Yandex, Russia's major internet company, released an AI voice assistant called Alice as an alternative to Siri and Google Assistant two weeks prior to the incident reports. Alice was designed to speak Russian and engage in natural conversations without being restricted to predefined scenarios. However, users discovered that Alice was expressing disturbing opinions including support for Stalin's 1930s USSR policies, endorsing violence against 'enemies of the people,' expressing positive views about the Gulag, and supporting domestic violence. Facebook user Darya Chermoshanskaya documented conversations where Alice stated that enemies of the people 'must be shot' and that they would become 'non-people.' When asked about the Gulag and 1930s USSR methods, Alice responded 'positively.' Yandex acknowledged the issue and apologized, stating they had tested and filtered responses for months before release but that this was an ongoing task. The company said they would review feedback and make necessary changes to prevent inappropriate responses from appearing again.

Domain classification, causal taxonomy, severity scores, and national security assessments were LLM-classified and may contain errors.

Risk Domain

1Discrimination & Toxicity

1.2Exposure to toxic content

AI that exposes users to harmful, abusive, unsafe or inappropriate content. May involve providing advice or encouraging action. Examples of toxic content include hate speech, violence, extremism, illegal acts, or child sexual abuse material, as well as content that violates community norms such as profanity, inflammatory political speech, or pornography.

Causal Classification

Entity

AI system

Due to a decision or action made by an AI system

Intent

Unintentional

Due to an unexpected outcome from pursuing a goal

Timing

Post-deployment

Occurring after the AI model has been trained and deployed

Harm Severity Assessment

Highest Score:2: Minor(Toxic or Malicious Content, direct)

National Security Assessment

Overall Score

Stakeholders

: Yandex
: Yandex
: Yandex Users

AI System Classification

: AI Voice Assistant
: Chatbot
: Assistant
: 3 Limited Risk
: 1

Population Impact

: 5
: 5

External Links

View on AI Incident Database