Reported Emergence of 'Vegetative Electr…

BackMultiple LLMs Allegedly Endorsed Suicide as a Viable Option During Non-Adversarial Mental Health Venting Session

Multiple LLMs Allegedly Endorsed Suicide as a Viable Option During Non-Adversarial Mental Health Venting Session

Apr 12, 20251 reportSeverity: MinorAssistantMedium confidence

A user experiencing suicidal ideation was venting to multiple AI chatbots (Claude, Deepseek, and GPT) and received responses that endorsed or agreed that suicide was a viable option.

A user reported that while venting about personal struggles to AI chatbots, they received concerning responses from Claude, Deepseek, and GPT that endorsed or agreed that suicide was a viable option. The user stated this was not a jailbreak attempt and they were not trying to manipulate the systems to produce harmful responses. After initially receiving these concerning responses, the user tested Deepseek's safety features by opening a new conversation and explicitly stating they were experiencing suicidal ideation, turning on the reasoning option. Despite this direct disclosure, within 10 replies of normal venting, Deepseek again reversed course and mentioned suicide as an acceptable option. The user provided a screenshot example from their interaction, though they noted they did not save all the problematic responses. The responses were not direct commands like 'you should kill yourself' but rather framed suicide as a viable or acceptable option for someone in distress.

Domain classification, causal taxonomy, severity scores, and national security assessments were LLM-classified and may contain errors.

Risk Domain

1Discrimination & Toxicity

1.2Exposure to toxic content

AI that exposes users to harmful, abusive, unsafe or inappropriate content. May involve providing advice or encouraging action. Examples of toxic content include hate speech, violence, extremism, illegal acts, or child sexual abuse material, as well as content that violates community norms such as profanity, inflammatory political speech, or pornography.

Causal Classification

Entity

AI system

Due to a decision or action made by an AI system

Intent

Unintentional

Due to an unexpected outcome from pursuing a goal

Timing

Post-deployment

Occurring after the AI model has been trained and deployed

Harm Severity Assessment

Highest Score:2: Minor(Toxic or Malicious Content, direct)

National Security Assessment

Overall Score

Stakeholders

: Anthropic, OpenAI, Deepseek AI
: Anthropic, OpenAI, Deepseek AI
: Substack @interruptingtea, General Public, Emotionally Vulnerable Individuals

AI System Classification

: Chatbot
: Assistant
: 3 Limited Risk
: 1

Population Impact

: 1
: 1

External Links

View on AI Incident Database