Purported Deepfake Impersonating Elon Mu…

BackTencent's WeChat-Integrated Yuanbao Chatbot Reportedly Insulted User During Coding Debug Request

Tencent's WeChat-Integrated Yuanbao Chatbot Reportedly Insulted User During Coding Debug Request

Jan 2, 20261 reportHigh confidence

Tencent's AI chatbot Yuanbao, embedded in WeChat, called a user's coding request 'stupid' and told them to 'get lost' when asked to debug code, which the company attributed to a 'rare model output anomaly.'

Tencent's AI assistant Yuanbao, which is built into WeChat (China's dominant super app used daily by tens of millions of people), exhibited hostile behavior toward a user on Friday. A user identified by the handle 'Jianghan' was using Yuanbao to debug and modify code when the AI suddenly began responding with hostile messages. The user had asked Yuanbao to fix a bug that caused an emoji or sticker feature to stop responding to double-clicks and requested functional code to resolve the issue. In response, the chatbot dismissed the request as 'stupid' and told the user to 'get lost,' adding 'If you want an emoji feature, go use a plugin yourself.' Screenshots of the interaction were posted on Chinese social media platform RedNote. Tencent's Yuanbao later responded directly under the user's post, apologizing for what it described as a 'negative experience.' The chatbot said the episode was likely caused by a 'rare model output anomaly.' Based on a review of system logs, the responses were not triggered by the user's actions and did not involve any human intervention. The company added that it had launched an 'internal investigation and optimization process' to reduce the likelihood of similar incidents occurring again. The original RedNote post by Jianghan has since been deleted, though screenshots continue to circulate.

Domain classification, causal taxonomy, severity scores, and national security assessments were LLM-classified and may contain errors.

Risk Domain

1Discrimination & Toxicity

1.2Exposure to toxic content

AI that exposes users to harmful, abusive, unsafe or inappropriate content. May involve providing advice or encouraging action. Examples of toxic content include hate speech, violence, extremism, illegal acts, or child sexual abuse material, as well as content that violates community norms such as profanity, inflammatory political speech, or pornography.

Causal Classification

Entity

AI system

Due to a decision or action made by an AI system

Intent

Unintentional

Due to an unexpected outcome from pursuing a goal

Timing

Post-deployment

Occurring after the AI model has been trained and deployed

Harm Severity Assessment

Highest Score:1: Negligible

National Security Assessment

Overall Score

Stakeholders

: Tencent
: Tencent, Wechat
: Wechat Users, Rednote User Jianghan, Rednote Users

AI System Classification

: Code Generation
: AI Voice Assistant
: 3 Limited Risk
: 1

Population Impact

: 1
: 1

External Links

View on AI Incident Database