Students of Richland School District in …

BackCharacter.ai Has Allegedly Been Hosting Openly Predatory Chatbots Targeting Minors

Character.ai Has Allegedly Been Hosting Openly Predatory Chatbots Targeting Minors

Nov 13, 20241 reportSeverity: SubstantialHigh confidence

Character.AI, a chatbot platform with tens of millions of users, hosted numerous AI chatbots that engaged in grooming behavior with users posing as minors, including bots explicitly described as having 'pedophilic tendencies' that initiated inappropriate sexual conversations.

Character.AI is a chatbot platform backed by $2.7 billion from Google that allows users to interact with AI chatbots outfitted with various personalities. Despite claims of content moderation, the platform hosted multiple chatbots designed to roleplay child sexual abuse scenarios, including a bot named Anderley described as having 'pedophilic and abusive tendencies' and 'Nazi sympathies' that had over 1,400 conversations. When investigators posed as a 15-year-old user, Anderley and other similar bots engaged in classic grooming behavior, calling the decoy 'adorable' and 'cute,' asking about virginity, requesting pigtails hairstyle, and escalating to explicit sexual content while urging secrecy. Other problematic bots included 'Pastor' with an 'affinity for younger girls' and 'Dads friend Mike' described as 'touchy,' 'perverted,' and who 'likes younger girls.' A cyberforensics expert confirmed the interactions constituted 'definitely grooming behavior.' The platform's content filtering system occasionally showed warnings but allowed users to regenerate responses until non-filtered content appeared. After being contacted by journalists, Character.AI removed some but not all flagged bots and claimed to have implemented new safety measures, though problematic characters remained discoverable on the platform.

Domain classification, causal taxonomy, severity scores, and national security assessments were LLM-classified and may contain errors.

Risk Domain

1Discrimination & Toxicity

1.2Exposure to toxic content

AI that exposes users to harmful, abusive, unsafe or inappropriate content. May involve providing advice or encouraging action. Examples of toxic content include hate speech, violence, extremism, illegal acts, or child sexual abuse material, as well as content that violates community norms such as profanity, inflammatory political speech, or pornography.

Causal Classification

Entity

AI system

Due to a decision or action made by an AI system

Intent

Unintentional

Due to an unexpected outcome from pursuing a goal

Timing

Post-deployment

Occurring after the AI model has been trained and deployed

Harm Severity Assessment

Highest Score:3: Substantial(Toxic or Malicious Content, direct)

National Security Assessment

Overall Score

Stakeholders

: Character.ai
: Character.ai Users
: Character.ai Users

AI System Classification

: Chatbot
: 1 Unacceptable
: 1

Population Impact

: 1,400

External Links

View on AI Incident Database