Scammers Allegedly Manipulate 2023 Speec…

BackFacebook's Content Moderation System Flagged and Removed Emergency Updates as Spam During Wildfires

Facebook's Content Moderation System Flagged and Removed Emergency Updates as Spam During Wildfires

Jun 1, 20243 reportsSeverity: MinorToolHigh confidence

Facebook's automated content moderation system incorrectly flagged and removed dozens of emergency wildfire safety posts from fire departments, volunteers, and disaster response organizations as spam during active wildfire emergencies across California and other states.

During the 2024 wildfire season, Facebook's automated content moderation system began systematically flagging and removing emergency safety posts as spam, affecting at least 40 posts across 20 fires since June 2024. The posts contained critical information including evacuation orders, fire locations, and safety resources, often linking to official sources like Cal Fire, U.S. Forest Service, and sheriff departments. The removals affected volunteer fire safety coordinators, official fire departments, news stations, and disaster nonprofits across California and other states. Users received notifications stating their posts violated 'Community Standards on Spam' for trying to get engagement in a 'misleading way,' but were not notified when posts were actually removed. The issue particularly impacted rural communities like Lake County, California, where Facebook groups serve as primary information sources during emergencies due to limited official communication channels and spotty cell service. Some users had accounts suspended after multiple violations, and the Forest Service had an entire emergency information page suspended during the Leadville, Colorado fire. Facebook spokesperson Erin McPike stated the company was investigating the issue and working to address it, noting they were unaware of the problem until contacted by The Washington Post.

Domain classification, causal taxonomy, severity scores, and national security assessments were LLM-classified and may contain errors.

Risk Domain

7AI System Safety, Failures & Limitations

7.3Lack of capability or robustness

AI systems that fail to perform reliably or effectively under varying conditions, exposing them to errors and failures that can have significant consequences, especially in critical applications or areas that require moral reasoning.

Causal Classification

Entity

AI system

Due to a decision or action made by an AI system

Intent

Unintentional

Due to an unexpected outcome from pursuing a goal

Timing

Post-deployment

Occurring after the AI model has been trained and deployed

Harm Severity Assessment

Highest Score:2: Minor(Physical Harm, inferred)

National Security Assessment

Overall Score

Stakeholders

: Meta, Facebook
: Meta, Facebook
: California Residents, Wildfire Evacuees, Emergency Responders, Disaster Relief Workers, Fire Safety Coordinators, Facebook Users, Facebook Users In Disaster Zones

AI System Classification

: Content Moderation
: Spam Filtering
: Tool
: 2 High Risk
: 1

Population Impact

: 20
: 100,000

External Links

View on AI Incident Database