Coordinated Manipulation Threatens Crowdsourced Fact-Checking Systems

Nikil Roashan Selvam, Jay Baxter, Sophie Hilgard, Brad Miller, Keith Coleman, Ellen Vitercik, Sanmi Koyejo· July 3, 2026 View original

Summary

This research investigates how coordinated users can strategically manipulate crowdsourced fact-checking systems, particularly those using matrix factorization for consensus, like X's Community Notes. It reveals that a small number of strategic ratings can push low-quality notes above consensus thresholds and that even "Not Helpful" ratings can paradoxically increase a note's helpfulness score.

Major social media platforms like X, Meta, TikTok, and Google employ crowdsourced fact-checking to combat misinformation at scale, relying on bridging mechanisms that identify misleading information when supported by diverse perspectives. The core of these systems, often based on matrix factorization (as seen in X and Meta), is designed to prevent simple majority rule from being exploited. This study examines the vulnerability of these matrix factorization-based systems to coordinated manipulation. Theoretical and empirical evaluations, using historical production data, show that strategic voting by a small group of users can fabricate synthetic consensus. Specifically, up to 10.7% of lower-quality notes could be manipulated past consensus thresholds with fewer than 10 ratings. Counterintuitively, the analysis also reveals that rating a note as "Not Helpful" can sometimes increase its helpfulness score. The research quantifies the manipulation effort and notes that mitigations have been developed and deployed within X's Community Notes algorithm.

Why it matters

Understanding these vulnerabilities is crucial for social media platforms and any organization relying on crowdsourced content moderation to protect against sophisticated manipulation tactics and maintain the integrity of information.

How to implement this in your domain

  1. 1Conduct regular audits of crowdsourced moderation systems to identify patterns indicative of coordinated manipulation.
  2. 2Implement advanced anomaly detection algorithms to flag unusual voting behaviors or rapid shifts in consensus.
  3. 3Develop and deploy dynamic algorithms that adapt to and mitigate strategic voting patterns, as X has done.
  4. 4Educate community moderators and platform users about potential manipulation tactics to foster a more vigilant environment.

Who benefits

Social MediaContent ModerationCybersecurityPublic RelationsGovernment

Key takeaways

  • Crowdsourced fact-checking systems are vulnerable to coordinated manipulation.
  • Strategic voting can fabricate synthetic consensus with minimal effort.
  • Counterintuitive effects, like "Not Helpful" ratings increasing helpfulness, can occur.
  • Platforms must continuously develop and deploy mitigations against these threats.

Original post by Nikil Roashan Selvam, Jay Baxter, Sophie Hilgard, Brad Miller, Keith Coleman, Ellen Vitercik, Sanmi Koyejo

"arXiv:2607.01824v1 Announce Type: new Abstract: Crowdsourced fact-checking systems have been adopted by major social media companies such as X, Meta, TikTok and Google with the aim of combating misleading information at scale without relying on centralized editorial control. Thes…"

View on X

Originally posted by Nikil Roashan Selvam, Jay Baxter, Sophie Hilgard, Brad Miller, Keith Coleman, Ellen Vitercik, Sanmi Koyejo on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses