Multi-Agent LLM Framework Improves Community Note Evaluation on X

Changxi Wen, Shuning Zhang, Bohao Chu, Yuwei Chuai, Hui Wang, Dai Shi, Xin Yi, Hewu Li· June 18, 2026 View original

Summary

Researchers developed MultiCom, a persona-guided multi-agent LLM framework for evaluating community notes on X, leveraging a large dataset of 2.5 million notes. MultiCom simulates diverse raters to generate structured, explainable judgments, significantly outperforming alternative methods in accuracy for identifying reliable community fact-checks.

A new research initiative addresses the challenges of slow and low-ratio human-based fact-checking on social media platforms, particularly concerning community notes. The team first compiled ComRate, an extensive dataset comprising 2.5 million community notes and over 209 million associated ratings from the platform X (formerly Twitter). This dataset serves as a foundation for developing more efficient evaluation methods. Building on this, the researchers propose MultiCom, a novel framework that uses persona-guided multi-agent large language models (LLMs) to evaluate community notes. MultiCom simulates a diverse population of raters by clustering real contributors in a matrix-factorized rater space. These persona agents are then prompted to generate structured and explainable assessments of community notes, adhering to the official rating schema and providing details like confidence levels, agreement signals, and underlying reasons. An advanced, out-of-fold calibrated aggregation algorithm processes these agent-generated judgments, combining raw votes with diagnostic reason signals to produce highly reliable predictions. Extensive evaluations demonstrate that MultiCom significantly surpasses existing methods, achieving an average accuracy of 84.7% (with a balanced accuracy of 68.3% and macro-F1 of 60.1%) on the evaluation set. This marks a substantial improvement in the automated assessment of community-based fact-checks.

Why it matters

For social media platforms and content moderation teams, this framework offers a scalable, efficient, and explainable method to evaluate community fact-checks, potentially reducing misinformation spread and improving platform integrity.

How to implement this in your domain

  1. 1Explore multi-agent LLM systems for automating content moderation or fact-checking processes on your platform.
  2. 2Develop persona-guided agents to simulate diverse user behaviors and judgments for evaluation tasks.
  3. 3Utilize large-scale datasets of user interactions to train and validate agent-based evaluation frameworks.
  4. 4Implement explainable AI techniques within agent judgments to provide transparency on moderation decisions.
  5. 5Integrate calibrated aggregation algorithms to combine agent outputs for robust and reliable predictions.

Who benefits

Social MediaContent ModerationPublic PolicyAI/ML DevelopmentJournalism

Key takeaways

  • MultiCom uses multi-agent LLMs to automate community note evaluation on social media.
  • It leverages persona-guided agents to simulate diverse rater populations.
  • The framework provides structured, explainable judgments for fact-checking.
  • MultiCom significantly outperforms traditional methods in accuracy and efficiency.

Original post by Changxi Wen, Shuning Zhang, Bohao Chu, Yuwei Chuai, Hui Wang, Dai Shi, Xin Yi, Hewu Li

"arXiv:2606.18268v1 Announce Type: cross Abstract: Community-based fact-checking that relies on cross-consensus is expanding rapidly on social media platforms. However, the delay and low-ratio of cross-consensus community fact-checks rated by human contributors remains a significa…"

View on X

Originally posted by Changxi Wen, Shuning Zhang, Bohao Chu, Yuwei Chuai, Hui Wang, Dai Shi, Xin Yi, Hewu Li on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses