New Verifier Ensures Source-Aware Factuality for LLM Agents

Ander Alvarez, Santhiya Rajan, Samuel Mugel, Rom\'an Or\'us· June 17, 2026 View original

Summary

ProvenanceGuard is a novel source-aware verifier for LLM agents that use the Model Context Protocol (MCP), designed to detect "cross-source conflation" where claims are supported but attributed to the wrong source. It decomposes answers, routes claims to specific evidence, and checks attribution, significantly improving factuality verification.

This research introduces ProvenanceGuard, a new verification system specifically designed for LLM agents that utilize the Model Context Protocol (MCP) to synthesize information from multiple heterogeneous sources. The core problem it addresses is "cross-source conflation," where an AI agent might generate a factual claim that is supported by *some* evidence, but incorrectly attributes it to the wrong source. ProvenanceGuard operates by analyzing MCP traces, breaking down agent answers into atomic claims, and then routing each claim to its correct source-specific evidence. It employs Natural Language Inference (NLI) and token-alignment to verify support and compares the agent's stated attribution with the actual supporting source. The system provides per-claim verdicts and an overall allow/block decision for the answer, with blocked answers capable of being repaired and re-verified. Evaluated on medical-domain MCP-agent traces, ProvenanceGuard achieved strong performance in detecting and preventing attribution errors. It significantly outperformed source-blind baselines, demonstrating its effectiveness in ensuring accurate source attribution, especially in complex multi-source environments.

Why it matters

For professionals building or deploying LLM agents that synthesize information from multiple sources, ProvenanceGuard offers a critical mechanism to ensure not just factuality, but also correct source attribution, which is vital for trust, compliance, and preventing misinformation. This is particularly important in domains like healthcare where source accuracy is paramount.

How to implement this in your domain

  1. 1Integrate ProvenanceGuard into LLM agent pipelines that use multiple evidence sources to verify source attribution.
  2. 2Utilize the system's claim decomposition and source routing capabilities to pinpoint attribution errors.
  3. 3Implement the repair-and-reverify mechanism for blocked answers to improve overall accuracy and reliability.
  4. 4Apply source-aware factuality verification in high-stakes applications where correct attribution is crucial.
  5. 5Develop internal metrics to track cross-source conflation rates in agent outputs.

Who benefits

HealthcareLegalFinanceInformation ServicesPharma

Key takeaways

  • LLM agents using multiple sources can suffer from "cross-source conflation," attributing claims to the wrong source.
  • ProvenanceGuard is a new verifier that checks both factuality and correct source attribution for agent outputs.
  • The system decomposes answers into atomic claims and routes them to specific evidence for verification.
  • Accurate source attribution is critical for trustworthy AI agents, especially in sensitive domains.

Original post by Ander Alvarez, Santhiya Rajan, Samuel Mugel, Rom\'an Or\'us

"arXiv:2606.18037v1 Announce Type: new Abstract: Tool-using LLM agents increasingly use the Model Context Protocol (MCP) to answer from heterogeneous evidence sources, including search, APIs, databases, clinical records, and formulary tools. Standard factuality metrics usually tes…"

View on X

Originally posted by Ander Alvarez, Santhiya Rajan, Samuel Mugel, Rom\'an Or\'us on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses