D2R-RAG Diagnoses and Repairs Factual Errors in RAG Systems

Soroush Hashemifar, Havva Alizadeh Noughabi, Fattane Zarrinkalam, Ali Dehghantanha· June 30, 2026 View original

Summary

Researchers introduce D2R-RAG (Diagnose-to-Repair RAG), a model-agnostic framework that improves the reliability of Retrieval-Augmented Generation (RAG) systems. D2R-RAG diagnoses factual errors using observable signals and applies adaptive repairs under latency and VRAM constraints, outperforming baselines in accuracy and efficiency.

Retrieval-Augmented Generation (RAG) systems, while improving LLM factuality, often suffer from fragility due to missing or irrelevant evidence, or generations that don't faithfully reflect the retrieved context. Existing solutions often require fine-tuning or privileged model access, limiting their practical use in black-box or resource-constrained environments. To address these limitations, D2R-RAG (Diagnose-to-Repair RAG) has been developed as a model-agnostic and resource-aware framework. It identifies failure signatures from observable signals in the query, retrieved evidence, and generated response, then selects from a small set of corrective actions while adhering to explicit latency and VRAM budgets. Experiments on benchmarks like FEVER and HotpotQA show that D2R-RAG significantly enhances reliability and achieves superior accuracy-efficiency trade-offs compared to other methods.

Why it matters

This framework offers a practical and efficient way for professionals to improve the factual accuracy and reliability of RAG systems, especially in environments with limited computational resources.

How to implement this in your domain

  1. 1Integrate D2R-RAG's diagnostic and repair mechanisms into existing RAG pipelines to enhance factuality.
  2. 2Develop custom failure signatures based on domain-specific error patterns in RAG outputs.
  3. 3Implement adaptive repair strategies that consider computational budget constraints for real-time applications.
  4. 4Utilize D2R-RAG to improve the robustness of RAG systems deployed in production, particularly for critical information retrieval.

Who benefits

AI EngineeringCustomer ServiceLegalHealthcareFinancial Services

Key takeaways

  • RAG systems can be fragile due to poor retrieval or unfaithful generation.
  • D2R-RAG diagnoses factual errors using observable signals without fine-tuning.
  • It applies adaptive repairs under explicit latency and VRAM constraints.
  • D2R-RAG improves RAG reliability and efficiency trade-offs over existing methods.

Original post by Soroush Hashemifar, Havva Alizadeh Noughabi, Fattane Zarrinkalam, Ali Dehghantanha

"arXiv:2606.29377v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) improves the factuality of large language models by grounding responses in external evidence, yet real-world deployments remain fragile. Failures often stem from missing or weakly relevant eviden…"

View on X

Originally posted by Soroush Hashemifar, Havva Alizadeh Noughabi, Fattane Zarrinkalam, Ali Dehghantanha on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses