LLMs Struggle with Self-Awareness on Clinical Data, New Method Improves Reliability

Akshat Dasula, Prasanna Desikan, Jaideep Srivastava· June 19, 2026 View original

Summary

This research reveals that Large Language Models (LLMs) often fail to recognize their own knowledge limitations when applied to structured clinical data, exhibiting "epistemic blind spots." A new method using cross-model attribution divergence significantly improves LLM reliability and provides patient-specific confidence estimates without needing model internals or retraining.

Large Language Models are increasingly being deployed for tasks involving structured clinical data, yet their ability to discern the boundaries of their own knowledge in such critical applications remains largely unexamined. This study investigates this crucial question by analyzing "epistemic blind spots" in LLMs, specifically comparing Qwen 2.5 7B with XGBoost on a clinical prediction task using cross-model attribution divergence. The findings highlight several key issues. Firstly, LLMs' verbalized confidence is shown to be unreliable, remaining consistently high even when prediction accuracy is low, indicating it tracks prompt format rather than actual prediction quality. Secondly, LLMs exhibit an inverse difficulty effect, performing worse on tasks where XGBoost is highly accurate but matching XGBoost's performance on moderately uncertain tasks. The research also demonstrates that combining few-shot examples with SHAP-derived feature evidence acts as a powerful, super-additive intervention, drastically reducing attribution disagreement and improving accuracy without requiring model training. Finally, a novel cross-model calibrator, which uses attribution divergence signals, significantly reduces expected calibration error, providing patient-specific reliability estimates that are far more informative than the LLM's inherent verbalized confidence. This work frames these challenges as a "cold start problem" for LLMs on structured data and outlines a path towards achieving genuine epistemic self-awareness.

Why it matters

For healthcare professionals and AI developers, this research is critical for safely deploying LLMs in clinical settings. It exposes a fundamental flaw in LLM self-assessment and offers a practical, non-invasive method to improve their reliability and provide trustworthy uncertainty estimates, which is vital for patient care and regulatory compliance.

How to implement this in your domain

  1. 1Implement cross-model attribution divergence techniques to assess and improve the reliability of LLMs on structured data.
  2. 2Develop patient-specific reliability estimates for LLM predictions in clinical applications, moving beyond generic confidence scores.
  3. 3Incorporate few-shot examples and feature evidence (e.g., SHAP values) to enhance LLM accuracy and reduce epistemic uncertainty without retraining.
  4. 4Prioritize external calibration methods for LLM outputs, especially in high-stakes domains like healthcare, to ensure trustworthy predictions.
  5. 5Educate stakeholders on the limitations of LLM verbalized confidence and the importance of robust uncertainty quantification.

Who benefits

HealthcarePharmaMedical DevicesAI EngineeringRegulatory Compliance

Key takeaways

  • LLMs struggle to recognize their own knowledge limits on structured clinical data, showing unreliable verbalized confidence.
  • A "cold start problem" exists for LLMs on structured data, where accuracy can drop significantly even when other models are highly confident.
  • Cross-model attribution divergence can detect epistemic blind spots and improve LLM reliability.
  • Combining few-shot examples and SHAP-derived feature evidence significantly enhances accuracy and reduces disagreement.

Original post by Akshat Dasula, Prasanna Desikan, Jaideep Srivastava

"arXiv:2606.19509v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly applied to structured clinical data, yet whether they can recognize the limits of their own knowledge on such tasks remains unexplored. We study this question through the lens of cross-m…"

View on X

Originally posted by Akshat Dasula, Prasanna Desikan, Jaideep Srivastava on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses