LLM-Solver Loops Face "Narration Gap" Risk, Compromising Verified Conclusions

Zunchen Huang, Songgaojun Deng· June 19, 2026 View original

Summary

This paper identifies a "narration gap" in hybrid LLM-solver reasoning pipelines, where the soundness guarantee of formal solvers can be lost when LLMs narrate results to users. It models the loop as a verified decision procedure, finding that while certificate gating ensures solver verdicts are sound, adversaries can invert verified conclusions through prompt injection, compromising the final user-read answer.

Hybrid reasoning pipelines that combine Large Language Models (LLMs) with formal tools like SAT and SMT solvers are increasingly used for safety and security-critical questions. While solvers provide sound and verifiable answers, this research highlights a critical vulnerability: the "narration gap." This gap occurs when the LLM translates the solver's formal output into a user-understandable answer, potentially compromising the soundness guarantee. The study models the entire LLM-solver loop as a verified decision procedure, analyzing its three components: formalizing the question, deciding it with the solver, and narrating the result. Previous work focused on formalization and decision, but this paper specifically addresses the narration phase. Empirical evaluations on five open-source models under prompt injection attacks reveal that while certificate gating can ensure the solver's verdict remains sound, an adversary can still manipulate the LLM to invert a verified conclusion through various phrasings and communication channels. Although hardened prompts can significantly reduce injection attacks, they cannot eliminate them and remain vulnerable to adaptive attacks. The findings demonstrate that even with formal verification at the solver level, the robustness of the final answer presented to the user can be compromised due to the LLM's narration.

Why it matters

For professionals in AI safety, cybersecurity, and critical systems development, this research is crucial. It exposes a significant security and reliability flaw in hybrid AI systems, emphasizing that formal verification at one stage does not guarantee end-to-end trustworthiness. It necessitates a re-evaluation of how LLM-solver interactions are designed and secured, especially for high-stakes applications.

How to implement this in your domain

  1. 1Design LLM-solver pipelines with explicit attention to the "narration gap," ensuring the integrity of solver outputs is maintained through the LLM's explanation.
  2. 2Implement robust certificate gating mechanisms to verify solver verdicts before LLM narration.
  3. 3Develop and test hardened prompts specifically designed to resist prompt injection attacks in the narration phase.
  4. 4Conduct adversarial testing on hybrid LLM-solver systems to identify and mitigate vulnerabilities in the user-facing output.
  5. 5Explore alternative methods for presenting solver results to users that minimize LLM interpretation or provide direct access to formal proofs.

Who benefits

CybersecurityAI EngineeringCritical InfrastructureLegalTechDefense

Key takeaways

  • The "narration gap" in LLM-solver loops can compromise the soundness of formally verified conclusions.
  • Prompt injection attacks can invert verified solver verdicts when LLMs narrate results to users.
  • Even with certificate gating, the final user-read answer may not be robust against adversarial manipulation.
  • Hardened prompts can reduce, but not eliminate, prompt injection vulnerabilities.

Original post by Zunchen Huang, Songgaojun Deng

"arXiv:2606.19588v1 Announce Type: new Abstract: Formal tools such as SAT and SMT solvers are increasingly embedded in language model reasoning pipelines when a safety or security critical question can be formulated in logic. Unlike chain of thought whose steps are sampled from th…"

View on X

Originally posted by Zunchen Huang, Songgaojun Deng on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses