New Framework Enhances Prompt Adaptation in Multi-LLM Agent Systems

Tan Zhu, Tong Yao, Kananart Kuwaranancharoen, Amit Singh, Yushang Lai, Deepa Mohan, Shankara Bhargava· June 15, 2026 View original

▶ The 60-second brief

Summary

Researchers introduce Graph-based Target Back-Propagation (GTBP), a novel context adaptation framework for multi-LLM agentic workflows. GTBP improves prompt engineering by accurately assigning credit and ensuring convergence, outperforming existing methods in complex agent systems.

Context adaptation is a technique that automates prompt engineering in Large Language Model systems by iteratively refining prompts based on task feedback, without altering the model's core weights. Extending this capability to multi-LLM agentic systems, where multiple LLMs collaborate on a task, presents challenges, particularly in accurately attributing performance to individual agents and guaranteeing the stability of prompt updates. This research proposes a new framework called Graph-based Target Back-Propagation (GTBP). GTBP models agentic workflows as directed acyclic graphs, allowing it to propagate local target outputs backward through the graph. This mechanism uses discrepancies between target and actual outputs to guide a stage-wise prompt update process. The theoretical analysis demonstrates that GTBP's prompt updates stabilize over iterations and that a capable LLM optimizer can reduce the overall objective. Empirically, GTBP consistently surpasses other strong baseline methods across various benchmarks, all while maintaining comparable computational efficiency, making it a robust solution for complex multi-agent AI systems.

Why it matters

For professionals building and deploying complex AI systems involving multiple interacting LLMs, efficient and reliable prompt engineering is crucial. GTBP offers a principled method to automate and optimize these prompts, leading to more robust, adaptable, and performant agentic workflows.

How to implement this in your domain

  1. 1Adopt graph-based representations for multi-LLM agentic workflows to enable structured context adaptation.
  2. 2Implement target back-propagation mechanisms to accurately assign credit and guide prompt updates in multi-agent systems.
  3. 3Integrate iterative prompt refinement loops into your agentic AI development pipeline.
  4. 4Evaluate GTBP or similar context adaptation frameworks for improving the performance and stability of your multi-LLM applications.

Who benefits

AI DevelopmentSoftware EngineeringRoboticsAutomation

Key takeaways

  • Context adaptation for multi-LLM agentic systems is challenging due to credit assignment and convergence issues.
  • GTBP uses graph-based target back-propagation to address these challenges.
  • The framework ensures stable prompt updates and improves overall objective reduction.
  • GTBP outperforms baselines while maintaining computational efficiency.

Original post by Tan Zhu, Tong Yao, Kananart Kuwaranancharoen, Amit Singh, Yushang Lai, Deepa Mohan, Shankara Bhargava

"arXiv:2606.14155v1 Announce Type: new Abstract: Context adaptation automates prompt engineering in LLM-based systems by iteratively revising tunable prompts from task feedback, without modifying model weights. Extending this paradigm to multi-LLM agentic systems is crucial: exist…"

View on X

Originally posted by Tan Zhu, Tong Yao, Kananart Kuwaranancharoen, Amit Singh, Yushang Lai, Deepa Mohan, Shankara Bhargava on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses