New Framework Interprets Black-Box Neural Combinatorial Optimization

Haocheng Duan, Yuxin Guo, Jieyi Bi, Anqi Xie, Sirui Li, Yining Ma, Cathy Wu· June 19, 2026 View original

Summary

This paper introduces Evolving Programmatic Bottlenecks (EPB), a novel framework for interpreting Neural Combinatorial Optimization (NCO) models by distilling them into human-readable program portfolios. EPB uses an LLM to evolve programs whose action distributions serve as bottlenecks, revealing how NCO policies make dynamic, state-dependent decisions.

Neural Combinatorial Optimization (NCO) models often achieve high performance but are difficult to understand due to their black-box nature, hindering their deployment and diagnostic analysis. Traditional interpretability tools are inadequate for NCO's dynamic and state-dependent decision-making processes. This research addresses this challenge by proposing Evolving Programmatic Bottlenecks (EPB), a framework designed to interpret NCO policies. EPB works by distilling complex NCO models into a collection of human-readable programs. It leverages a Large Language Model (LLM) to autonomously develop these programs, where each program's step-by-step action distribution acts as a bottleneck for interpretation. The framework operates iteratively, using a hybrid gradient descent scheme to update a student router and revise programs, while also dynamically adjusting the program bank's capacity. Experiments demonstrate EPB's effectiveness and broad applicability, showing that the distilled program portfolios can largely match the original NCO performance. The framework also reveals that NCO behavior evolves across different optimization stages and can be approximated by a combination of classic heuristic variants. This work significantly advances interpretable NCO and positions EPB as a valuable tool for understanding sequential decision-making models.

Why it matters

Interpreting black-box NCO models is crucial for building trust, enabling debugging, and facilitating wider adoption in critical applications where understanding the decision-making process is paramount.

How to implement this in your domain

  1. 1Apply EPB to existing NCO models to gain insights into their decision-making logic.
  2. 2Utilize the distilled programmatic portfolios to debug and refine NCO policies.
  3. 3Integrate interpretability frameworks like EPB into the development lifecycle of sequential decision-making AI.
  4. 4Leverage LLMs for automated program generation and revision in AI interpretability tasks.

Who benefits

LogisticsManufacturingSupply ChainFinanceAI Engineering

Key takeaways

  • EPB interprets black-box NCO models by distilling them into human-readable programs.
  • An LLM evolves programs whose action distributions serve as interpretative bottlenecks.
  • The framework reveals how NCO behavior shifts across optimization stages.
  • EPB is a promising tool for interpreting sequential decision-making models.

Original post by Haocheng Duan, Yuxin Guo, Jieyi Bi, Anqi Xie, Sirui Li, Yining Ma, Cathy Wu

"arXiv:2606.19741v1 Announce Type: new Abstract: Neural Combinatorial Optimization (NCO) achieves strong performance, yet its black-box nature remains a key roadblock to deployment and scientific diagnosis. Standard interpretability tools, such as Concept Bottleneck Models (CBMs),…"

View on X

Originally posted by Haocheng Duan, Yuxin Guo, Jieyi Bi, Anqi Xie, Sirui Li, Yining Ma, Cathy Wu on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses