EvoOptiGraph Coevolves LLMs and Data for Optimization Modeling.

Qingcan Kang, Mingyang Liu, Xiaojin Fu, Shixiong Kai, Tao Zhong, Mingxuan Yuan· June 26, 2026 View original

Summary

This paper introduces EvoOptiGraph, a framework that co-evolves data and large language models (LLMs) to improve optimization modeling from natural language. It generates structurally diverse mixed-integer linear program instances using graph-based evolutionary operators and guides model training with weakness signals, significantly outperforming existing methods.

Automating the creation of optimization models from natural language using large language models (LLMs) faces challenges due to limited structural diversity in training data and static data generation pipelines. To overcome this, researchers propose EvoOptiGraph, a novel framework where both the training data and the LLM itself co-evolve, driven by the model's identified weaknesses. EvoOptiGraph represents mixed-integer linear programs (MILPs) as attributed bipartite graphs. It then applies validity-preserving evolutionary operators to these graphs to generate a wide array of structurally diverse instances. These evolved graphs are converted into solver code and natural language descriptions through deterministic compilation and verified back-translation. The training process involves two stages: initial supervised fine-tuning (SFT) on a baseline dataset, followed by reinforcement learning with verifiable rewards (RLVR). During RLVR, signals indicating the model's failures guide the generation of new, targeted instances, creating a continuous feedback loop that updates the training distribution. This approach has shown significant improvements in accuracy, executability, and generalization compared to other models.

Why it matters

Professionals developing AI for complex problem-solving, especially in operations research and engineering, can leverage this co-evolutionary approach to build more robust and accurate optimization models from natural language.

How to implement this in your domain

  1. 1Explore graph-based representations for complex problem structures in your domain.
  2. 2Implement evolutionary algorithms to generate diverse problem instances based on these graph representations.
  3. 3Integrate a feedback loop where model weaknesses guide the creation of new training data.
  4. 4Apply reinforcement learning with verifiable rewards to continuously refine LLM performance on specific tasks.
  5. 5Evaluate the co-evolutionary approach against traditional supervised learning for optimization modeling tasks.

Who benefits

Operations ResearchManufacturingLogisticsSupply ChainEngineering

Key takeaways

  • Co-evolution of data and models, driven by weaknesses, improves LLM performance in optimization.
  • Graph-based structural generation creates diverse and challenging training instances.
  • Reinforcement learning with verifiable rewards guides targeted data generation.
  • EvoOptiGraph significantly enhances accuracy, executability, and generalization for optimization modeling.

Original post by Qingcan Kang, Mingyang Liu, Xiaojin Fu, Shixiong Kai, Tao Zhong, Mingxuan Yuan

"arXiv:2606.26578v1 Announce Type: new Abstract: Automating optimization modeling from natural language with large language models (LLMs) faces two key challenges. First, training corpora lack structural diversity. Second, data generation pipelines remain static and decoupled from…"

View on X

Originally posted by Qingcan Kang, Mingyang Liu, Xiaojin Fu, Shixiong Kai, Tao Zhong, Mingxuan Yuan on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses