Valdi: Value Diffusion World Models for MPC

Christopher Lindenberg, Kashyap Chitta· July 2, 2026 View original

Summary

Valdi introduces Value Diffusion World Models, combining end-to-end online training for Model Predictive Control (MPC) with a latent diffusion dynamics model. Preliminary experiments show that Valdi, using a single diffusion step, matches deterministic MLP baselines in the CarRacing environment, highlighting a trade-off between predictive multimodality and control performance.

World models are crucial for enabling Model Predictive Control (MPC), but they require dynamics predictions that are both fast enough for real-time online use and expressive enough to capture uncertain future states. Diffusion models naturally excel at modeling uncertainty and multimodality, yet their iterative inference process typically makes them too slow for low-latency latent planning in MPC. This research bridges this gap by introducing Value Diffusion World Models, or Valdi. Valdi integrates end-to-end online training specifically for MPC with a latent diffusion dynamics model. The key innovation lies in making diffusion models practical for control by optimizing them for speed. Preliminary experiments conducted in the CarRacing environment demonstrate promising results. Valdi, even when utilizing just a single diffusion step during both training and inference, achieved performance comparable to a deterministic MLP baseline. The study also highlights an important trade-off observed in this setup: balancing the model's ability to represent multiple possible future outcomes (predictive multimodality) against its direct control performance.

Why it matters

Professionals in robotics, autonomous systems, and reinforcement learning can leverage Valdi to develop more robust and adaptable control systems that can handle uncertainty more effectively, potentially leading to safer and more efficient autonomous agents.

How to implement this in your domain

1Explore integrating Value Diffusion World Models (Valdi) into existing Model Predictive Control (MPC) frameworks for robotics or autonomous systems.
2Investigate the trade-off between predictive multimodality and control performance when designing diffusion-based world models.
3Benchmark Valdi's single-step diffusion inference against traditional deterministic dynamics models for real-time control applications.
4Adapt Valdi's online training methodology for specific control tasks requiring rapid model updates and uncertainty handling.

Who benefits

RoboticsAutonomous VehiclesIndustrial AutomationGamingLogistics

Key takeaways

Valdi makes diffusion models viable for low-latency Model Predictive Control.
It combines online MPC training with a latent diffusion dynamics model.
Single-step diffusion inference can match deterministic baselines in control tasks.
There is a trade-off between predictive multimodality and direct control performance.

Original post by Christopher Lindenberg, Kashyap Chitta

"arXiv:2607.00917v1 Announce Type: new Abstract: World models can enable Model Predictive Control (MPC), but this requires dynamics prediction that is both fast enough for online use and expressive enough to represent uncertain futures. Diffusion models offer a natural mechanism f…"

View on X

Originally posted by Christopher Lindenberg, Kashyap Chitta on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses

Valdi: Value Diffusion World Models for MPC

Why it matters

How to implement this in your domain

Who benefits

Key takeaways

Want to go deeper?

More in AI Research

Human Feedback Guides Generative Meta-Learning for Robust Generalization.

Task-Aware LLM Quantization Improves Efficiency and Performance.

Multi-Source Bayesian Optimization Improves Constrained Design Space Exploration.