ReGuide Improves Robot Policies with Self-Correcting Diffusion

Tzu-Hsiang Lin, Srinivas Shakkottai, Dileep Kalathil, P. R. Kumar· June 30, 2026 View original

Summary

ReGuide is a new framework that enhances behavior-cloned diffusion policies by using guided test-time rollouts as reusable on-policy recovery data, significantly improving success rates in robotic tasks. It addresses covariate shift by generating corrective trajectories and fine-tuning the policy.

Behavior-cloned diffusion policies, while expressive, often struggle with covariate shift, where small deviations from trained states can lead to task failures in robotics. Existing solutions either expand training data, which can be costly, or use test-time guidance, which discards valuable corrective information after execution. This research introduces ReGuide, a novel self-improving framework designed to overcome these limitations. ReGuide leverages Phase-Conditioned Guidance (PCG) to generate corrective rollouts. It identifies drifted-but-recoverable states and applies guidance to steer the policy back towards successful trajectories, ensuring the corrected actions align with the dynamics model's training distribution. Crucially, these successful guided rollouts are not discarded but are absorbed back into the policy. The framework offers two methods for integrating this new data: ReGuide-FT, which fine-tunes the current policy checkpoint, and ReGuide-FS, which retrains from scratch on the augmented dataset. These methods can also be combined and iterated. Experiments on various Robomimic tasks showed ReGuide improving base-policy success rates by 1.3 to 7.7 times, demonstrating its effectiveness in creating more robust and adaptive robotic policies.

Why it matters

Robotics and automation professionals can utilize ReGuide to develop more robust and adaptive AI policies for complex tasks, reducing failures caused by unexpected environmental variations and improving overall system reliability. This could accelerate the deployment of autonomous systems in real-world scenarios.

How to implement this in your domain

  1. 1Investigate integrating self-improving policy frameworks like ReGuide into existing robotic control systems.
  2. 2Design experiments to generate diverse "recovery data" from guided test-time interactions for policy refinement.
  3. 3Implement mechanisms for continuous fine-tuning or retraining of diffusion policies using newly acquired successful recovery trajectories.
  4. 4Evaluate the robustness of robotic systems to covariate shift by testing them in varied and challenging environments.
  5. 5Collaborate with AI researchers to adapt and apply advanced diffusion policy techniques to specific industrial automation challenges.

Who benefits

RoboticsManufacturingLogisticsAutomotiveAerospace

Key takeaways

  • ReGuide improves robotic policy robustness against covariate shift.
  • It reuses guided test-time rollouts as valuable recovery data.
  • The framework significantly boosts success rates in various robotic tasks.
  • This approach enables self-improving and adaptive autonomous systems.

Original post by Tzu-Hsiang Lin, Srinivas Shakkottai, Dileep Kalathil, P. R. Kumar

"arXiv:2606.28939v1 Announce Type: new Abstract: Behavior-cloned diffusion policies are expressive but remain vulnerable to covariate shift: small deviations from demonstrated states can compound into task failure. Existing methods address this either by expanding the training dis…"

View on X

Originally posted by Tzu-Hsiang Lin, Srinivas Shakkottai, Dileep Kalathil, P. R. Kumar on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses