OnDeFog Enhances Online RL Performance in Frame-Dropping Env

OnDeFog Enhances Online RL Performance in Frame-Dropping Environments

Daiki Yotsufuji, Kenta Nishihara, Shoma Shimizu, Kento Uchida, Shinichi Shirakawa· June 19, 2026 View original

Summary

This study introduces OnDeFog, an online reinforcement learning method that combines the frame-dropping resilience of DeFog with the online learning capabilities of the Decision Transformer. OnDeFog demonstrates superior performance in environments with high frame dropping rates and when dealing with datasets containing low-reward data, overcoming the generalization limitations of existing online decision transformers.

In real-world reinforcement learning scenarios, agents often face challenges like communication delays or sensor failures, leading to "frame dropping" where critical state and reward information is lost. While the offline Decision Transformer under Random Frame Dropping (DeFog) was developed to mitigate this, its offline nature limits its ability to generalize to new, unseen states. To address this, researchers have proposed OnDeFog, an innovative online reinforcement learning approach. OnDeFog integrates the robust frame-dropping mechanisms of DeFog with the Online Decision Transformer (ODT), allowing it to learn policies through direct interaction with the environment. Extensive experimental evaluations show that OnDeFog significantly outperforms ODT in environments characterized by high frame dropping rates. Furthermore, it demonstrates superior performance compared to DeFog when trained on datasets rich in low-reward data, highlighting its enhanced adaptability and generalization capabilities in challenging, dynamic settings.

Why it matters

For professionals developing autonomous systems, robotics, or real-time control applications, OnDeFog offers a crucial advancement. It provides a more robust and adaptive reinforcement learning solution that can maintain performance even when faced with unreliable sensor data or communication, which is common in real-world deployments.

How to implement this in your domain

1Consider implementing OnDeFog in reinforcement learning applications where sensor failures or communication delays lead to dropped frames.
2Evaluate OnDeFog's performance in your specific real-world environments, especially those with high data uncertainty.
3Explore integrating online learning capabilities with existing offline decision transformer models to improve adaptability.
4Design robust data collection and training strategies that account for potential frame dropping and low-reward scenarios.

Who benefits

RoboticsAutonomous VehiclesIndustrial AutomationAerospaceTelecommunications

Key takeaways

OnDeFog is an online RL method robust to frame dropping in real-world applications.
It combines DeFog's resilience with the Online Decision Transformer's adaptability.
OnDeFog outperforms existing methods in high frame-dropping and low-reward data scenarios.
This improves generalization and performance for autonomous systems in challenging environments.

Original post by Daiki Yotsufuji, Kenta Nishihara, Shoma Shimizu, Kento Uchida, Shinichi Shirakawa

"arXiv:2606.19721v1 Announce Type: new Abstract: In challenging real-world reinforcement learning applications, communication delays or sensor failures often cause frame dropping, in which the agent cannot receive the dropped states and associated rewards. To address the performan…"

View on X

Originally posted by Daiki Yotsufuji, Kenta Nishihara, Shoma Shimizu, Kento Uchida, Shinichi Shirakawa on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses

OnDeFog Enhances Online RL Performance in Frame-Dropping Environments

Why it matters

How to implement this in your domain

Who benefits

Key takeaways

Want to go deeper?

More in AI Engineering & DevTools

AI-Powered Development Workflow Integrates Multiple Models

Proposing AI Usage Transparency for Credible Commentary

MCP and A2A Protocols Standardize Agentic Internet Development