Diffusion Language Models Encode Latent Denoising Progress

Maximo Rulli (Sapienza University of Rome), Thomas Fontanari (Sapienza University of Rome), Simone Petruzzi (Sapienza University of Rome), Federico Alvetreti (Sapienza University of Rome), Giorgio Strano (Sapienza University of Rome), Donato Crisostomi (Sapienza University of Rome), Giorgos Nikolaou (EPFL), Tommaso Mencattini (EPFL), Andrea Santilli (Independent researcher), Emanuele Rodol\`a (Sapienza University of Rome), Simone Scardapane (Sapienza University of Rome), Alessio Devoto (Independent researcher)· July 3, 2026 View original

▶ The 2-minute explainer

Summary

This research shows that Diffusion Language Models (DLMs) internally represent a latent "timestep" related to denoising progress within their residual streams, even without explicit conditioning. This signal can be extracted, steered to modulate model confidence, and exhibits structured properties, shedding light on DLM internal mechanisms.

Diffusion Language Models (DLMs) are emerging as a promising alternative to traditional autoregressive models. Unlike standard diffusion models that are explicitly conditioned on a timestep to guide their denoising process, DLMs do not have this explicit conditioning. This raises a fundamental question about whether these models implicitly represent their denoising progress and how such information might be utilized internally. This research demonstrates that DLMs do indeed encode a latent representation of the diffusion timestep within their internal residual streams. This "subliminal clock" signal can be reliably extracted using probing techniques across different layers of the model, indicating that the denoising progress is decodable from the model's internal activations. Furthermore, by steering the model along a specific low-dimensional subspace associated with this inferred timestep, researchers found they could systematically modulate the model's confidence and entropy in its outputs. The study also analyzed the geometric properties of this identified representation, revealing that it exhibits structured and interpretable characteristics within the activation space. These findings provide crucial insights into the internal workings of DLMs, explaining how they manage and process information related to their generative process, even without explicit external guidance. Understanding this latent time modeling can inform future advancements in DLM design and control.

Why it matters

For AI researchers and engineers working on generative models, understanding how DLMs implicitly manage denoising progress offers critical insights into their internal mechanisms, potentially leading to more controllable and efficient model architectures.

How to implement this in your domain

  1. 1Investigate latent time modeling in your own diffusion-based generative models to understand internal dynamics.
  2. 2Develop probing techniques to extract and analyze implicit signals within model activations.
  3. 3Explore methods for steering latent representations to control model behavior, confidence, or output characteristics.
  4. 4Apply insights from latent time modeling to design more efficient or interpretable diffusion language models.
  5. 5Contribute to research on the interpretability of generative AI models to uncover hidden mechanisms.

Who benefits

AI ResearchGenerative AI DevelopmentNatural Language ProcessingComputer Vision

Key takeaways

  • Diffusion Language Models implicitly encode denoising progress as a latent "timestep."
  • This latent signal is decodable from internal model activations.
  • Steering this latent representation can modulate model confidence and entropy.
  • The identified representation exhibits structured and interpretable properties.

Original post by Maximo Rulli (Sapienza University of Rome), Thomas Fontanari (Sapienza University of Rome), Simone Petruzzi (Sapienza University of Rome), Federico Alvetreti (Sapienza University of Rome), Giorgio Strano (Sapienza University of Rome), Donato Crisostomi (Sapienza University of Rome), Giorgos Nikolaou (EPFL), Tommaso Mencattini (EPFL), Andrea Santilli (Independent researcher), Emanuele Rodol\`a (Sapienza University of Rome), Simone Scardapane (Sapienza University of Rome), Alessio Devoto (Independent researcher)

"arXiv:2607.01774v1 Announce Type: new Abstract: Diffusion Language Models (DLMs) have recently emerged as a promising alternative to autoregressive models. Unlike standard diffusion-based approaches, DLMs are not explicitly conditioned on a timestep, raising a natural question: d…"

View on X

Originally posted by Maximo Rulli (Sapienza University of Rome), Thomas Fontanari (Sapienza University of Rome), Simone Petruzzi (Sapienza University of Rome), Federico Alvetreti (Sapienza University of Rome), Giorgio Strano (Sapienza University of Rome), Donato Crisostomi (Sapienza University of Rome), Giorgos Nikolaou (EPFL), Tommaso Mencattini (EPFL), Andrea Santilli (Independent researcher), Emanuele Rodol\`a (Sapienza University of Rome), Simone Scardapane (Sapienza University of Rome), Alessio Devoto (Independent researcher) on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses

More in AI Research

AI ResearchAI Engineering & DevTools

New Methods for Log-Density-Ratio Estimation in Gaussian Models

This research compares ridge-regularized variational and spectral log-density-ratio estimation in Gaussian location models, deriving high-dimensional asymptotic equivalents to analyze their population risks. It concludes that variational estimators perform better with many observations, while spectral estimators are favored with fewer due to lower variance.

Francis Bach (SIERRA)Jul 3, 2026
AI ResearchAI Engineering & DevTools

Dynamic Support Learning Enhances Reinforcement Learning Value Estimation

This paper introduces an approach that dynamically learns the lower and upper bounds of support intervals for categorical critics in reinforcement learning, improving value function estimation. The method, which forms a tighter upper bound on the mean-squared Bellman error, enhances stability and performance on continuous-control tasks without requiring pre-defined support intervals.

Jen-Yen Chang, Takayuki Osa, Tatsuya HaradaJul 3, 2026
AI Engineering & DevToolsAI Research

Decomposer Recovers Music Programs from Symbolic MIDI Data

Decomposer is a new framework that decompiles symbolic MIDI music into executable Strudel programs, allowing for the recovery of high-level musical instructions. It addresses challenges of low-resource language data and code readability by using synthetic data for fine-tuning and reinforcement learning to optimize both reconstruction faithfulness and code clarity.

Yewon Kim, Apurva Gandhi, David Chung, Graham Neubig, Chris DonahueJul 3, 2026