New Noise Schedule Improves Diffusion Model Performance on Imbalanced Data.

Jiequan Cui, Beier Zhu, Qingshan Xu, Xiaojuan Qi, Bei Yu, Hanwang Zhang· June 29, 2026 View original

Summary

This paper introduces the Class-frequency Guided (CFRG) noise schedule for diffusion models, which improves generation quality and diversity for low-frequency classes in imbalanced datasets. By assigning larger-scale noises to less frequent classes, the method addresses issues of inaccurate score estimation and class dominance.

Researchers have identified a critical issue in score-based generative diffusion models when dealing with imbalanced datasets: low-frequency classes often suffer from poor generation quality and limited diversity. This occurs because low-density regions associated with these classes lead to inaccurate score estimations, and high-frequency classes tend to dominate the model's score space. To counteract this, the team proposes the Class-frequency Guided (CFRG) noise schedule. This novel approach strategically assigns larger-scale noises to low-frequency classes during the diffusion process. Experiments across various tasks, including image and text-to-image generation on imbalanced datasets like CIFAR-100-LT and ImageNet-LT, demonstrate substantial improvements, highlighting the importance of frequency statistics in noise schedule design.

Why it matters

Professionals working with generative AI, especially on real-world datasets that are often imbalanced, can achieve significantly better and more diverse outputs for underrepresented categories.

How to implement this in your domain

  1. 1Analyze your current diffusion model's performance on imbalanced datasets, paying attention to low-frequency class generation quality.
  2. 2Experiment with implementing a class-frequency guided noise schedule in your diffusion model training pipeline.
  3. 3Evaluate the impact of different noise scaling strategies for underrepresented classes on generation diversity and quality.
  4. 4Consider fine-tuning existing diffusion models with the CFRG schedule to improve their performance on specific long-tail data.
  5. 5Document the improvements and challenges encountered to inform future generative AI projects.

Who benefits

MarketingEntertainmentE-commerceHealthcareDesign

Key takeaways

  • Diffusion models struggle with imbalanced datasets, leading to poor generation for low-frequency classes.
  • The Class-frequency Guided (CFRG) noise schedule assigns larger noise to low-frequency classes.
  • This method significantly improves generation quality and diversity for underrepresented categories.
  • Frequency statistics are crucial for optimizing noise schedules in generative AI.

Original post by Jiequan Cui, Beier Zhu, Qingshan Xu, Xiaojuan Qi, Bei Yu, Hanwang Zhang

"arXiv:2606.27696v1 Announce Type: cross Abstract: In this paper, we are the first to examine the correlations between class frequency and the multi-scale noise schedule within diffusion models. For score-based generative models, low-density regions often lead to inaccurately esti…"

View on X

Originally posted by Jiequan Cui, Beier Zhu, Qingshan Xu, Xiaojuan Qi, Bei Yu, Hanwang Zhang on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses