EMA-FS Accelerates GBDT Training with Gain-Informed Feature Screening
Summary
EMA-based Feature Screening (EMA-FS) is an algorithm-level optimization that significantly accelerates Gradient Boosted Decision Tree (GBDT) training, like LightGBM, by using an exponential moving average of per-feature split gains to screen out low-utility features during histogram construction. This informed approach outperforms random feature subsampling, offering substantial speedups with improved or maintained accuracy.
Why it matters
Accelerating GBDT training without sacrificing accuracy is crucial for data scientists and machine learning engineers, especially in industries dealing with large, high-dimensional datasets where model training time is a bottleneck. EMA-FS offers a practical, compatible, and effective solution for faster model development and deployment.
How to implement this in your domain
- 1Integrate EMA-FS into your GBDT training workflows, particularly when using LightGBM.
- 2Experiment with different K values (number of top features retained) and retention rates to balance speed and accuracy.
- 3Consider applying the stochastic variant (S-EMA-FS) for more flexible control over feature selection.
- 4Evaluate the performance gains on your specific high-dimensional datasets, especially in fraud detection or advertising.
Who benefits
Key takeaways
- EMA-FS significantly accelerates GBDT training by screening features based on their historical gain.
- It outperforms random feature subsampling, retaining high-utility features for histogram construction.
- The method is compatible with LightGBM and offers substantial speedups, sometimes with improved accuracy.
- This optimization is highly valuable for data scientists working with large, high-dimensional datasets.
Original post by Yan Song
"arXiv:2606.26337v1 Announce Type: new Abstract: Gradient Boosted Decision Trees (GBDT), exemplified by LightGBM, spend a dominant fraction of training time -- typically 65-70% -- constructing per-feature histograms. Existing approaches such as random feature subsampling (feature_…"
View on XOriginally posted by Yan Song on X · view source
Want to go deeper?
Turn these trends into skills with Learnijoy's hands-on AI & tech courses.
Explore coursesMore in AI Engineering & DevTools
AI-Powered Development Workflow Integrates Multiple Models
A new development workflow leverages various AI models like Grok 4.3, GPT-5.5, and Opus 4.8 for distinct stages including research, planning, coding, testing, and debugging. This structured approach aims to optimize the software development lifecycle.

Proposing AI Usage Transparency for Credible Commentary
The author suggests a requirement for individuals and organizations to publish their percentage of frontier AI usage at work and personal usage. This transparency would establish credibility before commenting on AI's utility.
MCP and A2A Protocols Standardize Agentic Internet Development
The Model Context Protocol (MCP) and Agent-to-Agent (A2A) Protocol are standardizing how AI agents discover tools, call services, and coordinate across systems. Understanding these protocols is crucial for developers building agent-compatible infrastructure.