DRIFT Refines LLM Training Data for Enhanced Capability

Zefan Wang, Lincheng Li, Tianyu Yu, Yuan Yao· June 18, 2026 View original

Summary

This paper introduces DRIFT, a method for refining instruction data for Supervised Fine-Tuning (SFT) of Large Language Models (LLMs) using on-policy data attribution. It addresses limitations of standard Influence Functions (IF) by minimizing proximity gaps and debiasing influence scores, leading to improved model performance.

Optimizing the distribution of training data is crucial for enhancing the capabilities of Large Language Models (LLMs through Supervised Fine-Tuning (SFT). While existing data curation methods effectively accelerate training under budget constraints, they often fall short in elevating the model's ultimate performance ceiling. The core challenge lies in identifying and refining data instances that are most effective at improving the final model, rather than just preserving current performance. This research explores instance-level data attribution using Influence Functions (IF) to tackle this problem. The authors identify two key limitations in standard IF formulations: a "proximity gap" arising from off-policy validation targets and a significant bias towards gradient norm. To overcome these, they propose DRIFT (Data Refinement via On-Policy Influence Functions for Supervised Fine-Tuning). DRIFT uses the model's own on-policy rollouts as validation targets, which empirically reduces the parameter proximity gap and better aligns with IF's local neighborhood assumption. It also applies signed weighting based on trajectory correctness and debiases influence scores, allowing a small set of validation queries to reliably attribute the entire dataset. Experiments on 7B-parameter instruction and reasoning models demonstrate that DRIFT consistently improves performance, surpassing existing data curation baselines.

Why it matters

For AI developers and researchers, DRIFT offers a powerful technique to significantly improve the performance of LLMs by intelligently curating training data. This can lead to more capable and robust models, especially for instruction-following and reasoning tasks, making the fine-tuning process more effective.

How to implement this in your domain

  1. 1Integrate DRIFT into LLM fine-tuning pipelines to optimize instruction data selection.
  2. 2Apply on-policy rollouts as validation targets to reduce the proximity gap in data attribution.
  3. 3Utilize signed weighting and debiasing techniques to improve the reliability of influence scores for data refinement.
  4. 4Experiment with DRIFT to enhance the reasoning and instruction-following capabilities of custom LLMs.

Who benefits

AI ResearchSoftware DevelopmentContent GenerationCustomer ServiceEducation

Key takeaways

  • Optimizing SFT data distribution is key to maximizing LLM capabilities.
  • DRIFT refines instruction data using on-policy data attribution via Influence Functions.
  • It addresses IF limitations by minimizing proximity gaps and debiasing scores.
  • DRIFT consistently raises the performance ceiling for instruction and reasoning models.

Original post by Zefan Wang, Lincheng Li, Tianyu Yu, Yuan Yao

"arXiv:2606.18307v1 Announce Type: cross Abstract: Optimizing the training data distribution for Supervised Fine-Tuning (SFT) dictates the capability of Large Language Models (LLMs). While existing data curation methods excel at accelerating training under constrained budgets, the…"

View on X

Originally posted by Zefan Wang, Lincheng Li, Tianyu Yu, Yuan Yao on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses