GRASP Enables Memory-Efficient Multi-Source Transfer Learning

Mary Isabelle Wisell, Nicholas Jacobs, Aayush Manandhar, Salimeh Yasaei Sekeh· June 16, 2026 View original

Summary

GRASP (Gradient-Aligned Sequential Parameter Transfer) is a new method for multi-source transfer learning that achieves superior knowledge integration with constant memory consumption. It sequentially merges source models, selectively transfers gradient-aligned parameters, and iteratively fine-tunes, outperforming ensemble methods while scaling to many sources without memory growth.

This paper introduces GRASP (Gradient-Aligned Sequential Parameter Transfer), a novel solution to the scalability challenges in multi-source transfer learning. Existing methods typically require loading all source models simultaneously, leading to memory consumption that scales linearly with the number of sources, or deploying all models at inference time, which is impractical for production. GRASP overcomes these limitations by maintaining constant memory usage while achieving superior knowledge integration. GRASP employs three key innovations: it processes and merges one source model at a time into an evolving target model, ensuring sequential integration. It uses parameter-wise gradient alignment to selectively transfer only those parameters whose optimization directions align with the target domain, effectively preventing negative transfer. Finally, it incorporates iterative fine-tuning to adapt the transferred knowledge before integrating the next source. Extensive experiments across various continual learning benchmarks and architectures demonstrate that GRASP achieves significantly higher accuracy (93.5% mean accuracy) compared to ensemble methods (71.7%) while requiring only constant memory, making it ideal for resource-constrained environments and continuously evolving source domains.

Why it matters

For AI engineers and data scientists dealing with multiple data sources or models, GRASP offers a breakthrough in memory-efficient transfer learning. This enables the integration of knowledge from numerous sources without prohibitive memory costs, facilitating scalable and adaptable AI systems.

How to implement this in your domain

  1. 1Adopt GRASP for multi-source transfer learning scenarios where memory constraints are a concern.
  2. 2Implement sequential model merging and parameter-wise gradient alignment in your transfer learning pipelines.
  3. 3Utilize iterative fine-tuning to adapt transferred knowledge effectively before integrating new sources.
  4. 4Apply GRASP to continually evolving source domains to maintain model performance over time.

Who benefits

AI/ML DevelopmentCloud ComputingEdge ComputingHealthcareFinance

Key takeaways

  • GRASP enables memory-efficient multi-source transfer learning with constant memory consumption.
  • It sequentially integrates knowledge from multiple source models.
  • Gradient alignment prevents negative transfer by selectively transferring parameters.
  • GRASP significantly outperforms traditional ensemble methods in accuracy and scalability.

Original post by Mary Isabelle Wisell, Nicholas Jacobs, Aayush Manandhar, Salimeh Yasaei Sekeh

"arXiv:2606.14900v1 Announce Type: new Abstract: Multi-source transfer learning faces a fundamental scalability bottleneck: existing approaches require either loading all K source models into memory simultaneously during parameter fusion, requiring O(K) memory, or deploying all mo…"

View on X

Originally posted by Mary Isabelle Wisell, Nicholas Jacobs, Aayush Manandhar, Salimeh Yasaei Sekeh on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses