HASTE System Boosts ML Engineering Efficiency with Skill Transfer.

Yongbin Kim, Yashar Talebirad, Osmar R. Zaiane· July 1, 2026 View original

▶ The 2-minute explainer

Summary

HASTE is a hierarchical multi-agent system that improves ML engineering efficiency by organizing and transferring knowledge across different competitions. It uses a three-tier skill inventory (global, domain, competition-specific) and LLM-driven abstraction to reduce compute waste and achieve higher success rates.

This paper introduces HASTE (Hierarchical Accumulation of Skills for Transfer-Efficient ML Engineering), a multi-agent system designed to prevent ML engineering agents from repeatedly "rediscovering" solutions in new competitions. The core idea is to efficiently organize and transfer knowledge by structuring it into three distinct scope tiers: global, domain-specific, and competition-specific. An orchestrator agent manages these tiers, coordinating domain specialists and facilitating knowledge abstraction and promotion between levels using Large Language Models (LLMs). A controlled experiment demonstrated the significant benefits of this scoped knowledge loading. With a fixed inventory of 159 skills across 8 competitions, HASTE achieved a 100% medal rate, whereas a flat loading approach (without hierarchical organization) only reached 62.5%, performing no better than loading no skills at all, and consumed twice the computational tokens. On the comprehensive MLE-Bench Lite benchmark, which includes 22 Kaggle competitions, HASTE achieved a 77.3% medal rate using Claude Sonnet 4.6, with each competition taking approximately 12 hours. The system showed remarkable efficiency gains in "warm-start" runs, where it leveraged previously learned global and domain-level skills. Warm starts required 52% fewer refinement iterations, and the fraction of proposed changes accepted by the agent dramatically increased from 42% with low skill inventory to 85% once over 50 skills were available. These findings suggest that intelligent knowledge organization can effectively substitute for raw model strength and computational budget in ML engineering tasks.

Why it matters

For ML engineering teams, HASTE offers a paradigm shift in how AI agents can be used for development, enabling faster iteration, reduced computational costs, and higher success rates by systematically leveraging accumulated knowledge. This is critical for scaling ML operations and tackling diverse problems efficiently.

How to implement this in your domain

  1. 1Explore implementing hierarchical knowledge management systems for ML development.
  2. 2Design multi-agent workflows for automating repetitive ML engineering tasks.
  3. 3Investigate using LLMs for abstracting and transferring skills across different projects.
  4. 4Benchmark the efficiency gains of "warm-start" vs. "cold-start" approaches in ML projects.
  5. 5Develop a shared knowledge base or "skill inventory" for ML engineering best practices.

Who benefits

AI/ML DevelopmentSoftware EngineeringData ScienceResearch & DevelopmentConsulting

Key takeaways

  • HASTE is a multi-agent system for efficient ML engineering skill transfer.
  • It uses a three-tier hierarchical knowledge organization (global, domain, competition).
  • Hierarchical loading significantly improves success rates and reduces compute.
  • Warm-start runs with accumulated skills are much more efficient than cold starts.

Original post by Yongbin Kim, Yashar Talebirad, Osmar R. Zaiane

"arXiv:2606.30911v1 Announce Type: new Abstract: ML engineering agents waste compute rediscovering known techniques because every competition is a cold start. We present HASTE, a hierarchical multi-agent system that organizes cross-competition knowledge into three scope tiers (glo…"

View on X

Originally posted by Yongbin Kim, Yashar Talebirad, Osmar R. Zaiane on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses

More in AI Engineering & DevTools

AI ResearchAI Engineering & DevTools

Philosophical Foundations for Explainable AI in Healthcare Explored

This paper critically reviews the intersection of philosophy of science and explainable AI (XAI) in health sciences, examining what constitutes an adequate medical explanation. It identifies causality, trust, and epistemic adequacy as central axes for designing robust XAI systems in clinical decision-making.

Martina Mattioli, Marcello PelilloJul 1, 2026
AI ResearchAI Engineering & DevTools

New Metric Improves LLM Reinforcement Learning with Verifiable Rewards.

This research introduces the Relative Surprisal Index (RSI), an information-theoretic metric for adaptive token selection in Reinforcement Learning with Verifiable Rewards (RLVR) for LLMs. RSI-S, an entropy-adaptive filtering method based on RSI, improves reasoning accuracy by 2-3 percentage points by retaining tokens within a stable surprisal interval.

Outongyi Lv, Yanzhao Zheng, Yuanwei Zhang, Zhenghao Huang, Xingjun Wang, Baohua Dong, Hangcheng Zhu, Yingda ChenJul 1, 2026
AI Engineering & DevToolsAI Research

New ACE Module Boosts LLM Agent Context Management

Researchers introduce ACE (Adaptive Context Elasticizer), a plug-and-play module that dynamically manages historical information for LLM-based agents. ACE maintains a lossless message layer and adaptively orchestrates context, significantly improving performance across various agent frameworks without architectural changes.

Ning Liao, Zihao Long, Xiaoxing Wang, Xue Yang, Yaoming Wang, Ziyuan Zhuang, Xunliang Cai, Rongxiang Weng, Junchi YanJul 1, 2026