PACT Improves Model Merging by Preserving Core Task Knowledge

Ningyuan Shi, Zhipeng Zhou, Hao Wang, Chunyan Miao, Peilin Zhao· June 18, 2026 View original

Summary

A new method called PACT enhances model merging by addressing the limitation of existing task-vector-based approaches. It identifies and preserves "Load-Bearing Wall" dimensions—task-critical knowledge embedded in pre-trained weights—before merging, leading to improved multi-task model performance.

A new technique, PACT (Preserving Anchored Cores in Task-vectors), has been developed to improve the effectiveness of model merging, a training-free alternative to multi-task learning. Current model merging methods, often based on Task Arithmetic, assume that all task-specific knowledge resides solely within "task vectors" derived from fine-tuned models. However, this research argues that this assumption is flawed, as some critical task knowledge, termed "Load-Bearing Wall (LBW) dimensions," remains embedded within the pre-trained model's weights. Ignoring these LBW dimensions can lead to task conflicts and degradation during the merging process. PACT addresses this by identifying and preserving these anchored task-specific cores within task vectors. It aligns their orthogonal complements with the subspace of pre-trained weights, then removes these aligned components from the task vectors before applying standard merging algorithms. This approach consistently enhances mainstream model merging methods and achieves new state-of-the-art performance across various benchmarks. An efficient variant using randomized SVD is also proposed for scalability.

Why it matters

AI engineers and researchers can leverage PACT to more effectively combine multiple specialized AI models into a single, high-performing multi-task model, reducing training costs and improving efficiency in complex AI systems.

How to implement this in your domain

  1. 1Integrate PACT into existing model merging pipelines to enhance the performance of multi-task models.
  2. 2Apply PACT when combining fine-tuned models to ensure critical task-specific knowledge is retained.
  3. 3Experiment with PACT in scenarios where multiple specialized models need to operate cohesively.
  4. 4Utilize the randomized SVD variant for improved scalability when dealing with large models.

Who benefits

AI DevelopmentSoftware EngineeringCloud ComputingRoboticsAutonomous Systems

Key takeaways

  • Model merging can be significantly improved by preserving "Load-Bearing Wall" dimensions.
  • PACT addresses limitations of existing task-vector-based merging approaches.
  • The method prevents degradation and resolves task conflicts in multi-task models.
  • PACT offers a scalable solution for combining specialized AI models efficiently.

Original post by Ningyuan Shi, Zhipeng Zhou, Hao Wang, Chunyan Miao, Peilin Zhao

"arXiv:2606.18627v1 Announce Type: new Abstract: Model merging has emerged as a training-free alternative to multi-task learning, aiming to combine multiple task-specific fine-tuned models into a single multi-task model. Most existing model merging approaches follow the Task Arith…"

View on X

Originally posted by Ningyuan Shi, Zhipeng Zhou, Hao Wang, Chunyan Miao, Peilin Zhao on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses