Fora Protects LLM Capabilities During Fine-Tuning

Rui Zhou, Tianci Xie· July 1, 2026 View original

Summary

Researchers introduce FORA (Function-space Orthogonal Residual Adaptation), a method that protects existing large language model capabilities during fine-tuning by directly identifying and preserving activation subspaces crucial for those capabilities. This approach outperforms weight-space projection and standard regularization, showing improved preservation with minimal new-task trade-off.

A new fine-tuning technique called FORA (Function-space Orthogonal Residual Adaptation) has been developed to prevent large language models (LLMs) from losing previously acquired capabilities when adapted to new tasks. Unlike existing methods that rely on proxies like parameter distances or weight singular directions, FORA directly identifies and protects the specific activation subspaces within the model that are essential for a given capability. This is achieved by estimating the principal directions of input-activation covariance per layer using label-free calibration inputs. FORA constructs a high-capacity branch for new task learning that is structurally barred from interfering with capability-relevant function directions, complemented by a narrow spectral channel for controlled plasticity. This construction can also be extended for parameter-efficient adaptation. Across various tasks, including preserving translation while learning COGS and GSM8K, FORA consistently demonstrated superior capability preservation compared to weight-space projection and standard regularization, with only a minor trade-off in new-task performance in some settings. The key advantage stems from projecting onto capability-derived directions rather than weight-derived ones.

Why it matters

Professionals can fine-tune LLMs for specific tasks without sacrificing their broad foundational knowledge and capabilities, leading to more versatile and cost-effective AI deployments.

How to implement this in your domain

  1. 1Evaluate current LLM fine-tuning pipelines for potential capability erosion.
  2. 2Research and experiment with function-space protection techniques like FORA for critical LLM applications.
  3. 3Develop a strategy for identifying and preserving core capabilities during model adaptation.
  4. 4Integrate capability-preserving fine-tuning methods into MLOps workflows for LLM deployment.

Who benefits

Software DevelopmentAI DevelopmentMarketingCustomer ServiceContent Creation

Key takeaways

  • FORA protects LLM capabilities during fine-tuning by preserving function-space activation directions.
  • It outperforms weight-space projection and standard regularization.
  • The method uses label-free calibration inputs to estimate capability-relevant subspaces.
  • FORA enables more versatile and robust LLM adaptation with minimal capability erosion.

Original post by Rui Zhou, Tianci Xie

"arXiv:2606.31092v1 Announce Type: new Abstract: Full fine-tuning adapts large language models to new tasks but can erode capabilities they already possess. Existing remedies protect through proxies such as parameter distances, importance penalties, output matching, or dominant si…"

View on X

Originally posted by Rui Zhou, Tianci Xie on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses