CoVoN Optimizer Improves Variational Continual Learning

Subarnaduti Paul, Yohan Jung, Mohammad Emtiyaz Khan, Siddharth Swaroop, Thomas M\"ollenhoff, Martin Mundt· June 24, 2026 View original

Summary

Researchers introduce Continual IVON (CoVON), a new optimizer for variational continual learning that incorporates fast and slow adaptation mechanisms by merging past posteriors as priors. CoVoN consistently outperforms existing methods in various continual learning tasks, including LLM fine-tuning.

Continual learning, where models adapt to new data without forgetting old knowledge, remains a significant challenge for deep networks. Existing optimizers often lack inherent mechanisms to balance stability (retaining old knowledge) and plasticity (learning new knowledge), a balance inspired by neuroscience. This research proposes a solution within the Variational Continual Learning (VCL) framework, where past posteriors serve as priors for future learning. The key innovation is to introduce "slow adaptation" by merging these past posteriors, which helps to slow down knowledge drift over time. This merged posterior then acts as the prior for "fast-weight updates" in the VCL process. These mechanisms are seamlessly integrated into the IVON optimizer, resulting in the new Continual IVON (CoVON) optimizer. CoVoN maintains a similar form and computational cost to Adam but consistently improves upon existing VCL optimizers. It demonstrates superior performance across domain-incremental learning, continual pre-training, and fine-tuning of large language models, offering a more robust approach to lifelong learning for AI systems.

Why it matters

This advancement provides a more effective and biologically inspired approach to continual learning, enabling AI models to adapt to evolving data streams and tasks without catastrophic forgetting, which is critical for real-world, dynamic applications.

How to implement this in your domain

  1. 1Evaluate CoVoN as an alternative optimizer for your continual learning tasks.
  2. 2Implement the fast and slow adaptation mechanisms in your custom learning pipelines.
  3. 3Apply CoVoN to fine-tune large language models on new, sequential datasets.
  4. 4Benchmark CoVoN's performance against other continual learning strategies in your domain.
  5. 5Consider using CoVoN for AI systems that need to continuously learn and adapt in production environments.

Who benefits

AI/ML DevelopmentRoboticsAutonomous VehiclesHealthcarePersonalized Learning

Key takeaways

  • CoVON is a new optimizer for variational continual learning.
  • It incorporates fast and slow adaptation by merging past posteriors as priors.
  • CoVON consistently outperforms existing VCL optimizers.
  • It improves continual learning performance across various tasks, including LLM fine-tuning.

Original post by Subarnaduti Paul, Yohan Jung, Mohammad Emtiyaz Khan, Siddharth Swaroop, Thomas M\"ollenhoff, Martin Mundt

"arXiv:2606.24007v1 Announce Type: new Abstract: Continual learning remains a major challenge for modern deep networks, partly because commonly used optimizers lack inherent mechanisms for continual adaptation. One such natural mechanism is fast and slow adaptation to balance stab…"

View on X

Originally posted by Subarnaduti Paul, Yohan Jung, Mohammad Emtiyaz Khan, Siddharth Swaroop, Thomas M\"ollenhoff, Martin Mundt on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses