SSM Adapters Outperform LoRA for Long-Context Fine-Tuning

Omanshu Thapliyal· June 26, 2026 View original

▶ The 2-minute explainer

Summary

This research introduces Hankel Reduced-order Model (HRM) adapters, a new State Space Model (SSM)-based parameter-efficient fine-tuning (PEFT) method that significantly outperforms LoRA variants on long-context tasks. HRM adapters leverage time-invariance for efficient parallel processing and demonstrate superior performance by effectively modulating recurrence.

Parameter-efficient fine-tuning (PEFT) methods typically focus on attention projectors, but their effectiveness for tasks requiring sequential state accumulation, especially with long contexts, remains underexplored. This study investigates whether PEFT can benefit from State Space Model (SSM) adapters and if MLP blocks are better injection sites. The researchers introduce the Hankel Reduced-order Model (HRM) adapter, an SSM-based residual module initialized using Balanced Truncation of empirical Hankel Grammians. HRM adapters leverage the time-invariance of the system matrix to enable an exact FFT-based parallel scan, achieving computational efficiency comparable to LoRA across all context lengths. In evaluations on Mistral-7B with iso-parametric settings, HRM adapters consistently outperformed LoRA variants on LongBench tasks, showing substantial relative accuracy improvements (e.g., +34.8% on QuALITY, +71.6% on QMSum). Further analysis revealed that HRM adapters effectively learn to modulate recurrence, positioning them as a robust architectural alternative to low-rank adaptation for long-context sequence modeling.

Why it matters

AI engineers and researchers working with large language models on long-context tasks can adopt HRM adapters to achieve superior performance and efficiency compared to traditional LoRA methods, enabling more capable and cost-effective fine-tuning.

How to implement this in your domain

  1. 1Experiment with HRM adapters as an alternative to LoRA for fine-tuning large language models on long-context tasks.
  2. 2Evaluate the performance of HRM adapters on specific long-sequence data processing or generation tasks.
  3. 3Investigate the optimal injection sites for SSM adapters within existing transformer architectures.
  4. 4Integrate HRM adapters into your PEFT toolkit for improved efficiency and accuracy in long-context scenarios.
  5. 5Contribute to the development of open-source implementations and benchmarks for SSM-based adapters.

Who benefits

Natural Language ProcessingAI EngineeringContent GenerationLegal TechResearch & Development

Key takeaways

  • Hankel Reduced-order Model (HRM) adapters are a new, efficient PEFT method.
  • HRM adapters significantly outperform LoRA variants on long-context tasks.
  • They achieve computational parity with LoRA through FFT-based parallel scans.
  • HRM adapters offer a robust alternative for long-context sequence modeling by modulating recurrence.

Original post by Omanshu Thapliyal

"arXiv:2606.26290v1 Announce Type: new Abstract: While parameter-efficient fine-tuning (PEFT) typically targets attention projectors, its efficacy for tasks requiring sequential state accumulation remains under-explored. We examine if PEFT for such tasks can benefit from state spa…"

View on X

Originally posted by Omanshu Thapliyal on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses