Two-Stage Fine-Tuning Generates Proteins with Targeted Amino-Acid Composition.

Violeta Basten-Romero, Rub\'en Mu\~noz-Tafalla, Anna Mar\'ia D\'iaz-Rovira, Bertran Miquel-Oliver, Isaac Filella-Merce, V\'ictor Guallar· June 29, 2026 View original

▶ The 2-minute explainer

Summary

This paper proposes a two-stage fine-tuning pipeline using protein language models to generate protein sequences that match specific amino-acid composition profiles while maintaining sequence quality and diversity. The method combines domain-adaptive fine-tuning with iterative reward-weighted reinforcement learning.

Generating protein sequences with precise amino-acid compositions is a significant challenge in biological design, particularly for applications like synthetic feed protein where nutritional value is directly linked to composition. Existing protein language models are powerful priors but lack mechanisms to steer generation towards explicit compositional targets. Researchers have developed a novel two-stage fine-tuning pipeline to address this. The first stage involves domain-adaptive fine-tuning (FT) of a protein language model on a relevant in-domain protein dataset. This initial step brings the average amino-acid composition of generated sequences closer to the desired target. The second stage then employs iterative reward-weighted fine-tuning, utilizing reinforcement learning (RL) and anchoring against the previously fine-tuned model as a stable reference. Evaluations on two distinct amino-acid compositions demonstrate that while the initial FT stage provides a good approximation, the subsequent RL stage is crucial for enforcing specific sequence constraints that FT alone cannot satisfy. The study also explores various design choices for the composition reward term, confirming that this pipeline effectively aligns amino-acid composition without compromising the overall quality or diversity of the generated protein sequences.

Why it matters

Professionals in biotechnology, pharmaceuticals, and agriculture can leverage this method to design novel proteins with precisely tailored properties, accelerating drug discovery, enzyme engineering, and the development of advanced nutritional products.

How to implement this in your domain

  1. 1Identify target amino-acid composition profiles for desired protein functions.
  2. 2Acquire or curate relevant in-domain protein datasets for initial model fine-tuning.
  3. 3Implement a two-stage fine-tuning pipeline using protein language models, incorporating RL for compositional steering.
  4. 4Define and optimize reward functions that accurately reflect the desired amino-acid composition and sequence quality.
  5. 5Validate generated protein sequences for both compositional accuracy and biological plausibility.

Who benefits

BiotechnologyPharmaceuticalsAgricultureFood ScienceMaterials Science

Key takeaways

  • A two-stage fine-tuning approach enables protein language models to generate sequences with targeted amino-acid compositions.
  • Domain-adaptive fine-tuning provides an initial compositional alignment.
  • Reinforcement learning is essential for enforcing precise sequence constraints.
  • The method maintains sequence quality and diversity while achieving compositional accuracy.

Original post by Violeta Basten-Romero, Rub\'en Mu\~noz-Tafalla, Anna Mar\'ia D\'iaz-Rovira, Bertran Miquel-Oliver, Isaac Filella-Merce, V\'ictor Guallar

"arXiv:2606.27939v1 Announce Type: new Abstract: Protein language models are standard priors for biological sequence generation, but steering them toward explicit distributional design targets remains largely unexplored. We study a constrained protein generation problem in which s…"

View on X

Originally posted by Violeta Basten-Romero, Rub\'en Mu\~noz-Tafalla, Anna Mar\'ia D\'iaz-Rovira, Bertran Miquel-Oliver, Isaac Filella-Merce, V\'ictor Guallar on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses