Reinforcement Learning Guides Cellular Navigation in Chemical Networks

Ruyi Tang (LCQB-AG), Gr\'egoire Sergeant-Perthuis (LCQB-AG), David Colliaux· June 26, 2026 View original

Summary

This research proposes a framework linking Partially Observable Markov Decision Processes (POMDPs) with biochemical reaction dynamics to model cellular navigation, reframing phototaxis as an information-driven sensorimotor process. The model, implemented via Chemical-Reaction-Network Ordinary Differential Equations (CRN-ODEs), demonstrates how intracellular networks can support adaptive information-seeking behavior.

This paper introduces a novel framework that integrates reinforcement learning principles into chemical reaction networks to model how living systems navigate complex environments. Specifically, it reinterprets the phototaxis of unicellular algae, traditionally seen as a mechanistic stimulus-response, as a subjective, information-driven sensorimotor process. The core idea links a Partially Observable Markov Decision Process (POMDP) with biochemical reaction dynamics. The model allows a cell to update its internal state based on noisy observations, balancing light orientation with exploratory reorientation. These internal dynamics are implemented using Chemical-Reaction-Network Ordinary Differential Equations (CRN-ODEs). By applying Inverse Reinforcement Learning to experimental Chlamydomonas trajectories, the researchers inferred the underlying behavioral objective, showing that the model accurately reproduces empirical alignment-to-light distributions. This work suggests that behaviors like run-tumble alternation emerge as a strategy for information acquisition, resolving sensory ambiguity through intracellular biochemical networks.

Why it matters

This interdisciplinary approach provides new insights into the computational capabilities of biological systems and could inspire novel designs for bio-inspired AI agents or autonomous micro-robots.

How to implement this in your domain

  1. 1Explore applying POMDP and CRN-ODE frameworks to model other complex biological processes beyond cellular navigation.
  2. 2Design bio-inspired AI agents that leverage information-gain strategies for exploration in uncertain environments.
  3. 3Investigate how to translate principles of curiosity-driven exploration from biological systems into robotic control algorithms.
  4. 4Develop simulation tools that integrate biochemical reaction networks with reinforcement learning for synthetic biology applications.

Who benefits

BiotechnologyRoboticsAI/MLPharmaceuticalsSynthetic Biology

Key takeaways

  • Cellular navigation can be modeled as an information-driven sensorimotor process using reinforcement learning.
  • A framework links POMDPs with biochemical reaction dynamics (CRN-ODEs) for this purpose.
  • The model explains behaviors like run-tumble alternation as information-acquisition strategies.
  • This approach offers insights into minimal cognition and bio-inspired AI.

Original post by Ruyi Tang (LCQB-AG), Gr\'egoire Sergeant-Perthuis (LCQB-AG), David Colliaux

"arXiv:2606.26168v1 Announce Type: new Abstract: Living systems navigate environments using noisy and incomplete sensory signals. In unicellular algae, phototaxis is often modeled as a mechanistic run--tumble process driven by stimulus--response rules. However, such descriptions o…"

View on X

Originally posted by Ruyi Tang (LCQB-AG), Gr\'egoire Sergeant-Perthuis (LCQB-AG), David Colliaux on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses