AI Agents Automate Chemical Reaction Rule Generation

Daniel Armstrong, Maarten Dobbelaere, Valentas Olikauskas, Helena Avila, Octavian Susanu, J\'er\^ome Waser, Philippe Schwaller· July 2, 2026 View original

Summary

A multi-agent framework of large language models (LLMs) has been developed to automatically classify chemical reactions and generate verifiable reaction rules. This system significantly expands a standard chemical taxonomy and can classify unseen reactions with high accuracy, creating a self-expanding reactivity database.

Computer-assisted synthesis planning relies heavily on extensive libraries of reaction rules to break down target molecules into simpler precursors. However, the vast and ever-evolving landscape of chemistry makes manual rule encoding impractical, and existing tools often use static rulesets that cannot adapt to new discoveries. This research introduces a fully automated pipeline that addresses these challenges.The pipeline utilizes a multi-agent framework of large language models (LLMs) to not only classify chemical reactions but also to generate the reaction rules themselves. Each rule is created under a rigorous verification loop, testing it against a large corpus of patent reactions. This system successfully expanded a standard chemical taxonomy from 68 to over 14,000 classes without human intervention, demonstrating its ability to adapt and grow. The resulting living reactivity database can classify new reactions with high accuracy, matching leading proprietary classifiers while offering finer chemical resolution and on-demand extensibility.

Why it matters

Professionals in chemical R&D, pharmaceuticals, and materials science can leverage this to accelerate drug discovery and materials design by automating the generation and verification of complex chemical reaction rules.

How to implement this in your domain

  1. 1Explore integrating agentic LLM frameworks for automated rule generation in chemical synthesis planning.
  2. 2Pilot the self-expanding reactivity database for specific R&D projects to classify novel reactions.
  3. 3Develop internal expertise in applying LLMs for symbolic system generation and verification in scientific domains.
  4. 4Assess the potential for customizing the rule generation pipeline for proprietary chemical data.

Who benefits

PharmaceuticalsChemical EngineeringMaterials ScienceBiotechnologyAcademia

Key takeaways

  • AI agents can automate the generation and verification of chemical reaction rules.
  • The system significantly expands chemical taxonomies without human curation.
  • It achieves high accuracy in classifying unseen reactions, matching proprietary tools.
  • This approach creates a self-expanding, living reactivity database for chemical synthesis.

Original post by Daniel Armstrong, Maarten Dobbelaere, Valentas Olikauskas, Helena Avila, Octavian Susanu, J\'er\^ome Waser, Philippe Schwaller

"arXiv:2607.01061v1 Announce Type: new Abstract: Computer-assisted synthesis planning breaks target molecules into accessible precursors using large libraries of reaction rules that assign each transformation a deterministic, interpretable label. But chemistry is long-tailed, maki…"

View on X

Originally posted by Daniel Armstrong, Maarten Dobbelaere, Valentas Olikauskas, Helena Avila, Octavian Susanu, J\'er\^ome Waser, Philippe Schwaller on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses