DRL Transformer Solves Open Shop Scheduling with Scalability

Faezeh Ardali, Mwembezi A. Nyelele, Gerald M. Knapp· June 15, 2026 View original

Summary

This study introduces a Transformer-based deep reinforcement learning method for the Open Shop Scheduling Problem (OSSP), a complex industrial challenge. The model, trained on small instances, demonstrates strong generalization to significantly larger problems, achieving competitive makespan values compared to classical dispatching rules.

This research presents a novel approach to tackle the computationally intensive Open Shop Scheduling Problem (OSSP), which is prevalent in various industrial and service sectors. The proposed method utilizes a Transformer-based deep reinforcement learning (DRL) policy, employing an encoder-decoder architecture with multi-head attention. The model was trained using only processing-time matrices on relatively small Taillard benchmark instances. A key finding was its ability to generalize effectively to much larger, unseen problems, ranging from 40x40 to 100x100 instances, without requiring retraining. When compared against traditional dispatching heuristics like SPT, LPT, MWKR, and EST, the Transformer achieved makespans typically within 15-30% of best-known values on small instances and maintained competitive performance on large instances. This demonstrates that a learning-based, feature-light Transformer policy can offer a viable and scalable alternative to classical scheduling rules, particularly for complex, large-scale OSSP scenarios.

Why it matters

Efficient scheduling is critical for operational optimization across many industries. This DRL-based Transformer offers a scalable and adaptable solution for complex scheduling problems, potentially leading to significant cost savings and improved resource utilization for professionals in manufacturing, logistics, and service management.

How to implement this in your domain

  1. 1Evaluate the Transformer-based scheduling policy for specific OSSP instances in manufacturing or logistics operations.
  2. 2Integrate the model into existing production planning or resource allocation systems.
  3. 3Customize the DRL training environment to incorporate domain-specific constraints and objectives.
  4. 4Benchmark the Transformer's performance against current scheduling methods to quantify potential improvements.

Who benefits

ManufacturingLogisticsSupply Chain ManagementOperations ResearchService Management

Key takeaways

  • A DRL-based Transformer can effectively solve the Open Shop Scheduling Problem.
  • The model generalizes well from small training data to much larger instances.
  • It offers competitive performance compared to classical dispatching rules.
  • This approach provides a scalable, learning-based alternative for complex scheduling.

Original post by Faezeh Ardali, Mwembezi A. Nyelele, Gerald M. Knapp

"arXiv:2606.13682v1 Announce Type: new Abstract: The open shop scheduling problem (OSSP) arises in many industrial and service settings but remains computationally challenging as the number of jobs and machines increases. While exact methods quickly become intractable, classical d…"

View on X

Originally posted by Faezeh Ardali, Mwembezi A. Nyelele, Gerald M. Knapp on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses