Automated Pipeline Explores Heterogeneous Mixture-of-Experts Architectures

Yashkumar R Lukhi, Harsh Rameshbhai Moradiya, Radu Timofte, Dmitry Ignatov· June 24, 2026 View original

▶ The 2-minute explainer

Summary

Researchers developed an automated pipeline to systematically search for optimal 4-Expert Mixture-of-Experts (MoE4) architectures within the LEMUR dataset ecosystem. The campaign identified a coverage bias in the search space and highlighted ShuffleNet and MobileNetV3 as high-performing expert families.

Designing optimal Mixture-of-Experts (MoE) architectures, especially heterogeneous ones, is a complex and time-consuming task typically done manually. This research introduces an automated, large-scale search pipeline specifically for 4-Expert MoE (MoE4) architectures. The pipeline systematically combines base neural network families from the LEMUR database into MoE4 ensembles, each managed by a convolutional gating network with advanced training techniques. Over a 28-day campaign, the pipeline generated and evaluated over a thousand candidate models. A significant finding was the discovery of a coverage bias in the explored search space: due to an alphabetical enumeration method, the entire search was anchored to a single family, AirNet, exploring only a small fraction of the theoretical combinations. This bias was precisely characterized, and a fix involving stratified random sampling was proposed. Within the AirNet-anchored scope, the campaign identified ShuffleNet and MobileNetV3 as consistently producing the highest-accuracy ensembles, reaching mean accuracies up to 0.632. Conversely, FractalNet and MNASNet were flagged as low-yield families, suggesting their exclusion from future searches. The pipeline, analysis tools, and the corrected generator are now open-source, contributing to more efficient and unbiased MoE architecture exploration.

Why it matters

For AI engineers and researchers, automating the search for optimal neural network architectures, particularly for complex MoE models, can significantly accelerate development and improve model performance. Understanding and correcting search biases ensures more effective and comprehensive exploration of design spaces.

How to implement this in your domain

  1. 1Leverage automated pipeline search tools for exploring complex neural network architectures like Mixture-of-Experts.
  2. 2Critically review search space sampling strategies to identify and mitigate potential biases in exploration.
  3. 3Prioritize expert families like ShuffleNet and MobileNetV3 when designing heterogeneous MoE models based on identified high performance.
  4. 4Contribute to or utilize open-source projects that provide tools for automated architecture search and analysis.

Who benefits

AI EngineeringMachine Learning ResearchSoftware DevelopmentCloud Computing

Key takeaways

  • Automated pipelines can systematically explore complex Mixture-of-Experts architectures.
  • A significant coverage bias was found in the search space due to enumeration methods.
  • ShuffleNet and MobileNetV3 consistently formed high-accuracy MoE ensembles.
  • The pipeline and corrected generator are open-source for future research.

Original post by Yashkumar R Lukhi, Harsh Rameshbhai Moradiya, Radu Timofte, Dmitry Ignatov

"arXiv:2606.23739v1 Announce Type: new Abstract: We present an automated large-scale search pipeline for heterogeneous 4-Expert Mixture-of-Experts (MoE4) architectures within the LEMUR neural network dataset ecosystem. Building on a hand-crafted heterogeneous MoE reference model,…"

View on X

Originally posted by Yashkumar R Lukhi, Harsh Rameshbhai Moradiya, Radu Timofte, Dmitry Ignatov on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses