Foundation Models Evaluated for Epidemic Forecasting; Mixture-of-Experts Excels

Alireza Jafari, Judy Fox, Geoffrey C. Fox, Madhav Marathe, Aniruddha Adiga· June 19, 2026 View original

Summary

A systematic evaluation of modern forecasting architectures for regional influenza prediction found that a mixture-of-experts model, combining multiple pretrained forecasters, achieved the strongest overall performance. The study also highlighted that numerical transformer-based models are reliable, pretraining offers significant gains at longer horizons, and LLM-based methods underperform in this specific time series forecasting context.

Accurate short-term forecasting of seasonal influenza is a critical public health need, informing vaccination schedules, hospital staffing, and resource allocation. Despite this, the comparative performance of modern forecasting architectures on infectious disease surveillance data has not been thoroughly characterized. This research addresses that gap by systematically evaluating various models for regional influenza forecasting, using influenza-like illness surveillance and hospitalization time series data. The study assessed classical neural networks, numerical transformer-based models, pretrained time series foundation models, and LLM-based forecasting approaches for 1-4-week-ahead predictions, considering both temporal and spatial generalization. A key finding was that a mixture-of-experts model, which fuses diverse pretrained forecasters, delivered the strongest overall performance, suggesting that heterogeneous pretrained representations offer complementary predictive information. Further results showed that numerical transformer-based models provide reliable forecasts, with pretraining yielding the most substantial improvements for longer prediction horizons, especially when the pretraining domain aligns mechanistically with influenza dynamics. Conversely, LLM-based time series methods performed less effectively than numerical forecasters in this specific application. The research also explored the utility of hospitalization information as an auxiliary covariate and pretraining source, finding that it can provide complementary improvements in certain scenarios and clarify when additional surveillance streams enhance multi-horizon forecasting robustness.

Why it matters

For professionals in public health, data science, and AI development, this research provides actionable guidance on selecting and configuring time series forecasting models for critical applications like epidemic prediction. It clarifies the strengths and weaknesses of different AI architectures, including LLMs, for real-world time series data.

How to implement this in your domain

  1. 1Consider implementing a mixture-of-experts approach for time series forecasting by combining diverse pretrained models.
  2. 2Prioritize numerical transformer-based models for robust time series predictions, especially for longer horizons with relevant pretraining.
  3. 3Evaluate the mechanistic alignment of pretraining domains when selecting or developing time series foundation models.
  4. 4Incorporate auxiliary signals like hospitalization data to enhance the robustness of multi-horizon forecasts where applicable.
  5. 5Exercise caution when applying LLM-based methods directly to time series forecasting, as they may underperform compared to specialized numerical models.

Who benefits

HealthcarePublic HealthData ScienceGovernmentLogistics

Key takeaways

  • Mixture-of-experts models excel in epidemic forecasting by combining diverse pretrained representations.
  • Numerical transformer models provide reliable forecasts, with pretraining boosting longer-horizon accuracy.
  • LLM-based time series methods generally underperform specialized numerical forecasters in this domain.
  • Auxiliary data like hospitalization signals can improve forecast robustness in specific settings.

Original post by Alireza Jafari, Judy Fox, Geoffrey C. Fox, Madhav Marathe, Aniruddha Adiga

"arXiv:2606.19560v1 Announce Type: new Abstract: Seasonal influenza infects millions of people and causes substantial morbidity and mortality in the United States each year, making accurate short-term forecasting a core public-health need. Reliable forecasts of epidemic time serie…"

View on X

Originally posted by Alireza Jafari, Judy Fox, Geoffrey C. Fox, Madhav Marathe, Aniruddha Adiga on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses