ResearchAI Research

Machine Learning Forecasts Rice Yields in Data-Scarce Regions Using Satellite Data

Ibrahim Denis Fofanah· June 15, 2026 View original

Summary

A new study explores using machine learning to forecast rice yields in data-constrained settings like Sierra Leone. It finds that combining national crop statistics with free satellite climate data significantly improves forecast accuracy, especially using early-season rainfall.

Researchers investigated the feasibility of using machine learning to predict rice yields in Sierra Leone, a country with limited agricultural data infrastructure. They trained various models, including XGBoost, Gradient Boosting, and Random Forest, using 25 years of national crop production data. Initially, models relying solely on crop statistics did not outperform a simple persistence benchmark. However, when the models were augmented with readily available satellite climate data, specifically CHIRPS rainfall and NASA POWER temperature, the forecasting accuracy improved substantially. An XGBoost model using only climate data reduced forecast error by one-third. The study highlighted early-season rainfall as a key predictor, suggesting that potential yield risks can be identified months before harvest. The findings offer practical policy recommendations for Sierra Leone's agricultural strategy, emphasizing the value of integrating climate data for better decision support, even in resource-limited environments. The entire methodology is provided as an open-source pipeline.

Why it matters

This research offers a cost-effective and scalable approach for improving agricultural planning and food security in developing nations by leveraging accessible satellite data and machine learning. Professionals in international development, agricultural technology, and climate science can use these methods to build more resilient food systems.

How to implement this in your domain

  1. 1Integrate satellite climate data (e.g., CHIRPS, NASA POWER) with existing agricultural statistics for yield forecasting.
  2. 2Develop and deploy machine learning models like XGBoost for early-season yield prediction based on climate indicators.
  3. 3Establish data collection protocols to capture relevant agricultural and environmental parameters for model training and validation.
  4. 4Collaborate with local agricultural agencies to translate forecasting insights into actionable policy recommendations for farmers and policymakers.

Who benefits

AgricultureInternational DevelopmentClimate ScienceData Analytics

Key takeaways

  • Machine learning can effectively forecast crop yields in data-scarce regions by integrating satellite climate data.
  • Early-season rainfall is a critical predictor for rice yield, enabling proactive risk assessment.
  • Open-source pipelines facilitate the adoption and adaptation of these forecasting methods in developing countries.
  • Combining diverse data sources significantly enhances predictive model performance over single-source approaches.

Original post by Ibrahim Denis Fofanah

"arXiv:2606.13959v1 Announce Type: new Abstract: Sierra Leone's agriculture operates with almost no data-driven decision support, and no published machine learning study has examined the country's crop yields. We ask whether rice yield can be forecast from data Sierra Leone curren…"

View on X

Originally posted by Ibrahim Denis Fofanah on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses