Explainable GNNs Improve Multi-Site Air Pollution Prediction.

Abdelkader Dairi, Fouzi Harrou, Ying Sun· June 25, 2026 View original

Summary

This study introduces an auto-configured, explainable Graph Neural Network (GNN) framework for multi-site air pollution prediction, using a confusion matrix for dynamic graph construction and a hybrid loss function. It demonstrates improved accuracy over traditional models and provides interpretability for feature importance and graph structure.

This research presents an innovative approach to predicting particulate matter (PM) concentrations across multiple sites, a critical task for air pollution mitigation. The core of the method involves an auto-configured and explainable Graph Neural Network (GNN) framework. Unlike traditional GNNs that rely on predefined graphs, this system dynamically constructs its graph based on a confusion matrix derived from a supervised learning process, allowing it to better capture complex inter-class relationships in the data. To further enhance learning stability and address issues like vanishing gradients, the study incorporates a hybrid loss function combining energy distance and Huber loss. The framework was rigorously evaluated using real-world air pollution data from Salt Lake City, Utah, comparing five different GNN models. Results consistently showed that GraphSage achieved the highest prediction accuracy for PM1, PM10, and PM2.5 concentrations across various time horizons, outperforming both traditional machine learning and deep learning models. Furthermore, the application of GNNExplainer and PGExplainer provided crucial insights into feature importance and graph structure, ensuring the model's transparency and trustworthiness.

Why it matters

For environmental scientists, urban planners, and public health professionals, this research offers a powerful and transparent tool for more accurate air pollution forecasting. The explainable nature of the GNNs can build trust in predictions, aiding in timely interventions and policy-making to protect public health.

How to implement this in your domain

  1. 1Explore dynamic graph construction methods based on confusion matrices for GNN applications in environmental monitoring.
  2. 2Implement hybrid loss functions (e.g., energy distance + Huber loss) to improve GNN training stability and accuracy.
  3. 3Apply GraphSage or similar GNN architectures for spatiotemporal prediction tasks, especially in environmental or urban contexts.
  4. 4Integrate explainability tools like GNNExplainer or PGExplainer to interpret GNN predictions and build trust.

Who benefits

Environmental MonitoringUrban PlanningPublic HealthSmart CitiesClimate Science

Key takeaways

  • Dynamic graph construction improves GNN adaptability for pollution prediction.
  • Hybrid loss functions enhance learning stability and address gradient issues.
  • GraphSage demonstrates superior accuracy in multi-site air pollution forecasting.
  • Explainable GNNs provide transparency for feature importance and graph structure.

Original post by Abdelkader Dairi, Fouzi Harrou, Ying Sun

"arXiv:2606.24978v1 Announce Type: new Abstract: Accurate particulate matter (PM) prediction is crucial for mitigating air pollution. Graph Neural Networks (GNNs) effectively model spatiotemporal dependencies, but predefined graphs limit adaptability, and some datasets complicate…"

View on X

Originally posted by Abdelkader Dairi, Fouzi Harrou, Ying Sun on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses