EconPapers    
Economics at your fingertips  
 

Vision-AQ: Explainable Multi-Modal Deep Learning for Air Pollution Classification in Smart Cities

Faisal Mehmood, Sajid Ur Rehman and Ahyoung Choi ()
Additional contact information
Faisal Mehmood: Department of AI and Software, Gachon University, Seongnam-si 13120, Gyeonggi-do, Republic of Korea
Sajid Ur Rehman: Department of Creative Technologies, Air University, Islamabad 44000, Pakistan
Ahyoung Choi: Department of AI and Software, Gachon University, Seongnam-si 13120, Gyeonggi-do, Republic of Korea

Mathematics, 2025, vol. 13, issue 18, 1-18

Abstract: Accurate air quality prediction (AQP) is crucial for safeguarding public health and guiding smart city management. However, reliable assessment remains challenging due to complex emission patterns, meteorological variability, and chemical interactions, compounded by the limited coverage of ground-based monitoring networks. To address this gap, we propose Vision-AQ (Visual Integrated Operational Network for Air Quality), a novel multi-modal deep learning framework that classifies Air Quality Index (AQI) levels by integrating environmental imagery with pollutant data. Vision-AQ employs a dual-input neural architecture: (1) a pre-trained ResNet50 convolutional neural network (CNN) that extracts high-level features from city-scale environmental photographs in India and Nepal, capturing haze, smog, and visibility patterns, and (2) a multi-layer perceptron (MLP) that processes tabular sensor data, including PM 2.5 , PM 10 , and AQI values. The fused representations are passed to a classifier to predict six AQI categories. Trained on a comprehensive dataset, the model achieves strong predictive performance with high accuracy, precision, recall and F1-score of 99%, with 23.7 million parameters. To ensure interpretability, we use Grad-CAM visualization to highlights the model’s reliance on meaningful atmospheric features, confirming its explainability. The results demonstrate that Vision-AQ is a reliable, scalable, and cost-effective approach for localized AQI classification, offering the potential to augment conventional monitoring networks and enable more granular air quality management in urban South Asia.

Keywords: explainable AI; multi-modal learning; air quality prediction; smart cities; environmental monitoring (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/13/18/3017/pdf (application/pdf)
https://www.mdpi.com/2227-7390/13/18/3017/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:13:y:2025:i:18:p:3017-:d:1752211

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-09-19
Handle: RePEc:gam:jmathe:v:13:y:2025:i:18:p:3017-:d:1752211