Assessment of Water Quality in Urban Lakes Using Multi-Source Data and Modeling Techniques
Arpan Dawn,
Gilbert Hinge,
Amandeep Kumar,
Mohammad Reza Nikoo and
Mohamed A. Hamouda ()
Additional contact information
Arpan Dawn: Department of Civil Engineering, National Institute of Technology Durgapur, Durgapur 713209, West Bengal, India
Gilbert Hinge: Department of Civil Engineering, National Institute of Technology Durgapur, Durgapur 713209, West Bengal, India
Amandeep Kumar: Department of Civil Engineering, National Institute of Technology Durgapur, Durgapur 713209, West Bengal, India
Mohammad Reza Nikoo: Department of Civil and Architectural Engineering, Sultan Qaboos University, Muscat P.O. Box 50, Oman
Mohamed A. Hamouda: Department of Civil and Environmental Engineering, United Arab Emirates University, Al Ain 15551, Abu Dhabi, United Arab Emirates
Sustainability, 2025, vol. 17, issue 16, 1-23
Abstract:
Urban and peri-urban lakes are increasingly threatened by water quality degradation due to rising anthropogenic pressures and environmental variability. This study proposes an integrated framework that combines multi-source data and machine learning to estimate and monitor three key water quality parameters: turbidity, total dissolved solids (TDS), and biological oxygen demand (BOD). Field measurements from three lakes in West Bengal, India, Rabindra Sarovar, Mirikh Lake, and Hanuman Ghat Lake, were combined with Landsat-8 satellite imagery, meteorological data, and land use information. Three modeling scenarios were developed: (i) using only remote sensing indices, (ii) combining remote sensing indices with meteorological variables, and (iii) integrating remote sensing indices, meteorological data, and land use features. Principal component analysis (PCA) was used to reduce dimensionality and redundancy. Machine learning models, namely, XGBoost, Decision Tree, and Ridge Regression, were trained and evaluated using R 2 and RMSE (Root Mean Square Error) metrics. The third scenario outperformed the others, with Ridge Regression achieving the highest accuracy for BOD prediction (R 2 = 0.99). Spatiotemporal patterns revealed persistently high BOD levels along urban lake fringes and post-monsoon spikes in turbidity and TDS, especially in agriculturally influenced zones. These patterns were closely linked to land use practices, rainfall-driven runoff, and point-source pollution. This study underscores the effectiveness of remote sensing and machine learning as scalable tools for real-time water quality monitoring, promoting sustainability through informed lake management strategies in India.
Keywords: water quality; turbidity; BOD; TDS; machine learning; remote sensing; urban lakes; Landsat-8; environmental monitoring (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2071-1050/17/16/7258/pdf (application/pdf)
https://www.mdpi.com/2071-1050/17/16/7258/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:17:y:2025:i:16:p:7258-:d:1722098
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().