Water quality classification model with small features and class imbalance based on fuzzy rough sets
Sara A. Shehab (),
Ashraf Darwish and
Aboul Ella Hassanien
Additional contact information
Sara A. Shehab: Faculty of Computers and AI
Ashraf Darwish: Helwan University
Aboul Ella Hassanien: Cairo University
Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, 2025, vol. 27, issue 1, No 48, 1419 pages
Abstract:
Abstract Water quality has witnessed a significant decline in recent decades due to pollution and various other challenges. Consequently, there is a pressing need for accurate models capable of predicting water quality. This paper proposed a robust fuzzy rough set model for feature selection which can effectively improve classification performance while discarding the irrelevant features. To address the issue of imbalanced data, the synthetic minority oversampling technique (SMOTE) is employed first. Then, a fuzzy rough set is applied to select the most relevant features. Several machine learning models have been performed to determine the most appropriate model applicable based on the performance of the selected feature. The classification was performed with eXtreme gradient boosting (XGBoost), adaptive boosting ensemble (AdaBoost), and random forest. For every combination between the feature selection method and the classifiers, the most appropriate feature subsets were found and used for model training within the training set of the classifiers. Experimental results demonstrate that the proposed model achieves a remarkable accuracy of 98.83%, with precision, sensitivity, specificity, and F1-Score reaching 98.30%, 99.31%, 98.39%, and 98.80%, respectively. This underscores the effectiveness of the XGBoost algorithm in accurately classifying water quality. The suggested model exhibits a high degree of accuracy while employing only a few parameters, making it a promising solution for real-time water quality monitoring systems.
Keywords: Fuzzy rough sets; Feature selection; Water index quality; Machine learning (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s10668-023-03916-4 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:endesu:v:27:y:2025:i:1:d:10.1007_s10668-023-03916-4
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/10668
DOI: 10.1007/s10668-023-03916-4
Access Statistics for this article
Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development is currently edited by Luc Hens
More articles in Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().