A Comparative Analysis of Machine Learning Models for Simulating, Classifying, and Assessment River Inflow
Ali Najah Ahmed (),
Nguyen Van Thieu (),
Kai Lun Chong (),
Yuk Feng Huang () and
Ahmed El-Shafie ()
Additional contact information
Ali Najah Ahmed: Sunway University
Nguyen Van Thieu: PHENIKAA University
Kai Lun Chong: INTI International University (INTI-IU)
Yuk Feng Huang: Universiti Tunku Abdul Rahman
Ahmed El-Shafie: United Arab Emirates University
Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), 2025, vol. 39, issue 8, No 23, 4069 pages
Abstract:
Abstract Accurately classifying river inflow is crucial for understanding river dynamics and ecosystem health. This study evaluates the performance of seven machine learning models, including Support Vector Machines (SVM), Logistic Regression (LR), K-Nearest Neighbors (KNN), Decision Trees (DT), Random Forests (RF), Adaptive Boosting (AdaBoost), and Multi-Layer Perceptron (MLP), for streamflow classification. One of the key challenges in this task is the imbalance in class distributions, which can negatively impact model performance. To address this, we apply the Synthetic Minority Over-sampling Technique (SMOTE) to improve classification outcomes for minority classes. Furthermore, we investigate the impact of four proposed feature selection methods, including mutual information (MI-FS), linear kernel SVM (SVM-FS), random forest (RF-FS), and multi-criteria selection (MC-FS) on model performance by identifying optimal lag values. Model hyperparameters are fine-tuned using GridSearchCV technique, and evaluation step is assessed across seven performance metrics. Experimental results show that MLP and SVM consistently outperform other models, making them the most suitable choices for streamflow classification. Among the FS techniques, MC-FS demonstrates superior performance by effectively reducing dimensionality while preserving predictive power. However, our findings indicate that SMOTE enhances classification for minority classes but reduces accuracy for majority classes, highlighting a trade-off in handling imbalanced data. Additionally, we observe that the linear assumption in SVM-FS can negatively impact model performance when it fails to detect all relevant input features. These insights provide valuable guidance for future streamflow classification tasks.
Keywords: Inflow classification; Feature selection; SMOTE; Random forest; Adaptive boosting; Machine learning (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s11269-025-04146-1 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:waterr:v:39:y:2025:i:8:d:10.1007_s11269-025-04146-1
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11269
DOI: 10.1007/s11269-025-04146-1
Access Statistics for this article
Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA) is currently edited by G. Tsakiris
More articles in Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA) from Springer, European Water Resources Association (EWRA)
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().