Annual Peak Runoff Forecasting Using Two-Stage Input Variable Selection-Aided k-Nearest-Neighbors Ensemble
Wei Sun (),
Decheng Zeng,
Shu Chen,
Miaomiao Ren and
Yutong Xie
Additional contact information
Wei Sun: Sun Yat-Sen University
Decheng Zeng: GDH Feilaixia Hydropower Co Ltd
Shu Chen: Sun Yat-Sen University
Miaomiao Ren: Sun Yat-Sen University
Yutong Xie: Sun Yat-Sen University
Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), 2025, vol. 39, issue 8, No 27, 4135-4150
Abstract:
Abstract Annual river forecasting is a useful tool for practical flood management. However, since it involves a problem with a small dataset, deep learning models with complex network structures are generally not applicable. In this study, two-stage input variable selection (IVS)-aided k-nearest-neighbors (kNN) ensemble models are developed for annual peak flow (APF) forecasting at the Lechangxia Reservoir in China. In the first stage, three correlation-based metrics (Pearson correlation coefficient, Spearman correlation coefficient, and mutual information index) are used to rank tele-connected indicators according to their linear and nonlinear relationships with historical APF. The top indicators identified through this filter method are forwarded to the second stage, where a leave-one-out cross-validation-based exhaustive search systematically evaluates all possible combinations of the retained inputs. The optimal kNN member uses the two-stage IVS method based on the Spearman correlation coefficient. Multiple kNN models are subsequently developed using distinct input subsets, and these models are aggregated through a simple average method to create the final ensemble forecast. The optimal ensemble model improves the validation R and RMSE of the optimal member model, by 17.5% and 16.1%, respectively. This study highlights the effectiveness of improving long-term river forecasting performances through integrating pre-processing (two-stage input variable selection) and post-processing (multi-strategy kNN ensemble) methods.
Keywords: K-nearest neighbors; Ensemble learning; Annual peak flow; Long-term river forecasting; Flooding (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s11269-025-04149-y Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:waterr:v:39:y:2025:i:8:d:10.1007_s11269-025-04149-y
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11269
DOI: 10.1007/s11269-025-04149-y
Access Statistics for this article
Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA) is currently edited by G. Tsakiris
More articles in Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA) from Springer, European Water Resources Association (EWRA)
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().