EconPapers    
Economics at your fingertips  
 

Machine Learning-Based Rainfall Forecasting with Multiple Non-Linear Feature Selection Algorithms

Prabal Das (), D. A. Sachindra () and Kironmala Chanda ()
Additional contact information
Prabal Das: Indian Institute of Technology (Indian School of Mines)
D. A. Sachindra: Maria Curie-Sklodowska University
Kironmala Chanda: Indian Institute of Technology (Indian School of Mines)

Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), 2022, vol. 36, issue 15, No 15, 6043-6071

Abstract: Abstract The present research examined the potential of two important feature selection methods, Bayesian Networks (BN) and Recursive Feature Elimination (RFE), in identifying the optimum predictors for forecasting rainfall in the Godavari Basin in India. Initially, a set of ‘probable hydro-climatological variables’ is chosen based on previous studies. Following a correlation analysis between these probable predictors and monthly rainfall, the most correlated contiguous zones were chosen as ‘potential predictors' which were subsequently used as inputs to the two feature selection algorithms, BN and RFE for selecting the ‘optimum predictors’. The optimum predictions were further utilised to develop seven state-of-the-art Machine Learning (ML) models, including Support Vector Regression (SVR), Gaussian Process Regression (GPR), Multivariate Adaptive Regression Splines (MARS), Random Forest (RF), Parallel Multi-Population Genetic Programming (PMPGP (5 demes (Case 1), 9 demes (Case 2), and 17 demes (Case 3)). The result of the correlation analysis revealed that the domain for Relative Humidity, Total Precipitable Water should be considered over the study area, and for U-Wind, V-Wind and Surface Pressure, the zones around Indian Ocean, Arabian Sea and Persian Gulf should be considered respectively. BN, as a feature selection technique for choosing the optimum predictors, was found to be more effective than RFE. In terms of prediction models, GPR and PMPGP models outperformed others, both when used alone and in conjunction with feature selection methods. The R2 values for GPR models vary from 0.82–0.41, whereas the same varies from 0.81–0.31 for PMPGP models.

Keywords: Bayesian Networks (BN); Recursive Feature Elimination (RFE); Gaussian Process Regression (GPR); Multivariate adaptive regression splines (MARS); Random forest (RF); Parallel Multi-Population Genetic Programming (PMPGP); Long-term rainfall (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)

Downloads: (external link)
http://link.springer.com/10.1007/s11269-022-03341-8 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:waterr:v:36:y:2022:i:15:d:10.1007_s11269-022-03341-8

Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11269

DOI: 10.1007/s11269-022-03341-8

Access Statistics for this article

Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA) is currently edited by G. Tsakiris

More articles in Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA) from Springer, European Water Resources Association (EWRA)
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:waterr:v:36:y:2022:i:15:d:10.1007_s11269-022-03341-8