EconPapers    
Economics at your fingertips  
 

A novel hybrid model combined with ensemble embedded feature selection method for estimating reference evapotranspiration in the North China Plain

Hanmi Zhou, Linshuang Ma, Xiaoli Niu, Youzhen Xiang, Jiageng Chen, Yumin Su, Jichen Li, Sibo Lu, Cheng Chen and Qi Wu

Agricultural Water Management, 2024, vol. 296, issue C

Abstract: The reference evapotranspiration (ETo) is a key parameter in achieving sustainable use of agricultural water resources. To accurately acquire ETo under limited conditions, this study combined the northern goshawk optimization algorithm (NGO) with the extreme gradient boosting (XGBoost) model to propose a novel NGO-XGBoost model. The performance of this model was evaluated using meteorological data from 30 stations in the North China Plain and compared with XGBoost, random forest (RF), and k nearest neighbor (KNN) models. An ensemble embedded feature selection (EEFS) method combined with the results from RF, XGBoost, adaptive boosting (AdaBoost), and categorical boosting (CatBoost) models is used to obtain the importance of meteorological factors in estimating ETo, and thereby determine the optimal combination of inputs to the model. The results indicated that by using the top 3, 4, and 5 important factors as input combinations, all models achieved high ETo estimation accuracy. It is worth noting that there were significant spatial differences in the estimation precisions of the four models, but the NGO-XGBoost model exhibited consistently high estimation precisions, with global performance indicator (GPI) rankings of 1st, and the range of coefficient of determination (R2), nash efficiency coefficient (NSE), root mean square error (RMSE), mean absolute error (MAE) and mean bias error (MBE) were 0.920–0.998, 0.902–0.998, 0.078–0.623 mm d−1, 0.058–0.430 mm d−1, and −0.254–0.062 mm d−1, respectively. Furthermore, the accuracy of the NGO-XGBoost model in estimating ETo varied across different seasons, which was more significantly affected by humidity and wind speed in winter. When the target station data was insufficient, the NGO-XGBoost model was trained by using the historical data from neighboring stations and still maintained a high precision. Overall, this study recommends a reliable method for estimating ETo, which provides a reference for accurately calculating ETo in the North China Plain in the absence of meteorological data.

Keywords: Penman-Monteith equation; Northern goshawk optimization algorithm; Extreme gradient boosting; K nearest neighbor; Random forest (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0378377424001422
Full text for ScienceDirect subscribers only

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eee:agiwat:v:296:y:2024:i:c:s0378377424001422

DOI: 10.1016/j.agwat.2024.108807

Access Statistics for this article

Agricultural Water Management is currently edited by B.E. Clothier, W. Dierickx, J. Oster and D. Wichelns

More articles in Agricultural Water Management from Elsevier
Bibliographic data for series maintained by Catherine Liu ().

 
Page updated 2025-03-19
Handle: RePEc:eee:agiwat:v:296:y:2024:i:c:s0378377424001422