A novel hybrid model combined with ensemble embedded feature selection method for estimating reference evapotranspiration in the North China Plain
Hanmi Zhou,
Linshuang Ma,
Xiaoli Niu,
Youzhen Xiang,
Jiageng Chen,
Yumin Su,
Jichen Li,
Sibo Lu,
Cheng Chen and
Qi Wu
Agricultural Water Management, 2024, vol. 296, issue C
Abstract:
The reference evapotranspiration (ETo) is a key parameter in achieving sustainable use of agricultural water resources. To accurately acquire ETo under limited conditions, this study combined the northern goshawk optimization algorithm (NGO) with the extreme gradient boosting (XGBoost) model to propose a novel NGO-XGBoost model. The performance of this model was evaluated using meteorological data from 30 stations in the North China Plain and compared with XGBoost, random forest (RF), and k nearest neighbor (KNN) models. An ensemble embedded feature selection (EEFS) method combined with the results from RF, XGBoost, adaptive boosting (AdaBoost), and categorical boosting (CatBoost) models is used to obtain the importance of meteorological factors in estimating ETo, and thereby determine the optimal combination of inputs to the model. The results indicated that by using the top 3, 4, and 5 important factors as input combinations, all models achieved high ETo estimation accuracy. It is worth noting that there were significant spatial differences in the estimation precisions of the four models, but the NGO-XGBoost model exhibited consistently high estimation precisions, with global performance indicator (GPI) rankings of 1st, and the range of coefficient of determination (R2), nash efficiency coefficient (NSE), root mean square error (RMSE), mean absolute error (MAE) and mean bias error (MBE) were 0.920–0.998, 0.902–0.998, 0.078–0.623 mm d−1, 0.058–0.430 mm d−1, and −0.254–0.062 mm d−1, respectively. Furthermore, the accuracy of the NGO-XGBoost model in estimating ETo varied across different seasons, which was more significantly affected by humidity and wind speed in winter. When the target station data was insufficient, the NGO-XGBoost model was trained by using the historical data from neighboring stations and still maintained a high precision. Overall, this study recommends a reliable method for estimating ETo, which provides a reference for accurately calculating ETo in the North China Plain in the absence of meteorological data.
Keywords: Penman-Monteith equation; Northern goshawk optimization algorithm; Extreme gradient boosting; K nearest neighbor; Random forest (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0378377424001422
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:agiwat:v:296:y:2024:i:c:s0378377424001422
DOI: 10.1016/j.agwat.2024.108807
Access Statistics for this article
Agricultural Water Management is currently edited by B.E. Clothier, W. Dierickx, J. Oster and D. Wichelns
More articles in Agricultural Water Management from Elsevier
Bibliographic data for series maintained by Catherine Liu ().