Ensemble Learning Algorithms for Solar Radiation Prediction in Santo Domingo: Measurements and Evaluation
Francisco A. Ramírez-Rivera () and
Néstor F. Guerrero-Rodríguez
Additional contact information
Francisco A. Ramírez-Rivera: Engineering Sciences, Pontificia Universidad Católica Madre y Maestra (PUCMM), Av. Abraham Lincoln Esq. Romulo Betancourt, Santo Domingo 2748, Dominican Republic
Néstor F. Guerrero-Rodríguez: Engineering Sciences, Pontificia Universidad Católica Madre y Maestra (PUCMM), Av. Abraham Lincoln Esq. Romulo Betancourt, Santo Domingo 2748, Dominican Republic
Sustainability, 2024, vol. 16, issue 18, 1-27
Abstract:
Solar radiation is a fundamental parameter for solar photovoltaic (PV) technology. Reliable solar radiation prediction has become valuable for designing solar PV systems, guaranteeing their performance, operational efficiency, safety in operations, grid dispatchment, and financial planning. However, high quality ground-based solar radiation measurements are scarce, especially for very short-term time horizons. Most existing studies trained machine learning (ML) models using datasets with time horizons of 1 h or 1 day, whereas very few studies reported using a dataset with a 1 min time horizon. In this study, a comprehensive evaluation of nine ensemble learning algorithms (ELAs) was performed to estimate solar radiation in Santo Domingo with a 1 min time horizon dataset, collected from a local weather station. The ensemble learning models evaluated included seven homogeneous ensembles: Random Forest (RF), Extra Tree (ET), adaptive gradient boosting (AGB), gradient boosting (GB), extreme gradient boosting (XGB), light gradient boosting (LGBM), histogram-based gradient boosting (HGB); and two heterogeneous ensembles: voting and stacking. RF, ET, GB, and HGB were combined to develop voting and stacking ensembles, with linear regression (LR) being adopted in the second layer of the stacking ensemble. Six technical metrics, including mean squared error (MSE), root mean squared error (RMSE), relative root mean squared error (rRMSE), mean absolute error (MAE), mean absolute percentage error (MAPE), and coefficient of determination (R2), were used as criteria to determine the prediction quality of the developed ensemble algorithms. A comparison of the results indicates that the HGB algorithm offers superior prediction performance among the homogeneous ensemble learning models, while overall, the stacking ensemble provides the best accuracy, with metric values of MSE = 3218.27, RMSE = 56.73, rRMSE = 12.700, MAE = 29.87, MAPE = 10.60, and R2 = 0.964.
Keywords: ensemble learning; evaluation metrics; heterogeneous ensemble learning; homogeneous ensemble learning; hyperparameter; time horizon; solar radiation (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2071-1050/16/18/8015/pdf (application/pdf)
https://www.mdpi.com/2071-1050/16/18/8015/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:16:y:2024:i:18:p:8015-:d:1477599
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().