EconPapers    
Economics at your fingertips  
 

Addressing Data Scarcity in Solar Energy Prediction with Machine Learning and Augmentation Techniques

Aleksandr Gevorgian, Giovanni Pernigotto () and Andrea Gasparella
Additional contact information
Aleksandr Gevorgian: Faculty of Engineering, Free University of Bozen-Bolzano, 39100 Bolzano, Italy
Giovanni Pernigotto: Faculty of Engineering, Free University of Bozen-Bolzano, 39100 Bolzano, Italy
Andrea Gasparella: Faculty of Engineering, Free University of Bozen-Bolzano, 39100 Bolzano, Italy

Energies, 2024, vol. 17, issue 14, 1-19

Abstract: The accurate prediction of global horizontal irradiance (GHI) is crucial for optimizing solar power generation systems, particularly in mountainous areas with complex topography and unique microclimates. These regions face significant challenges due to limited reliable data and the dynamic nature of local weather conditions, which complicate accurate GHI measurement. The scarcity of precise data impedes the development of reliable solar energy prediction models, impacting both economic and environmental outcomes. To address these data scarcity challenges in solar energy prediction, this paper focuses on various locations in Europe and Asia Minor, predominantly in mountainous regions. Advanced machine learning techniques, including random forest (RF) and extreme gradient boosting (XGBoost) regressors, are employed to effectively predict GHI. Additionally, optimizing training data distribution based on cloud opacity values and integrating synthetic data significantly enhance predictive accuracy, with R 2 scores ranging from 0.91 to 0.97 across multiple locations. Furthermore, substantial reductions in root mean square error (RMSE), mean absolute error (MAE), and mean bias error (MBE) underscore the improved reliability of the predictions. Future research should refine synthetic data generation, optimize additional meteorological and environmental parameter integration, extend methodology to new regions, and test for predicting global tilted irradiance (GTI). The studies should expand training data considerations beyond cloud opacity, incorporating sky cover and sunshine duration to enhance prediction accuracy and reliability.

Keywords: global horizontal irradiance (GHI); mountainous regions; data scarcity; solar energy prediction; machine learning (search for similar items in EconPapers)
JEL-codes: Q Q0 Q4 Q40 Q41 Q42 Q43 Q47 Q48 Q49 (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.mdpi.com/1996-1073/17/14/3365/pdf (application/pdf)
https://www.mdpi.com/1996-1073/17/14/3365/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jeners:v:17:y:2024:i:14:p:3365-:d:1431659

Access Statistics for this article

Energies is currently edited by Ms. Agatha Cao

More articles in Energies from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jeners:v:17:y:2024:i:14:p:3365-:d:1431659