Ensemble Machine Learning for Predicting the Power Output from Different Solar Photovoltaic Systems

Raj, Veena; Dotse, Sam-Quarcoo; Sathyajith, Mathew; Petra, M. I.; Yassin, Hayati

Ensemble Machine Learning for Predicting the Power Output from Different Solar Photovoltaic Systems

Veena Raj, Sam-Quarcoo Dotse, Mathew Sathyajith, M. I. Petra and Hayati Yassin ()
Additional contact information
Veena Raj: Faculty of Integrated Technologies, Universiti Brunei Darussalam, Jalan Tungku Link, Gadong BE1410, Brunei
Sam-Quarcoo Dotse: School of Sustainable Development, University of Environment and Sustainable Development, Private Mail Bag, Somanya, Ghana
Mathew Sathyajith: Faculty of Engineering and Science, University of Agder, Jon Lilletunsvei 9, 4879 Grimstad, Norway
M. I. Petra: Faculty of Integrated Technologies, Universiti Brunei Darussalam, Jalan Tungku Link, Gadong BE1410, Brunei
Hayati Yassin: Faculty of Integrated Technologies, Universiti Brunei Darussalam, Jalan Tungku Link, Gadong BE1410, Brunei

Energies, 2023, vol. 16, issue 2, 1-15

Abstract: In this paper, ensemble-based machine learning models with gradient boosting machine and random forest are proposed for predicting the power production from six different solar PV systems. The models are based on three year’s performance of a 1.2 MW grid-integrated solar photo-voltaic (PV) power plant. After cleaning the data for errors and outliers, the model features were chosen on the basis of principal component analysis. Accuracies of the developed models were tested and compared with the performance of models based on other supervised learning algorithms, such as k-nearest neighbour and support vector machines. Though the accuracies of the models varied with the type of PV systems, in general, the machine learned models developed under the study could perform well in predicting the power output from different solar PV technologies under varying working environments. For example, the average root mean square error of the models based on the gradient boosting machines, random forest, k-nearest neighbour, and support vector machines are 17.59 kW, 17.14 kW, 18.74 kW, and 16.91 kW, respectively. Corresponding averages of mean absolute errors are 8.28 kW, 7.88 kW, 14.45 kW, and 6.89 kW. Comparing the different modelling methods, the decision-tree-based ensembled algorithms and support vector machine models outperformed the approach based on the k-nearest neighbour method. With these high accuracies and lower computational costs compared with the deep learning approaches, the proposed ensembled models could be good options for PV performance predictions used in real and near-real-time applications.

Keywords: solar PV power prediction; machine learning; random forest; support vector machines; k-nearest neighbour; gradient boosting machine (search for similar items in EconPapers)
JEL-codes: Q Q0 Q4 Q40 Q41 Q42 Q43 Q47 Q48 Q49 (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)

Downloads: (external link)
https://www.mdpi.com/1996-1073/16/2/671/pdf (application/pdf)
https://www.mdpi.com/1996-1073/16/2/671/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jeners:v:16:y:2023:i:2:p:671-:d:1026904

Access Statistics for this article

Energies is currently edited by Ms. Cassie Shen

More articles in Energies from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().