Integration of the Machine Learning Algorithms and I-MR Statistical Process Control for Solar Energy
Yasemin Ayaz Atalan () and
Abdulkadir Atalan ()
Additional contact information
Yasemin Ayaz Atalan: Department of Mechanical Engineering, Yozgat Bozok University, Yozgat 66200, Turkey
Abdulkadir Atalan: Department of Industrial Engineering, Çanakkale Onsekiz Mart University, Çanakkale 17100, Turkey
Sustainability, 2023, vol. 15, issue 18, 1-20
Abstract:
The importance of solar power generation facilities, as one of the renewable energy types, is increasing daily. This study proposes a two-way validation approach to verify the validity of the forecast data by integrating solar energy production quantity with machine learning (ML) and I-MR statistical process control (SPC) charts. The estimation data for the amount of solar energy production were obtained by using random forest (RF), linear regression (LR), gradient boosting (GB), and adaptive boost or AdaBoost (AB) algorithms from ML models. Data belonging to eight independent variables consisting of environmental and geographical factors were used. This study consists of approximately two years of data on the amount of solar energy production for 636 days. The study consisted of three stages: First, descriptive statistics and analysis of variance tests of the dependent and independent variables were performed. In the second stage of the method, estimation data for the amount of solar energy production, representing the dependent variable, were obtained from AB, RF, GB, and LR algorithms and ML models. The AB algorithm performed best among the ML models, with the lowest RMSE, MSE, and MAE values and the highest R 2 value for the forecast data. For the estimation phase of the AB algorithm, the RMSE, MSE, MAE, and R 2 values were calculated as 0.328, 0.107, 0.134, and 0.909, respectively. The RF algorithm performed worst with performance scores for the prediction data. The RMSE, MSE, MAE, and R 2 values of the RF algorithm were calculated as 0.685, 0.469, 0.503, and 0.623, respectively. In the last stage, the estimation data were tested with I-MR control charts, one of the statistical control tools. At the end of all phases, this study aimed to validate the results obtained by integrating the two techniques. Therefore, this study offers a critical perspective to demonstrate a two-way verification approach to whether a system’s forecast data are under control for the future.
Keywords: solar energy; machine learning; random forest; AdaBoost; gradient boosting; linear regression; statistical process control; I-MR control chart (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2071-1050/15/18/13782/pdf (application/pdf)
https://www.mdpi.com/2071-1050/15/18/13782/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:15:y:2023:i:18:p:13782-:d:1240958
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().