Predicting Solar Radiation: A Fusion Approach with CatBoost and Random Forest Ensemble Enhance by Explainable AI
Abu Bakar Siddique Mahi (),
Farhana Sultana Eshita (),
Tasnim Jahin Mowla (),
Aloke Kumar Saha () and
Shah Murtaza Rashid Al Masud ()
Additional contact information
Abu Bakar Siddique Mahi: University of Asia Pacific
Farhana Sultana Eshita: University of Asia Pacific
Tasnim Jahin Mowla: University of Asia Pacific
Aloke Kumar Saha: University of Asia Pacific
Shah Murtaza Rashid Al Masud: University of Asia Pacific
A chapter in Machine Learning Technologies on Energy Economics and Finance, 2025, pp 155-180 from Springer
Abstract:
Abstract Accurate solar radiation estimation is crucial for effectively designing and optimizing solar energy systems. It serves a crucial function in determining the amount of energy that the sun contributes to a specific area. This radiation is crucial for generating energy in photovoltaic plants or solar thermal power plants. However, the occurrence of radiation fluctuates and is greatly influenced by climatic data, leading to an irregular pattern. The fluctuations in solar radiation result in variations in energy generation from these solar power systems. Due to the task’s complexity and the need to predict fluctuations in the performance of photovoltaic generation systems, it is critical to invest in the development of dependable tools that can accurately forecast and assess incident solar radiation. For this study, we have utilized a thorough approach to create a reliable and easily understandable predictive model. The study started by employing the SelectKBest feature selection method to identify the most influential variables from the provided dataset. Following the implementation of the feature selection technique, we proceeded to train a total of eight distinct machine learning models. We selected CatBoost and Random Forest as the top performers among the considered models for further development. In order to improve the system’s predictive capabilities, we developed an ensemble model by merging the two most successful individual models. We utilize a voting technique to combine the results of the CatBoost and Random Forest models, resulting in the ultimate ensemble prediction. The ensemble model gained an impressive accuracy rate of 98.52%, showcasing its exceptional predictive capabilities. We also utilized the Shapash library to enhance the interpretability of the ensemble model we developed. Through the implementation of a comprehensive approach, a robust, highly accurate, and interpretable ensemble model is successfully developed, capitalizing on the strengths of multiple machine learning algorithms.
Keywords: Solar radiation; Feature selection; Machine learning; Ensemble model; Explainable AI (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:isochp:978-3-031-95099-5_7
Ordering information: This item can be ordered from
http://www.springer.com/9783031950995
DOI: 10.1007/978-3-031-95099-5_7
Access Statistics for this chapter
More chapters in International Series in Operations Research & Management Science from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().