EconPapers    
Economics at your fingertips  
 

CCPP Power Prediction Using CatBoost with Domain Knowledge and Recursive Feature Elimination

Baicun Guo (), Bowen Yang, Weizhan Shi, Fengye Yang, Dong Wang and Shuhong Wang
Additional contact information
Baicun Guo: School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China
Bowen Yang: School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China
Weizhan Shi: School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China
Fengye Yang: School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China
Dong Wang: School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China
Shuhong Wang: School of Electrical Engineering, Xi’an Jiaotong University, Xi’an 710049, China

Energies, 2025, vol. 18, issue 16, 1-23

Abstract: Combined cycle power plants are modern power generation systems that provide an efficient and environmentally friendly way of generating electricity. With the development of smart grids, higher requirements have been put forward for their power prediction. Using a dataset comprising 9568 observations from a combined cycle power plant operating at full load for 6 years, a high-precision power prediction model integrating CatBoost and domain knowledge is proposed. Twenty new features were designed based on domain expertise, and Recursive Feature Elimination was applied to select the most informative features, optimizing model performance. Experimental results demonstrate that CatBoost outperformed six commonly used machine learning algorithms, both with and without domain knowledge integration. And the incorporation of domain knowledge improved the predictive performance of all evaluated models, underscoring the effectiveness and general applicability of the proposed features. Moreover, Recursive Feature Elimination was applied to select 11 features. The optimized CatBoost model achieved the best predictive accuracy with a root mean square error of 2.8545, a mean absolute error of 1.9645, and an R-squared of 0.9702. A comparative analysis with existing literature methods further validated the superior performance of the proposed approach. These findings highlight the effectiveness of integrating domain knowledge with machine learning and its potential for improving power output prediction in combined cycle power plants.

Keywords: CatBoost; combined cycle power plant; power prediction; domain knowledge; Recursive feature elimination (search for similar items in EconPapers)
JEL-codes: Q Q0 Q4 Q40 Q41 Q42 Q43 Q47 Q48 Q49 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/1996-1073/18/16/4272/pdf (application/pdf)
https://www.mdpi.com/1996-1073/18/16/4272/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jeners:v:18:y:2025:i:16:p:4272-:d:1722134

Access Statistics for this article

Energies is currently edited by Ms. Agatha Cao

More articles in Energies from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-08-12
Handle: RePEc:gam:jeners:v:18:y:2025:i:16:p:4272-:d:1722134