EconPapers    
Economics at your fingertips  
 

Problem of data imbalance in building energy load prediction: Concept, influence, and solution

Chaobo Zhang, Junyang Li, Yang Zhao, Tingting Li, Qi Chen, Xuejun Zhang and Weikang Qiu

Applied Energy, 2021, vol. 297, issue C, No S0306261921005791

Abstract: Building energy systems work under wide-scale operation conditions. The available data from some conditions might be far less than the data from the other conditions seriously. This is the so-called data imbalance problem, that is, the volumes of data are different for various conditions. This problem is always ignored in the field of building energy load prediction. Three questions remain unclear: how to identify various building operation conditions, how this problem affects the prediction accuracy, and how to overcome this problem. With the aim of addressing the above three questions, at first, this study proposes a clustering decision tree algorithm to identify the building operation conditions. Then, the effects of data imbalance are investigated by changing the proportions of model training samples from various operation conditions. Finally, a clustering decision tree-based multi-model prediction method is proposed to solve the data imbalance problem. The one-year historical operational data from a public building are utilized to validate the multi-model method. The results show that the proposed method has better prediction performance than the conventional single model-based method. It decreases the mean absolute errors of energy load prediction using artificial neural networks, gradient boosting trees, random forests, and support vector regression by 9.83%, 6.71%, 1.32%, and 12.22% on average, respectively. In addition, it increases the coefficients of determination of energy load prediction using the four algorithms by 8.47%, 4.59%, 0.26%, and 13.99% on average, respectively.

Keywords: Building energy load prediction; Identification of operation conditions; Data mining; Data imbalance; Clustering decision tree; Model interpretation (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (12)

Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0306261921005791
Full text for ScienceDirect subscribers only

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eee:appene:v:297:y:2021:i:c:s0306261921005791

Ordering information: This journal article can be ordered from
http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/bibliographic
http://www.elsevier. ... 405891/bibliographic

DOI: 10.1016/j.apenergy.2021.117139

Access Statistics for this article

Applied Energy is currently edited by J. Yan

More articles in Applied Energy from Elsevier
Bibliographic data for series maintained by Catherine Liu ().

 
Page updated 2025-03-19
Handle: RePEc:eee:appene:v:297:y:2021:i:c:s0306261921005791