EconPapers    
Economics at your fingertips  
 

LightGBM-, SHAP-, and Correlation-Matrix-Heatmap-Based Approaches for Analyzing Household Energy Data: Towards Electricity Self-Sufficient Houses

Nitin Kumar Singh () and Masaaki Nagahara
Additional contact information
Nitin Kumar Singh: Graduate School of Life Science and Systems Engineering, Kyushu Institute of Technology, 2-4 Hibikino, Wakamatsu Campus, Kitakyushu 808-0196, Japan
Masaaki Nagahara: Graduate School of Advanced Science and Engineering, Hiroshima University, Higashi Hiroshima City 739-8527, Japan

Energies, 2024, vol. 17, issue 17, 1-32

Abstract: The rapidly growing global energy demand, environmental concerns, and the urgent need to reduce carbon footprints have made sustainable household energy consumption a critical priority. This study aims to analyze household energy data to predict the electricity self-sufficiency rate of households and extract meaningful insights that can enhance it. For this purpose, we use LightGBM (Light Gradient Boosting Machine)-, SHAP (SHapley Additive exPlanations)-, and correlation-heatmap-based approaches to analyze 12 months of energy and questionnaire survey data collected from over 200 smart houses in Kitakyushu, Japan. First, we use LightGBM to predict the ESSR of households and identify the key features that impact the prediction model. By using LightGBM, we demonstrated that the key features are the housing type, average monthly electricity bill, presence of floor heating system, average monthly gas bill, electricity tariff plan, electrical capacity, number of TVs, cooking equipment used, number of washing and drying machines, and the frequency of viewing home energy management systems (HEMSs). Furthermore, we adopted the LightGBM classifier with ℓ 1 regularization to extract the most significant features and established a statistical correlation between these features and the electricity self-sufficiency rate. This LightGBM-based model can also predict the electricity self-sufficiency rate of households that did not participate in the questionnaire survey. The LightGBM-based model offers a global view of feature importance but lacks detailed explanations for individual predictions. For this purpose, we used SHAP analysis to identify the impact-wise order of key features that influence the electricity self-sufficiency rate (ESSR) and evaluated the contribution of each feature to the model’s predictions. A heatmap is also used to analyze the correlation among household variables and the ESSR. To evaluate the performance of the classification model, we used a confusion matrix showing a good F1 score (Weighted Avg) of 0.90. The findings discussed in this article offer valuable insights for energy policymakers to achieve the objective of developing energy-self-sufficient houses.

Keywords: SHAP; LightGBM; correlation heatmap; time-series data; zero-carbon housing; energy policy; questionnaire survey (search for similar items in EconPapers)
JEL-codes: Q Q0 Q4 Q40 Q41 Q42 Q43 Q47 Q48 Q49 (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/1996-1073/17/17/4518/pdf (application/pdf)
https://www.mdpi.com/1996-1073/17/17/4518/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jeners:v:17:y:2024:i:17:p:4518-:d:1474242

Access Statistics for this article

Energies is currently edited by Ms. Agatha Cao

More articles in Energies from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jeners:v:17:y:2024:i:17:p:4518-:d:1474242