A Hybrid Machine Learning Approach for High-Accuracy Energy Consumption Prediction Using Indoor Environmental Quality Sensors
Bibars Amangeldy,
Nurdaulet Tasmurzayev (),
Timur Imankulov (),
Baglan Imanbek,
Waldemar Wójcik and
Yedil Nurakhov
Additional contact information
Bibars Amangeldy: LLP “DigitAlem”, Almaty 050042, Kazakhstan
Nurdaulet Tasmurzayev: LLP “DigitAlem”, Almaty 050042, Kazakhstan
Timur Imankulov: LLP “DigitAlem”, Almaty 050042, Kazakhstan
Baglan Imanbek: Faculty of Information Technologies, Al-Farabi Kazakh National University, Almaty 050040, Kazakhstan
Waldemar Wójcik: Institute of Electronics and Information Technology, Politechnika Lubelska, 20-618 Lublin, Poland
Yedil Nurakhov: LLP “DigitAlem”, Almaty 050042, Kazakhstan
Energies, 2025, vol. 18, issue 15, 1-31
Abstract:
Accurate forecasting of energy consumption in buildings is essential for achieving energy efficiency and reducing carbon emissions. However, many existing models rely on limited input variables and overlook the complex influence of indoor environmental quality (IEQ). In this study, we assess the performance of hybrid machine learning ensembles for predicting hourly energy demand in a smart office environment using high-frequency IEQ sensor data. Environmental variables including carbon dioxide concentration (CO 2 ), particulate matter (PM 2.5 ), total volatile organic compounds (TVOCs), noise levels, humidity, and temperature were recorded over a four-month period. We evaluated two ensemble configurations combining support vector regression (SVR) with either Random Forest or LightGBM as base learners and Ridge regression as a meta-learner, alongside single-model baselines such as SVR and artificial neural networks (ANN). The SVR combined with Random Forest and Ridge regression demonstrated the highest predictive performance, achieving a mean absolute error (MAE) of 1.20, a mean absolute percentage error (MAPE) of 8.92%, and a coefficient of determination (R 2 ) of 0.82. Feature importance analysis using SHAP values, together with non-parametric statistical testing, identified TVOCs, humidity, and PM 2.5 as the most influential predictors of energy use. These findings highlight the value of integrating high-resolution IEQ data into predictive frameworks and demonstrate that such data can significantly improve forecasting accuracy. This effect is attributed to the direct link between these IEQ variables and the activation of energy-intensive systems; fluctuations in humidity drive HVAC energy use for dehumidification, while elevated pollutant levels (TVOCs, PM 2.5 ) trigger increased ventilation to maintain indoor air quality, thus raising the total energy load.
Keywords: energy consumption; building energy management; machine learning; predictive modeling; indoor environmental quality (IEQ); support vector regression (SVR); hybrid models; smart buildings (search for similar items in EconPapers)
JEL-codes: Q Q0 Q4 Q40 Q41 Q42 Q43 Q47 Q48 Q49 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/1996-1073/18/15/4164/pdf (application/pdf)
https://www.mdpi.com/1996-1073/18/15/4164/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jeners:v:18:y:2025:i:15:p:4164-:d:1718416
Access Statistics for this article
Energies is currently edited by Ms. Agatha Cao
More articles in Energies from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().