A Multimodal Interaction-Driven Feature Discovery Framework for Power Demand Forecasting
Zifan Ning,
Min Jin () and
Pan Zeng
Additional contact information
Zifan Ning: College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China
Min Jin: College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China
Pan Zeng: College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China
Energies, 2025, vol. 18, issue 11, 1-25
Abstract:
Power demand forecasting is a critical and challenging task for modern power systems and integrated energy systems. Due to the absence of well-established theoretical frameworks and publicly available feature databases on power demand changes, the known interpretable features of power demand fluctuations are primarily derived from expert experience and remain significantly limited. This substantially hinders advancements in power demand forecasting accuracy. Emerging multimodal learning approaches have demonstrated great promise in machine learning and AI-generated content (AIGC). In this paper, we propose, for the first time, a textual-knowledge-guided numerical feature discovery (TKNFD) framework for short-term power demand forecasting by interacting text modal data—a potentially valuable yet long-overlooked resource in the field of power demand forecasting—with numerical modal data. TKNFD systematically and automatically aggregates qualitative textual knowledge, expands it into a candidate feature-type set, collects corresponding numerical data for these features, and ultimately constructs four-dimensional multivariate source-tracking databases (4DM-STDs). Subsequently, TKNFD introduces a two-stage quantitative feature identification strategy that operates independently of forecasting models. The essence of TKNFD lies in achieving reliable and comprehensive feature discovery by fully exploiting the dual relationships of synonymy and complementarity between text modal data and numerical modal data in terms of granularity, scope, and temporality. In this study, TKNFD identifies 38–50 features while further interpreting their contributions and dependency correlations. Benchmark experiments conducted in Maine, Texas, and New South Wales demonstrate that the forecasting accuracy using TKNFD-identified features consistently surpasses that of state-of-the-art feature schemes by up to 36.37% MAPE. Notably, driven by multimodal interaction, TKNFD can discover previously unknown interpretable features without relying on prior empirical knowledge. This study reveals 10–16 previously unknown interpretable features, particularly several dominant features in integrated energy and astronomical dimensions. These discoveries enhance our understanding of the origins of strong randomness and non-linearity in power demand fluctuations. Additionally, the 4DM-STDs developed for these three regions can serve as public baseline databases for future research.
Keywords: multimodal learning; qualitative knowledge; quantitative identification; prior empirical knowledge; interpretability; integrated energy dimension; methane price (search for similar items in EconPapers)
JEL-codes: Q Q0 Q4 Q40 Q41 Q42 Q43 Q47 Q48 Q49 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/1996-1073/18/11/2907/pdf (application/pdf)
https://www.mdpi.com/1996-1073/18/11/2907/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jeners:v:18:y:2025:i:11:p:2907-:d:1670082
Access Statistics for this article
Energies is currently edited by Ms. Agatha Cao
More articles in Energies from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().