GeMSyD: Generic Framework for Synthetic Data Generation
Ramona Tolas (),
Raluca Portase and
Rodica Potolea
Additional contact information
Ramona Tolas: Computer Science Department, Technical University of Cluj Napoca, 400114 Cluj-Napoca, Romania
Raluca Portase: Computer Science Department, Technical University of Cluj Napoca, 400114 Cluj-Napoca, Romania
Rodica Potolea: Computer Science Department, Technical University of Cluj Napoca, 400114 Cluj-Napoca, Romania
Data, 2024, vol. 9, issue 1, 1-28
Abstract:
In the era of data-driven technologies, the need for diverse and high-quality datasets for training and testing machine learning models has become increasingly critical. In this article, we present a versatile methodology, the Generic Methodology for Constructing Synthetic Data Generation (GeMSyD), which addresses the challenge of synthetic data creation in the context of smart devices. GeMSyD provides a framework that enables the generation of synthetic datasets, aligning them closely with real-world data. To demonstrate the utility of GeMSyD, we instantiate the methodology by constructing a synthetic data generation framework tailored to the domain of event-based data modeling, specifically focusing on user interactions with smart devices. Our framework leverages GeMSyD to create synthetic datasets that faithfully emulate the dynamics of human–device interactions, including the temporal dependencies. Furthermore, we showcase how the synthetic data generated using our framework can serve as a valuable resource for machine learning practitioners. By employing these synthetic datasets, we perform a series of experiments to evaluate the performance of a neural-network-based prediction model in the domain of smart device interaction. Our results underscore the potential of synthetic data in facilitating model development and benchmarking.
Keywords: synthetic data generation framework; smart home-appliance dataset; user interaction data; event-based data processing; open-source framework; usage forecasting (search for similar items in EconPapers)
JEL-codes: C8 C80 C81 C82 C83 (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2306-5729/9/1/14/pdf (application/pdf)
https://www.mdpi.com/2306-5729/9/1/14/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jdataj:v:9:y:2024:i:1:p:14-:d:1317190
Access Statistics for this article
Data is currently edited by Ms. Cecilia Yang
More articles in Data from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().