Adaptive Regression Analysis of Heterogeneous Data Streams via Models with Dynamic Effects
Jianfeng Wei,
Jian Yang,
Xuewen Cheng,
Jie Ding () and
Shengquan Li ()
Additional contact information
Jianfeng Wei: Peng Cheng Laboratory, Shenzhen 518066, China
Jian Yang: Peng Cheng Laboratory, Shenzhen 518066, China
Xuewen Cheng: Peng Cheng Laboratory, Shenzhen 518066, China
Jie Ding: School of Mathematical Sciences, Dalian University of Technology, Dalian 116024, China
Shengquan Li: Peng Cheng Laboratory, Shenzhen 518066, China
Mathematics, 2023, vol. 11, issue 24, 1-18
Abstract:
Streaming data sequences arise from various areas in the era of big data, and it is challenging to explore efficient online models that adapt to them. To address the potential heterogeneity, we introduce a new online estimation procedure to analyze the constantly incoming streaming datasets. The underlying model structures are assumed to be the generalized linear models with dynamic regression coefficients. Our key idea lies in introducing a vector of unknown parameters to measure the differences between batch-specific regression coefficients from adjacent data blocks. This is followed by the usage of the adaptive lasso penalization methodology to accurately select nonzero components, which indicates the existence of dynamic coefficients. We provide detailed derivations to demonstrate how our proposed method not only fits within the online updating framework in which the old estimator is recursively replaced with a new one based solely on the current individual-level samples and historical summary statistics but also adaptively avoids undesirable estimation biases coming from the potential changes in model parameters of interest. Computational issues are also discussed in detail to facilitate implementation. Its practical performance is demonstrated through both extensive simulations and a real case study. In summary, we contribute to a novel online method that efficiently adapts to streaming data environment, addresses potential heterogeneity, and mitigates estimation biases from changes in coefficients.
Keywords: adaptive lasso; data streams; dynamic coefficients; online estimation; regression analysis (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
https://www.mdpi.com/2227-7390/11/24/4899/pdf (application/pdf)
https://www.mdpi.com/2227-7390/11/24/4899/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:11:y:2023:i:24:p:4899-:d:1296009
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().