Optimal subsampling for modal regression in massive data
Yue Chao (),
Lei Huang (),
Xuejun Ma () and
Jiajun Sun ()
Additional contact information
Yue Chao: Soochow University
Lei Huang: Southwest Jiaotong University
Xuejun Ma: Soochow University
Jiajun Sun: Soochow University
Metrika: International Journal for Theoretical and Applied Statistics, 2024, vol. 87, issue 4, No 2, 379-409
Abstract:
Abstract Many modern statistical analysis research efforts are focused on solving the limited computational resources problem that arises when dealing with large datasets. One popular and effective method to address this challenge is to obtain informative subdata from the full dataset based on optimal subsampling probabilities. In this article, we present an optimal subsampling approach for big data modal regression from the perspective of minimizing asymptotic mean squared error. The estimation procedure is achieved by running a two-step algorithm based on the modal expectation-maximization algorithm when the bandwidth for the modal regression is not related to the subsample size. Under certain regularity conditions, we investigate the consistency and asymptotic normality of the subsample-based estimator given the full data. Furthermore, an optimal bandwidth selection approach within this framework is also investigated. Simulation studies demonstrate that our proposed subsampling method performs well in the context of big data modal regression. Empirical evaluation is also conducted using real data.
Keywords: Modal regression; Massive data; Optimal subsampling strategies; MEM algorithm (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s00184-023-00916-2 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:metrik:v:87:y:2024:i:4:d:10.1007_s00184-023-00916-2
Ordering information: This journal article can be ordered from
http://www.springer.com/statistics/journal/184/PS2
DOI: 10.1007/s00184-023-00916-2
Access Statistics for this article
Metrika: International Journal for Theoretical and Applied Statistics is currently edited by U. Kamps and Norbert Henze
More articles in Metrika: International Journal for Theoretical and Applied Statistics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().