EconPapers    
Economics at your fingertips  
 

Optimal subsampling for modal regression in massive data

Yue Chao (), Lei Huang (), Xuejun Ma () and Jiajun Sun ()
Additional contact information
Yue Chao: Soochow University
Lei Huang: Southwest Jiaotong University
Xuejun Ma: Soochow University
Jiajun Sun: Soochow University

Metrika: International Journal for Theoretical and Applied Statistics, 2024, vol. 87, issue 4, No 2, 379-409

Abstract: Abstract Many modern statistical analysis research efforts are focused on solving the limited computational resources problem that arises when dealing with large datasets. One popular and effective method to address this challenge is to obtain informative subdata from the full dataset based on optimal subsampling probabilities. In this article, we present an optimal subsampling approach for big data modal regression from the perspective of minimizing asymptotic mean squared error. The estimation procedure is achieved by running a two-step algorithm based on the modal expectation-maximization algorithm when the bandwidth for the modal regression is not related to the subsample size. Under certain regularity conditions, we investigate the consistency and asymptotic normality of the subsample-based estimator given the full data. Furthermore, an optimal bandwidth selection approach within this framework is also investigated. Simulation studies demonstrate that our proposed subsampling method performs well in the context of big data modal regression. Empirical evaluation is also conducted using real data.

Keywords: Modal regression; Massive data; Optimal subsampling strategies; MEM algorithm (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s00184-023-00916-2 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:metrik:v:87:y:2024:i:4:d:10.1007_s00184-023-00916-2

Ordering information: This journal article can be ordered from
http://www.springer.com/statistics/journal/184/PS2

DOI: 10.1007/s00184-023-00916-2

Access Statistics for this article

Metrika: International Journal for Theoretical and Applied Statistics is currently edited by U. Kamps and Norbert Henze

More articles in Metrika: International Journal for Theoretical and Applied Statistics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-04-06
Handle: RePEc:spr:metrik:v:87:y:2024:i:4:d:10.1007_s00184-023-00916-2