Robust optimal subsampling based on weighted asymmetric least squares
Min Ren (),
Shengli Zhao (),
Mingqiu Wang () and
Xinbei Zhu ()
Additional contact information
Min Ren: Qufu Normal University
Shengli Zhao: Qufu Normal University
Mingqiu Wang: Qufu Normal University
Xinbei Zhu: Virginia Tech University
Statistical Papers, 2024, vol. 65, issue 4, No 13, 2251 pages
Abstract:
Abstract With the development of contemporary science, a large amount of generated data includes heterogeneity and outliers in the response and/or covariates. Furthermore, subsampling is an effective method to overcome the limitation of computational resources. However, when data include heterogeneity and outliers, incorrect subsampling probabilities may select inferior subdata, and statistic inference on this subdata may have a far inferior performance. Combining the asymmetric least squares and $$L_2$$ L 2 estimation, this paper proposes a double-robustness framework (DRF), which can simultaneously tackle the heterogeneity and outliers in the response and/or covariates. The Poisson subsampling is implemented based on the DRF for massive data, and a more robust probability will be derived to select the subdata. Under some regularity conditions, we establish the asymptotic properties of the subsampling estimator based on the DRF. Numerical studies and actual data demonstrate the effectiveness of the proposed method.
Keywords: Asymmetric least squares; Massive data; Poisson subsampling; Robustness; Primary 62F12; Secondary 62D99 (search for similar items in EconPapers)
Date: 2024
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s00362-023-01480-7 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:stpapr:v:65:y:2024:i:4:d:10.1007_s00362-023-01480-7
Ordering information: This journal article can be ordered from
http://www.springer. ... business/journal/362
DOI: 10.1007/s00362-023-01480-7
Access Statistics for this article
Statistical Papers is currently edited by C. Müller, W. Krämer and W.G. Müller
More articles in Statistical Papers from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().