EconPapers    
Economics at your fingertips  
 

Information-based optimal subdata selection for non-linear models

Jun Yu (), Jiaqi Liu () and HaiYing Wang ()
Additional contact information
Jun Yu: Beijing Institute of Technology
Jiaqi Liu: University of Connecticut
HaiYing Wang: University of Connecticut

Statistical Papers, 2023, vol. 64, issue 4, No 5, 1069-1093

Abstract: Abstract Subdata selection methods provide flexible tradeoffs between computational complexity and statistical efficiency in analyzing big data. In this work, we investigate a new algorithm for selecting informative subdata from massive data for a broad class of models, including generalized linear models as special cases. A connection between the proposed method and many widely used optimal design criteria such as A-, D-, and E-optimality criteria is established to provide a comprehensive understanding of the selected subdata. Theoretical justifications are provided for the proposed method, and numerical simulations are conducted to illustrate the superior performance of the selected subdata.

Keywords: Generalized linear models; Information matrix; Massive data; Optimality criteria (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s00362-023-01430-3 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:stpapr:v:64:y:2023:i:4:d:10.1007_s00362-023-01430-3

Ordering information: This journal article can be ordered from
http://www.springer. ... business/journal/362

DOI: 10.1007/s00362-023-01430-3

Access Statistics for this article

Statistical Papers is currently edited by C. Müller, W. Krämer and W.G. Müller

More articles in Statistical Papers from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:stpapr:v:64:y:2023:i:4:d:10.1007_s00362-023-01430-3