EconPapers    
Economics at your fingertips  
 

Statistical inference in massive data sets

Runze Li, Dennis K.J. Lin and Bing Li

Applied Stochastic Models in Business and Industry, 2013, vol. 29, issue 5, 399-409

Abstract: Analysis of massive data sets is challenging owing to limitations of computer primary memory. In this paper, we propose an approach to estimate population parameters from a massive data set. The proposed approach significantly reduces the required amount of primary memory, and the resulting estimate will be as efficient if the entire data set was analyzed simultaneously. Asymptotic properties of the resulting estimate are studied, and the asymptotic normality of the resulting estimator is established. The standard error formula for the resulting estimate is proposed and empirically tested; thus, statistical inference for parameters of interest can be performed. The effectiveness of the proposed approach is illustrated using simulation studies and an Internet traffic data example. Copyright © 2012 John Wiley & Sons, Ltd.

Date: 2013
References: Add references at CitEc
Citations: View citations in EconPapers (14)

Downloads: (external link)
https://doi.org/10.1002/asmb.1927

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:wly:apsmbi:v:29:y:2013:i:5:p:399-409

Access Statistics for this article

More articles in Applied Stochastic Models in Business and Industry from John Wiley & Sons
Bibliographic data for series maintained by Wiley Content Delivery ().

 
Page updated 2025-03-20
Handle: RePEc:wly:apsmbi:v:29:y:2013:i:5:p:399-409