Data science, big data and statistics
Pedro Galeano () and
Daniel Peña
Additional contact information
Pedro Galeano: Universidad Carlos III de Madrid
TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, 2019, vol. 28, issue 2, No 1, 289-329
Abstract:
Abstract This article analyzes how Big Data is changing the way we learn from observations. We describe the changes in statistical methods in seven areas that have been shaped by the Big Data-rich environment: the emergence of new sources of information; visualization in high dimensions; multiple testing problems; analysis of heterogeneity; automatic model selection; estimation methods for sparse models; and merging network information with statistical models. Next, we compare the statistical approach with those in computer science and machine learning and argue that the convergence of different methodologies for data analysis will be the core of the new field of data science. Then, we present two examples of Big Data analysis in which several new tools discussed previously are applied, as using network information or combining different sources of data. Finally, the article concludes with some final remarks.
Keywords: Machine learning; Sparse model selection; Statistical learning; Network analysis; Multivariate data; Time series; 62A01; 62H99 (search for similar items in EconPapers)
Date: 2019
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
http://link.springer.com/10.1007/s11749-019-00651-9 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:testjl:v:28:y:2019:i:2:d:10.1007_s11749-019-00651-9
Ordering information: This journal article can be ordered from
http://www.springer. ... cs/journal/11749/PS2
DOI: 10.1007/s11749-019-00651-9
Access Statistics for this article
TEST: An Official Journal of the Spanish Society of Statistics and Operations Research is currently edited by Alfonso Gordaliza and Ana F. Militino
More articles in TEST: An Official Journal of the Spanish Society of Statistics and Operations Research from Springer, Sociedad de Estadística e Investigación Operativa
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().