Stata/SQL/Python integration to emulate prospective cohort studies from big register data
Matteo Marrazzo and
Nicola Orsini
Additional contact information
Matteo Marrazzo: Karolinska Institutet
Nordic and Baltic Stata Users' Group Meeting 2019 from Stata Users Group
Abstract:
The possibilities of using Stata to interrogate and analyze big data are not widely known among health researchers. However, the ability to meld different programming tools is becoming gradually more important with the increasing mainstream availability of big data sources. The aim of this presentation is to illustrate, using existing commands such as odbc and python, how to emulate and analyze large prospective cohorts from a collection of big national registers, harvesting the power of the different engines available (for example, SQL to handle relational databases and the preprocess phase, Stata to easily perform advanced statistical analyses and Python to implement well-known modules and packages for data manipulation and plots). I use a case study in pharmaco-epidemiology to illustrate the potential of using Stata to both design and analyze such complex and large datasets.
Date: 2020-08-20
New Economics Papers: this item is included in nep-big
References: Add references at CitEc
Citations:
Downloads: (external link)
http://fmwww.bc.edu/repec/ncon19/nordic19_marrazzo.pdf
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:boc:ncon19:12
Access Statistics for this paper
More papers in Nordic and Baltic Stata Users' Group Meeting 2019 from Stata Users Group Contact information at EDIRC.
Bibliographic data for series maintained by Christopher F Baum ().