Using remote access to big datasets efficiently with Stata
Volker Lang ()
Additional contact information
Volker Lang: University of Tübingen
German Stata Users' Group Meetings 2009 from Stata Users Group
Abstract:
In this talk, I discuss problems experienced and solutions developed with Stata, using remote access to a big dataset (around 10GB) of the Institute for Employment Research (IAB). I focus on two topics. The first problem is that of not directly controlling the data. The solution here is to implement good pre-documentation into the do-files to structure and improve the communication with the people hosting the remote access. Second, there are memory and running-time problems with using such a large dataset; I discuss this problem in relation to the first one. The solution here is the extensive use of sampling techniques. I present routines for entering such sampling procedures into remote-access do-files.
Date: 2009-07-16
References: Add references at CitEc
Citations:
Downloads: (external link)
http://fmwww.bc.edu/repec/dsug2009/lang.pdf (application/zip)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:boc:dsug09:10
Access Statistics for this paper
More papers in German Stata Users' Group Meetings 2009 from Stata Users Group Contact information at EDIRC.
Bibliographic data for series maintained by Christopher F Baum ().