Stata export for metadata documentation
Anne Balz,
Klaus Pforr and
Florian Thirolf
Additional contact information
Anne Balz: GESIS - Leibniz-Institut für Sozialwissenschaften
Florian Thirolf: GESIS - Leibniz-Institut für Sozialwissenschaften
German Stata Users' Group Meetings 2019 from Stata Users Group
Abstract:
Precise and detailed data documentation is essential for the secondary analysis of scientific data, whether they are survey or official microdata. Among the most important metadata in this perspective are variable and category labels and frequency distributions and descriptive statistics. To generate and publish these metadata from Stata datafiles, an efficient export interface is essential. It must be able to handle large and complex datasets, account for the specifics of different studies, and generate flexible output formats (depending on the requirements of the documentation system). As a solution to the problem described above, we present the process developed in the GML (German Microdata Lab) at GESIS. In the first step, we show how an aggregated file with all required metadata can be generated from the microdata. In the second step, this file is transformed into a standardized DDI format. Additionally, we will present the implementation for MISSY (the metadata information system for official microdata at GESIS), which includes some practical additions (for example, communication with the MISSY database to retrieve existing element identifiers, writing an output tailored to the MISSY data model).
Date: 2019-07-10
References: Add references at CitEc
Citations:
Downloads: (external link)
http://repec.org/dsug2019/germany19_balz.pdf presentation materials (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:boc:dsug19:03
Access Statistics for this paper
More papers in German Stata Users' Group Meetings 2019 from Stata Users Group Contact information at EDIRC.
Bibliographic data for series maintained by Christopher F Baum ().