A workflow for data documentation using Stata
Luiza Cardoso de Andrade,
Kristoffer Bjarkefur,
Benjamin Daniels and
Avnish Singh
Additional contact information
Luiza Cardoso de Andrade: The World Bank, Development Impact Evaluation
Kristoffer Bjarkefur: World Bank
Avnish Singh: World Bank
2022 Stata Conference from Stata Users Group
Abstract:
This presentation introduces three commands providing new functionality for high-quality and transparent data handling. First, iecorrect uses human-readable sheets to document and implement all changes (corrections) to data points in one line of Stata code. Second, iecodebook export creates data dictionaries and includes new features for validating the structure or contents of datasets and creating replication datasets. Third, iesave enhances save with the additional features of tracking changes to datasets over time in a Git-friendly way. Altogether, these commands allow users to access data descriptions and changelogs without reviewing Stata code—and allows team members to contribute to data quality control without using Stata. In addition to the commands, the presentation will discuss general challenges of documenting datasets the authorship team solved during their creation.
Date: 2022-08-11
References: Add references at CitEc
Citations:
Downloads: (external link)
http://repec.org/usug2022/US22_Andrade.zip
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:boc:usug22:11
Access Statistics for this paper
More papers in 2022 Stata Conference from Stata Users Group Contact information at EDIRC.
Bibliographic data for series maintained by Christopher F Baum ().