Dealing with the cryptic survey: Processing labels and value labels with Mata
Alfonso Miranda
Mexican Stata Users' Group Meetings 2009 from Stata Users Group
Abstract:
Survey data comes often as a plain table containing cryptic variable names, numbers, and letters. To make sense of the data, the researcher is given a questionnaire or a code book that contains a list of variable names, their description, and an interpretation of the values (either a number or a string) that each variable can take. Code books are commonly provided as plain text or in PDF format. Hence, the researcher is left “free” to type labels and value labels one by one. This often leads to bad research habits, such as “cutting” and “processing” the piece of survey the researcher needs in the short-run and leaving the rest for future processing. Obviously, this is boring, time consuming, and eventually leads to the creation of various versions of the same survey, an inability to track important changes, and an incapacity to reproduce research results—because the researcher cannot recreate the analyzed dataset step by step from the original source. In this talk, I will discuss how to recover the information that is contained in questionnaires or code books and how to process this information in a clean, fast, and efficient way with Mata.
Date: 2009-06-05
References: Add references at CitEc
Citations:
Downloads: (external link)
http://repec.org/msug2009/mex09sug_am.pdf presentation slides (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:boc:msug09:05
Access Statistics for this paper
More papers in Mexican Stata Users' Group Meetings 2009 from Stata Users Group Contact information at EDIRC.
Bibliographic data for series maintained by Christopher F Baum (baum@bc.edu).