Dealing with identifier variables in data management and analysis
P. Wilner Jeanty ()
Stata Journal, 2013, vol. 13, issue 4, 699-718
Abstract:
Identifier variables are prominent in most data files and, more often than not, are essential to fully use the information in a Stata dataset. However, rendering them in the proper format and relevant number of digits appropriate for data management and statistical analysis might pose unnerving challenges to inexperienced or even veteran Stata users. To lessen these challenges, I provide some useful tips and guard against some pitfalls by featuring two official Stata routines: the string() function and its elaborated wrapper, the tostring command. I illustrate how to use these two routines to address the difficulties caused by identifier variables in managing and analyzing data from private institutions and U.S. government agencies. Copyright 2013 by StataCorp LP.
Keywords: identifier variables; leading zeros; FIPS codes; U.S. Census Bureau; Bureau of Economic Analysis; USDA; cross-sectional data; panel data (search for similar items in EconPapers)
Date: 2013
Note: to access software from within Stata, net describe http://www.stata-journal.com/software/sj13-4/dm0071/
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.stata-journal.com/article.html?article=dm0071 link to article purchase
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:tsj:stataj:v:13:y:2013:i:4:p:699-718
Ordering information: This journal article can be ordered from
http://www.stata-journal.com/subscription.html
Access Statistics for this article
Stata Journal is currently edited by Nicholas J. Cox and Stephen P. Jenkins
More articles in Stata Journal from StataCorp LLC
Bibliographic data for series maintained by Christopher F. Baum () and Lisa Gilmore ().