Translation from narrative text to standard codes variables with Stata
Federico Belotti and
Domenico Depalo
Stata Journal, 2010, vol. 10, issue 3, 458-481
Abstract:
In this article, we describe screening, a new Stata command for data management that can be used to examine the content of complex narrative-text variables to identify one or more user-defined keywords. The command is useful when dealing with string data contaminated with abbreviations, typos, or mistakes. A rich set of options allows a direct translation from the original narrative string to a user-defined standard coding scheme. Moreover, screening is flexible enough to facilitate the merging of information from different sources and to extract or reorganize the content of string variables.
Keywords: screening; keyword matching; narrative-text variables; standard coding schemes (search for similar items in EconPapers)
Date: 2010
Note: to access software from within Stata, net describe http://www.stata-journal.com/software/sj10-3/dm0050/
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
http://www.stata-journal.com/article.html?article=dm0050 link to article download
Related works:
Working Paper: Translation from narrative text to standard codes variables with Stata (2009) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:tsj:stataj:v:10:y:2010:i:3:p:458-481
Ordering information: This journal article can be ordered from
http://www.stata-journal.com/subscription.html
Access Statistics for this article
Stata Journal is currently edited by Nicholas J. Cox and Stephen P. Jenkins
More articles in Stata Journal from StataCorp LLC
Bibliographic data for series maintained by Christopher F. Baum () and Lisa Gilmore ().