PERCENTMATCH: Stata module to calculate the highest percentage match (near duplicates) between observations
Noble L. Kuriakose ()
Additional contact information
Noble L. Kuriakose: SurveyMonkey
Statistical Software Components from Boston College Department of Economics
Abstract:
percentmatch calculates the highest percent match between observation across the variables in varlist (or across all variables if varlist is not specified). Similar to duplicates, percentmatch compares observations to identify identical values. The match percentage is given by the number of identical values divided by the number of variables. percentmatch returns the highest match percentage for each observation.
Language: Stata
Requires: Stata version 12
Keywords: data; management (search for similar items in EconPapers)
Date: 2015-03-21
Note: This module should be installed from within Stata by typing "ssc install percentmatch". The module is made available under terms of the GPL v3 (https://www.gnu.org/licenses/gpl-3.0.txt). Windows users should not attempt to download these files with a web browser.
References: Add references at CitEc
Citations:
Downloads: (external link)
http://fmwww.bc.edu/repec/bocode/p/percentmatch.ado program code (text/plain)
http://fmwww.bc.edu/repec/bocode/p/percentmatch.sthlp help file (text/plain)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:boc:bocode:s457984
Ordering information: This software item can be ordered from
http://repec.org/docs/ssc.php
Access Statistics for this software item
More software in Statistical Software Components from Boston College Department of Economics Boston College, 140 Commonwealth Avenue, Chestnut Hill MA 02467 USA. Contact information at EDIRC.
Bibliographic data for series maintained by Christopher F Baum ().