STRDIST: Stata module to calculate the Levenshtein distance, or edit distance, between strings
Michael Barker () and
Felix Pöge ()
Additional contact information
Michael Barker: Georgetown University
Felix Pöge: Max Planck Institute for Innovation and Competition
Statistical Software Components from Boston College Department of Economics
Abstract:
strdist calculates the Levenshtein distance, or edit distance, between strings. It is implemented in Mata, and does not require a C plugin. ustrdist handles Unicode strings.
Language: Stata
Requires: Stata version 10 (version 14 for ustrdist)
Keywords: edit distance; Levenshtein distance; string comparison; data management (search for similar items in EconPapers)
Date: 2012-11-11, Revised 2017-12-13
Note: This module should be installed from within Stata by typing "ssc install strdist". The module is made available under terms of the GPL v3 (https://www.gnu.org/licenses/gpl-3.0.txt). Windows users should not attempt to download these files with a web browser.
References: Add references at CitEc
Citations:
Downloads: (external link)
http://fmwww.bc.edu/repec/bocode/s/strdist.ado program code (text/plain)
http://fmwww.bc.edu/repec/bocode/s/strdist.sthlp help file (text/plain)
http://fmwww.bc.edu/repec/bocode/u/ustrdist.ado program code (text/plain)
http://fmwww.bc.edu/repec/bocode/u/ustrdist.sthlp help file (text/plain)
http://fmwww.bc.edu/repec/bocode/t/test_strdist.do sample do-file (text/plain)
http://fmwww.bc.edu/repec/bocode/t/test_ustrdist.do sample do-file (text/plain)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:boc:bocode:s457547
Ordering information: This software item can be ordered from
http://repec.org/docs/ssc.php
Access Statistics for this software item
More software in Statistical Software Components from Boston College Department of Economics Boston College, 140 Commonwealth Avenue, Chestnut Hill MA 02467 USA. Contact information at EDIRC.
Bibliographic data for series maintained by Christopher F Baum ().