Record linkage for character-based surnames: Evidence from chinese exclusion
Hannah M. Postel
Explorations in Economic History, 2023, vol. 87, issue C
Abstract:
This paper proposes a novel pre-processing technique to improve record linkage for historical Chinese populations. Current matching approaches are relatively ineffective due to Chinese-specific naming conventions and enumeration errors. This paper develops a three-step process that both triples the match rate over baseline and improves match accuracy. The procedures developed in this paper can be applied in part or in full to other sources of historical data, and/or modified for use with other character-based languages such as Japanese. More broadly, this approach suggests the promise of language-specific linkage procedures to boost match rates for ethnic minority groups.
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0014498322000717
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:exehis:v:87:y:2023:i:c:s0014498322000717
DOI: 10.1016/j.eeh.2022.101493
Access Statistics for this article
Explorations in Economic History is currently edited by R.H. Steckel
More articles in Explorations in Economic History from Elsevier
Bibliographic data for series maintained by Catherine Liu ().