Gradient Boosting to Address Statistical Problems Arising from Non-Linkage of Census Bureau Datasets
Matthew Cefalu,
John Sullivan,
Narayan Sastry,
Elizabeth Fussell and
Todd Gardner
Working Papers from U.S. Census Bureau, Center for Economic Studies
Abstract:
This article introduces the twangRDC package, which contains functions to address non-linkage in US Census Bureau datasets. The Census Bureau’s Person Identification Validation System facilitates data linkage by assigning unique person identifiers to federal, third party, decennial census, and survey data. Not all records in these datasets can be linked to the reference file and as such not all records will be assigned an identifier. This article is a tutorial for using the twangRDC to generate nonresponse weights to account for non-linkage of person records across US Census Bureau datasets.
Keywords: non-response; gradient boosting; weighting; R; Census data (search for similar items in EconPapers)
Pages: 10 pages
Date: 2024-06
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
https://www2.census.gov/library/working-papers/2024/adrm/ces/CES-WP-24-27.pdf First version, 2024 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:cen:wpaper:24-27
Access Statistics for this paper
More papers in Working Papers from U.S. Census Bureau, Center for Economic Studies Contact information at EDIRC.
Bibliographic data for series maintained by Dawn Anderson (dawn.m.anderson@census.gov).