EconPapers    
Economics at your fingertips  
 

Gradient Boosting to Address Statistical Problems Arising from Non-Linkage of Census Bureau Datasets

Matthew Cefalu, John Sullivan, Narayan Sastry, Elizabeth Fussell and Todd Gardner

Working Papers from U.S. Census Bureau, Center for Economic Studies

Abstract: This article introduces the twangRDC package, which contains functions to address non-linkage in US Census Bureau datasets. The Census Bureau’s Person Identification Validation System facilitates data linkage by assigning unique person identifiers to federal, third party, decennial census, and survey data. Not all records in these datasets can be linked to the reference file and as such not all records will be assigned an identifier. This article is a tutorial for using the twangRDC to generate nonresponse weights to account for non-linkage of person records across US Census Bureau datasets.

Keywords: non-response; gradient boosting; weighting; R; Census data (search for similar items in EconPapers)
Pages: 10 pages
Date: 2024-06
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www2.census.gov/library/working-papers/2024/adrm/ces/CES-WP-24-27.pdf First version, 2024 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:cen:wpaper:24-27

Access Statistics for this paper

More papers in Working Papers from U.S. Census Bureau, Center for Economic Studies Contact information at EDIRC.
Bibliographic data for series maintained by Dawn Anderson (dawn.m.anderson@census.gov).

 
Page updated 2025-04-13
Handle: RePEc:cen:wpaper:24-27