Simulated Probabilistic Population Estimation
Sherman Dorn
No dpc7z, SocArXiv from Center for Open Science
Abstract:
One can estimate population sizes from random nonunique identifier variables such as birthdates or first names. Banks and Pandiani (2001) developed an efficient method of estimating population from the number of birthdates, and extended that to estimating overlaps of two populations. Banks and Pandiani took advantage of the (almost) uniform distribution of birthdates using the coupon-collector model in probability. This paper develops an alternative method from simulated data and extends the method to nonuniform distributions, such as names. The appendix provides applicable R code.
Date: 2017-10-14
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://osf.io/download/59e28a45b83f6902b30757b5/
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:osf:socarx:dpc7z
DOI: 10.31219/osf.io/dpc7z
Access Statistics for this paper
More papers in SocArXiv from Center for Open Science
Bibliographic data for series maintained by OSF ().