EconPapers    
Economics at your fingertips  
 

When are there too many collisions? Variants of the birthday problem

John E. Connett

Communications in Statistics - Theory and Methods, 2024, vol. 53, issue 12, 4487-4497

Abstract: Due to restrictions on the use of unique identifiers of individuals in data sets, there may be instances in which two or more data sets have some of the individuals in common, with no direct way to detect such occurrences. More generally, a collision occurs when two or more observations are in agreement with respect to variables associated with the observations. This article discusses several possible statistical/probabilistic approaches to determining when the number of collisions (or near-collisions) exceeds what would be expected by chance if in fact the observations are all distinct. The methods and results are related to the Birthday Problem and to Occupancy Problems.

Date: 2024
References: Add references at CitEc
Citations:

Downloads: (external link)
http://hdl.handle.net/10.1080/03610926.2023.2184186 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:taf:lstaxx:v:53:y:2024:i:12:p:4487-4497

Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/lsta20

DOI: 10.1080/03610926.2023.2184186

Access Statistics for this article

Communications in Statistics - Theory and Methods is currently edited by Debbie Iscoe

More articles in Communications in Statistics - Theory and Methods from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().

 
Page updated 2025-03-20
Handle: RePEc:taf:lstaxx:v:53:y:2024:i:12:p:4487-4497