Measuring overlap in logistic regression
Andreas Christmann and
Peter Rousseeuw
No 1999,25, Technical Reports from Technische Universität Dortmund, Sonderforschungsbereich 475: Komplexitätsreduktion in multivariaten Datenstrukturen
Abstract:
In this paper we show that the recent notion of regression depth can be used as a data-analytic tool to measure the amount of separation between successes and failures in the binary response framework. Extending this algorithm allows us to compute the overlap in data sets which are commonly fitted by logistic regression models. The overlap is the number of observations that would need to be removed to obtain complete or quasicomplete separation, i.e. the situation where the logistic regression parameters are no longer identifiable and the maximum likelihood estimate does not exist. It turns out that the overlap is often quite small.
Keywords: Linear discriminant analysis; Logistic regression; Outliers; Overlap; Probit regression; Regression depth; Separation (search for similar items in EconPapers)
Date: 1999
References: View complete reference list from CitEc
Citations: View citations in EconPapers (5)
Downloads: (external link)
https://www.econstor.eu/bitstream/10419/77347/2/1999-25.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:zbw:sfb475:199925
Access Statistics for this paper
More papers in Technical Reports from Technische Universität Dortmund, Sonderforschungsbereich 475: Komplexitätsreduktion in multivariaten Datenstrukturen Contact information at EDIRC.
Bibliographic data for series maintained by ZBW - Leibniz Information Centre for Economics ().