Efficient and robust estimation of regression and scale parameters, with outlier detection
Alain Desgagné
Computational Statistics & Data Analysis, 2021, vol. 155, issue C
Abstract:
Linear regression with normally distributed errors – including particular cases such as ANOVA, Student’s t-test or location–scale inference – is a widely used statistical procedure. In this case the ordinary least squares estimator possesses remarkable properties but is very sensitive to outliers. Several robust alternatives have been proposed, but there is still significant room for improvement. An original method of estimation is thus proposed, which offers high efficiency simultaneously in the absence and the presence of outliers, both for the estimation of the regression coefficients and the scale parameter. The approach first consists in broadening the normal assumption of the errors to a mixture of the normal and the filtered-log-Pareto (FLP), an original distribution designed to represent the outliers. The expectation–maximization (EM) algorithm is then adapted, which yields the N–FLP estimators of the regression coefficients, the scale parameter and the proportion of outliers, along with probabilities of each observation being an outlier. The performance of the N–FLP estimators is compared with the best alternatives in an extensive Monte Carlo simulation. It is shown that this method of estimation can also be used for a complete robust inference, including confidence intervals, hypothesis testing and model selection.
Keywords: location–scale family; ANOVA; Student’s t-test; M-estimators; Conflicting information (search for similar items in EconPapers)
Date: 2021
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S016794732030205X
Full text for ScienceDirect subscribers only.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:csdana:v:155:y:2021:i:c:s016794732030205x
DOI: 10.1016/j.csda.2020.107114
Access Statistics for this article
Computational Statistics & Data Analysis is currently edited by S.P. Azen
More articles in Computational Statistics & Data Analysis from Elsevier
Bibliographic data for series maintained by Catherine Liu ().