EconPapers    
Economics at your fingertips  
 

Efficient and robust estimation of regression and scale parameters, with outlier detection

Alain Desgagné

Computational Statistics & Data Analysis, 2021, vol. 155, issue C

Abstract: Linear regression with normally distributed errors – including particular cases such as ANOVA, Student’s t-test or location–scale inference – is a widely used statistical procedure. In this case the ordinary least squares estimator possesses remarkable properties but is very sensitive to outliers. Several robust alternatives have been proposed, but there is still significant room for improvement. An original method of estimation is thus proposed, which offers high efficiency simultaneously in the absence and the presence of outliers, both for the estimation of the regression coefficients and the scale parameter. The approach first consists in broadening the normal assumption of the errors to a mixture of the normal and the filtered-log-Pareto (FLP), an original distribution designed to represent the outliers. The expectation–maximization (EM) algorithm is then adapted, which yields the N–FLP estimators of the regression coefficients, the scale parameter and the proportion of outliers, along with probabilities of each observation being an outlier. The performance of the N–FLP estimators is compared with the best alternatives in an extensive Monte Carlo simulation. It is shown that this method of estimation can also be used for a complete robust inference, including confidence intervals, hypothesis testing and model selection.

Keywords: location–scale family; ANOVA; Student’s t-test; M-estimators; Conflicting information (search for similar items in EconPapers)
Date: 2021
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S016794732030205X
Full text for ScienceDirect subscribers only.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eee:csdana:v:155:y:2021:i:c:s016794732030205x

DOI: 10.1016/j.csda.2020.107114

Access Statistics for this article

Computational Statistics & Data Analysis is currently edited by S.P. Azen

More articles in Computational Statistics & Data Analysis from Elsevier
Bibliographic data for series maintained by Catherine Liu ().

 
Page updated 2025-03-19
Handle: RePEc:eee:csdana:v:155:y:2021:i:c:s016794732030205x