Robust estimation of the population mean using quantile regression under systematic sampling
Usman Shahzad,
Ishfaq Ahmad,
Nadia H. Al-Noor,
Muhammad Hanif and
Ibrahim Mufrah Almanjahie
Mathematical Population Studies, 2023, vol. 30, issue 3, 195-207
Abstract:
Regression ratio mean estimators of a study variable $$Y$$Y are defined as the coefficients provided by the ordinary least-squares regression of $$Y$$Y on a given auxiliary variable $$X$$X. They can be improved by using the coefficient of variation and the coefficient of kurtosis of $$X$$X. The influence of outliers on the estimates of the population mean of $$Y$$Y is neutralized by calculating robust regression coefficients, obtained by the method of either least absolute deviations, Huber-M, Huber-MM, Hampel-M, Tukey-M, or adjusted least squares. These robust coefficients are used to estimate the population mean of $$Y$$Y under simple random sampling. Extension to systematic sampling—which is a probability sampling in which every element of the population has equal probability of inclusion to be drawn—using the coefficients provided by quantile regression—whose coefficients result from the minimization of the sum of absolute deviations rather than from the square deviations from the regression line—requires ratio estimators of the population mean of $$Y$$Y. The mean square errors of these estimators are expressed analytically. If the quantile regression coefficient is greater than the ratio of the covariance between the study and the auxiliary variables to the variance of the auxiliary variable minus a function of the mean or the coefficient of variation, skewness, or kurtosis of $$X$$X and $$Y$$Y, then the proposed robust quantile regression mean estimator of $$Y$$Y is more efficient than the ratio estimators in the presence of outliers under systematic sampling. The reason is that these estimators only use regression coefficients and not the ratio between the population mean and sample means of the auxiliary variable $$X$$X. The aforementioned condition occurs with the values of the case study. For empirical data of 176 forest strips, the proposed estimate of the volume of timber is over 30% more efficient than the ratio estimates based on quantile regression coefficients.
Date: 2023
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1080/08898480.2022.2139072 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:mpopst:v:30:y:2023:i:3:p:195-207
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/GMPS20
DOI: 10.1080/08898480.2022.2139072
Access Statistics for this article
Mathematical Population Studies is currently edited by Prof. Noel Bonneuil, Annick Lesne, Tomasz Zadlo, Malay Ghosh and Ezio Venturino
More articles in Mathematical Population Studies from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().