Robust estimation of the population mean using quantile regression under systematic sampling

Shahzad, Usman; Ahmad, Ishfaq; Al-Noor, Nadia H.; Hanif, Muhammad; Almanjahie, Ibrahim Mufrah

Robust estimation of the population mean using quantile regression under systematic sampling

Usman Shahzad, Ishfaq Ahmad, Nadia H. Al-Noor, Muhammad Hanif and Ibrahim Mufrah Almanjahie

Mathematical Population Studies, 2023, vol. 30, issue 3, 195-207

Abstract: Regression ratio mean estimators of a study variable $$Y$$Y are defined as the coefficients provided by the ordinary least-squares regression of $$Y$$Y on a given auxiliary variable $$X$$X. They can be improved by using the coefficient of variation and the coefficient of kurtosis of $$X$$X. The influence of outliers on the estimates of the population mean of $$Y$$Y is neutralized by calculating robust regression coefficients, obtained by the method of either least absolute deviations, Huber-M, Huber-MM, Hampel-M, Tukey-M, or adjusted least squares. These robust coefficients are used to estimate the population mean of $$Y$$Y under simple random sampling. Extension to systematic sampling—which is a probability sampling in which every element of the population has equal probability of inclusion to be drawn—using the coefficients provided by quantile regression—whose coefficients result from the minimization of the sum of absolute deviations rather than from the square deviations from the regression line—requires ratio estimators of the population mean of $$Y$$Y. The mean square errors of these estimators are expressed analytically. If the quantile regression coefficient is greater than the ratio of the covariance between the study and the auxiliary variables to the variance of the auxiliary variable minus a function of the mean or the coefficient of variation, skewness, or kurtosis of $$X$$X and $$Y$$Y, then the proposed robust quantile regression mean estimator of $$Y$$Y is more efficient than the ratio estimators in the presence of outliers under systematic sampling. The reason is that these estimators only use regression coefficients and not the ratio between the population mean and sample means of the auxiliary variable $$X$$X. The aforementioned condition occurs with the values of the case study. For empirical data of 176 forest strips, the proposed estimate of the volume of timber is over 30% more efficient than the ratio estimates based on quantile regression coefficients.

Date: 2023
References: Add references at CitEc
Citations:

Downloads: (external link)
http://hdl.handle.net/10.1080/08898480.2022.2139072 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:taf:mpopst:v:30:y:2023:i:3:p:195-207

Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/GMPS20

DOI: 10.1080/08898480.2022.2139072

Access Statistics for this article

Mathematical Population Studies is currently edited by Prof. Noel Bonneuil, Annick Lesne, Tomasz Zadlo, Malay Ghosh and Ezio Venturino

More articles in Mathematical Population Studies from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().