Regression estimator in ranked set sampling

Research output: Contribution to journalArticlespeer-review

81 Citations (Scopus)


Ranked set sampling (RSS) utilizes inexpensive auxiliary information about the ranking of the units in a sample to provide a more precise estimator of the population mean of the variable of interest Y, which is either difficult or expensive to measure. However, the ranking may not be perfect in most situations. In this paper, we assume that the ranking is done on the basis of a concomitant variable X. Regression-type RSS estimators of the population mean of Y will be proposed by utilizing this concomitant variable X in both the ranking process of the units and the estimation process when the population mean of X is known. When X has unknown mean, double sampling will be used to obtain an estimate for the population mean of X. It is found that when X and Y jointly follow a bivariate normal distribution, our proposed RSS regression estimator is more efficient than RSS and simple random sampling (SRS) naive estimators unless the correlation between X and Y is low (|p| < 0.4). Moreover, it is always superior to the regression estimator under SRS for all ρ. When normality does not hold, this approach could still perform reasonably well as long as the shape of the distribution of the concomitant variable X is only slightly departed from symmetry. For heavily skewed distributions, a remedial measure will be suggested. An example of estimating the mean plutonium concentration in surface soil on the Nevada Test Site, Nevada, U.S.A., will be considered. Copyright © 1997 International Biometric Society.
Original languageEnglish
Pages (from-to)1070-1080
Issue number3
Publication statusPublished - Sept 1997


Yu, P. L. H., & Lam, K. (1997). Regression estimator in ranked set sampling. Biometrics, 53(3), 1070-1080. doi: 10.2307/2533564


Dive into the research topics of 'Regression estimator in ranked set sampling'. Together they form a unique fingerprint.