1. Introduction
The ranked set sampling (RSS) procedure has been used advantageously in agriculture, forestry, environmental studies, ecological studies and recently in human studies where the exact measurement of units is either difficult or expensive. For example, in forestry, the measurement of the stem volume of standing trees is difficult but the ranking of the trees using their height and diameter at breast height is rather easy. For such situations, McIntyre (1952) [
1] introduced RSS to estimate the population mean. RSS is a cost-efficient alternative to simple random sampling (SRS) if observations can be ranked according to the characteristic under investigation by means of visual inspection or other methods not requiring actual measurements. McIntyre (1952) [
1] indicated that the RSS procedure is superior to the SRS procedure in estimating the population mean. Further, Dell and Clutter (1972) [
2] and Takahasi and Wakimoto (1968) [
3] provided a mathematical foundation for RSS. Dell and Clutter (1972) [
2] also showed that the estimator for the population mean based on RSS is at least as efficient as the estimator based on SRS with the same number of measurements even when there are ranking errors. Bhoj (2001) [
4] introduced RSS with unequal samples. Recently, some novel versions of RSS have been suggested, for example, except extreme ranked set sampling (Aldrabseh and Ismail, 2023 [
5]); partial stratified ranked set sampling (Almanjahie et al., 2023 [
6]); and dual ranked set sampling (Taconeli, 2023 [
7]). Some other versions are found in Latpate et al. (2021) [
8]. Some classes of estimators using RSS are presented by Bhushan et al. (2022) [
9] and Yusuf et al. (2023) [
10].
Substantial work is found in the area of unequal allocations of RSS (Kaur et al., 1997, 2000 [
11,
12]; Tiwari and Chandra 2011 [
13]; Tiwari et al., 2023 [
14]). Bhoj and Kushary (2016) [
15] proposed RSS with unequal samples for positively skewed distributions with heavy right tails. However, in the present paper, an attempt is made by assigning unequal weights instead of unequal allocations to obtain more precision from the estimators. Further, weight assignment is rather easy and cost-effective compared with the repeated allocation of order statistics.
The selection of a ranked set sample of size 
k involves drawing 
k random samples with 
k units in each sample. The units in each sample are ranked using judgment or other methods not requiring actual measurements. The unit with the lowest rank is measured from the first sample, the unit with the second-lowest rank is measured from the second sample, and the procedure is continued until the unit with the highest rank is measured from the last sample. The 
k2 ordered observations in 
k samples can be displayed in the matrix form as
      
We measure only k diagonal observations, and they constitute the RSS. We note that these k observations are independently but not identically distributed. In RSS, k is usually small to reduce the ranking errors, and therefore, to increase the sample size, the above procedure is repeated  times to obtain a sample of the size . In this paper, we assume ; i.e., the sample size is equal to the set size.
In the present paper, our main interest is to estimate the population mean for positively skewed distributions with a longer right tail. We propose estimators based on weighted ranked set sampling (WRSS) and compare their performance with those of the ones based on the usual RSS procedure and Neyman’s optimal allocation model. In 
Section 2, we summarize the estimators of population mean based on the RSS procedure and Neyman’s optimal solution. In 
Section 3, we propose our WRSS procedure to estimate the population mean of skewed distributions. First, we introduce the WRSS procedure, where we assign one low weight to the highest order statistics and calculate the relative precisions of the estimator based on WRSS, RSS and Neyman’s optimal procedure with respect to the estimator based on SRS. The procedures are used to obtain the relative precisions by using the four positively skewed distributions. We also compute one set of weights for all four distributions for each 
k. In 
Section 4, we derive optimal weights for the lowest and highest order statistics for the chosen distributions for each 
k. We then obtain one set of weights for the lowest and highest order statistics for each 
k, which will maximize the sum of relative precisions of four distributions. In 
Section 5, we generalize the use of all optimal weights for all order statistics for 
k = 4 and 
k = 5 for each distribution. We also obtain one set of weights for each 
k for the four chosen distributions. In 
Section 6, to see the effect of increasing skewness, the relative precisions of estimators for the lognormal family of distributions are compared. In 
Section 7, we summarize the results and give recommendations.
  2. Estimation of Mean
We consider first the usual RSS to estimate the population mean. We let 
 denote the value of the characteristic under study of the 
ith order statistic. The mean and variance of the 
ith rank order statistic for set size 
k are denoted by 
 and 
, respectively. We denote the population mean and variance with 
 and 
, respectively. Then, the unbiased estimator for 
 based on RSS is given by
      
	  with the variance
      
The relative precision of 
 compared to the estimator based on SRS with the same number of observations 
k (Bhoj and Chandra, 2019 [
16]) is
      
      where 
 is the average within-rank variance.
For the skewed distribution, Neyman’s allocation 
 provides the optimal allocation and the relative precision of the unbiased estimator of 
 based on this model with respect to SRS with the same number of observations 
n and is given by the following equation (Bhoj and Chandra, 2019 [
16]):
      where 
 is the average within-rank standard deviation.
There are some unequal allocation models for the skewed distributions in the literature (see “
t” and “
s, 
t” model (Kaur et al., 1997 [
11]), systematic model (Tiwari and Chandra, 2011 [
13]) and simple model (Chandra et al., 2018 [
17] and Bhoj and Chandra, 2019 [
16])). The Neyman’s allocation does not provide the integer values of 
 which are necessary for any application. The procedure of making them integers is shown in Bhoj and Chandra (2019) [
16] and used in this paper. It is noted that the inequality 
 always holds for the skewed distributions.
  3. WRSS with One Optimal Weight
In this section, we propose a weighted ranked set sampling (WRSS) with the optimal weight for the largest order statistic since the largest order statistic has the highest variance and higher bias of the estimator for the mean when we deal with the positively skewed distributions. We define the weights 
  as
      
The exact values of the weights are proposed as follows: 
Our weighted estimator for the population mean 
 is
      
The relative precision of our biased estimator 
 with respect to the estimator based on SRS is
      
The value of  is to be chosen such that the  is maximum. To find the optimum value of  (for each k), a program using the “Solver Tool” of MS Excel version 2016 for  was developed, and using the different iterations on , the value of  was tested until it reached its maximum. For all the other values above and below this optimal ,  started decreasing.
We computed 
 for all four chosen distributions, lognormal (LN(0,1)), Pareto (P(3.5) and P(4.5)) and Weibull (W(0.5)), and 
k = 2(1)5. The values of 
, 
, 
 and 
 for these distributions and 
k = 2(1)5 are presented in 
Table 1. The values of 
 are much higher than those of 
, i.e., the relative precisions of the estimator based on the RSS procedure. Furthermore, the 
 values are higher than those of 
, i.e., those based on Neyman’s optimal allocation model for all four distributions when 
. All relative precisions increase as 
k increases for LN(0,1), P(3.5) and P(4.5). However, for W(0.5), 
 decreases as 
k increases. This may be because the distribution W(0.5) has extremely large skewness and kurtosis.
Now we attempt to compute one set of values of  for four values of sample sizes which work well for all four chosen distributions. In these computations,  was determined so that the sum of  for the four distributions is close to the maximum. This optimum value of  was found using the same iteration procedure in the developed Excel program.
The values of optimum 
, and 
 for the chosen four distributions and four sample sizes are presented in 
Table 2. The values of 
 in 
Table 2 are slightly lower than the ones in 
Table 1, as is expected. This is due to the values of 
 in 
Table 2 slightly differing from the optimum 
 given in 
Table 1. However, the pattern of 
 remains the same in both the tables (
Table 1 and 
Table 2).
  4. WRSS with Two Optimal Weights
In this section, we propose a WRSS with two optimal weights for the two extreme order statistics. Here, the weights 
  for 
 are defined as
      
The proposed exact weights are as follows: 
where 
.
Our estimator of population mean is
      
The relative precision of 
 with respect to the estimator based on SRS is
      
We calculate the optimal values of 
 and 
 using the iteration method. Based on these values, we computed 
 along with 
 and 
 for the chosen four distributions and sample sizes 
k = 3, 4 and 5; the results are presented in 
Table 3. The gains in precisions of the estimator 
 over 
 are marginal. The gains of 
 based on 
 are substantially higher than those of the estimator based on RSS. 
 is superior to the estimator based on Neyman’s optimal allocation model for all values of 
k for the LN(0,1) and P(3.5) distributions. The values of 
 are higher than those of 
 for the other two distributions for 
k = 3 and 4. The gains of 
 over 
 for 
k = 5 for these two distributions are marginal.
As we did in the case of 
 we attempted to compute one set of values of 
 and 
 for three values of sample sizes which work well for all four chosen distributions. In these computations, 
 and 
 were determined so that the sum of relative precisions of 
 for the four distributions was close to the maximum relative precision. The values of 
 and 
 and 
,
 and 
 for the three sample sizes and four chosen distributions are presented in 
Table 4. The relative precisions of 
 in 
Table 4 are higher than those of 
 for each 
k in 
Table 2. The pattern of relative precisions is the same as seen in 
Table 3.
  5. WRSS with All Optimal Weights
Now, we extend WRSS with optimal weights for all order statistics for k = 4 and k = 5. We take , and determine the optimal values of C and  by minimizing the mean square error (MSE) of the estimator by using , .
The values of  are chosen so that the value of  is maximized. Then we repeat the procedure of computing the optimal values of C and  with these new ’s. The procedure is repeated until the value of  reaches the maximum. We performed this by using the developed computer program in the “Solver Tool” of MS Excel version 2016.
The values of 
 are presented in 
Table 5. We observe that the values of 
 presented in 
Table 4 are higher than the values of 
 based on one or two optimal weights that are given in 
Table 1 and 
Table 3.
As we did in 
Section 3 and 
Section 4, we computed one set of values of 
C, 
 and different fractions of 
 for 
k = 4 and 
k = 5 which work well for all chosen four distributions. In these computations, these values were determined so that the sum of 
’s for the four distributions is close to the maximum relative precision. These values, along with 
,
 and 
 for 
k = 4 and 
k = 5 and the four chosen distributions, are presented in 
Table 6. As we expected, the values of 
 are smaller in 
Table 6 when compared to the values of 
 in 
Table 5. However, the pattern of relative precisions remains the same.
  6. WRSS with Increasing Skewness
In this section, we wish to study the performance of the three methods, RSS, WRSS and Neyman’s optimum allocation model, with increasing values of skewness of a family of distributions. For this purpose, the lognormal distribution, 
, is considered. The probability density function (
pdf) of 
 is given by
      
Then, skewness (
Sk) and shape parameter (
p) are given by
      
The performance of these three methods relative to SRS with 
k = 4 is presented in 
Table 7 for a lognormal family of distributions for a range of values of the population’s standard deviation. The variances of the order statistics of the family of distributions were computed by using the variances of order statistics for different values of the shape parameter (
p), which are readily available in Balakrishnan and Chen (1999) [
18]. From 
Table 7, we observe that as skewness increases, the performance of (i) the RSS method decreases, and (ii) the Neyman’s and WRSS methods increases. The values of 
 based on all and two optimal weights are higher than those of 
 for all values of shape parameters. However, 
 based on one optimal weight is higher than 
 for all values of 
p > 1.9. The rate of increase in relative precisions of the proposed estimators based on WRSS is higher than that of the estimator based on Neyman’s method (see 
Figure 1).
  7. Conclusions and Discussion
In this paper, we proposed a weighted ranked set sampling procedure to estimate the population mean of distributions which are positively skewed with a heavy right tail. We chose four distributions: lognormal (LN(0,1)), Pareto (P(3.5) and P(4.5)) and Weibull (W(0.5)). The means and variances of order statistics for these distributions are readily available in Harter and Balakrishnan (1996) [
19]. We proposed three weighted ranked set sampling procedures. The first procedure is based on one optimal weight for the largest order statistics, the second procedure is to use the two optimal weights for the two extreme order statistics and the third is the one which is based on 
k optimal weights. We calculated the relative precisions for each of these four distributions by using the WRSS procedure for each sample size. These relative precisions are much higher than the relative precisions of the RSS estimator of the mean. Furthermore, the relative precisions of our estimators are higher than those that are based on Neyman’s optimal procedures for 
. The relative precisions of our estimator are even higher than those of Neyman’s procedure for 
k = 5 for some distributions. Furthermore, we attempted to compute one set of weight(s) for each 
k for all the distributions and compared the relative precisions of our estimator with those of the RSS and Neyman’s estimators. Although there is a slight loss in the values of relative precisions, they are still higher than those of Neyman’s model for 
 for all four distributions and either higher than or very close to Neyman’s model for 
k = 5. In general, as is expected, the relative precisions of our estimator based on all optimal weights are higher than the relative precisions of our estimator based on two and one optimal weight(s). The gain in relative precisions is, however, marginal. 
We studied the performance of our proposed estimators for increasing skewness of a family of lognormal distributions. The relative precision of our estimator based on one optimal weight is higher than those of Neyman’s estimator when the shape parameter exceeds 1.9. The relative precisions of our estimator based on two and 
k optimal weights are uniformly higher than those of Neyman’s estimator for all values of the shape parameter considered in 
Table 7. From 
Figure 1, we see that with the increasing values of skewness, the rate of increase of relative precisions of our proposed estimators based on WRSS is higher than that of the estimator based on Neyman’s method.
Based on the numerical computations of relative precisions, we recommend our estimator based on WRSS procedures for estimating the population mean of skewed distributions with a heavy right tail for small values of set sizes.
Although there are studies on the allocation models of RSS for improving the precision of estimators, implementing them in actual practice is difficult and costly. The main objectives of RSS are avoiding actual measurements along with achieving higher precision. Increasing the allocations requires more actual measurements. Instead of allocating order statistics multiple times, assigning appropriate weights to each order statistic is rather easy. Further, in most of the cases, weighted RSS has higher precision than the optimal allocation model, i.e., Neyman’s method. In addition to this, there are some limitations of the weighted RSS method. This method does not work well if the distribution is symmetric in nature. For symmetric distributions, the precision is not higher than that of the optimal allocation models. For skewed distributions, if the nature of distribution is difficult to identify, the process of finding the optimal weight is difficult. In such cases, the approximation of the distribution function and therefore of the weights is one of the solutions.