Variance Estimation under Some Transformation for Both Symmetric and Asymmetric Data

Daraz, Umer; Alomair, Mohammed Ahmed; Albalawi, Olayan

doi:10.3390/sym16080957

Open AccessArticle

Variance Estimation under Some Transformation for Both Symmetric and Asymmetric Data

by

Umer Daraz

¹

,

Mohammed Ahmed Alomair

^2,*

and

Olayan Albalawi

³

¹

School of Mathematics and Statistics, Central South University, Changsha 410017, China

²

Department of Quantitative Methods, School of Business, King Faisal University, Al-Ahsa 31982, Saudi Arabia

³

Department of Statistics, Faculty of Science, University of Tabuk, Tabuk 71491, Saudi Arabia

^*

Author to whom correspondence should be addressed.

Symmetry 2024, 16(8), 957; https://doi.org/10.3390/sym16080957

Submission received: 30 May 2024 / Revised: 15 July 2024 / Accepted: 19 July 2024 / Published: 26 July 2024

(This article belongs to the Section Mathematics)

Download Versions Notes

Abstract

This article suggests an improved class of efficient estimators that use various transformations to estimate the finite population variance of the study variable. These estimators are particularly helpful in situations where we know about the minimum and maximum values of the auxiliary variable, and the ranks of the auxiliary variable are associated with the study variable. Consequently, these rankings can be applied as an effective tool to improve the accuracy of the estimator. A first-order approximation is used to investigate the properties of the proposed class of estimators, such as bias and mean squared error (

M S E

) under simple random sampling. A simulation study carried out in order to measure the performance and verify the theoretical results. The suggested class of estimators has a greater percent relative efficiency (

P R E

) than the other existing estimators in all of the simulated situations, according to the results. Three symmetric and asymmetric datasets are examined in the application section in order to show the superior performance of the proposed class of estimators over the existing estimators.

Keywords:

simple random sampling; auxiliary information; study variable; minimum and maximum values; ranks; PRE

1. Introduction

Survey sampling collects accurate data on population characteristics to enhance the estimation performance while reducing the costs, time, and human resources. In many populations, a few extreme values exist, and estimating the unknown population characteristics without considering such information can be quite sensitive. Consequently, outcomes may be overstated or underestimated in certain cases. Therefore, the accuracy of classical estimators usually decreases in terms of mean square error (

M S E

) when extreme values in the dataset are present. Such information might be tempted to be eliminated from the sample. In order to adequately address this problem, it is important to include this information in the process of estimating population characteristics. Given the known smallest and largest observations of the auxiliary variable, ref. [1] offered two estimators by transforming them linearly. Such works were not studied further after that, until the works of ref. [2]. They applied the concept of using extreme values to a variety of finite population mean estimators. Using a stratified random sampling method, ref. [3] improved the estimate of the finite population mean under extreme values. For more details, see refs. [4,5,6,7] and the references therein.

The estimation problem of finite population variance is an important issue, and controlling variability in applications is challenging. This problem arises in biological and agricultural research, giving a signal that the intended results are unexpected. By carefully using supplementary information, the accuracy of the estimators can be increased. Ref. [8] was the first to discuss the utilization of auxiliary information in the calculation of population variance. Ref. [9] proposed some ratio and product type exponential estimators to estimate the population variance. To estimate the population variance, ref. [10] suggested different efficient classes of the estimator through extreme values transformations. Recently, ref. [11] used the concept of extreme values to introduce new classes of estimators for estimating the population variance with minimum mean squared errors. Ref. [12] provided some new classes of difference-cum-ratio-type exponential estimators for a finite population variance in stratified random sampling by utilizing the known information about extreme values. A variety of researchers have suggested many different kinds of estimators for calculating the population variance, including refs. [13,14,15,16,17,18,19,20,21,22,23].

The rankings of the auxiliary variable are associated with the study variable when there is a relationship between the two variables. As a result, these rankings can be utilized as a valuable tool to enhance the accuracy of the estimator. This article retains the extreme values of the auxiliary variable in the data and utilizes them as auxiliary information. As discussed by refs. [10,11], this article aims to suggest an effective class of estimators for estimating the variance of a finite population. These estimators utilize the available information on the extreme values of an auxiliary variable, as well as the ranks of the auxiliary variable under simple random sampling, in order to enhance the accuracy.

This article is divided into the following sections. Section 2 presents the concepts and notations. This section also includes information on certain existing estimators. In Section 3, we explain our proposed class of estimators. Section 4 provides the mathematical comparison. In Section 5, we simulate six different artificial populations using various probability distributions to assess the theoretical findings described in Section 4. This section also includes numerical examples to support our theoretical results. Finally, Section 6 discusses the results, as well as suggestions for future studies.

2. Concepts and Notations

Consider a finite population with size N units, denoted by

U = (U_{1}, U_{2}, U_{3}, \dots, U_{N})

. Let

y_{i}

,

x_{i},

and

r_{i}

represent the ith unit values of the study variable Y, the auxiliary variable X, and the ranks of the auxiliary variable R, respectively. For these variables, we define the population variances

S_{y}^{2} = \frac{1}{N - 1} \sum_{i = 1}^{N} {(Y_{i} - \bar{Y})}^{2}

(1)

S_{x}^{2} = \frac{1}{N - 1} \sum_{i = 1}^{N} {(X_{i} - \bar{X})}^{2}

(2)

and

S_{r}^{2} = \frac{1}{N - 1} \sum_{i = 1}^{N} {(R_{i} - \bar{R})}^{2}

(3)

where

\bar{Y} = \frac{1}{N} \sum_{i = 1}^{N} Y_{i}

(4)

\bar{X} = \frac{1}{N} \sum_{i = 1}^{N} X_{i}

(5)

and

\bar{R} = \frac{1}{N} \sum_{i = 1}^{N} R_{i}

(6)

are the population means of

Y, X,

and R, respectively.

The population coefficients of variation for

Y, X,

and R are defined as

C_{y} = \frac{S_{y}}{\bar{Y}}

(7)

C_{x} = \frac{S_{x}}{\bar{X}}

(8)

and

C_{r} = \frac{S_{r}}{\bar{R}}

(9)

respectively. Furthermore, we know that the population correlation coefficients between Y and

X,

Y and

R,

and X and R are

ρ_{y x} = \frac{S_{y x}}{S_{y} S_{x}}

(10)

ρ_{y r} = \frac{S_{y r}}{S_{y} S_{r}}

(11)

and

ρ_{x r} = \frac{S_{x r}}{S_{x} S_{r}}

(12)

where

S_{y x} = \frac{1}{N - 1} \sum_{i = 1}^{N} (Y_{i} - \bar{Y}) (X_{i} - \bar{X})

(13)

S_{y r} = \frac{1}{N - 1} \sum_{i = 1}^{N} (Y_{i} - \bar{Y}) (R_{i} - \bar{R})

(14)

and

S_{x r} = \frac{1}{N - 1} \sum_{i = 1}^{N} (x_{i} - \bar{X}) (R_{i} - \bar{R})

(15)

are the population co-variances, respectively.

In order to calculate the unknown population parameter

\bar{Y}

, we adopt simple random sampling without replacement to pick a random sample of n units from the population. Let us define the sample variances

{\hat{S}}_{y}^{2} = \frac{1}{n - 1} \sum_{i = 1}^{n} {(Y_{i} - \bar{y})}^{2}

(16)

{\hat{S}}_{x}^{2} = \frac{1}{n - 1} \sum_{i = 1}^{n} {(X_{i} - \bar{x})}^{2}

(17)

and

{\hat{S}}_{r}^{2} = \frac{1}{n - 1} \sum_{i = 1}^{n} {(R_{i} - \bar{r})}^{2}

(18)

where

\bar{y} = \frac{1}{n} \sum_{i = 1}^{n} Y_{i}

(19)

\bar{x} = \frac{1}{n} \sum_{i = 1}^{n} X_{i}

(20)

and

\bar{r} = \frac{1}{n} \sum_{i = 1}^{n} R_{i}

(21)

are the sample means of

Y, X,

and R, respectively. Additionally, the sample coefficients of variation are defined as

c_{y} = \frac{s_{y}}{\bar{y}}

(22)

c_{x} = \frac{s_{x}}{\bar{x}}

(23)

and

c_{r} = \frac{s_{r}}{\bar{r}}

(24)

where

s_{y}, s_{x},

and

s_{r}

denote the sample standard deviations, respectively.

For each estimator, we define the following terms in order to obtain the biases and mean square errors:

e_{0} = (\frac{s_{y}^{2} - S_{y}^{2}}{S_{y}^{2}})

,

e_{1} = (\frac{s_{x}^{2} - S_{x}^{2}}{S_{x}^{2}}),

and

e_{2} = (\frac{s_{r}^{2} - S_{r}^{2}}{S_{r}^{2}})

such that

E (e_{i h}) = 0

for i = 0, 1, 2.

E (e_{0}^{2}) = ϕ δ_{400}^{*}, E (e_{1}^{2}) = ϕ δ_{040}^{*}, E (e_{2}^{2}) = ϕ δ_{004}^{*}

E (e_{0} e_{1}) = ϕ δ_{220}^{*}, E (e_{0} e_{2}) = ϕ δ_{202}^{*}, E (e_{1} e_{2}) = ϕ δ_{022}^{*},

where

δ_{400}^{*} = (δ_{400} - 1)

,

δ_{040}^{*} = (δ_{040} - 1)

,

δ_{004}^{*} = (δ_{004} - 1), δ_{220}^{*} = (δ_{220} - 1), δ_{202}^{*} = (δ_{202} - 1),

δ_{022}^{*} = (δ_{022} - 1),

and

ϕ = (\frac{1}{n} - \frac{1}{N})

.

Also

δ_{l q s} = \frac{φ_{l q s}}{φ_{200}^{l / 2} φ_{020}^{q / 2} φ_{002}^{s / 2}},

(25)

φ_{l q s} = \frac{\sum_{i = 1}^{N} {(Y_{i} - \bar{Y})}^{l} {(X_{i} - \bar{X})}^{q} {(R_{i} - \bar{R})}^{s}}{N - 1},

(26)

where

φ_{l q s}

represents the population central moment with orders

(l, q, s)

, and

(φ_{200}, φ_{020}, φ_{002})

denotes the standard deviation of

(Y, X, R) .

The population coefficients of kurtosis are defined as

δ_{400} = \frac{φ_{400}}{φ_{200}^{2}},

(27)

δ_{040} = \frac{φ_{040}}{φ_{020}^{2}},

(28)

δ_{004} = \frac{φ_{004}}{φ_{002}^{2}},

(29)

where

φ_{400} = = \frac{\sum_{i = 1}^{N} {(Y_{i} - \bar{Y})}^{4}}{N - 1},

(30)

φ_{040} = = \frac{\sum_{i = 1}^{N} {(X_{i} - \bar{X})}^{4}}{N - 1},

(31)

φ_{004} = = \frac{\sum_{i = 1}^{N} {(R_{i} - \bar{R})}^{4}}{N - 1},

(32)

φ_{200} = = \frac{\sum_{i = 1}^{N} {(Y_{i} - \bar{Y})}^{2}}{N - 1},

(33)

φ_{020} = = \frac{\sum_{i = 1}^{N} {(X_{i} - \bar{X})}^{2}}{N - 1},

(34)

φ_{002} = = \frac{\sum_{i = 1}^{N} {(R_{i} - \bar{R})}^{2}}{N - 1},

(35)

respectively. Here

δ_{400} = β_{2 (y)}, δ_{040} = β_{2 (x)}

, and

δ_{004} = β_{2 (r)}

.

Next, we go over different existing estimators of finite population variances and compare them with the proposed class of estimators.

For population variance, the usual variance estimator of

{\hat{S}}_{y}^{2} = s_{y}^{2},

is provided by

V a r ({\hat{S}}_{y}^{2}) = ϕ S_{y}^{4} δ_{400}^{*} .

(36)

Ref. [8] suggested a ratio estimator for population variance

{\hat{S}}_{t}^{2}

, which is given by

{\hat{S}}_{t}^{2} = s_{y}^{2} (\frac{S_{x}^{2}}{s_{x}^{2}}) .

(37)

The following are the formulas for the bias and

M S E

of

{\hat{S}}_{t}^{2},

which can be found in ref. [8]

B i a s ({\hat{S}}_{t}^{2}) ≅ ϕ S_{y}^{2} (δ_{040}^{*} - δ_{220}^{*})

(38)

and

M S E ({\hat{S}}_{t}^{2}) ≅ ϕ S_{y}^{4} (δ_{400}^{*} + δ_{040}^{*} - 2 δ_{220}^{*}) .

(39)

The linear regression estimator

{\hat{S}}_{l r}^{2}

proposed by ref. [24], is defined as

{\hat{S}}_{l r}^{2} = s_{y}^{2} + b_{(s_{y}^{2}, s_{x}^{2})} (S_{x}^{2} - s_{x}^{2}),

(40)

where

b_{(s_{y}^{2}, s_{x}^{2})} = \frac{s_{y}^{2} {\hat{δ}}_{220}^{*}}{s_{x}^{2} {\hat{δ}}_{040}^{*}}

is the sample regression coefficient.

The following is the formula for the

M S E

of

{\hat{S}}_{l r}^{2},

which can be found in ref. [24]

M S E ({\hat{S}}_{l r}^{2}) ≅ ϕ S_{y}^{4} δ_{400}^{*} (1 - ρ^{* 2}),

(41)

where

ρ^{*} = \frac{δ_{220}^{*}}{\sqrt{δ_{400}^{*}} \sqrt{δ_{040}^{*}}}

.

Ref. [9] suggested an exponential ratio-type estimator

{\hat{S}}_{b t}^{2}

, which is expressed as

{\hat{S}}_{b t}^{2} = s_{y}^{2} exp (\frac{S_{x}^{2} - s_{x}^{2}}{S_{x}^{2} + s_{x}^{2}}) .

(42)

The following are the formulas for the bias and

M S E

of

{\hat{S}}_{b t}^{2},

which can be found in ref. [9]

B i a s ({\hat{S}}_{b t}^{2}) ≅ \frac{1}{2} ϕ S_{y}^{2} (\frac{3 δ_{040}^{*}}{4} - δ_{220}^{*})

(43)

and

M S E ({\hat{S}}_{b t}^{2}) ≅ ϕ S_{y}^{4} (δ_{400}^{*} + \frac{δ_{040}^{*}}{4} - δ_{220}^{*}) .

(44)

In simple random sampling, ref. [20] proposed a ratio-type estimator

{\hat{S}}_{u s}^{2}

by utilizing the kurtosis of an auxiliary variable.

{\hat{S}}_{u s}^{2} = s_{y}^{2} (\frac{S_{x}^{2} + δ_{040}}{s_{x}^{2} + δ_{040}}) .

(45)

The following are the formulas for the bias and

M S E

of

{\hat{S}}_{u s}^{2},

which can be found in ref. [20]

B i a s ({\hat{S}}_{u s}^{2}) ≅ ϕ S_{y}^{2} v_{1} (v_{1} δ_{040}^{*} - δ_{220}^{*})

(46)

and

M S E ({\hat{S}}_{u s}^{2}) ≅ ϕ S_{y}^{4} (δ_{400}^{*} + v_{1}^{2} δ_{040}^{*} - 2 v_{1} δ_{220}^{*}),

(47)

where

v_{1} = \frac{S_{x}^{2}}{S_{x}^{2} + δ_{040}}

.

Ref. [15] proposed certain ratio estimators as follows

{\hat{S}}_{a}^{2} = s_{y}^{2} (\frac{S_{x}^{2} + C_{x}}{s_{x}^{2} + C_{x}}),

(48)

{\hat{S}}_{b}^{2} = s_{y}^{2} (\frac{δ_{040} S_{x}^{2} + C_{x}}{δ_{040} s_{x}^{2} + C_{x}})

(49)

and

{\hat{S}}_{c}^{2} = s_{y}^{2} (\frac{C_{x} S_{x}^{2} + δ_{040}}{C_{x} s_{x}^{2} + δ_{040}}) .

(50)

The following are the formulas for the bias and

M S E

of

{\hat{S}}_{j}^{2} (j = a, b, c),

which can be found in ref. [15]

B i a s ({\hat{S}}_{j}^{2}) ≅ ϕ S_{y}^{2} v_{i} (v_{i} δ_{040}^{*} - δ_{220}^{*}), i = 2, 3, 4

(51)

and

M S E ({\hat{S}}_{j}^{2}) ≅ ϕ S_{y}^{4} (δ_{400}^{*} + v_{i}^{2} δ_{040}^{*} - 2 v_{i} δ_{220}^{*}),

(52)

where

v_{2} = \frac{S_{x}^{2}}{S_{x}^{2} + C_{x}}, v_{3} = \frac{δ_{040} S_{x}^{2}}{δ_{040} S_{x}^{2} + C_{x}}, v_{4} = \frac{C_{x} S_{x}^{2}}{C_{x} S_{x}^{2} + δ_{040}}

.

3. Proposed Estimator

This section, which is inspired by refs. [10,11], presents a new class of effective estimators that estimate the finite population variance by using the largest and smallest values and rankings of the auxiliary variable under simple random sampling.

{\hat{S}}_{A}^{2} = s_{y}^{2} exp [δ_{1} \{\frac{ξ_{1} (S_{x}^{2} - s_{x}^{2})}{ξ_{1} (S_{x}^{2} + s_{x}^{2}) + 2 ξ_{2}}\}] exp [δ_{2} \{\frac{ξ_{3} (S_{r}^{2} - s_{r}^{2})}{ξ_{3} (S_{r}^{2} + s_{r}^{2}) + 2 ξ_{4}}\}],

(53)

where

(δ_{i}, i = 1, 2)

represent known constant values, whereas

(ξ_{i}, i = 1, 2, 3, 4)

represent auxiliary variable parameters. The largest and smallest values of the auxiliary variable are denoted by

(x_{M}, x_{m})

, while the largest and smallest values of the ranks of the auxiliary variable are denoted by

(R_{M}, R_{m})

. Table 1 shows the known values for

ξ_{1}, ξ_{2}

, while

ξ_{3} = 1

and

ξ_{4} = R_{M} - R_{m}

. Table 1 lists the various classes of the proposed estimator derived from (53).

Properties of the Proposed Estimator

The bias and

M S E

of the proposed estimator

{\hat{S}}_{A}^{2}

are now obtained by rewriting (53) in terms of errors, i.e.,

\begin{matrix} {\hat{S}}_{A}^{2} = S_{y}^{2} (1 + e_{0}) exp {(\frac{- δ_{1} k_{4} e_{1}}{2} (1 + \frac{k_{4} e_{1}}{2}))}^{- 1} exp {(\frac{- δ_{2} k_{5} e_{2}}{2} (1 + \frac{k_{5} e_{2}}{2}))}^{- 1} \end{matrix}

(54)

where

k_{4} = \frac{ξ_{1} S_{x}^{2}}{ξ_{1} S_{x}^{2} + ξ_{2}}

and

k_{5} = \frac{ξ_{3} S_{r}^{2}}{ξ_{3} S_{r}^{2} + ξ_{4}}

.

Using the Taylor series under the first order of approximation, we obtain

\begin{matrix} {\hat{S}}_{A}^{2} - S_{y}^{2} ≅ S_{y}^{2} [e_{0} - \frac{δ_{1} k_{4}}{2} e_{1} - \frac{δ_{2} k_{5}}{2} e_{2} + (\frac{δ_{1} k_{4}^{2}}{4} + \frac{δ_{1}^{2} k_{4}^{2}}{8}) e_{1}^{2} + (\frac{δ_{2} k_{5}^{2}}{4} + \frac{δ_{2}^{2} k_{5}^{2}}{8}) e_{2}^{2} \\ - \frac{δ_{1} k_{4}}{2} e_{0} e_{1} - \frac{δ_{2} k_{5}}{2} e_{0} e_{2} + \frac{δ_{1} δ_{2} k_{4} k_{5}}{2} e_{1} e_{2}] . \end{matrix}

(55)

Using (55), the bias of

{\hat{S}}_{A}^{2}

is given by

\begin{matrix} B i a s ({\hat{S}}_{A}^{2}) ≅ ϕ S_{y}^{2} [(\frac{δ_{1} k_{4}^{2}}{4} + \frac{δ_{1}^{2} k_{4}^{2}}{8}) δ_{040}^{*} + (\frac{δ_{2} k_{5}^{2}}{4} + \frac{δ_{2}^{2} k_{5}^{2}}{8}) δ_{004}^{*} - \frac{δ_{1} k_{4}}{2} δ_{220}^{*} \\ - \frac{δ_{2} k_{5}}{2} δ_{202}^{*} + \frac{δ_{1} δ_{2} k_{4} k_{5}}{2} δ_{022}^{*}] . \end{matrix}

(56)

We derived an

M S E

by squaring both sides of (55) and taking the expectation. The equation is as follows

\begin{matrix} M S E ({\hat{S}}_{A}^{2}) ≅ ϕ S_{y}^{4} [δ_{400}^{*} + \frac{δ_{1}^{2} k_{4}^{2}}{4} δ_{040}^{*} + \frac{δ_{2}^{2} k_{5}^{2}}{4} δ_{004}^{*} - δ_{1} k_{4} δ_{220}^{*} - δ_{2} k_{5} δ_{202}^{*} + \frac{δ_{1} δ_{2} k_{4} k_{5}}{2} δ_{022}] . \end{matrix}

(57)

As the given constant values of

(δ_{1} = δ_{2} = 1)

are substituted into (56) and (57), the bias and

M S E

for

{\hat{S}}_{A}^{2}

can be rewritten. After the some simplifications, we obtain

\begin{matrix} B i a s ({\hat{S}}_{A}^{2}) ≅ ϕ S_{y}^{2} [\frac{3}{8} (k_{4}^{2} δ_{040}^{*} + k_{5}^{2} δ_{004}^{*}) - \frac{1}{2} (k_{4} δ_{220}^{*} + k_{5} δ_{202}^{*} - k_{4} k_{5} δ_{022}^{*})] \end{matrix}

(58)

and

\begin{matrix} M S E ({\hat{S}}_{A}^{2}) ≅ ϕ S_{y}^{4} [δ_{400}^{*} + \frac{1}{4} (k_{4}^{2} δ_{040}^{*} + k_{5}^{2} δ_{004}^{*}) - \frac{1}{2} (2 k_{4} δ_{220}^{*} + 2 k_{5} δ_{202}^{*} - k_{4} k_{5} δ_{022}^{*})] . \end{matrix}

(59)

4. Mathematical Comparison

The comparison of the suggested class of estimators

{\hat{S}}_{A}^{2}

, with other existing estimators

{\hat{S}}_{y}^{2}, {\hat{S}}_{t}^{2}, {\hat{S}}_{l r}^{2}, {\hat{S}}_{b t}^{2}, {\hat{S}}_{u s}^{2}

,

{\hat{S}}_{k c_{i}}^{2},

is covered in this section.

Condition (i): By (36) and (59)

V a r ({\hat{S}}_{y}^{2}) > M S E ({\hat{S}}_{A}^{2}) if

(2 k_{4} δ_{220}^{*} + 2 k_{5} δ_{202}^{*} - k_{4} k_{5} δ_{022}^{*}) > \frac{1}{2} (k_{4}^{2} δ_{040}^{*} + k_{5}^{2} δ_{004}^{*}) .

Condition (ii): By (39) and (59)

M S E ({\hat{S}}_{t}^{2}) > M S E ({\hat{S}}_{A}^{2}) if

[2 δ_{220}^{*} (k_{4} - 2) + 2 k_{5} δ_{202}^{*} - k_{4} k_{5} δ_{022}^{*}] > \frac{1}{2} [δ_{040}^{*} (k_{4}^{2} - 4) + k_{5}^{2} δ_{004}^{*}] .

Condition (iii): By (41) and (59)

M S E ({\hat{S}}_{l r}^{2}) > M S E ({\hat{S}}_{A}^{2}) if

(2 k_{4} δ_{220}^{*} + 2 k_{5} δ_{202}^{*} - k_{4} k_{5} δ_{022}^{*}) > \frac{1}{2} (4 ρ_{y x}^{* 2} δ_{400}^{*} + k_{4}^{2} δ_{040}^{*} + k_{5}^{2} δ_{004}^{*}) .

Condition (iv): By (44) and (59)

M S E ({\hat{S}}_{b t}^{2}) > M S E ({\hat{S}}_{A}^{2}) if

[2 δ_{220}^{*} (k_{4} - 1) + 2 k_{5} δ_{202}^{*} - k_{4} k_{5} δ_{022}^{*}] > \frac{1}{2} [δ_{040}^{*} (k_{4}^{2} - 1) + k_{5}^{2} δ_{004}^{*}] .

Condition (v): By (47) and (59)

M S E ({\hat{S}}_{u s}^{2}) > M S E ({\hat{S}}_{A}^{2}) if

[δ_{220}^{*} (k_{4} - 4 v_{1}) + k_{5} δ_{202}^{*} - k_{4} k_{5} δ_{022}^{*}] > \frac{1}{2} [δ_{040}^{*} (k_{4}^{2} - 4 v_{1}^{2}) + k_{5}^{2} δ_{004}^{*}] .

Condition (vi): By (52) and (59)

M S E ({\hat{S}}_{c k_{i}}^{2}) > M S E ({\hat{S}}_{A}^{2}) if

[δ_{220}^{*} (k_{4} - 4 v_{i}) + k_{5} δ_{202}^{*} - k_{4} k_{5} δ_{022}^{*}] > \frac{1}{2} [δ_{040}^{*} (k_{4}^{2} - 4 v_{i}^{2}) + k_{5}^{2} δ_{004}^{*}] .

5. Numerical Comparison

This section compares the mean squared errors MSEs of several estimators, including the proposed class of estimators, using both simulated and actual datasets. The purpose is to evaluate the performance of these estimators. In addition, we compute the percent relative efficiency (PREs) of both the proposed class of estimators and other existing estimators. For more details see Appendix A and Appendix B.

5.1. Simulation Study

We use the approach outlined in refs. [10,11] to perform a simulation study in order to validate the theoretical results reported in Section 4. Using the probability distributions listed below, it is possible to artificially generate the auxiliary variable X into six different populations:

Population 1: $X \sim G a m m a (γ_{1} = 2, γ_{2} = 4),$
Population 2: $X \sim G a m m a (γ_{1} = 3, γ_{2} = 7)$ ,
Population 3: $X \sim E x p o n e n t i a l (μ = 5),$
Population 4: $X \sim E x p o n e n t i a l (μ = 10) .$
Population 5: $X \sim U n i f o r m (α_{1} = 5, α_{2} = 8),$
Population 6: $X \sim U n i f o r m (α_{1} = 6, α_{2} = 10),$

The variable of interest, Y, is calculated using the following formula:

Y = r_{y x} \times X + e,

where the error term is

e \sim N (0, 1)

and the correlation coefficient between the target and research variables is

r_{y x} = 0.77 .

In order to calculate the mean squared errors (

M S E s

) and percent relative efficiencies (

P R E s

) of the proposed class of estimators and other existing estimators, we performed the following procedures in R software:

Step 1: A population of 1000 observations is initially generated by employing the above probability distributions.
Step 2: We obtain the population total from Step 1 along with the smallest and largest values of the supplementary variable.
Step 3: We use SRSWOR to obtain different sizes of samples for each population.
Step 4: For each sample size, calculate the $M S E$ values of all the estimators discussed in this article.
Step 5: subsequently 80,000 repetitions of Steps 3 and 4, Table 2 and Table 3 present the outcomes of the artificial populations, while Table 4 and Table 5 present a summary of the real datasets.

Finally, to obtain MSE and PRE for each estimator over all of the replications, we apply the following formulas:

M S E {({\hat{S}}_{T}^{2})}_{min} = \frac{\sum_{h = 1}^{80000} {({\hat{S}}_{T}^{2} - S_{y}^{2})}^{2}}{80000}

and

P R E = \frac{V ({\hat{S}}_{y}^{2})}{M S E {({\hat{S}}_{T}^{2})}_{m i n}} \times 100,

where

T = t, l r, b t, u s, a, b, c, A_{i} (i = 1, 2, \dots, 8) .

5.2. Numerical Examples

We evaluated the suggested estimator’s performance by comparing the mean squared errors MSEs and PREs among various estimators using three real-life datasets. The following lists the datasets together with summar y statistics:

Data 1. [Source: Ref. [25], p. 135]

Y: Enrollment of students in 2012,

X: The total schools in 2012,

R: Ranks the total schools in 2012.

The summary statistics are as follows:

\begin{matrix} N = 36, n = 15, \bar{X} = 1054.39, \bar{Y} = 148718.70, \bar{R} = 18.50, X_{M} = 2370, X_{m} = 388, R_{M} = 36, \\ R_{m} = 1, S_{x} = 402.61, S_{y} = 182315.10, S_{r} = 10.54, C_{x} = 0.38, C_{y} = 1.23, C_{r} = 0.56, ρ_{y x} = \\ 0.29, ρ_{x r} = 0.94, ρ_{y r} = 0.19, δ_{400} = 3365, δ_{040} = 4698, δ_{004} = 4698, δ_{220} = 2976, δ_{202} = 3298, \\ δ_{022} = 3297 \end{matrix}

Data 2. [Source: Ref. [25], p. 226]

Y: Total number of workers in 2012,

X: Total number of registered factories in 2012,

R: Ranks the total number of registered factories in 2012.

The summary statistics are as follows:

\begin{matrix} N = 36, n = 15, \bar{X} = 335.78, \bar{Y} = 52432.86, \bar{R} = 18.5, X_{M} = 2055, X_{m} = 24, R_{M} = 36, R_{m} \\ = 1, S_{x} = 451.14, S_{y} = 178201.10, S_{r} = 10.54, C_{x} = 1.34, C_{y} = 3.40, C_{r} = 0.57, ρ_{y x} = 0.69, \\ ρ_{y r} = 0.39, ρ_{x r} = 0.84, δ_{400} = 2366, δ_{040} = 4398, δ_{004} = 4068, δ_{220} = 2276, δ_{202} = 2099, \\ δ_{022} = 2098 . \end{matrix}

Data 3. (Source: Ref. [26], p. 24)

Y: Food costs associated with the family’s job,

X: The weekly earnings of families,

R: Ranks the weekly earnings of families.

The summary statistics are as follows:

\begin{matrix} N = 33, n = 5, \bar{X} = 72.55, \bar{Y} = 27.49, \bar{R} = 17, X_{M} = 95, X_{m} = 58, R_{M} = 33, R_{m} = 1, S_{x} = \\ 10.58, S_{y} = 10.13, S_{r} = 9.64, C_{x} = 0.15, C_{y} = 0.37, C_{r} = 0.57, ρ_{y x} = 0.25, ρ_{y r} = 0.20, ρ_{x r} = \\ 0.98, δ_{400} = 5.55, δ_{040} = 3.08, δ_{004} = 1.10, δ_{220} = 2.22, δ_{202} = 1.94, δ_{022} = 2.24 \end{matrix}

To find out how well the suggested class of estimators performed, we employed three real datasets and simulation tests. For comparing various estimators, the

P R E

criteria has been adopted. According to the simulation investigation, the

M S E

and

P R E

values of the suggested and existing estimators can be found in Table 2 and Table 3, respectively. Table 4 and Table 5 demonstrate the results obtained for the actual datasets. Here are some general findings that we found:

Table 2 and Table 4 show that the $M S E$ values of each proposed estimate are less than those of the existing estimators described in the literature for all of the simulated scenarios and real datasets. This validates the better performance of the suggested estimators over the existing estimators.
In addition, the $P R E$ values of each proposed estimator are greater than those of the existing estimators, which are given in Table 3 and Table 5. The suggested class of estimators performs better than the existing estimators.

6. Conclusions

We introduced a class of effective estimators for calculating the finite population variance in this article. These estimators use the auxiliary variable’s known minimum and maximum values, as well as its ranks. In Section 4, we discussed theoretical conditions that illustrate the greater efficiency of the suggested estimators in order to compare their qualities with those of existing estimators. We performed a simulation study and examined various empirical datasets in order to validate these conditions. According to Table 3, the suggested estimators consistently perform better in terms of

(P R E s)

than existing estimators. The theoretical conclusions in Section 4 are further confirmed by the empirical data shown in Table 5. The simulation and empirical data lead us to conclude that the suggested estimators

{\hat{S}}_{A_{i}}^{2}

(i = 1, 2, 3, \dots, 8)

are more efficient than the other estimators under consideration. As

{\hat{S}}_{A_{8}}^{2}

has the lowest

M S E

among these suggested estimators, it is particularly preferred.

We investigated the characteristics of the suggested efficient class of estimators using a simple random sampling technique. Our findings are useful for identifying more efficient estimators with low

M S E s

for stratified random sampling. This topic is useful for future research.

Author Contributions

Methodology, U.D.; Software, U.D.; Validation, U.D. and O.A.; Formal analysis, U.D., M.A.A. and O.A.; Investigation, U.D., M.A.A. and O.A.; Resources, U.D. and O.A.; Data curation, U.D., M.A.A. and O.A.; Writing—original draft, U.D.; Writing—review and editing, U.D.; Visualization, U.D.; Supervision, M.A.A.; Project administration, U.D., M.A.A. and O.A.; Funding acquisition, M.A.A. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Deanship of Scientific Research, Vice Presidency for Graduate Studies and Scientific Research, King Faisal University, Saudi Arabia [Grant No. KFU241416].

Data Availability Statement

Data are contained within the article.

Acknowledgments

We would like to express our sincere gratitude to the editor and the anonymous reviewers for their valuable feedback and insightful suggestions, which greatly improved the quality of this manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Numerical Examples

\begin{matrix} m y d a t a = r e a d . c s v (f i l e . c h o o s e ()) \\ a t t a c h (m y d a t a) \\ N = l e n g t h (X) = l e n g t h (Y) \\ N_{1} = N - 1 \\ n = 15 \\ X_{r} = r a n k (X) \\ Y_{r} = r a n k (Y) \\ ϕ = r o u n d ((1 / n) - (1 / N), d i g i t = 4) \\ \bar{X} = r o u n d (m e a n (X), d i g i t = 4) \\ \bar{Y} = r o u n d (m e a n (Y), d i g i t = 4) \\ \bar{R} = r o u n d (m e a n (X_{r}), d i g i t = 4) \\ X_{M} = m a x (X) \\ X_{m} = m i n (X) \\ R_{M} = m a x (X_{r}) \\ R_{m} = m i n (X_{r}) \\ S_{X}^{2} = r o u n d ((s u m (X^{2}) - (N * {\bar{X}}^{2})) / N_{1}, d i g i t = 4) \\ S_{Y}^{2} = r o u n d ((s u m (Y^{2}) - (N * {\bar{Y}}^{2})) / N_{1}, d i g i t = 4) \\ S_{r}^{2} = r o u n d ((s u m (X_{r}^{2}) - (N * {\bar{R}}^{2})) / N_{1}, d i g i t = 4) \\ S_{X} = r o u n d (s q r t (S_{X}^{2}), d i g i t = 4) \\ S_{r} = r o u n d (s q r t (S_{r}^{2}), d i g i t = 4) \\ S_{Y} = r o u n d (s q r t (S_{Y}^{2}), d i g i t = 4) \\ C_{X}^{2} = r o u n d ((S_{X}^{2} / {\bar{X}}^{2}), d i g i t = 4) \\ C_{r}^{2} = r o u n d ((S_{X_{r}}^{2} / {\bar{R}}^{2}), d i g i t = 4) \\ C_{Y}^{2} = r o u n d ((S_{Y}^{2} / {\bar{Y}}^{2}), d i g i t = 4) \\ C_{X} = r o u n d (s q r t (C_{X}), d i g i t = 4) \\ C_{r} = r o u n d (s q r t (C_{r}), d i g i t = 4) \\ C_{Y} = r o u n d (s q r t (C_{Y}), d i g i t = 4) \\ ρ_{Y X} = r o u n d (c o r (Y, X), d i g i t = 4) \\ ρ_{X R} = r o u n d (c o r (X, X_{r}), d i g i t = 4) \\ ρ_{Y R} = r o u n d (c o r (Y, X_{r}), d i g i t = 4) \\ φ_{400} . = r o u n d ((s u m ({(Y - \bar{Y})}^{4})) / N_{1}, d i g i t = 4) \\ φ_{040} = r o u n d ((s u m ({(X - \bar{X})}^{4})) / N_{1}, d i g i t = 4) \\ φ_{004} = r o u n d ((s u m ({(X_{r} - \bar{X_{r}})}^{4})) / N_{1}, d i g i t = 4) \\ φ_{200} = r o u n d ((s u m ({(Y - \bar{Y})}^{2})) / N_{1}, d i g i t = 4) \\ φ_{020} = r o u n d ((s u m ({(X - \bar{X})}^{2})) / N_{1}, d i g i t = 4) \\ φ_{002} = r o u n d ((s u m ({(X_{r} - \bar{X_{r}})}^{2})) / N_{1}, d i g i t = 4) \\ φ_{220} = r o u n d ((s u m ({(Y - \bar{Y})}^{2} * {(X - \bar{X})}^{2})) / N_{1}, d i g i t = 4) \\ φ_{022} = r o u n d ((s u m ({(X - \bar{X})}^{2} * {(X_{r} - \bar{R})}^{2})) / N_{1}, d i g i t = 4) \\ φ_{202} = r o u n d ((s u m ({(Y - \bar{Y})}^{2} * {(X_{r} - \bar{R})}^{2})) / N_{1}, d i g i t = 4) \\ ρ^{*} = r o u n d (\frac{δ_{220}^{*}}{\sqrt{δ_{400}^{*}} \sqrt{δ_{040}^{*}}}, d i g i t = 4) \\ β_{2 (Y)} = r o u n d (φ_{400} / (φ_{200}^{2}), d i g i t = 4) \\ β_{2 (X)} = r o u n d (φ_{040} / (φ_{020}^{2}), d i g i t = 4) \\ β_{2 (r)} = r o u n d (φ_{004} / (φ_{002}^{2}), d i g i t = 4) \\ δ_{400} = β_{2 (Y)} \\ δ_{040} = β_{2 (X)} \\ δ_{004} = β_{2 (r)} \\ δ_{400}^{*} . = (β_{2 (Y)} - 1) \\ δ_{040}^{*} = (β_{2 (X)} - 1) \\ δ_{004}^{*} = (β_{2 (r)} - 1) \\ δ_{220} = r o u n d (φ_{220} / (S_{Y}^{2} * S_{X}^{2}), d i g i t = 4) \\ δ_{022} = r o u n d (φ_{022} / (S_{X}^{2} * S_{r}^{2}), d i g i t = 4) \\ δ_{202} = r o u n d (φ_{202} / (S_{Y}^{2} * S_{r}^{2}), d i g i t = 4) \\ δ_{220}^{*} = (δ_{220} - 1) \\ δ_{022}^{*} = (δ_{022} - 1) \\ δ_{202}^{*} = (δ_{202} - 1) \\ v_{1} = r o u n d (\frac{S_{X}^{2}}{S_{X}^{2} + δ_{040}}, d i g i t = 4) \\ v_{2} = r o u n d (\frac{S_{X}^{2}}{S_{X}^{2} + C_{X}}, d i g i t = 4) \\ v_{3} = r o u n d (\frac{δ_{040} * S_{X}^{2}}{δ_{040} * S_{X}^{2} + C_{X}}, d i g i t = 3) \\ v_{4} = r o u n d (\frac{C_{X} * S_{X}^{2}}{C X * S_{X}^{2} + δ_{040}}, d i g i t = 4) \\ ξ_{11} = 1 \\ ξ_{21} = X_{M} - X_{m} \\ ξ_{12} = X_{M} - X_{m} \\ ξ_{22} = C_{X} \\ ξ_{13} = X_{M} - X_{m} \\ ξ_{23} = 1 \\ ξ_{14} = X_{M} - X_{m} \\ ξ_{24} = β_{2 (X)} \\ ξ_{15} = β_{2 (X)} \\ ξ_{25} = X_{M} - X_{m} \\ ξ_{16} = ρ_{Y X} \\ ξ_{17} = C_{X} \\ ξ_{27} = X_{M} - X_{m} \\ ξ_{18} = ρ_{Y X} \\ ξ_{28} = X_{M} - X_{m} \\ ξ_{3} = 1 \\ ξ_{4} = R_{M} - R_{x} \\ k_{41} = \frac{ξ_{11} * S_{X}^{2}}{ξ_{11} * S_{X}^{2} + ξ_{21}} \\ k_{42} = \frac{ξ_{12} * S_{X}^{2}}{ξ_{12} * S_{X}^{2} + ξ_{22}} \\ k_{43} = \frac{ξ_{13} * S_{X}^{2}}{ξ_{13} * S_{X}^{2} + ξ_{23}} \\ k_{44} = \frac{ξ_{14} * S_{X}^{2}}{ξ_{14} * S_{X}^{2} + ξ_{24}} \\ k_{45} = \frac{ξ_{15} * S_{X}^{2}}{ξ_{15} * S_{X}^{2} + ξ_{25}} \\ k_{46} = \frac{ξ_{16} * S_{X}^{2}}{ξ_{16} * S_{X}^{2} + ξ_{26}} \\ k_{47} = \frac{ξ_{17} * S_{X}^{2}}{ξ_{17} * S_{X}^{2} + ξ_{27}} \\ k_{48} = \frac{ξ_{18} * S_{X}^{2}}{ξ_{18} * S_{X}^{2} + ξ_{28}} \\ k_{5} = \frac{ξ_{3} * S_{r}^{2}}{ξ_{3} * S_{r}^{2} + ξ_{4}} \end{matrix}

Mean squared errors:

\begin{matrix} V a r ({\hat{S}}_{Y}^{2}) = ϕ * S_{Y}^{4} * δ_{400}^{*} \\ M S E ({\hat{S}}_{t}^{2}) = ϕ * S_{Y}^{4} * (δ_{400}^{*} + δ_{040}^{*} - 2 * δ_{220}^{*}) \\ M S E ({\hat{S}}_{l r}^{2}) = ϕ * S_{Y}^{4} * δ_{400}^{*} * (1 - ρ_{Y X}^{* 2}) \\ M S E ({\hat{S}}_{b t}^{2}) = ϕ * S_{Y}^{4} * (δ_{400}^{*} + \frac{δ_{040}^{*}}{4} - δ_{220}^{*}) \\ M S E ({\hat{S}}_{u s}^{2}) = ϕ * S_{Y}^{4} * (δ_{400}^{*} + v_{1}^{2} * δ_{040}^{*} - 2 * v_{1} * δ_{220}^{*}) \\ M S E ({\hat{S}}_{a}^{2}) = ϕ * S_{Y}^{4} * (δ_{400}^{*} + v_{2}^{2} * δ_{040}^{*} - 2 * v_{2} * δ_{220}^{*}) \\ M S E ({\hat{S}}_{b}^{2}) = ϕ * S_{Y}^{4} * (δ_{400}^{*} + v_{3}^{2} * δ_{040}^{*} - 2 * v_{3} * δ_{220}^{*}) \\ M S E ({\hat{S}}_{c}^{2}) = ϕ * S_{Y}^{4} * (δ_{400}^{*} + v_{4}^{2} * δ_{040}^{*} - 2 * v_{4} * δ_{220}^{*}) \\ M S E ({\hat{S}}_{A_{1}}^{2}) = ϕ * S_{Y}^{4} * (δ_{400}^{*} + \frac{1}{4} * (k_{41}^{2} * δ_{040}^{*} + k_{5}^{2} * δ_{004}^{*}) - \frac{1}{2} * (2 * k_{41} * δ_{220}^{*} + 2 * k_{5} * δ_{202}^{*} - k_{41} * k_{5} * δ_{022}^{*})) \\ M S E ({\hat{S}}_{A_{2}}^{2}) = ϕ * S_{Y}^{4} * (δ_{400}^{*} + \frac{1}{4} * (k_{42}^{2} * δ_{040}^{*} + k_{5}^{2} * δ_{004}^{*}) - \frac{1}{2} * (2 * k_{42} * δ_{220}^{*} + 2 * k_{5} * δ_{202}^{*} - k_{42} * k_{5} * δ_{022}^{*})) \\ M S E ({\hat{S}}_{A_{3}}^{2}) = ϕ * S_{Y}^{4} * (δ_{400}^{*} + \frac{1}{4} * (k_{43}^{2} * δ_{040}^{*} + k_{5}^{2} * δ_{004}^{*}) - \frac{1}{2} * (2 * k_{43} * δ_{220}^{*} + 2 * k_{5} * δ_{202}^{*} - k_{43} * k_{5} * δ_{022}^{*})) \\ M S E ({\hat{S}}_{A_{4}}^{2}) = ϕ * S_{Y}^{4} * (δ_{400}^{*} + \frac{1}{4} * (k_{44}^{2} * δ_{040}^{*} + k_{5}^{2} * δ_{004}^{*}) - \frac{1}{2} * (2 * k_{44} * δ_{220}^{*} + 2 * k_{5} * δ_{202}^{*} - k_{44} * k_{5} * δ_{022}^{*})) \\ M S E ({\hat{S}}_{A_{5}}^{2}) = ϕ * S_{Y}^{4} * (δ_{400}^{*} + \frac{1}{4} * (k_{45}^{2} * δ_{040}^{*} + k_{5}^{2} * δ_{004}^{*}) - \frac{1}{2} * (2 * k_{45} * δ_{220}^{*} + 2 * k_{5} * δ_{202}^{*} - k_{45} * k_{5} * δ_{022}^{*})) \\ M S E ({\hat{S}}_{A_{6}}^{2}) = ϕ * S_{Y}^{4} * (δ_{400}^{*} + \frac{1}{4} * (k_{46}^{2} * δ_{040}^{*} + k_{5}^{2} * δ_{004}^{*}) - \frac{1}{2} * (2 * k_{46} * δ_{220}^{*} + 2 * k_{5} * δ_{202}^{*} - k_{46} * k_{5} * δ_{022}^{*})) \\ M S E ({\hat{S}}_{A_{7}}^{2}) = ϕ * S_{Y}^{4} * (δ_{400}^{*} + \frac{1}{4} * (k_{47}^{2} * δ_{040}^{*} + k_{5}^{2} * δ_{004}^{*}) - \frac{1}{2} * (2 * k_{47} * δ_{220}^{*} + 2 * k_{5} * δ_{202}^{*} - k_{47} * k_{5} * δ_{022}^{*})) \\ M S E ({\hat{S}}_{A_{8}}^{2}) = ϕ * S_{Y}^{4} * (δ_{400}^{*} + \frac{1}{4} * (k_{48}^{2} * δ_{040}^{*} + k_{5}^{2} * δ_{004}^{*}) - \frac{1}{2} * (2 * k_{48} * δ_{220}^{*} + 2 * k_{5} * δ_{202}^{*} - k_{48} * k_{5} * δ_{022}^{*})) \end{matrix}

Percent relative efficiency:

\begin{array}{l} \frac{V a r ({\hat{S}}_{Y}^{2})}{V a r ({\hat{S}}_{Y}^{2})} * 100 \\ \frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{t}^{2})} * 100 \\ \frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{l r}^{2})} * 100 \\ \frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{b t}^{2})} * 100 \\ \frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{u s}^{2})} * 100 \\ \frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{a}^{2})} * 100 \\ \frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{b}^{2})} * 100 \\ \frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{c}^{2})} * 100 \\ \frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{1}}^{2})} * 100 \\ \frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{2}}^{2})} * 100 \\ \frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{3}}^{2})} * 100 \\ \frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{4}}^{2})} * 100 \\ \frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{5}}^{2})} * 100 \\ \frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{6}}^{2})} * 100 \\ \frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{7}}^{2})} * 100 \\ \frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{8}}^{2})} * 100 \end{array}

Appendix B. Simulation Study

\begin{matrix} l i b r a r y (s a m p l i n g) \\ N = 1000 \\ c o r r e l a t e d V a l u e = f u n c t i o n (X, r) \\ r^{2} = r * * 2 \\ v e = 1 - r^{2} \\ S D = s q r t (v e) \\ e = r n o r m (l e n g t h (X), m e a n = 0, s d = S D) \\ Y = r * X + e \\ r e t u r n (Y) \\ s e t . s e e d (0) \\ X = r g a m m a (N, 2, 4) \\ Y = c o r r e l a t e d V a l u e (X = X, r = 0.77) \\ m y d a t a = d a t a . f r a m e (X, Y) \\ Y = m y d a t a [, 1] \\ X = m y d a t a [, 2] \\ n = 50 \\ ϕ = r o u n d ((1 / n) - (1 / N), d i g i t = 4) \\ N_{1} = N - 1 \\ \bar{X} = r o u n d (m e a n (X), d i g i t = 4) \\ \bar{Y} = r o u n d (m e a n (Y), d i g i t = 4) \\ \bar{R} = r o u n d (m e a n (X_{r}), d i g i t = 4) \\ X_{r} = r a n k (X) \\ Y_{r} = r a n k (Y) \\ X_{M} = m a x (X) \\ X_{m} = m i n (X) \\ R_{M} = m a x (X_{r}) \\ R_{m} = m i n (X_{r}) \\ S_{X}^{2} = r o u n d ((s u m (X^{2}) - (N * {\bar{X}}^{2})) / N_{1}, d i g i t = 4) \\ S_{Y}^{2} = r o u n d ((s u m (Y^{2}) - (N * {\bar{Y}}^{2})) / N_{1}, d i g i t = 4) \\ S_{r}^{2} = r o u n d ((s u m (X_{r}^{2}) - (N * {\bar{R}}^{2})) / N_{1}, d i g i t = 4) \\ S_{X} = r o u n d (s q r t (S_{X}^{2}), d i g i t = 4) \\ S_{r} = r o u n d (s q r t (S_{r}^{2}), d i g i t = 4) \\ S_{Y} = r o u n d (s q r t (S_{Y}^{2}), d i g i t = 4) \\ C_{X}^{2} = r o u n d ((S_{X}^{2} / {\bar{X}}^{2}), d i g i t = 4) \\ C_{r}^{2} = r o u n d ((S_{X_{r}}^{2} / {\bar{R}}^{2}), d i g i t = 4) \\ C_{Y}^{2} = r o u n d ((S_{Y}^{2} / {\bar{Y}}^{2}), d i g i t = 4) \\ C_{X} = r o u n d (s q r t (C_{X}), d i g i t = 4) \\ C_{r} = r o u n d (s q r t (C_{r}), d i g i t = 4) \\ C_{Y} = r o u n d (s q r t (C_{Y}), d i g i t = 4) \\ ρ_{Y X} = r o u n d (c o r (Y, X), d i g i t = 4) \\ ρ_{X R} = r o u n d (c o r (X, X_{r}), d i g i t = 4) \\ ρ_{Y R} = r o u n d (c o r (Y, X_{r}), d i g i t = 4) \\ φ_{400} . = r o u n d ((s u m ({(Y - \bar{Y})}^{4})) / N_{1}, d i g i t = 4) \\ φ_{040} = r o u n d ((s u m ({(X - \bar{X})}^{4})) / N_{1}, d i g i t = 4) \\ φ_{004} = r o u n d ((s u m ({(X_{r} - \bar{X_{r}})}^{4})) / N_{1}, d i g i t = 4) \\ φ_{200} = r o u n d ((s u m ({(Y - \bar{Y})}^{2})) / N_{1}, d i g i t = 4) \\ φ_{020} = r o u n d ((s u m ({(X - \bar{X})}^{2})) / N_{1}, d i g i t = 4) \\ φ_{002} = r o u n d ((s u m ({(X_{r} - \bar{X_{r}})}^{2})) / N_{1}, d i g i t = 4) \\ φ_{220} = r o u n d ((s u m ({(Y - \bar{Y})}^{2} * {(X - \bar{X})}^{2})) / N_{1}, d i g i t = 4) \\ φ_{022} = r o u n d ((s u m ({(X - \bar{X})}^{2} * {(X_{r} - \bar{R})}^{2})) / N_{1}, d i g i t = 4) \\ φ_{202} = r o u n d ((s u m ({(Y - \bar{Y})}^{2} * {(X_{r} - \bar{R})}^{2})) / N_{1}, d i g i t = 4) \\ ρ^{*} = r o u n d (\frac{δ_{220}^{*}}{\sqrt{δ_{400}^{*}} \sqrt{δ_{040}^{*}}}, d i g i t = 4) \\ β_{2 (Y)} = r o u n d (φ_{400} / (φ_{200}^{2}), d i g i t = 4) \\ β_{2 (X)} = r o u n d (φ_{040} / (φ_{020}^{2}), d i g i t = 4) \\ β_{2 (r)} = r o u n d (φ_{004} / (φ_{002}^{2}), d i g i t = 4) \\ δ_{400} = β_{2 (Y)} \\ δ_{040} = β_{2 (X)} \\ δ_{004} = β_{2 (r)} \\ δ_{400}^{*} . = (β_{2 (Y)} - 1) \\ δ_{040}^{*} = (β_{2 (X)} - 1) \\ δ_{004}^{*} = (β_{2 (r)} - 1) \\ δ_{220} = r o u n d (φ_{220} / (S_{Y}^{2} * S_{X}^{2}), d i g i t = 4) \\ δ_{022} = r o u n d (φ_{022} / (S_{X}^{2} * S_{r}^{2}), d i g i t = 4) \\ δ_{202} = r o u n d (φ_{202} / (S_{Y}^{2} * S_{r}^{2}), d i g i t = 4) \\ δ_{220}^{*} = (δ_{220} - 1) \\ δ_{022}^{*} = (δ_{022} - 1) \\ δ_{202}^{*} = (δ_{202} - 1) \\ b_{(} s_{y}^{2}, s_{x}^{2}) = \frac{s_{y}^{2} * δ_{220}^{*}}{s_{x}^{2} * δ_{040}^{*}} \\ v_{1} = r o u n d (\frac{S_{X}^{2}}{S_{X}^{2} + δ_{040}}, d i g i t = 4) \\ v_{2} = r o u n d (\frac{S_{X}^{2}}{S_{X}^{2} + C_{X}}, d i g i t = 4) \\ v_{3} = r o u n d (\frac{δ_{040} * S_{X}^{2}}{δ_{040} * S_{X}^{2} + C_{X}}, d i g i t = 3) \\ v_{4} = r o u n d (\frac{C_{X} * S_{X}^{2}}{C X * S_{X}^{2} + δ_{040}}, d i g i t = 4) \\ ξ_{11} = 1 \\ ξ_{21} = X_{M} - X_{m} \\ ξ_{12} = X_{M} - X_{m} \\ ξ_{22} = C_{X} \\ ξ_{13} = X_{M} - X_{m} \\ ξ_{23} = 1 \\ ξ_{14} = X_{M} - X_{m} \\ ξ_{24} = β_{2 (X)} \\ ξ_{15} = β_{2 (X)} \\ ξ_{25} = X_{M} - X_{m} \\ ξ_{16} = ρ_{Y X} \\ ξ_{17} = C_{X} \\ ξ_{27} = X_{M} - X_{m} \\ ξ_{18} = ρ_{Y X} \\ ξ_{28} = X_{M} - X_{m} \\ ξ_{3} = 1 \\ ξ_{4} = R_{M} - R_{x} \\ k_{41} = \frac{ξ_{11} * S_{X}^{2}}{ξ_{11} * S_{X}^{2} + ξ_{21}} \\ k_{42} = \frac{ξ_{12} * S_{X}^{2}}{ξ_{12} * S_{X}^{2} + ξ_{22}} \\ k_{43} = \frac{ξ_{13} * S_{X}^{2}}{ξ_{13} * S_{X}^{2} + ξ_{23}} \\ k_{44} = \frac{ξ_{14} * S_{X}^{2}}{ξ_{14} * S_{X}^{2} + ξ_{24}} \\ k_{45} = \frac{ξ_{15} * S_{X}^{2}}{ξ_{15} * S_{X}^{2} + ξ_{25}} \\ k_{46} = \frac{ξ_{16} * S_{X}^{2}}{ξ_{16} * S_{X}^{2} + ξ_{26}} \\ k_{47} = \frac{ξ_{17} * S_{X}^{2}}{ξ_{17} * S_{X}^{2} + ξ_{27}} \\ k_{48} = \frac{ξ_{18} * S_{X}^{2}}{ξ_{18} * S_{X}^{2} + ξ_{28}} \\ k_{5} = \frac{ξ_{3} * S_{r}^{2}}{ξ_{3} * S_{r}^{2} + ξ_{4}} \end{matrix}

Now, we perform simulations

\begin{matrix} N_{2} = 8000 \\ S_{Y}^{2} = c (); S_{X}^{2} = c (); s_{x}^{2} = c (); \bar{Y} = c (); \bar{X}; X_{M} = c (); X_{m} = c (); \bar{R} = c (); \bar{y} = c (); \bar{x} = \\ c (); x_{M} = c (), x_{m} = c (); \bar{r} = c (); R_{M} = c (); R_{m} = c (); {\hat{S}}_{t}^{2} = c (); {\hat{S}}_{l r}^{2} = c (); {\hat{S}}_{b t}^{2} = c (); {\hat{S}}_{u s}^{2} = \\ c (); {\hat{S}}_{a}^{2} = c (); {\hat{S}}_{b}^{2} = c (); {\hat{S}}_{c}^{2} = c (); {\hat{S}}_{A_{1}}^{2} = c (); {\hat{S}}_{A_{2}}^{2} = c (); {\hat{S}}_{A_{3}}^{2} = c (); {\hat{S}}_{A_{4}}^{2} = c (); {\hat{S}}_{A_{5}}^{2} = \\ c (); {\hat{S}}_{A_{6}}^{2} = c (); {\hat{S}}_{A_{7}}^{2} = c (); {\hat{S}}_{A_{8}}^{2} = c () \\ for (i in 1 : N_{2}) { \\ N = 1000 \\ n = 150 \\ d 1 = m y d a t a [s a m p l e (1 : n r o w (m y d a t a), N),] \\ d 2 = d 1 [s a m p l e (1 : n r o w (d 1), n),] \\ x_{r} = r a n k (d 2 [, 2]) \\ X_{M} = c (X_{M}, m a x (d 1 [, 2])) \\ X_{m} = c (X_{m}, m i n (d 1 [, 2]) \\ R_{M} = c (R_{M}, m a x (d 1 [, 2])) \\ R_{m} = c (R_{m}, m i n (d 1 [, 2]) \\ x_{M} = c (x_{M}, m a x (d 2 [, 2])) \\ x_{m} = c (x_{m}, m i n (d 2 [, 2])) \\ \bar{X} = c (\bar{X}, m e a n (d 1 [, 2])) \\ \bar{Y} = c (\bar{X}, m e a n (d 1 [, 1])) \\ \bar{R} = c (\bar{R}, m e a n (d 1 [, 2])) \\ \bar{x} = c (\bar{x}, m e a n (d 2 [, 2])) \\ \bar{y} = c (\bar{y}, m e a n (d 2 [, 1])) \\ \bar{r} = c (\bar{r}, m e a n (d 1 [, 2])) \\ S_{X}^{2} = c (s_{X}^{2}, v a r (d 1 [, 2])) \\ S_{Y}^{2} = c (s_{Y}^{2}, v a r (d 1 [, 1])) \\ S_{R}^{2} = c (s_{R}^{2}, v a r (d 1 [, 2])) \\ s_{x}^{2} = c (s_{x}^{2}, v a r (d 2 [, 2])) \\ s_{y}^{2} = c (s_{y}^{2}, v a r (d 2 [, 1])) \\ s_{r}^{2} = c (s_{r}^{2}, v a r (d 2 [, 2])) \\ b_{(} s_{y}^{2}, s_{x}^{2}) = \frac{s_{y}^{2} * δ_{220}^{*}}{s_{x}^{2} * δ_{040}^{*}} \\ {\hat{S}}_{t}^{2} = s_{y}^{2} * (\frac{S_{X}^{2}}{s_{y}^{2}}) \\ {\hat{S}}_{l r}^{2} = s_{y}^{2} + b_{(} s_{y}^{2}, s_{x}^{2}) * (S_{X}^{2} - s_{x}^{2}) \\ {\hat{S}}_{b t}^{2} = s_{y}^{2} * exp (\frac{S_{X}^{2} - s_{x}^{2}}{S_{X}^{2} + s_{x}^{2}}) \\ {\hat{S}}_{u s}^{2} = s_{y}^{2} * (\frac{S_{X}^{2} + δ_{040}}{s_{x}^{2} + δ_{040}}) \\ {\hat{S}}_{a}^{2} = s_{y}^{2} * (\frac{S_{X}^{2} + C_{X}}{s_{x}^{2} + C_{X}}) \\ {\hat{S}}_{b}^{2} = s_{y}^{2} * (\frac{δ_{040} * S_{X}^{2} + C_{X}}{δ_{040} * s_{x}^{2} + C_{X}}) \\ {\hat{S}}_{c}^{2} = s_{y}^{2} * (\frac{C_{X} * S_{X}^{2} + δ_{040}}{C_{X} * s_{x}^{2} + δ_{040}}) \\ {\hat{S}}_{A_{1}}^{2} = s_{2}^{2} * exp (\frac{S_{X}^{2} - s_{x}^{2}}{S_{X}^{2} + s_{x}^{2} + 2 * (x_{M} - x_{m})}) * exp (\frac{S_{R}^{2} - s_{r}^{2}}{S_{R}^{2} + s_{r}^{2} + 2 * (R_{M} - R_{m})}) \\ {\hat{S}}_{A_{2}}^{2} = s_{2}^{2} * exp (\frac{(x_{M} - x_{m}) * (S_{X}^{2} - s_{x}^{2})}{(x_{M} - x_{m}) * (S_{X}^{2} + s_{x}^{2}) + 2 * C_{X}}) * exp (\frac{S_{R}^{2} - s_{r}^{2}}{S_{R}^{2} + s_{r}^{2} + 2 * (R_{M} - R_{m})}) \\ {\hat{S}}_{A_{3}}^{2} = s_{2}^{2} * exp (\frac{(x_{M} - x_{m}) * (S_{X}^{2} - s_{x}^{2})}{(x_{M} - x_{m}) * (S_{X}^{2} + s_{x}^{2}) + 2}) * exp (\frac{S_{R}^{2} - s_{r}^{2}}{S_{R}^{2} + s_{r}^{2} + 2 * (R_{M} - R_{m})}) \\ {\hat{S}}_{A_{4}}^{2} = s_{2}^{2} * exp (\frac{(x_{M} - x_{m}) * (S_{X}^{2} - s_{x}^{2})}{(x_{M} - x_{m}) * (S_{X}^{2} + s_{x}^{2}) + 2 * β_{2 (X)}}) * exp (\frac{S_{R}^{2} - s_{r}^{2}}{S_{R}^{2} + s_{r}^{2} + 2 * (R_{M} - R_{m})}) \\ {\hat{S}}_{A_{5}}^{2} = s_{2}^{2} * exp (\frac{β_{2 (X)} * (S_{X}^{2} - s_{x}^{2})}{β_{2 (X)} * (S_{X}^{2} + s_{x}^{2}) + 2 * (x_{M} - x_{m})}) * exp (\frac{S_{R}^{2} - s_{r}^{2}}{S_{R}^{2} + s_{r}^{2} + 2 * (R_{M} - R_{m})}) \\ {\hat{S}}_{A_{6}}^{2} = s_{2}^{2} * exp (\frac{(x_{M} - x_{m}) * (S_{X}^{2} - s_{x}^{2})}{(x_{M} - x_{m}) * (S_{X}^{2} + s_{x}^{2}) + 2 * ρ_{Y X}}) * exp (\frac{S_{R}^{2} - s_{r}^{2}}{S_{R}^{2} + s_{r}^{2} + 2 * (R_{M} - R_{m})}) \\ {\hat{S}}_{A_{7}}^{2} = s_{2}^{2} * exp (\frac{C_{X} * (S_{X}^{2} - s_{x}^{2})}{C_{X} * (S_{X}^{2} + s_{x}^{2}) + 2 * (x_{M} - x_{m})}) * exp (\frac{S_{R}^{2} - s_{r}^{2}}{S_{R}^{2} + s_{r}^{2} + 2 * (R_{M} - R_{m})}) \\ {\hat{S}}_{A_{7}}^{2} = s_{2}^{2} * exp (\frac{ρ_{Y X} * (S_{X}^{2} - s_{x}^{2})}{ρ_{Y X} * (S_{X}^{2} + s_{x}^{2}) + 2 * (x_{M} - x_{m})}) * exp (\frac{S_{R}^{2} - s_{r}^{2}}{S_{R}^{2} + s_{r}^{2} + 2 * (R_{M} - R_{m})}) \\ } \end{matrix}

Mean squared errors:

\begin{matrix} M S E ({\hat{S}}_{t}^{2}) = m e a n {({\hat{S}}_{t}^{2} - S_{Y}^{2})}^{2} \\ M S E ({\hat{S}}_{l r}^{2}) = m e a n {({\hat{S}}_{l r}^{2} - S_{Y}^{2})}^{2} \\ M S E ({\hat{S}}_{b t}^{2}) = m e a n {({\hat{S}}_{b t}^{2} - S_{Y}^{2})}^{2} \\ M S E ({\hat{S}}_{u s}^{2}) = m e a n {({\hat{S}}_{u s}^{2} - S_{Y}^{2})}^{2} \\ M S E ({\hat{S}}_{a}^{2}) = m e a n ({\hat{S}}_{a}^{2} - S_{Y}^{2}) \\ M S E ({\hat{S}}_{b}^{2}) = m e a n {({\hat{S}}_{b}^{2} - S_{Y}^{2})}^{2} \\ M S E ({\hat{S}}_{c}^{2}) = m e a n {({\hat{S}}_{c}^{2} - S_{Y}^{2})}^{2} \\ M S E ({\hat{S}}_{A_{1}}^{2}) = m e a n {({\hat{S}}_{A_{1}}^{2} - S_{Y}^{2})}^{2} \\ M S E ({\hat{S}}_{A_{2}}^{2}) = m e a n {({\hat{S}}_{A_{2}}^{2} - S_{Y}^{2})}^{2} \\ M S E ({\hat{S}}_{A_{3}}^{2}) = m e a n {({\hat{S}}_{A_{3}}^{2} - S_{Y}^{2})}^{2} \\ M S E ({\hat{S}}_{A_{4}}^{2}) = m e a n {({\hat{S}}_{A_{4}}^{2} - S_{Y}^{2})}^{2} \\ M S E ({\hat{S}}_{A_{5}}^{2}) = m e a n {({\hat{S}}_{A_{5}}^{2} - S_{Y}^{2})}^{2} \\ M S E ({\hat{S}}_{A_{6}}^{2}) = m e a n {({\hat{S}}_{A_{6}}^{2} - S_{Y}^{2})}^{2} \\ M S E ({\hat{S}}_{A_{7}}^{2}) = m e a n {({\hat{S}}_{A_{7}}^{2} - S_{Y}^{2})}^{2} \\ M S E ({\hat{S}}_{A_{8}}^{2}) = m e a n {({\hat{S}}_{A_{8}}^{2} - S_{Y}^{2})}^{2} \end{matrix}

Relative efficiency:

\begin{matrix} r o u n d (\frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{t}^{2})} * 100, d i g i t = 4) \\ r o u n d (\frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{l r}^{2})} * 100, d i g i t = 4) \\ r o u n d (\frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{b t}^{2})} * 100, d i g i t = 4) \\ r o u n d (\frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{u s}^{2})} * 100, d i g i t = 4) \\ r o u n d (\frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{a}^{2})} * 100, d i g i t = 4) \\ r o u n d (\frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{b}^{2})} * 100, d i g i t = 4) \\ r o u n d (\frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{c}^{2})} * 100, d i g i t = 4) \\ r o u n d (\frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{1}}^{2})} * 100, d i g i t = 4) \\ r o u n d (\frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{2}}^{2})} * 100, d i g i t = 4) \\ r o u n d (\frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{3}}^{2})} * 100, d i g i t = 4) \\ r o u n d (\frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{4}}^{2})} * 100, d i g i t = 4) \\ r o u n d (\frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{5}}^{2})} * 100, d i g i t = 4) \\ r o u n d (\frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{6}}^{2})} * 100, d i g i t = 4) \\ r o u n d (\frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{7}}^{2})} * 100, d i g i t = 4) \\ r o u n d (\frac{V a r ({\hat{S}}_{Y}^{2})}{M S E ({\hat{S}}_{A_{8}}^{2})} * 100, d i g i t = 4) \end{matrix}

References

Mohanty, S.; Sahoo, J. A note on improving the ratio method of estimation through linear transformation using certain known population parameters. Sankhyā Indian J. Stat. Ser. 1995, 57, 93–102. [Google Scholar]
Khan, M.; Shabbir, J. Some improved ratio, product, and regression estimators of finite population mean when using minimum and maximum values. Sci. World J. 2013, 2013, 431868. [Google Scholar] [CrossRef] [PubMed]
Daraz, U.; Shabbir, J.; Khan, H. Estimation of finite population mean by using minimum and maximum values in stratified random sampling. J. Mod. Appl. Stat. Methods 2018, 17, 20. [Google Scholar] [CrossRef]
Cekim, H.O.; Cingi, H. Some estimator types for population mean using linear transformation with the help of the minimum and maximum values of the auxiliary variable. Hacet. J. Math. Stat. 2017, 46, 685–694. [Google Scholar]
Chatterjee, S.; Hadi, A.S. Regression Analysis by Example; John Wiley & Sons: Hoboken, NJ, USA, 2013. [Google Scholar]
Khan, M. Improvement in estimating the finite population mean under maximum and minimum values in double sampling scheme. J. Stat. Appl. Probab. Lett. 2015, 2, 115–121. [Google Scholar]
Walia, G.S.; Kaur, H.; Sharma, M. Ratio type estimator of population mean through efficient linear transformation. Am. J. Math. Stat. 2015, 5, 144–149. [Google Scholar]
Isaki, C.T. Variance estimation using auxiliary information. J. Am. Stat. Assoc. 1983, 78, 117–123. [Google Scholar] [CrossRef]
Bahl, S.; Tuteja, R. Ratio and product type exponential estimators. J. Inf. Optim. Sci. 1991, 12, 159–164. [Google Scholar] [CrossRef]
Daraz, U.; Khan, M. Estimation of variance of the difference-cum-ratio-type exponential estimator in simple random sampling. Res. Math. Stat. 2021, 8, 1899402. [Google Scholar] [CrossRef]
Daraz, U.; Wu, J.; Albalawi, O. Double exponential ratio estimator of a finite population variance under extreme values in simple random sampling. Mathematics 2024, 12, 1737. [Google Scholar] [CrossRef]
Daraz, U.; Wu, J.; Alomair, M.A.; Aldoghan, L.A. New classes of difference cum-ratio-type exponential estimators for a finite population variance in stratified random sampling. Heliyon 2024, 10, e33402. [Google Scholar] [CrossRef]
Ahmad, S.; Al Mutairi, A.; Nassr, S.G.; Alsuhabi, H.; Kamal, M.; Rehman, M.U. A new approach for estimating variance of a population employing information obtained from a stratified random sampling. Heliyon 2023, 9, 1–13. [Google Scholar] [CrossRef] [PubMed]
Dubey, V.; Sharma, H. On estimating population variance using auxiliary information. Stat. Transit. New Ser. 2008, 9, 7–18. [Google Scholar]
Kadilar, C.; Cingi, H. Ratio estimators for the population variance in simple and stratified random sampling. Appl. Math. Comput. 2006, 173, 1047–1059. [Google Scholar] [CrossRef]
Shabbir, J.; Gupta, S. Some estimators of finite population variance of stratified sample mean. Commun. Stat. Theory Methods 2010, 39, 3001–3008. [Google Scholar] [CrossRef]
Shabbir, J.; Gupta, S. Using rank of the auxiliary variable in estimating variance of the stratified sample mean. Int. J. Comput. Theor. Stat. 2019, 6, 207. [Google Scholar] [CrossRef]
Singh, H.; Chandra, P. An alternative to ratio estimator of the population variance in sample surveys. J. Transp. Stat. 2008, 9, 89–103. [Google Scholar]
Singh, H.P.; Solanki, R.S. A new procedure for variance estimation in simple random sampling using auxiliary information. J. Stat. Pap. 2013, 54, 479–497. [Google Scholar] [CrossRef]
Upadhyaya, L.; Singh, H. An estimator for populationvariance that utilizes the kurtosis of an auxiliary variablein sample surveys. Vikram Math. J. 1999, 19, 14–17. [Google Scholar]
Yadav, S.K.; Kadilar, C.; Shabbir, J.; Gupta, S. Improved family of estimators of population variance in simple random sampling. J. Stat. Theory Pract. 2015, 9, 219–226. [Google Scholar] [CrossRef]
Yasmeen, U.; Noor-ul-Amin, M. Estimation of Finite Population Variance Under Stratified Sampling Technique. J. Reliab. Stat. Stud. 2021, 14, 565–584. [Google Scholar] [CrossRef]
Zaman, T.; Bulut, H. An efficient family of robust-type estimators for the population variance in simple and stratified random sampling. Commun. Stat. Theory Methods 2023, 52, 2610–2624. [Google Scholar] [CrossRef]
Watson, D.J. The estimation of leaf area in field crops. J. Agric. Sci. 1937, 27, 474–483. [Google Scholar] [CrossRef]
Bureau of Statistics. Punjab Development Statistics Government of the Punjab, Lahore, Pakistan; Bureau of Statistics: Lahore, Pakistan, 2013.
Cochran, W.B. Sampling Techniques; John Wiley and Sons: Hoboken, NJ, USA, 1963. [Google Scholar]

Table 1. Some classes of the proposed estimator.

Subsets of the Proposed Estimator ${\hat{S}}_{A}^{2}$	$ξ_{1}$	$ξ_{2}$
${\hat{S}}_{A_{1}}^{2} = s_{y}^{2} exp [δ_{1} \{\frac{(S_{x}^{2} - s_{x}^{2})}{(S_{x}^{2} + s_{x}^{2}) + 2 (x_{M} - x_{m})}\}] exp [δ_{2} \{\frac{ξ_{3} (S_{r}^{2} - s_{r}^{2})}{ξ_{3} (S_{r}^{2} + s_{r}^{2}) + 2 ξ_{4}}\}]$	1	$x_{M} - x_{m}$
${\hat{S}}_{A_{2}}^{2} = s_{y}^{2} exp [δ_{1} \{\frac{(x_{M} - x_{m}) (S_{x}^{2} - s_{x}^{2})}{(x_{M} - x_{m}) (S_{x}^{2} + s_{x}^{2}) + 2 c_{x}}\}] exp [δ_{2} \{\frac{ξ_{3} (S_{r}^{2} - s_{r}^{2})}{ξ_{3} (S_{r}^{2} + s_{r}^{2}) + 2 ξ_{4}}\}]$	$x_{M} - x_{m}$	$c_{x}$
${\hat{S}}_{A_{3}}^{2} = s_{y}^{2} exp [δ_{1} \{\frac{(x_{M} - x_{m}) (S_{x}^{2} - s_{x}^{2})}{(x_{M} - x_{m}) (S_{x}^{2} + s_{x}^{2}) + 2}\}] exp [δ_{2} \{\frac{ξ_{3} (S_{r}^{2} - s_{r}^{2})}{ξ_{3} (S_{r}^{2} + s_{r}^{2}) + 2 ξ_{4}}\}]$	$x_{M} - x_{m}$	1
${\hat{S}}_{A_{4}}^{2} = s_{y}^{2} exp [δ_{1} \{\frac{(x_{M} - x_{m}) (S_{x}^{2} - s_{x}^{2})}{(x_{M} - x_{m}) (S_{x}^{2} + s_{x}^{2}) + 2 β_{2 (x)}}\}] exp [δ_{2} \{\frac{ξ_{3} (S_{r}^{2} - s_{r}^{2})}{ξ_{3} (S_{r}^{2} + s_{r}^{2}) + 2 ξ_{4}}\}]$	$x_{M} - x_{m}$	$β_{2 (x)}$
${\hat{S}}_{A_{5}}^{2} = s_{y}^{2} exp [δ_{1} \{\frac{β_{2 (x)} (S_{x}^{2} - s_{x}^{2})}{β_{2 (x)} (S_{x}^{2} + s_{x}^{2}) + 2 (x_{M} - x_{m})}\}] exp [δ_{2} \{\frac{ξ_{3} (S_{r}^{2} - s_{r}^{2})}{ξ_{3} (S_{r}^{2} + s_{r}^{2}) + 2 ξ_{4}}\}]$	$β_{2 (x)}$	$x_{M} - x_{m}$
${\hat{S}}_{A_{6}}^{2} = s_{y}^{2} exp [δ_{1} \{\frac{(x_{M} - x_{m}) (S_{x}^{2} - s_{x}^{2})}{(x_{M} - x_{m}) (S_{x}^{2} + s_{x}^{2}) + 2 ρ_{y x}}\}] exp [δ_{2} \{\frac{ξ_{3} (S_{r}^{2} - s_{r}^{2})}{ξ_{3} (S_{r}^{2} + s_{r}^{2}) + 2 ξ_{4}}\}]$	$x_{M} - x_{m}$	$ρ_{y x}$
${\hat{S}}_{A_{7}}^{2} = s_{y}^{2} exp [δ_{1} \{\frac{c_{x} (S_{x}^{2} - s_{x}^{2})}{c_{x} (S_{x}^{2} + s_{x}^{2}) + 2 (x_{M} - x_{m})}\}] exp [δ_{2} \{\frac{ξ_{3} (S_{r}^{2} - s_{r}^{2})}{ξ_{3} (S_{r}^{2} + s_{r}^{2}) + 2 ξ_{4}}\}]$	$c_{x}$	$x_{M} - x_{m}$
${\hat{S}}_{A_{8}}^{2} = s_{y}^{2} exp [δ_{1} \{\frac{ρ_{y x} (S_{x}^{2} - s_{x}^{2})}{ρ_{y x} (S_{x}^{2} + s_{x}^{2}) + 2 (x_{M} - x_{m})}\}] exp [δ_{2} \{\frac{ξ_{3} (S_{r}^{2} - s_{r}^{2})}{ξ_{3} (S_{r}^{2} + s_{r}^{2}) + 2 ξ_{4}}\}]$	$ρ_{y x}$	$x_{M} - x_{m}$

Table 2. MSEs of all the estimators using simulated data.

Estimator	Pop-I	Pop-II	Pop-III	Pop-IV	Pop-V	Pop-VI
(1) ${\hat{S}}_{y}^{2}$	7.34e−5	9.62e−5	8.82e−4	6.43e−4	7.47e−3	6.00e−3
(2) ${\hat{S}}_{t}^{2}$	5.98e−5	7.90e−5	5.90e−4	4.99e−4	6.00e−3	5.02e−3
(3) ${\hat{S}}_{l r}^{2}$	5.90e−5	7.88e−5	5.89e−4	4.80e−4	5.60e−3	4.80e−3
(4) ${\hat{S}}_{b t}^{2}$	5.31e−5	7.60e−5	5.80e−4	4.65e−4	5.40e−3	4.70e−3
(5) ${\hat{S}}_{u s}^{2}$	5.32e−5	7.58e−5	5.78e−4	4.50e−4	5.20e−3	4.50e−3
(6) ${\hat{S}}_{a}^{2}$	5.30e−5	7.40e−5	5.76e−4	4.20e−4	5.00e−3	4.30e−3
(7) ${\hat{S}}_{b}^{2}$	5.30e−5	7.40e−5	5.76e−4	4.20e−4	5.00e−3	4.30e−3
(8) ${\hat{S}}_{c}^{2}$	5.20e−5	7.35e−5	5.60e−4	4.00e−4	4.90e−3	4.10e−3
(9) ${\hat{S}}_{A_{1}}^{2}$	2.69e−5	5.78e−5	3.80e−4	2.80e−4	2.77e−3	2.00e−3
(10) ${\hat{S}}_{A_{2}}^{2}$	2.98e−5	5.92e−5	3.98e−4	3.00e−4	2.96e−3	2.20e−3
(11) ${\hat{S}}_{A_{3}}^{2}$	2.39e−5	5.39e−5	3.50e−4	2.50e−4	2.60e−3	2.10e−3
(12) ${\hat{S}}_{A_{4}}^{2}$	2.380e−5	5.35e−5	3.35e−4	2.20e−4	2.40e−3	1.90e−3
(13) ${\hat{S}}_{A_{5}}^{2}$	2.50e−5	5.60e−5	3.60e−4	2.70e−4	2.80e−3	1.70e−3
(14) ${\hat{S}}_{A_{6}}^{2}$	2.50e−5	5.61e−5	3.66e−4	2.77e−4	3.00e−3	2.15e−3
(15) ${\hat{S}}_{A_{7}}^{2}$	2.40e−5	5.25e−5	3.20e−4	2.10e−4	2.35e−3	2.40e−3
(16) ${\hat{S}}_{A_{8}}^{2}$	2.30e−5	5.22e−5	3.05e−4	1.90e−4	1.99e−3	1.40e−3

Table 3. PREs of all the estimators using simulated data.

Estimator	Pop-I	Pop-II	Pop-III	Pop-IV	Pop-V	Pop-VI
(1) ${\hat{S}}_{y}^{2}$	100	100	100	100	100	100
(2) ${\hat{S}}_{t}^{2}$	122.74	125.57	149.49	128.86	124.50	119.52
(3) ${\hat{S}}_{l r}^{2}$	124.41	125.88	149.74	133.95	133.39	125.00
(4) ${\hat{S}}_{b t}^{2}$	138.23	130.52	152.07	138.28	138.33	127.66
(5) ${\hat{S}}_{u s}^{2}$	137.97	130.87	52.50	142.89	143.65	133.33
(6) ${\hat{S}}_{a}^{2}$	138.49	134.05	153.13	153.09	149.00	139.53
(7) ${\hat{S}}_{b}^{2}$	138.49	134.05	153,13	153.00	149.00	139.53
(8) ${\hat{S}}_{c}^{2}$	141.15	134.97	157.50	60.75	152.45	146.34
(9) ${\hat{S}}_{A_{1}}^{2}$	272.05	71.63	232.1	229.64	269.68	300.00
(10) ${\hat{S}}_{A_{2}}^{2}$	246.31	167.57	221.61	214.33	252.36	272.73
(11) ${\hat{S}}_{A_{3}}^{2}$	307.11	184.04	252.00	257.20	287.11	285.71
(12) ${\hat{S}}_{A_{4}}^{2}$	308.40	185.42	263.28	292.28	311.25	315.79
(13) ${\hat{S}}_{A_{5}}^{2}$	293.60	177.14	245.00	238.15	266.79	352.94
(14) ${\hat{S}}_{A_{6}}^{2}$	293.60	176.82	240.98	232.13	249.00	279.07
(15) ${\hat{S}}_{A_{7}}^{2}$	305.83	188.85	275.63	306.19	317.87	250.00
(16) ${\hat{S}}_{A_{8}}^{2}$	319.13	189.74	289.18	337.53	375.38	428.57

Table 4. MSEs using empirical datasets.

Estimator	Data 1	Data 2	Data 3
(1) ${\hat{S}}_{y}^{2}$	1.45e+23	9.27e+22	8130.61
(2) ${\hat{S}}_{t}^{2}$	9.07.e+22	8.67e+22	7487.31
(3) ${\hat{S}}_{l r}^{2}$	6.36e+22	4.65e+22	6851.91
(4) ${\hat{S}}_{b t}^{2}$	6.72e+22	4.66e+22	6879.75
(5) ${\hat{S}}_{u s}^{2}$	8.67e+22	8.33e+22	7407.67
(6) ${\hat{S}}_{a}^{2}$	9.07e+22	8.67e+22	7483.48
(7) ${\hat{S}}_{b}^{2}$	9.07e+22	8.67e+22	7486.06
(8) ${\hat{S}}_{c}^{2}$	8.13e+22	8.41e+22	7080.74
(9) ${\hat{S}}_{A_{1}}^{2}$	4.22e+22	3.81e+22	6658.82
(10) ${\hat{S}}_{A_{2}}^{2}$	4.25e+22	3.84e+22	6726.29
(11) ${\hat{S}}_{A_{3}}^{2}$	4.25e+22	3.84e+22	6726.18
(12) ${\hat{S}}_{A_{4}}^{2}$	4.25e+22	3.84e+22	6725.93
(13) ${\hat{S}}_{A_{5}}^{2}$	4.25e+22	3.84e+22	6686.33
(14) ${\hat{S}}_{A_{6}}^{2}$	4.25e+22	3.84e+22	6726.27
(15) ${\hat{S}}_{A_{7}}^{2}$	4.17e+22	3.82e+22	6831.87
(16) ${\hat{S}}_{A_{8}}^{2}$	4.15e+22	3.80e+22	6631.44

Table 5. PREs using empirical datasets.

Estimator	Data 1	Data 2	Data 3
(1) ${\hat{S}}_{y}^{2}$	100	100	100
(2) ${\hat{S}}_{t}^{2}$	159.32	106.92	108.59
(3) ${\hat{S}}_{l r}^{2}$	227.25	199.09	118.66
(4) ${\hat{S}}_{b t}^{2}$	215.92	198.86	118.18
(5) ${\hat{S}}_{u s}^{2}$	166.89	111.34	109.75
(6) ${\hat{S}}_{a}^{2}$	159.32	106.92	108.65
(7) ${\hat{S}}_{b}^{2}$	159.32	106.92	108.61
(8) ${\hat{S}}_{c}^{2}$	177.80	110.22	114.79
(9) ${\hat{S}}_{A_{1}}^{2}$	342.83	243.29	122.10
(10) ${\hat{S}}_{A_{2}}^{2}$	340.27	241.54	120.87
(11) ${\hat{S}}_{A_{3}}^{2}$	340.27	241.55	120.88
(12) ${\hat{S}}_{A_{4}}^{2}$	340.28	241.55	120.88
(13) ${\hat{S}}_{A_{5}}^{2}$	340.27	241.55	121.60
(14) ${\hat{S}}_{A_{6}}^{2}$	340.27	241.54	120.88
(15) ${\hat{S}}_{A_{7}}^{2}$	346.72	242.85	119.01
(16) ${\hat{S}}_{A_{8}}^{2}$	348.52	244.05	122.60

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Daraz, U.; Alomair, M.A.; Albalawi, O. Variance Estimation under Some Transformation for Both Symmetric and Asymmetric Data. Symmetry 2024, 16, 957. https://doi.org/10.3390/sym16080957

AMA Style

Daraz U, Alomair MA, Albalawi O. Variance Estimation under Some Transformation for Both Symmetric and Asymmetric Data. Symmetry. 2024; 16(8):957. https://doi.org/10.3390/sym16080957

Chicago/Turabian Style

Daraz, Umer, Mohammed Ahmed Alomair, and Olayan Albalawi. 2024. "Variance Estimation under Some Transformation for Both Symmetric and Asymmetric Data" Symmetry 16, no. 8: 957. https://doi.org/10.3390/sym16080957

APA Style

Daraz, U., Alomair, M. A., & Albalawi, O. (2024). Variance Estimation under Some Transformation for Both Symmetric and Asymmetric Data. Symmetry, 16(8), 957. https://doi.org/10.3390/sym16080957

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Variance Estimation under Some Transformation for Both Symmetric and Asymmetric Data

Abstract

1. Introduction

2. Concepts and Notations

3. Proposed Estimator

Properties of the Proposed Estimator

4. Mathematical Comparison

5. Numerical Comparison

5.1. Simulation Study

5.2. Numerical Examples

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Numerical Examples

Appendix B. Simulation Study

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI