Application of Log-Type Estimators for Addressing Non-Response in Survey Sampling Using Real Datasets

G. R. V. Triveni; Faizan Danish; Melfi Alrasheedi

doi:10.3390/math13071089

,

and

¹

Department of Mathematics, School of Advanced Sciences, VIT-AP University, Inavolu, Beside AP Secretariat, Amaravati AP-522237, India

²

Department of Quantitative Methods, School of Business, King Faisal University, Al-Ahsa 31982, Saudi Arabia

^*

Authors to whom correspondence should be addressed.

Mathematics2025, 13(7), 1089;https://doi.org/10.3390/math13071089

This article belongs to the Special Issue Applied Statistics in Real-World Problems

Version Notes

Order Reprints

Abstract

There is a difficulty in survey sampling when non-response (NR) occurs in the process of estimating the population parameters. This study examines the effectiveness of combined and separate log-type estimators when using bivariate auxiliary information when NR occurs in data. In this study, we propose families of novel log-type estimators under various scenarios. We performed an analysis on the reliability and efficiency of our proposed estimators in situations when NR occurs in both study and auxiliary variables and when NR occurs only in study variables. In this study, we have concentrated on certain issues like how the non-response effects the estimators’ efficiency, how different NR rates effect the precision of estimators, and how the combined and separate types of estimators handle the problem of NR. We proved the efficiency of our proposed estimators by using the bias and mean square error (MSE) metrics under different NR rates, illustrating the positive correlation between higher NR rates and increased errors. To evaluate the impact of NR on MSE values, we took four real datasets, which included a cost of living index dataset for 121 nations and another dataset which is essential for forecasting solar UV radiation hazards influenced by environmental factors, thus enhancing public health awareness and preventive strategies. Additionally, a simulation study comprising 10,000 iterations was also performed. This study provides survey practitioners with valuable guidance on selecting strong estimation methods to enhance the accuracy and efficiency of survey estimates in the context of non-response. This investigation contributes to the domain of survey sampling by demonstrating the robustness and effectiveness of log-type estimators. These estimators enhance survey findings by effectively addressing NR issues.

Keywords:

bias; log-type estimators; mean square error; non-response

MSC:

62D05

1. Introduction

Non-response in sample surveys presents a significant obstacle, as it has the potential to create bias and affect the dependability of survey findings. A sample may contain missing data due to factors such as illness, language barriers, participant reluctance, or unavailability during data collection. When conducting surveys on sensitive subjects such as drug use, abortion, or sexually transmitted diseases, non-response (NR) becomes more noticeable, as respondents may opt not to reveal information or decline to participate in the survey entirely. An NR has consequences that go beyond simply missing data. It can result in either underestimating or overestimating demographic metrics, affecting survey results’ reliability. Conventional methods frequently fail to address NR problems or assume that survey data are both complete and unbiased, disregarding the difficulties presented by individuals who do not respond.

Stratified random sampling is an important methodological strategy in survey research that helps deal with the complications caused by NR. Stratified sampling involves dividing the population into homogeneous subgroups, or strata, based on relevant criteria. This method assures that each subgroup is sufficiently represented in the sample. Stratification enhances the accuracy of estimates and allows researchers to customise tactics to minimise NR effects in each stratum. Within each stratum, it is possible to use particular tactics such as targeted follow-ups, alternate contact methods, or weighting adjustments to increase response rates and make up for missing data. These customised approaches are crucial for preserving the accuracy of survey findings, especially when addressing delicate topics that often have higher rates of NR. Furthermore, stratified random sampling allows for incorporating additional data pertaining to each stratum, such as demographic characteristics or past records. By utilising additional data, researchers can more effectively account for bias caused by NR, thus improving the accuracy and dependability of estimates for population parameters.

Many researchers worked on NR. This issue was initially addressed by [1]. Auxiliary information is essential in statistical estimation, as it offers supplementary data that can enhance the accuracy and precision of estimates. It plays a crucial role in decreasing the variability of estimators and correcting for biases, making it extremely beneficial in a wide range of sampling and estimation procedures. Accurately estimating the population mean is crucial in survey sampling, as it offers valuable insights about the population’s mean. Methods such as ratios, regression estimators, etc., utilise additional information to improve the accuracy and reliability of these estimations. When there is a lack of response, it is very crucial to use supplementary information. By including relevant auxiliary variables, researchers can mitigate biases caused by non-respondents. This application guarantees the resilience of the population mean estimation, even in the presence of missing data in the sample. As a result, the study’s conclusions maintain their integrity and validity. Various authors have addressed the NR issue in stratified random sampling. The authors of [2] suggested estimators for the population mean that combine different types of estimators. A two-phase sampling method was used to estimate the population mean when there is NR [3]. Moreover, ref. [4,5] proposed a family of estimators in the presence of NR, while [6] focused on estimating the population mean by using additional information. An estimator for stratified sampling using a single auxiliary variable was developed by [7].

Moreover, the importance of using bivariate or dual auxiliary information has been highlighted, because it can improve estimation accuracy and reduce biases caused by NR. By employing two correlated variables, dual auxiliary information provides a more precise approach that enables improved adjustments and minimises estimation mistakes. Several researchers investigated NR by utilising dual auxiliary information. In the non-response scenario, ref. [8] estimated the population mean and [9] developed a generalised exponential-type estimator under a stratified sampling scheme. The authors of [10] suggested a family of ratio estimators in the presence of NR issues and addressed them well. An efficient method was introduced by [11] to estimate the population mean based on bivariate auxiliary information. The authors of [12] have studied the problem of NR under simple random sampling. NR and measurement error are significant concerns in survey sampling, which can result in biased and inaccurate results if not well-handled. To address the issues of both the measurement error and NR, ref. [13] proposed some techniques. Further, ref. [14] introduced a ratio estimator in the presence of NR; also, ref. [15] developed a new estimator to estimate population mean, and [16] developed a generalised class of estimators for estimating population means in the presence of such conditions, specifically inside stratified random sampling frameworks. The authors of [17,18] proposed various methods in order to identify the optimum strata boundaries in stratified random sampling, [19] proposed ratio estimators by using regression methods, and [20] developed a modified regression estimator by incorporating regression techniques. Later, ref. [21] proposed a robust regression-type estimator, and [22,23] emphasised the formulation and use of robust regression-based ratio and regression estimators for population mean and variance, utilising auxiliary variables and advanced statistical methodologies in both simple and stratified random sampling. Recently, ref. [24] proposed exponential ratio and regression type estimators by using past sample means, and [25] suggested power and log-transformed ratio estimators. Later, refs. [26,27] proposed calibrated estimators for the issue of NR.

Few studies exist on log-type estimators that cover different situations, such as Optional Randomised Response Technique (ORRT) models, calibration estimators in stratified random sampling, population variance estimation, and information from a single auxiliary variable. There is a significant gap for studies that address various important factors like non-response, the use of bivariate auxiliary information, log-type families of estimators, and stratified random sampling. To fill this gap, we conducted research to develop families of log-type estimators that use dual auxiliary information in the context of stratified random sampling with different non-response rates.

The paper is organised as follows: Section 1 provides notations and procedures for estimating the population mean in the presence of NR. Section 2 presents the proposed classes of log-type estimators for all four cases. Section 3 uses four real datasets to conduct an empirical investigation and demonstrate the results. Section 4 of our study involves conducting a simulation to improve the accuracy of our results and showing the results by using trace plots. In Section 5, we provide an explanation of the results and discussion. Finally, Section 6 provides our study’s conclusions.

Procedure for Estimation of Population Mean When NR Occurs

Notations are as follows:

N =

the total population size;

L =

no. of strata;

N_{h} =

population size of the stratum

h

;

n_{h}

= sample size of the stratum

h

;

n_{1 h}

and

n_{2 h}

= the number of respondent and non-respondent persons in the sample of

h^{t h}

stratum, such that

n_{h} = n_{1 h} + n_{2 h}

;

m_{h}

= sub-sampled units from non-respondent group;

W_{h} = \frac{N_{h}}{N}

denotes the stratum weight;

W_{h (2)} = \frac{N_{2 h}}{N_{h}}

denotes the NR unit weight.

A finite population of size

N

is stratified into

L

homogeneous strata and the size of the

h^{t h}

stratum is

N_{h}

,

h = 1, 2 . . ., L .

Let

y_{h i} (x_{h i}, z_{h i})

represent the observations of the study variable

y

(auxiliary variables

x

and

z

) on the

i^{t h}

unit of

h^{t h}

stratum. Let

{\bar{y}}_{h} ({\bar{x}}_{h}, {\bar{z}}_{h})

represent the sample mean of the

h^{t h}

stratum corresponding to the population mean of

{\bar{Y}}_{h} ({\bar{X}}_{h}, {\bar{Z}}_{h})

. In order to choose a subset of n elements from

N

, we employ the simple random sampling without replacement (SRSWOR) method. We choose

n_{h}

units from the

h^{t h}

stratum such that

\sum_{h = 1}^{L} n_{h} .

Following [1],

m_{h}

units are selected from the non-respondent group

(i . e ., n_{2 h})

, which is random, and the selection is a proportion of the NR sampled units. Consequently, we choose

m_{h}

, such that

m_{h} = \frac{n_{2 h}}{g}

, where

g > 1

or

0 < (1 / g) < 1 .

Thus,

g = \frac{n_{2 h}}{m_{h}}

is treated as a constant chosen priorly.

To obtain the estimates of the stratum population mean, we combine the initial response and the response group

(n_{1 h})

and the data obtained from the NR group

{(n}_{2 h})

, i.e.,

m_{h}

sub-sampled units. The estimators for the population means of the study and auxiliary variables in the NR stratum are as follows:

{\bar{y}}_{h}^{*} = \frac{n_{1 h} {\bar{y}}_{1 h} + n_{2 h} {\bar{y}}_{m h}}{n_{h}}, {\bar{x}}_{h}^{*} = \frac{n_{1 h} {\bar{x}}_{1 h} + n_{2 h} {\bar{x}}_{m h}}{n_{h}} and {\bar{z}}_{h}^{*} = \frac{n_{1 h} {\bar{z}}_{1 h} + n_{2 h} {\bar{z}}_{m h}}{n_{h}}

{\bar{y}}_{1 h}

{(\bar{x}}_{1 h}, {\bar{z}}_{1 h})

is the sample mean of the study variable (auxiliary variables) based on

n_{1 h}

response units in the

h^{t h}

stratum and

{\bar{y}}_{m h} ({\bar{x}}_{m h}, {\bar{z}}_{m h})

is the sample mean of the study variable ((auxiliary variables) based on

m_{h}

response units in the

h^{t h}

stratum, where

m_{h} = \frac{n_{2 h}}{g} (g > 1

).

The estimator

{\bar{y}}_{h}^{*}

({\bar{x}}_{h}^{*}, {\bar{z}}_{h}^{*})

is unbiased for the population mean

{\bar{Y}}_{h}

({\bar{X}}_{h}, {\bar{Z}}_{h})

of the study (auxiliary variables) in the

h^{t h}

stratum. The variances of these estimators, as described by [1], are given by the following:

\begin{array}{c} V ({\bar{y}}_{h}^{*}) = (\frac{1}{n_{h}} - \frac{1}{N_{h}}) S_{y h}^{2} + (\frac{g - 1}{n_{h}}) W_{h (2)} S_{y h (2)}^{2} \\ V ({\bar{x}}_{h}^{*}) = (\frac{1}{n_{h}} - \frac{1}{N_{h}}) S_{x h}^{2} + (\frac{g - 1}{n_{h}}) W_{h (2)} S_{x h (2)}^{2} \\ V ({\bar{z}}_{h}^{*}) = (\frac{1}{n_{h}} - \frac{1}{N_{h}}) S_{z h}^{2} + (\frac{g - 1}{n_{h}}) W_{h (2)} S_{z h (2)}^{2} \end{array}

where

S_{y h}^{2} (S_{x h}^{2}, S_{z h}^{2})

is the population variance of the study variable (auxiliary variables) based on all units of

N_{h}

in the

h^{t h}

stratum.

S_{y h (2)}^{2} (S_{x h (2)}^{2}, S_{z h (2)}^{2})

is the population’s variance of the study variable (auxiliary variables) based on NR

(N_{2 h})

units in the

h^{t h}

stratum, and their weight is given as

W_{h (2)} = \frac{N_{2 h}}{N_{h}}

. The covariances of the estimators are given (in Equation (1)) by the following:

\begin{array}{c} c o v ({\bar{y}}_{h}^{*}, {\bar{x}}_{h}^{*}) = (\frac{1}{n_{h}} - \frac{1}{N_{h}}) S_{y x h} + (\frac{g - 1}{n_{h}}) W_{h (2)} S_{y x h (2)} \\ c o v ({\bar{x}}_{h}^{*}, {\bar{z}}_{h}^{*}) = (\frac{1}{n_{h}} - \frac{1}{N_{h}}) S_{x z h} + (\frac{g - 1}{n_{h}}) W_{h (2)} S_{x z h (2)} \\ c o v ({\bar{y}}_{h}^{*}, {\bar{z}}_{h}^{*}) = (\frac{1}{n_{h}} - \frac{1}{N_{h}}) S_{y z h} + (\frac{g - 1}{n_{h}}) W_{h (2)} S_{y z h (2)} \end{array}

(1)

In the

h^{t h}

stratum,

S_{y x h} (S_{x z h}, S_{y z h})

is the population covariance of the study (auxiliary variables) based on response units

{(N}_{h})

and

S_{y x h (2)} (S_{x z h (2)}, S_{y z h (2)})

is the population covariance of the study (auxiliary variables) based on NR units

{(N}_{2 h})

.

Without using the auxiliary information, the NR-stratified estimator of the population mean of the study variable is given by the following:

{\bar{y}}_{s t}^{*} = \sum_{h = 1}^{L} W_{h} {\bar{y}}_{h}^{*} = \sum_{h = 1}^{L} W_{h} (\frac{n_{1 h} {\bar{y}}_{1 h} + n_{2 h} {\bar{y}}_{m h}}{n_{h}}), where W_{h} = \frac{N_{h}}{N}

The variances of the estimators are given as follows:

\begin{array}{c} V ({\bar{y}}_{s t}^{*}) = \sum_{h = 1}^{L} W_{h}^{2} (\frac{1}{n_{h}} - \frac{1}{N_{h}}) S_{y h}^{2} + (\frac{g - 1}{n_{h}}) W_{h (2)} S_{y h (2)}^{2} \\ {\bar{x}}_{s t}^{*} = \sum_{h = 1}^{L} W_{h} {\bar{x}}_{h}^{*}, V ({\bar{x}}_{s t}^{*}) = \sum_{h = 1}^{L} W_{h}^{2} (\frac{1}{n_{h}} - \frac{1}{N_{h}}) S_{x h}^{2} + (\frac{g - 1}{n_{h}}) W_{h (2)} S_{x h (2)}^{2} \\ {\bar{z}}_{s t}^{*} = \sum_{h = 1}^{L} W_{h} {\bar{z}}_{h}^{*}, V ({\bar{z}}_{s t}^{*}) = \sum_{h = 1}^{L} W_{h}^{2} (\frac{1}{n_{h}} - \frac{1}{N_{h}}) S_{z h}^{2} + (\frac{g - 1}{n_{h}}) W_{h (2)} S_{z h (2)}^{2} \end{array}

A modified Hansen and Hurwitz [1] unbiased estimator for stratified sampling may be given as follows:

F_{0} = \sum_{h = 1}^{L} W_{h} {\bar{y}}_{h}^{*}

(2)

where

{\bar{y}}_{h}^{*} = \frac{n_{h_{(1)}}}{n_{h}} {\bar{y}}_{(1) n_{h_{1}}} + \frac{n_{h_{(2)}}}{n_{h}} {\bar{y}}_{(2) r_{h}}

.

Here,

{\bar{y}}_{(1) n_{h_{1}}}

is the mean of

n_{h_{1}}

respondents on the first call,

{\bar{y}}_{(2) r_{h}}

is the mean of

r_{h}

units of respondents on the second call, and

{\bar{y}}_{h}^{*}

denotes the unbiased Hansen–Hurwitz estimator [1] of

{\bar{y}}_{h}

for stratum h. The variance of this estimator is presented in the Equation (3).

The variance of Equation (2) is given as follows:

V a r (F_{0}) = \sum_{h = 1}^{L} W_{h}^{2} (φ_{h} S_{y h}^{2} + φ_{h}^{*} S_{y h (2)}^{2})

(3)

where

φ_{h} = (\frac{1}{n_{h}} - \frac{1}{N_{h}})

,

φ_{h}^{*} = (\frac{g - 1}{n_{h}}) W_{h (2)}

.

2. The Proposed Classes of Log-Type Estimators

This paper presents the development of sets of log-type estimators for stratified random sampling. These estimators utilise bivariate auxiliary information in the specified scenarios.

2.1. Non-Response Occurs in Both Study and Auxiliary Variables: Combined Log-Type Estimators

Case (i): NR in study and auxiliary variables complicates survey sampling. Combined log-type estimators use log transformations to improve estimating accuracy and reduce bias in NR. Now, we will thoroughly examine each scenario, analysing novel estimators and their corresponding bias and mean square error (MSE) expressions.

\begin{array}{c} F_{c_{1}} = α_{1} {\bar{y}}_{s t}^{*} (\frac{\bar{X}}{{\bar{x}}_{s t}^{*}}) {[1 + γ_{1} l o g (\frac{{\bar{z}}_{s t}^{*}}{\bar{Z}})]}^{β_{1}} \\ F_{c_{2}} = α_{2} {\bar{y}}_{s t}^{*} (\frac{{\bar{x}}_{s t}^{*}}{\bar{X}}) {[1 + γ_{2} l o g (\frac{\bar{Z}}{{\bar{z}}_{s t}^{*}})]}^{β_{2}} \\ F_{c_{3}} = {\bar{y}}_{s t}^{*} {(\frac{\bar{X}}{{\bar{x}}_{s t}^{*}})}^{α_{3}} {[1 - γ_{3} l o g (\frac{{\bar{z}}_{s t}^{*}}{\bar{Z}})]}^{β_{3}} \\ F_{c_{4}} = {\bar{y}}_{s t}^{*} {(\frac{{\bar{x}}_{s t}^{*}}{\bar{X}})}^{α_{4}} {[1 - γ_{4} l o g (\frac{\bar{Z}}{{\bar{z}}_{s t}^{*}})]}^{β_{4}} \end{array}

(4)

where the constants

α_{p}, β_{p}

, and

γ_{p}

are chosen in between 0 and 1,

p = 1,2, 3,4 .

Theorem 1.

The features of the proposed estimator

F_{c_{1}}

, including first-order approximation of bias and MSE, are provided below.

\begin{array}{c} {B i a s (F}_{c_{1}}) = \bar{Y} \{α_{1} [1 + χ_{x}^{*} - χ_{y x}^{*} + β_{1} γ_{1} (χ_{y z}^{*} - χ_{x z}^{*} + \frac{1}{2} {(β}_{1} - 1) (γ_{1} - 1) χ_{z}^{*})] - 1\} \\ {M S E (F}_{c_{1}}) = {\bar{Y}}^{2} {1 + α_{1}^{2} [1 + χ_{y}^{*} + 3 χ_{x}^{*} + χ_{z}^{*} β_{1} γ_{1} (β_{1} γ_{1} + 4 (χ_{y z}^{*} - χ_{x z}^{*})) - 4 χ_{y x}^{*}] \\ - 2 α_{1} [1 + χ_{x}^{*} - χ_{y x}^{*} + β_{1} γ_{1} (χ_{y z}^{*} - χ_{x z}^{*} + \frac{1}{2} {(β}_{1} - 1) (γ_{1} - 1) χ_{z}^{*})]} \end{array}

Proof.

See Appendix A. □

Theorem 2.

The features of the proposed estimator

F_{c_{2}}

, including first-order approximation of bias and MSE, are provided below.

\begin{array}{c} B i a s (F_{c_{2}}) = \bar{Y} \{α_{2} [1 + χ_{y x}^{*} - β_{2} γ_{2} (χ_{y z}^{*} + χ_{x z}^{*} - \frac{1}{2} {(β}_{2} - 1) (γ_{2} + 1) χ_{z}^{*})] - 1\} \\ M S E (F_{c_{2}}) = {\bar{Y}}^{2} {1 + α_{2}^{2} [1 + χ_{y}^{*} + χ_{x}^{*} + χ_{z}^{*} β_{2} γ_{2} (β_{2} γ_{2} + {(β}_{2} - 1) {(γ}_{2} + 1)) + 4 χ_{y x}^{*}] \\ - 2 α_{2} [1 + χ_{y x}^{*} - β_{2} γ_{2} (χ_{y z}^{*} + χ_{x z}^{*} - \frac{1}{2} {(β}_{2} - 1) (γ_{2} + 1) χ_{z}^{*})]} \end{array}

Proof.

See Appendix A. □

Theorem 3.

The features of the proposed estimator

F_{c_{3}}

, including first-order approximation of bias and MSE, are provided below.

\begin{array}{c} B i a s (F_{c_{3}}) = \bar{Y} \{- α_{3} χ_{y x}^{*} - β_{3} γ_{3} [χ_{y z}^{*} + α_{3} χ_{x z}^{*} - \frac{1}{2} {(β}_{3} - 1) (γ_{3} + 1) χ_{z}^{*}] + \frac{α_{3} (α_{3} + 1)}{2} χ_{x}^{*}\} \\ M S E (F_{c_{3}}) = {\bar{Y}}^{2} [χ_{y}^{*} + α_{3}^{2} χ_{x}^{*} + β_{3} γ_{3} (β_{3} γ_{3} χ_{z}^{*} - 2 χ_{y z}^{*} + 2 α_{3} χ_{x z}^{*}) - 2 α_{3} χ_{y x}^{*}] \end{array}

Proof.

See Appendix A. □

Theorem 4.

The features of the proposed estimator

F_{c_{4}}

, including first-order approximation of bias and MSE, are provided below.

\begin{array}{c} B i a s (F_{c_{4}}) = \bar{Y} \{α_{4} χ_{y x}^{*} + β_{4} γ_{4} [{χ_{y z}^{*} + α}_{4} χ_{x z}^{*} + \frac{1}{2} {(β}_{4} - 1) (γ_{4} - 1) χ_{z}^{*}] + \frac{α_{4} (α_{4} - 1)}{2} χ_{x}^{*}\} \\ {M S E (F_{c_{4}}) = \bar{Y}}^{2} [χ_{y}^{*} + α_{4}^{2} χ_{x}^{*} + β_{4} γ_{4} (β_{4} γ_{4} χ_{z}^{*} + 2 χ_{y z}^{*} + {2 α}_{4} χ_{x z}^{*}) + 2 α_{4} χ_{y x}^{*}] \end{array}

Proof.

See Appendix A. □

The Bias ((A2), (A5), (A8), (A11)) and MSE ((A3), (A6), (A9), (A12)) for all the estimators presented in (4) are obtained in Appendix A with proofs. Similarly, we will obtain the bias and MSE expressions for other estimators below by using error terms displayed in Table A1 and Table A2 (See Appendix A).

2.2. Non-Response Occurs in Both Study and Auxiliary Variables: Separate Log-Type Estimators

Case (ii): NR in the study and auxiliary variables complicate survey sampling. Separate log-type estimators use log transformations to improve estimating accuracy and reduce bias in NR.

\begin{array}{c} F_{s_{1}} = \sum_{h = 1}^{L} W_{h} α_{1_{h}} {\bar{y}}_{h}^{*} (\frac{{\bar{X}}_{h}}{{\bar{x}}_{h}^{*}}) {[1 + γ_{1_{h}} l o g (\frac{{\bar{z}}_{h}^{*}}{{\bar{Z}}_{h}})]}^{{β_{1}}_{h}} \\ F_{s_{2}} = \sum_{h = 1}^{L} W_{h} α_{2_{h}} {\bar{y}}_{h}^{*} (\frac{{\bar{x}}_{h}^{*}}{{\bar{X}}_{h}}) {[1 + γ_{2_{h}} l o g (\frac{{\bar{Z}}_{h}}{{\bar{z}}_{h}^{*}})]}^{β_{2_{h}}} \\ F_{s_{3}} = \sum_{h = 1}^{L} W_{h} {\bar{y}}_{h}^{*} {(\frac{{\bar{X}}_{h}}{{\bar{x}}_{h}^{*}})}^{α_{3_{h}}} {[1 - γ_{3_{h}} l o g (\frac{{\bar{z}}_{h}^{*}}{{\bar{Z}}_{h}})]}^{β_{3_{h}}} \\ F_{s_{4}} = \sum_{h = 1}^{L} W_{h} {\bar{y}}_{h}^{*} {(\frac{{\bar{x}}_{h}^{*}}{{\bar{X}}_{h}})}^{α_{4_{h}}} {[1 - γ_{4_{h}} l o g (\frac{{\bar{Z}}_{h}}{{\bar{z}}_{h}^{*}})]}^{β_{4_{h}}} \end{array}

(5)

By using expected values from Table A1, we derive the bias ((6), (8), (10), (12)) and MSE ((7), (9), (11), (13)) for the separate-type estimators presented in (5) as follows:

Theorem 5.

The features of the proposed estimator

F_{s_{1}}

, including first-order approximation of bias and MSE, are provided below.

B i a s (F_{s_{1}}) = \sum_{h = 1}^{L} W_{h} {\bar{Y}}_{h} \{α_{1_{h}} [1 + ϕ_{x_{h}}^{*} - ϕ_{{y x}_{h}}^{*} + β_{1_{h}} γ_{1_{h}} (ϕ_{{y z}_{h}}^{*} - ϕ_{{x z}_{h}}^{*} + \frac{1}{2} (β_{1_{h}} - 1) (γ_{1_{h}} - 1) ϕ_{z_{h}}^{*})] - 1\}

(6)

M S E (F_{s_{1}}) = \sum_{h = 1}^{L} W_{h}^{2} {\bar{Y}}_{h}^{2} \{1 + α_{1_{h}}^{2} [1 + ϕ_{y_{h}}^{*} + 3 ϕ_{x_{h}}^{*} + ϕ_{z_{h}}^{*} β_{1_{h}}^{2} γ_{1_{h}}^{2} + 4 β_{1_{h}} γ_{1_{h}} (ϕ_{{y z}_{h}}^{*} - ϕ_{{x z}_{h}}^{*}) - 4 ϕ_{{y x}_{h}}^{*}] - 2 α_{1_{h}} [(1 + ϕ_{x_{h}}^{*} - ϕ_{{y x}_{h}}^{*} + β_{1_{h}} γ_{1_{h}} (ϕ_{{y z}_{h}}^{*} - ϕ_{{x z}_{h}}^{*} + \frac{1}{2} (β_{1_{h}} - 1)) (γ_{1_{h}} - 1) ϕ_{z_{h}}^{*})]\}

(7)

Theorem 6.

The features of the proposed estimator

F_{s_{2}}

, including first-order approximation of bias and MSE, are provided below.

B i a s (F_{s_{2}}) = \sum_{h = 1}^{L} W_{h} {\bar{Y}}_{h} \{α_{2_{h}} [1 + ϕ_{{y x}_{h}}^{*} - β_{2_{h}} γ_{2_{h}} (ϕ_{{y z}_{h}}^{*} + ϕ_{{x z}_{h}}^{*} - \frac{1}{2} (β_{2_{h}} - 1) (γ_{2_{h}} + 1) ϕ_{z_{h}}^{*})] - 1\}

(8)

M S E (F_{s_{2}}) = \sum_{h = 1}^{L} W_{h}^{2} {\bar{Y}}_{h}^{2} \{1 + α_{2_{h}}^{2} [1 + ϕ_{y_{h}}^{*} + ϕ_{x_{h}}^{*} + ϕ_{z_{h}}^{*} β_{2_{h}} γ_{2_{h}} (β_{2_{h}} γ_{2_{h}} + (β_{2_{h}} - 1) (γ_{2_{h}} + 1)) + 4 ϕ_{{y x}_{h}}^{*}] - 2 α_{2_{h}} [1 + ϕ_{{y x}_{h}}^{*} - β_{2_{h}} γ_{2_{h}} (ϕ_{{y z}_{h}}^{*} + ϕ_{{x z}_{h}}^{*} - \frac{1}{2} (β_{2_{h}} - 1) (γ_{2_{h}} - 1) ϕ_{z_{h}}^{*})]\}

(9)

Theorem 7.

The features of the proposed estimator

F_{s_{3}}

, including first-order approximation of bias and MSE, are provided below.

B i a s (F_{s_{3}}) = \sum_{h = 1}^{L} W_{h} {\bar{Y}}_{h} \{- α_{3_{h}} ϕ_{{y x}_{h}}^{*} - β_{3_{h}} γ_{3_{h}} [ϕ_{{y z}_{h}}^{*} + α_{3_{h}} ϕ_{{x z}_{h}}^{*} - \frac{1}{2} (β_{3_{h}} - 1) (γ_{3_{h}} + 1) ϕ_{z_{h}}^{*}] + \frac{α_{3_{h}} (α_{3_{h}} + 1)}{2} ϕ_{x_{h}}^{*}\}

(10)

M S E (F_{s_{3}}) = \sum_{h = 1}^{L} W_{h}^{2} {\bar{Y}}_{h}^{2} [ϕ_{y_{h}}^{*} + α_{3_{h}}^{2} ϕ_{x_{h}}^{*} + β_{3_{h}} γ_{3_{h}} (β_{3_{h}} γ_{3_{h}} ϕ_{z_{h}}^{*} - 2 ϕ_{{y z}_{h}}^{*} + 2 α_{3_{h}} ϕ_{{x z}_{h}}^{*}) - 2 α_{3_{h}} ϕ_{{y x}_{h}}^{*}]

(11)

Theorem 8.

The features of the proposed estimator

F_{s_{4}}

, including first-order approximation of bias and MSE, are provided below.

B i a s (F_{s_{4}}) = \sum_{h = 1}^{L} W_{h} {\bar{Y}}_{h} \{α_{4_{h}} ϕ_{{y x}_{h}}^{*} + β_{4_{h}} γ_{4_{h}} [ϕ_{{y z}_{h}}^{*} + α_{4_{h}} ϕ_{{x z}_{h}}^{*} + \frac{1}{2} (β_{4_{h}} - 1) (γ_{4_{h}} - 1) ϕ_{z_{h}}^{*}] + \frac{α_{4_{h}} (α_{4_{h}} - 1)}{2} ϕ_{x_{h}}^{*}\}

(12)

M S E (F_{s_{4}}) = \sum_{h = 1}^{L} W_{h}^{2} {\bar{Y}}_{h}^{2} [ϕ_{y_{h}}^{*} + α_{4_{h}}^{2} ϕ_{x_{h}}^{*} + β_{4_{h}} γ_{4_{h}} (β_{4_{h}} γ_{4_{h}} ϕ_{z_{h}}^{*} + 2 ϕ_{{y z}_{h}}^{*} + 2 α_{4_{h}} ϕ_{{x z}_{h}}^{*}) + 2 α_{4_{h}} ϕ_{{y x}_{h}}^{*}]

(13)

2.3. Non-Response Occurs Only in Study Variable: Combined Log-Type Estimators

Case (iii): In cases where NR occurs only in the study variable, we developed combined log-type estimators. These estimators use log transformations to handle the missing data, enhancing the accuracy and reliability of the estimates despite the absence of responses in the study variable.

\begin{array}{c} T_{c_{1}} = α_{1} {\bar{y}}_{s t}^{*} (\frac{\bar{X}}{{\bar{x}}_{s t}}) {[1 + γ_{1} l o g (\frac{{\bar{z}}_{s t}}{\bar{Z}})]}^{β_{1}} \\ T_{c_{2}} = α_{2} {\bar{y}}_{s t}^{*} (\frac{{\bar{x}}_{s t}}{\bar{X}}) {[1 + γ_{2} l o g (\frac{\bar{Z}}{{\bar{z}}_{s t}})]}^{β_{2}} \\ T_{c_{3}} = {\bar{y}}_{s t}^{*} {(\frac{\bar{X}}{{\bar{x}}_{s t}})}^{α_{3}} {[1 - γ_{3} l o g (\frac{{\bar{z}}_{s t}}{\bar{Z}})]}^{β_{3}} \\ T_{c_{4}} = {\bar{y}}_{s t}^{*} {(\frac{{\bar{x}}_{s t}}{\bar{X}})}^{α_{4}} {[1 - γ_{4} l o g (\frac{\bar{Z}}{{\bar{z}}_{s t}})]}^{β_{4}} \end{array}

(14)

By using expected values from Table A2, we derive the bias ((15), (17), (19), (21)) and MSE ((16), (18), (20), (22)) for the combined-type estimators presented in (14) as follows:

Theorem 9.

The features of the proposed estimator

T_{c_{1}}

, including first-order approximation of bias and MSE, are provided below.

B i a s (T_{c_{1}}) = \bar{Y} \{α_{1} [1 + χ_{x} - χ_{y x} + β_{1} γ_{1} (χ_{y z} - χ_{x z} + \frac{1}{2} {(β}_{1} - 1) (γ_{1} - 1) χ_{z})] - 1\}

(15)

\begin{array}{c} M S E (T_{c_{1}}) = {\bar{Y}}^{2} {1 + α_{1}^{2} [1 + χ_{y}^{*} + 3 χ_{x} + β_{1}^{2} γ_{1}^{2} χ_{z} + 4 β_{1} γ_{1} (χ_{y z} - χ_{x z}) - 4 χ_{y x}] \\ - 2 α_{1} [(1 + χ_{x} - χ_{y x} + β_{1} γ_{1} (χ_{y z} - χ_{x z} + \frac{1}{2} {(β}_{1} - 1) (γ_{1} - 1) χ_{z})]} \end{array}

(16)

Theorem 10.

The features of the proposed estimator

T_{c_{2}}

, including first-order approximation of bias and MSE, are provided below.

B i a s (T_{c_{2}}) = \bar{Y} \{α_{2} [1 + χ_{y x} - β_{2} γ_{2} (χ_{y z} + χ_{x z} - \frac{1}{2} {(β}_{2} - 1) (γ_{2} + 1) χ_{z})] - 1\}

(17)

\begin{array}{c} M S E (T_{c_{2}}) = {\bar{Y}}^{2} {1 + α_{2}^{2} [1 + χ_{y}^{*} + χ_{x} + χ_{z} β_{2} γ_{2} (β_{2} γ_{2} + {(β}_{2} - 1) {(γ}_{2} + 1)) + 4 χ_{y x}] \\ - 2 α_{2} [1 + χ_{y x} - β_{2} γ_{2} (χ_{y z} + χ_{x z} - \frac{1}{2} {(β}_{2} - 1) (γ_{2} - 1) χ_{z})]} \end{array}

(18)

Theorem 11.

The features of the proposed estimator

T_{c_{3}}

, including first-order approximation of bias and MSE, are provided below.

B i a s (T_{c_{3}}) = \bar{Y} \{- α_{3} χ_{y x} - β_{3} γ_{3} [χ_{y z} + α_{3} χ_{x z} - \frac{1}{2} {(β}_{3} - 1) (γ_{3} + 1) χ_{z}] + \frac{α_{3} (α_{3} + 1)}{2} χ_{x}\}

(19)

M S E (T_{c_{3}}) = {\bar{Y}}^{2} [χ_{y}^{*} + α_{3}^{2} χ_{x} + β_{3} γ_{3} (β_{3} γ_{3} χ_{z} - 2 χ_{y z} + 2 α_{3} χ_{x z}) - 2 α_{3} χ_{y x}]

(20)

Theorem 12.

The features of the proposed estimator

T_{c_{4}}

, including first-order approximation of bias and MSE, are provided below.

B i a s (T_{c_{4}}) = \bar{Y} \{α_{4} χ_{y x} + β_{4} γ_{4} [{{χ_{y z} + α}_{4} χ}_{x z} + \frac{1}{2} {(β}_{4} - 1) (γ_{4} - 1) χ_{z}] + \frac{α_{4} (α_{4} - 1)}{2} χ_{x}\}

(21)

M S E {(T}_{c_{4}}) = {\bar{Y}}^{2} [χ_{y}^{*} + α_{4}^{2} χ_{x} + β_{4} γ_{4} (β_{4} γ_{4} χ_{z} + 2 χ_{y z} + {2 α}_{4} χ_{x z}) + 2 α_{4} χ_{y x}]

(22)

2.4. Non-Response Occurs Only on Study Variable: Separate Log-Type Estimators

Case (iv): In cases where NR occurs only in the study variable, we developed separate log-type estimators as follows:

\begin{array}{c} T_{s_{1}} = \sum_{h = 1}^{L} W_{h} α_{1_{h}} {\bar{y}}_{h}^{*} (\frac{{\bar{X}}_{h}}{{\bar{x}}_{h}}) {[1 + γ_{1_{h}} l o g (\frac{{\bar{z}}_{h}}{{\bar{Z}}_{h}})]}^{β_{1_{h}}} \\ T_{s_{2}} = \sum_{h = 1}^{L} W_{h} α_{2_{h}} {\bar{y}}_{h}^{*} (\frac{{\bar{x}}_{h}}{{\bar{X}}_{h}}) {[1 + γ_{2_{h}} l o g (\frac{{\bar{Z}}_{h}}{{\bar{z}}_{h}})]}^{β_{2_{h}}} \\ T_{s_{3}} = \sum_{h = 1}^{L} W_{h} {\bar{y}}_{h}^{*} {(\frac{{\bar{X}}_{h}}{{\bar{x}}_{h}})}^{α_{3_{h}}} {[1 - γ_{3_{h}} l o g (\frac{{\bar{z}}_{h}}{{\bar{Z}}_{h}})]}^{β_{3_{h}}} \\ T_{s_{4}} = \sum_{h = 1}^{L} W_{h} {\bar{y}}_{h}^{*} {(\frac{{\bar{x}}_{h}}{{\bar{X}}_{h}})}^{α_{4_{h}}} {[1 - γ_{4_{h}} l o g (\frac{{\bar{Z}}_{h}}{{\bar{z}}_{h}})]}^{β_{4_{h}}} \end{array}

(23)

By using expected values from Table A2, we derive the bias ((24), (26), (28), (30)) and MSE ((25), (27), (29), (31)) for the separate-type estimators presented in (23) as follows:

Theorem 13.

The features of the proposed estimator

T_{s_{1}}

, including first-order approximation of bias and MSE, are provided below.

B i a s (T_{s_{1}}) = \sum_{h = 1}^{L} W_{h} {\bar{Y}}_{h} \{α_{1_{h}} [1 + ϕ_{x_{h}} - ϕ_{{y x}_{h}} + β_{1_{h}} γ_{1_{h}} (ϕ_{{y z}_{h}} - ϕ_{{x z}_{h}} + \frac{1}{2} (β_{1_{h}} - 1) (γ_{1_{h}} - 1) ϕ_{z_{h}})] - 1\}

(24)

M S E (T_{s_{1}}) = \sum_{h = 1}^{L} W_{h}^{2} {\bar{Y}}_{h}^{2} \{1 + α_{1_{h}}^{2} [1 + ϕ_{y_{h}}^{*} + 3 ϕ_{x_{h}} + ϕ_{z_{h}}^{*} β_{1_{h}}^{2} γ_{1_{h}}^{2} + 4 β_{1_{h}} γ_{1_{h}} (ϕ_{{y z}_{h}} - ϕ_{{x z}_{h}}) - 4 ϕ_{{y x}_{h}}] - 2 α_{1_{h}} [1 + ϕ_{x_{h}} - ϕ_{{y x}_{h}} + β_{1_{h}} γ_{1_{h}} (ϕ_{{y z}_{h}} - ϕ_{{x z}_{h}} + \frac{1}{2} (β_{1_{h}} - 1) (γ_{1_{h}} - 1) ϕ_{z_{h}})]\}

(25)

Theorem 14.

The features of the proposed estimator

T_{s_{2}}

, including first-order approximation of bias and MSE, are provided below.

B i a s (T_{s_{2}}) = \sum_{h = 1}^{L} W_{h} {\bar{Y}}_{h} \{α_{2_{h}} [1 + ϕ_{{y x}_{h}} - β_{2_{h}} γ_{2_{h}} (ϕ_{{y z}_{h}} + ϕ_{{x z}_{h}} - \frac{1}{2} (β_{2_{h}} - 1) (γ_{2_{h}} + 1) ϕ_{z_{h}})] - 1\}

(26)

M S E (T_{s_{2}}) = \sum_{h = 1}^{L} W_{h}^{2} {\bar{Y}}_{h}^{2} \{1 + α_{2_{h}}^{2} [1 + ϕ_{y_{h}}^{*} + ϕ_{x_{h}} + ϕ_{z_{h}} β_{2_{h}} γ_{2_{h}} (β_{2_{h}} γ_{2_{h}} + (β_{2_{h}} - 1) (γ_{2_{h}} + 1)) + 4 ϕ_{{y x}_{h}}] - 2 α_{2_{h}} [1 + ϕ_{{y x}_{h}} - β_{2_{h}} γ_{2_{h}} (ϕ_{{y z}_{h}} + ϕ_{{x z}_{h}} - \frac{1}{2} {(β}_{2_{h}} - 1) (γ_{2_{h}} - 1) ϕ_{z_{h}})]\}

(27)

Theorem 15.

The features of the proposed estimator

T_{s_{3}}

, including first-order approximation of bias and MSE, are provided below.

B i a s (T_{s_{3}}) = \sum_{h = 1}^{L} W_{h} {\bar{Y}}_{h} \{- α_{3_{h}} ϕ_{y x} - β_{3_{h}} γ_{3_{h}} [ϕ_{{y z}_{h}} + α_{3_{h}} ϕ_{{x z}_{h}} - \frac{1}{2} (β_{3_{h}} - 1) (γ_{3_{h}} + 1) ϕ_{z_{h}}] + \frac{α_{3_{h}} (α_{3_{h}} + 1)}{2} ϕ_{x_{h}}\}

(28)

M S E (T_{s_{3}}) = \sum_{h = 1}^{L} W_{h}^{2} {\bar{Y}}_{h}^{2} [ϕ_{y_{h}}^{*} + α_{3_{h}}^{2} ϕ_{x_{h}} + β_{3_{h}} γ_{3_{h}} (β_{3_{h}} γ_{3_{h}} ϕ_{z_{h}} - 2 ϕ_{{y z}_{h}} + 2 α_{3_{h}} ϕ_{{x z}_{h}}) - 2 α_{3_{h}} ϕ_{{y x}_{h}}]

(29)

Theorem 16.

The features of the proposed estimator

T_{s_{4}}

, including first-order approximation of bias and MSE, are provided below.

B i a s (T_{s_{4}}) = \sum_{h = 1}^{L} W_{h} {\bar{Y}}_{h} \{α_{4_{h}} ϕ_{{y x}_{h}} + β_{4_{h}} γ_{4_{h}} [ϕ_{{y z}_{h}} + α_{4_{h}} ϕ_{{x z}_{h}} + \frac{1}{2} (β_{4_{h}} - 1) (γ_{4_{h}} - 1) ϕ_{z_{h}}] + \frac{α_{4_{h}} (α_{4_{h}} - 1)}{2} ϕ_{x_{h}}\}

(30)

M S E (T_{s_{4}}) = \sum_{h = 1}^{L} W_{h}^{2} {\bar{Y}}_{h}^{2} [ϕ_{y_{h}}^{*} + α_{4_{h}}^{2} ϕ_{x_{h}} + β_{4_{h}} γ_{4_{h}} (β_{4_{h}} γ_{4_{h}} ϕ_{z_{h}} + 2 ϕ_{{y z}_{h}} + 2 α_{4_{h}} ϕ_{{x z}_{h}}) + 2 α_{4_{h}} ϕ_{{y x}_{h}}]

(31)

3. Empirical Study

The statistics of two stratified populations are provided in Table A3 and Table A4 (see Appendix A) for the purpose of evaluating the performance of the recommended family of estimators under stratified random sampling in the presence of NR. We have taken the references of the datasets for the first and second population from [28]. Each stratum in the two populations has its own sample size determined via Neyman allocation.

3.1. First Population

In 2007, we analysed data from 923 districts in Turkey, which were categorised into six regions: Marmara, Aegean, Mediterranean, Central Anatolia, Black Sea, and East and Southeast Anatolia. The dependent variable

(Y)

in the study represented the number of teachers, while the first auxiliary variable

(X)

represented the number of pupils. The second auxiliary variable

(Z)

represented the number of courses in primary and secondary schools.

3.2. Second Population

In Pakistan’s flood-affected districts, there are 6940 male families and 1678 female families. For our study, we are considering food expenditures as the main variable

(Y),

household wages as an additional variable

(X),

as well as overall spending during May 2011 as another auxiliary variable

(Z) .

The data of the first population and the second population are shown in Table A3 and Table A4, respectively. Both tables contain all the relevant data necessary for calculating the bias and MSE of the suggested estimators. The covariances and correlation coefficients of the variables are displayed, along with the corresponding NR rates of 10%, 20%, and 30%, accordingly.

Table 1 displays the bias and MSE values for the combined

(F_{c_{1}}, F_{c_{2}}, F_{c_{3}}, F_{c_{4}})

and separate

(F_{s_{1}}, F_{s_{2}}, F_{s_{3}}, F_{s_{4}})

log-type classes of estimators when NR occurs in both study and auxiliary variables for the first population. Here, we analyse the occurrence of NR in percentages of 10, 20, and 30. In addition, we examine the various values of

g (2, 2.5, 3)

and record the corresponding metric values. At each stage of

g

, we can observe an increase in both the values and NR rates, indicating a rising inaccuracy. At a value of

g = 2

, the error values for combined estimators are 8095.37, 8157.32, and 8176.98 at 10%, 20%, and 30%, correspondingly. Additionally, we may notice the corresponding change for the separate-type estimators.

Table 1. NR occurs in both study and auxiliary variables: first population.

In the situation of where NR occurs only in the study variable, Table 2 provides a detailed explanation of the bias and MSE values for various estimators under different NR rates and values of the parameter

g .

The estimators are categorised into a combined type

(T_{c_{1}}, T_{c_{2}}, T_{c_{3}}, T_{c_{4}})

and separate type (

T_{s_{1}}, T_{s_{2}}, T_{s_{3}}, T_{s_{4}}) .

The NR rates considered are 10%, 20%, and 30%, while the values of

g

are 2, 2.5, and 3. For each combination of the NR rate and

g

, the table lists the bias and MSE for each estimator.

Table 2. NR occurs only in study variable: first population.

For instance, at a 10% (20%, 30%) NR rate and

g

= 2, the MSE values of combined and separate-type estimators are

T_{c_{1}} = 2094.46 (2587.44, 2649.19)

and

T_{s_{1}} = 315.91 (493.27, 545.86) .

At a 10% (20%, 30%) NR rate and

g

= 2.5, the MSE values of the second class of estimators for combined and separate-type estimators are

T_{c_{2}} = 15,021.27 (15,498.74, 15,695.27)

and

T_{s_{2}} = 4193.40 (4607.71, 4742.26) .

We can observe a constant increase in error when increasing NR rates in all the estimators.

For different combined and separate estimators

{(F}_{c_{1}}, F_{c_{2}}, F_{c_{3}}, F_{c_{4}})

and

(F_{s_{1}}, F_{s_{2}}, F_{s_{3}}, F_{s_{4}})

, the bias and MSE are compared in the table for different NR rates (10%, 20%, 30%) and values of

g

(2, 2.5, 3). In the situation where NR occurs in both study and auxiliary variables, Table 3 provides the bias and MSE for every estimator for every NR rate and

g

value. This trend holds true for both combined and separate forms of estimators; combined types tend to have larger bias and MSE values, while separate types do better with decreasing bias and error as the number of NRs increase.

Table 3. NR occurs in both study and auxiliary variables: second population.

In case where NR occurs only in the study variable, using a variety of NR rates, Table 4 compares the effectiveness of combined and separate-type estimators with regard to bias and MSE metrics. It demonstrates that estimator robustness differs under various circumstances and shows how biases and MSEs change when the NR rates increase. It appears that separate-type estimators may be better at reducing the impact of NR in stratified random sampling than combined-type estimators, since they typically have lower biases and MSEs.

Table 4. NR occurs only in study variable: second population.

3.3. Third Population

This is the real dataset that we obtained from [29]. The dataset includes the cost of living index for 121 nations, covering the period from 31 December 2023 to 29 June 2024. We obtained the data from the Kaggle website on 25 July 2024. The dataset contains columns such as rank, country, cost of living index, rent index, cost of living plus rent index, groceries index, restaurant price index, and local purchasing power index. Our interest lies in estimating the cost of living plus rent index, which we will refer to as the study variable

(Y) .

We have chosen the cost of living index and rent index as auxiliary variables

(X, Z) .

We have categorised the data into three strata based on the cost of living index: low

(< 40),

medium

(41 \leq i n d e x < 65),

and high

(\geq 65) .

Next, we implement Neyman allocation to pick the sample and examine various NR rates. Table 5 and Table 6 contain the bias and MSE results of different NR rates with different

g

values.

Table 5. NR occurs in both study and auxiliary variables: third population.

Table 6. NR occurs only in study variable: third population.

3.4. Fourth Population

We acquired this dataset from [30], which helps to enhance the predictions of UV radiation exposure, and the health risks associated with it. The dataset consists of columns like month, day, solar radiation, cloud cover, ozone level, altitude, UV index, and UV risk level. Out of these, the target variable we choose here is solar radiation and the auxiliary variables are cloud cover and ozone level, which are highly correlated with the target variable. There are a total of 1000 rows, which are divided into three strata (of sizes 304, 356, and 340) and selected samples for each stratum by using Neyman Allocation. Table 7 and Table 8 shows the results of the dataset as follows:

Table 7. NR occurs in both study and auxiliary variables: fourth population.

Table 8. NR occurs only in study variable: fourth population.

4. Simulation Study

A simulation study is conducted to assess the influence of various NR rates on stratified sample estimators across 10,000 iterations. This study comprises six distinct groups, known as strata. In each iteration, the simulation randomly selects a population size between 100 and 200 persons for each stratum. The starting sample sizes for each group range from 20 to 40 individuals. The NR rates are established at 10%, 20%, and 30%, with corresponding sample sizes after NR ranging from 50 to 100. The code generates hypothetical means and standard deviations for three variables

(Y, X, a n d Z)

as well as their correlations for each stratum. The values are sampled from uniform distributions with the following ranges: mean values for

Y

(200 to 800),

X

(5000 to 25,000), and Z (200 to 600). The aggregated means and covariances for the entire population are calculated by applying weighted sums, where the weights are determined by the sizes of each stratum within the population. This comprehensive configuration enables the analysis of how varying degrees of NR impact the precision and dependability of the suggested estimators in stratified sampling. Table 9 provides the bias and MSE values of the suggested families of log-type estimators when NR occurs in both the study and auxiliary variables.

Table 9. NR occurs in both study and auxiliary variables: simulation study.

The findings of the bias and MSE for various NR rates when NR occurs only in the study variable are provided in Table 10, along with various values of

g

, by taking 10,000 iterations. The average values of the metrics for all the iterations are recorded in Table 10 for both the combined and separate log-type family of estimators.

Table 10. NR occurs in only study variable: simulation study.

5. Results and Discussion

This study extensively examines various NR rates and their effects on the bias and MSE (both combined and separate) in the context of stratified sampling. Our discussion mostly focused on four scenarios (case (i)–(iv)), where we developed theorems and proofs for all four estimators proposed in each case. In addition, we conducted this study for comparisons on several factors, including MSE increases with an increase rate in NR (proved through Table 1 and Table 2) and the efficiency of separate-type estimators compared to combined-type estimators (using Table 3 and Table 4). We also conducted a comparison with the MSE. The efficiency is high when NR occurs in both study and auxiliary variables and it is lower when NR occurs only in a study variable (proved by using Table 5 and Table 6). This is discussed further in this section.

In the numerical study, for all four real datasets, as we mentioned earlier, the values of constants

α_{p}, β_{p}

, and

γ_{p}

are chosen in between 0 and 1,

p = 1,2, 3,4 .

In our study, we consider the values of constants

α_{1}, β_{1}

and

γ_{1} a s 0.8, α_{2}, β_{2}

and

γ_{3} a s 0.9, α_{3}, β_{3}

and

γ_{3} a s 0.4, a n d α_{4}, β_{4}

and

γ_{4} a s 0.3

for Table 5, and the values of constants

α_{1}, β_{1}

and

γ_{1} a s 1, α_{2}, β_{2}

and

γ_{3} a s 0.9, α_{3}, β_{3}

and

γ_{3} a s 0.6, a n d α_{4}, β_{4}

and

γ_{4} a s 0.3 .

By utilising Table 1, we place a specific emphasis on case (i), where NR occurs in both the study and auxiliary variables. Table 1 clearly demonstrates that the MSE values rise in parallel with the growth in both the NR rates and the values of the constant

g

. In addition, we displayed this information visually in Figure 1 and Figure 2. The estimators

(F_{c_{1}}, F_{c_{2}}, F_{c_{3}}, F_{c_{4}})

are denoted collectively as (a), (b), (c), and (d) in Figure 1. In Figure 2, the separate estimators

(F_{s_{1}}, F_{s_{2}}, F_{s_{3}}, F_{s_{4}})

are labelled separately as (a), (b), (c), and (d). It is observed that the MSE values consistently increase with the NR rates.

Figure 1. MSE values for different NR rates for the combined families of estimators in the case that NR occurs in both study and auxiliary variables for first population. (a) MSE of

F_{c_{1}}

with increase of NR rates; (b) MSE of

F_{c_{2}}

across different NR rates; (c) MSE of

F_{c_{3}}

under various NR rates; (d) MSE of

F_{c_{4}}

at 10%, 20% and 30% of NR.

Figure 2. MSE values for different NR rates for the separate families of estimators in the case that NR occurs in both study and auxiliary variables for first population. (a) MSE of

F_{s_{1}}

with increase of NR rates; (b) MSE of

F_{s_{2}}

across different NR rates; (c) MSE of

F_{s_{3}}

under various NR rates; (d) MSE of

F_{s_{4}}

at 10%, 20% and 30% of NR.

The efficiencies of combined and separate estimators were compared using the results from Table 2, focusing on case (ii), where NR occurs only in the study variable. It was observed that separate estimators generally exhibit greater efficiency than combined estimators when they are evaluated using MSE metrics. Figure 3a–d illustrate comparisons between the estimators

(T_{c_{1}}, T_{s_{1}}), (T_{c_{2}}, T_{s_{2}}), (T_{c_{3}}, T_{s_{3}}) a n d (T_{c_{4}}, T_{s_{4}})

, respectively. The observed differences in MSE values between combined and separate estimators may depend on the nature and properties of the datasets. Table 3 and Table 4 demonstrate that MSE values are higher when NR occurs in both variables compared to when it only affects the study variable. For example, with a 10% NR rate and

g

= 2, the MSE of

F_{c_{1}}

is greater than

T_{c_{1}}

(58.93 > 34.37), emphasising the significance of response rates in data analysis. Similarly, at

g

= 2.5, the MSE of

F_{c_{1}}

surpasses

T_{c_{1}}

(60.04 compared to 34.67), and at

g

= 3,

F_{c_{1}}

exceeds

T_{c_{1}}

(61.14 compared to 34.98).

Figure 3. Comparing combined-type estimators and separate-type estimators when NR occurs only in study variable for first population. (a) Comparison of MSEs of

T_{c_{1}}

and

T_{s_{1}}

with increase of NR rates; (b) Comparison of MSEs of

T_{c_{2}}

and

T_{s_{2}}

with rise of NR rates; (c) Comparison of MSEs of

T_{c_{3}}

and

T_{s_{3}}

with increase of NR rates; (d) Comparison of MSEs of

T_{c_{4}}

and

T_{s_{4}}

with rise of NR rates.

The analysis indicates that the NR rates significantly impact the bias and MSE of the cost of living plus rent index estimation. Higher NR rates generally lead to an increased bias and MSE (presented in Table 5 and Table 6), indicating the importance of addressing NR in survey designs to ensure accurate and reliable estimates. Overall, the stratification by the cost of living index and the use of Neyman allocation provided a robust framework for sampling. The auxiliary variables (cost of living index and rent index) were effective in improving the estimation of the study variable

Y

. The same results are observed in the fouth population, which predicts the solar UV radiation from Table 7 and Table 8. Further investigations into specific

g

values and their effects on the bias and MSE could provide more insights for optimising sampling strategies in future studies. As the sample size increases, the efficiency of the proposed estimators increases as well.

In the simulation study, we carried out 10,000 iterations to obtain the best values for the bias and MSE; we depict the bias and MSE values for each iteration in Figure 4 and Figure 5. Here, in graphs 4 and 5, we consider the combined-type estimators from Table 9, and the plots are taken for the bias and MSE values of

F_{c_{1}}, F_{c_{2}}, F_{c_{2}}, F_{c_{4}}

(in graph, bias is denoted by Bc1, Bc2, Bc3, and Bc4, and MSEs are denoted by Mc1, Mc2, Mc3, and Mc4) under 10%, 20%, 30% of NR rates.

Figure 4. Trace plots depicting the bias values for various NR rates in the simulation study.

Figure 5. Trace plots depicting the MSE values for various NR rates in the simulation study.

6. Conclusions

In this study, we researched the nature of non-response (NR) in different situations and conditions under a stratified sampling scheme. We proposed combined and separate log-type families of estimators and derived their bias and MSE metric equations in the form of theorems and proofs. In addition, we conducted comparisons on several factors, which are discussed in the points below. For proving these arguments, we have utilised four real datasets and a simulation study with 10,000 iterations. The results presented in various tables clearly showed the results of the arguments and additionally, the graphs clearly represented the results.

The effect of varying NR rates (10% (low), 20% (medium), and 30% (high)) on MSE values was shown.
Evaluations were proivded of how well combined-type and separate-type estimators perform in the presence of NR.
The dissimilarity of MSEs of the same estimators under two NR scenarios was shown: one in which NR is present in both the study and auxiliary variables, and the other in which NR is present only in the study variable.
In this simulated study, we looked at different non-response rates in survey data and compared the biases and MSEs of combined and separate estimators. The performance and reliability of the estimators can be understood by methodically examining these measures across many circumstances. Survey practitioners might use the results as a reference when choosing reliable estimation methods. The results help improve the efficiency and precision of survey estimations when non-response is present.

Scope for Future Work: The study presented log-type estimators across different non-response scenarios, utilising both real datasets and simulations. Future research may investigate systematic methods for selecting tuning parameters and addressing multicollinearity in auxiliary variables. Furthermore, the integration of these estimators with conventional survey weighting methods may improve their applicability. Alternative transformations, such as Box–Cox transformations, may be utilised for the purpose of variance stabilisation. In conclusion, the comparison of log-type estimators with traditional estimators regarding the Percentage Relative Efficiency (PRE) represents a significant avenue for future research.

Author Contributions

Conceptualisation, G.R.V.T. and F.D.; methodology, F.D. and G.R.V.T.; software, G.R.V.T.; validation, G.R.V.T. and F.D.; formal analysis, G.R.V.T. and M.A.; investigation, F.D. and M.A.; resources, F.D. and G.R.V.T.; data curation, G.R.V.T.; writing—original draft preparation, G.R.V.T. and F.D.; writing—review and editing, F.D. and M.A.; visualisation, G.R.V.T. and F.D.; supervision, F.D. and M.A.; project administration, F.D. and M.A. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Deanship of Scientific Research, Vice Presidency for Graduate Studies and Scientific Research, King Faisal University, Saudi Arabia [Grant No. KFU251230].

Data Availability Statement

Datasets are publicly available at: https://www.numbeo.com/cost-of-living/rankings.jsp & https://www.kaggle.com/datasets/ziya07/solar-uv-radiation (Accessed on 25 July 2024).

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1. The following error terms and their expectations were used to obtain bias and MSE when NR exists in both the study and auxiliary variables.

For Combined Estimators

For Separate Estimators

δ_{0_{s t}}^{*} = \frac{{\bar{y}}_{s t}^{*} - \bar{Y}}{\bar{Y}}

,

δ_{1_{s t}}^{*} = \frac{{\bar{x}}_{s t}^{*} - \bar{X}}{\bar{X}}

and

δ_{2_{s t}}^{*} = \frac{{\bar{z}}_{s t}^{*} - \bar{Z}}{\bar{Z}}

E (δ_{i_{s t}}^{*}) = 0, i = 0,1, 2

δ_{0_{h}}^{*} = \frac{{\bar{y}}_{h}^{*} - \bar{Y}}{\bar{Y}}

,

δ_{1_{h}}^{*} = \frac{{\bar{x}}_{h}^{*} - \bar{X}}{\bar{X}}

and

δ_{2_{h}}^{*} = \frac{{\bar{z}}_{h}^{*} - \bar{Z}}{\bar{Z}}

E (δ_{i_{h}}^{*}) = 0, i = 0,1, 2

.

E (δ_{0_{s t}}^{* 2}) = \frac{1}{{\bar{Y}}^{2}} \sum_{h = 1}^{L} W_{h}^{2} (φ_{h} S_{y h}^{2} + φ_{h}^{*} S_{y h (2)}^{2}) = χ_{y}^{*}

E (δ_{1_{s t}}^{* 2}) = \frac{1}{{\bar{X}}^{2}} \sum_{h = 1}^{L} W_{h}^{2} (φ_{h} S_{x h}^{2} + φ_{h}^{*} S_{x h (2)}^{2}) = χ_{x}^{*}

E (δ_{2_{s t}}^{* 2}) = \frac{1}{{\bar{Z}}^{2}} \sum_{h = 1}^{L} W_{h}^{2} (φ_{h} S_{z h}^{2} + φ_{h}^{*} S_{z h (2)}^{2}) = χ_{z}^{*}

E (δ_{0_{s t}}^{*} δ_{1_{s t}}^{*}) = \frac{1}{\bar{Y} \bar{X}} \sum_{h = 1}^{L} W_{h}^{2} (φ_{h} S_{y x h} + φ_{h}^{*} S_{y x h (2)}) = χ_{y x}^{*}

E (δ_{1_{s t}}^{*} δ_{2_{s t}}^{*}) = \frac{1}{\bar{X} \bar{Z}} \sum_{h = 1}^{L} W_{h}^{2} (φ_{h} S_{x z h} + φ_{h}^{*} S_{x z h (2)}) = χ_{x z}^{*}

E (δ_{0_{s t}}^{*} δ_{2_{s t}}^{*}) = \frac{1}{\bar{Y} \bar{Z}} \sum_{h = 1}^{L} W_{h}^{2} (φ_{h} S_{y z h}^{2} + φ_{h}^{*} S_{y z h (2)}^{2}) = χ_{y z}^{*}

E (δ_{0_{h}}^{* 2}) = \frac{1}{{\bar{Y}}_{h}^{2}} (φ_{h} S_{y h}^{2} + φ_{h}^{*} S_{y h (2)}^{2}) = ϕ_{y_{h}}^{*}

E (δ_{1_{h}}^{* 2}) = \frac{1}{{\bar{X}}_{h}^{2}} (φ_{h} S_{x h}^{2} + φ_{h}^{*} S_{x h (2)}^{2}) = ϕ_{x_{h}}^{*}

E (δ_{2_{h}}^{* 2}) = \frac{1}{{\bar{Z}}_{h}^{2}} (φ_{h} S_{z h}^{2} + φ_{h}^{*} S_{z h (2)}^{2}) = ϕ_{z_{h}}^{*}

E (δ_{0_{h}}^{*} δ_{1_{h}}^{*}) = \frac{1}{{\bar{Y}}_{h} {\bar{X}}_{h}} (φ_{h} S_{y x h}^{2} + φ_{h}^{*} S_{y x h (2)}^{2}) = ϕ_{{y x}_{h}}^{*}

E (δ_{1_{h}}^{*} δ_{2_{h}}^{*}) = \frac{1}{{\bar{X}}_{h} {\bar{Z}}_{h}} (φ_{h} S_{x z h}^{2} + φ_{h}^{*} S_{x z h (2)}^{2}) = ϕ_{{x z}_{h}}^{*}

E (δ_{0_{h}}^{*} δ_{2_{h}}^{*}) = \frac{1}{{\bar{Y}}_{h} {\bar{Z}}_{h}} (φ_{h} S_{y z h}^{2} + φ_{h}^{*} S_{y z h (2)}^{2}) = ϕ_{{y z}_{h}}^{*}

Table A2. The following error terms and their expectations were used to obtain bias and MSE when NR exists only in the study variable.

For Combined Estimators

For Separate Estimators

δ_{y_{s t}}^{*} = \frac{{\bar{y}}_{s t}^{*} - \bar{Y}}{\bar{Y}}

,

δ_{x_{s t}} = \frac{{\bar{x}}_{s t} - \bar{X}}{\bar{X}}

and

δ_{z_{s t}} = \frac{{\bar{z}}_{s t} - \bar{Z}}{\bar{Z}}

E (δ_{y_{s t}}^{*}) = E (δ_{i_{s t}}) = 0, i = x, z

.

δ_{y_{h}}^{*} = \frac{{\bar{y}}_{h}^{*} - \bar{Y}}{\bar{Y}}

,

δ_{x_{h}} = \frac{{\bar{x}}_{h} - \bar{X}}{\bar{X}}

and

δ_{z_{h}} = \frac{{\bar{z}}_{h} - \bar{Z}}{\bar{Z}}

E (δ_{y_{h}}^{*}) = E (δ_{i_{h}}^{*}) = 0, i = x, z

.

E (δ_{y_{s t}}^{* 2}) = \frac{1}{{\bar{Y}}^{2}} \sum_{h = 1}^{L} W_{h}^{2} (φ_{h} S_{y h}^{2} + φ_{h}^{*} S_{y h (2)}^{2}) = χ_{y}^{*}

E (δ_{x_{s t}}^{2}) = \frac{1}{{\bar{X}}^{2}} \sum_{h = 1}^{L} W_{h}^{2} (φ_{h} S_{x h}^{2}) = χ_{x}

E (δ_{z_{s t}}^{2}) = \frac{1}{{\bar{Z}}^{2}} \sum_{h = 1}^{L} W_{h}^{2} (φ_{h} S_{z h}^{2}) = χ_{z}

E (δ_{y_{s t}}^{*} δ_{x_{s t}}) = \frac{1}{\bar{Y} \bar{X}} \sum_{h = 1}^{L} W_{h}^{2} (φ_{h} S_{y x h}) = χ_{y x}

E (δ_{x_{s t}} δ_{z_{s t}}) = \frac{1}{\bar{X} \bar{Z}} \sum_{h = 1}^{L} W_{h}^{2} (φ_{h} S_{x z h}) = χ_{x z}

E (δ_{y_{s t}}^{*} δ_{z_{s t}}) = \frac{1}{\bar{Y} \bar{Z}} \sum_{h = 1}^{L} W_{h}^{2} (φ_{h} S_{y z h}) = χ_{y z}

E (δ_{y_{h}}^{* 2}) = \frac{1}{{\bar{Y}}_{h}^{2}} (φ_{h} S_{y h}^{2} + φ_{h}^{*} S_{y h}^{2}) = ϕ_{y_{h}}^{*}

E (δ_{x_{h}}^{2}) = \frac{1}{{\bar{X}}_{h}^{2}} (φ_{h} S_{x h}^{2}) = ϕ_{x_{h}}

E (δ_{z_{h}}^{2}) = \frac{1}{{\bar{Z}}_{h}^{2}} (φ_{h} S_{z h}^{2}) = ϕ_{z_{h}}

E (δ_{y_{h}}^{*} δ_{x_{h}}) = \frac{1}{{\bar{Y}}_{h} {\bar{X}}_{h}} (φ_{h} S_{y x h}) = ϕ_{{y x}_{h}}

E (δ_{x_{h}} δ_{z_{h}}) = \frac{1}{{\bar{X}}_{h} {\bar{Z}}_{h}} (φ_{h} S_{x z h}) = ϕ_{{x z}_{h}}

E (δ_{y_{h}}^{*} δ_{z_{h}}) = \frac{1}{{\bar{Y}}_{h} {\bar{Z}}_{h}} (φ_{h} S_{y z h}) = ϕ_{{y z}_{h}}

Proof of Theorem 1.

To obtain the expressions of bias and MSE for

F_{c_{1}},

we need to transform the

F_{c_{1}}

in terms of

δ_{s t}^{*}

’s (by using Table A1) as follows:

F_{c_{1}} = α_{1} {\bar{y}}_{s t}^{*} (\frac{\bar{X}}{{\bar{x}}_{s t}^{*}}) {[1 + γ_{1} l o g (\frac{{\bar{z}}_{s t}^{*}}{\bar{Z}})]}^{β_{1}} = α_{1} \bar{Y} (1 + δ_{0_{s t}}^{*}) (1 + δ_{1_{s t}}^{*}) {[1 + γ_{1} l o g (1 + δ_{2_{s t}}^{*})]}^{β_{1}}

(by using Table A1).

By deviating

\bar{Y}

on both sides of the above equation, we obtain the following:

F_{c_{1}} - \bar{Y} = \bar{Y} \{α_{1} [1 + δ_{0_{s t}}^{*} - δ_{1_{s t}}^{*} - δ_{0_{s t}}^{*} δ_{1_{s t}}^{*} + β_{1} γ_{1} (δ_{2_{s t}}^{*} + {δ_{0_{s t}}^{*} δ}_{2_{s t}}^{*} - {δ_{1_{s t}}^{*} δ}_{2_{s t}}^{*} + \frac{1}{2} {(β}_{1} - 1) (γ_{1} - 1) δ_{2_{s t}}^{* 2})] - 1\}

(A1)

By applying expectations on both sides of the (A1), we acquire the bias expression as follows:

{B i a s (F}_{c_{1}}) = \bar{Y} \{α_{1} [1 + χ_{x}^{*} - χ_{y x}^{*} + β_{1} γ_{1} (χ_{y z}^{*} - χ_{x z}^{*} + \frac{1}{2} {(β}_{1} - 1) (γ_{1} - 1) χ_{z}^{*})] - 1\}

(A2)

On squaring both sides of the (A1),

{(F_{c_{1}} - \bar{Y})}^{2} = {\bar{Y}}^{2} \{1 + α_{1}^{2} [1 + δ_{0_{s t}}^{* 2} + 3 δ_{1_{s t}}^{* 2} + δ_{2_{s t}}^{* 2} β_{1} γ_{1} (β_{1} γ_{1} + 4 ({δ_{0_{s t}}^{*} δ}_{2_{s t}}^{*} - {δ_{1_{s t}}^{*} δ}_{2_{s t}}^{*})) - 4 {δ_{0_{s t}}^{*} δ}_{1_{s t}}^{*}] - 2 α_{1} [1 + δ_{1_{s t}}^{* 2} - {δ_{0_{s t}}^{*} δ}_{1_{s t}}^{*} + β_{1} γ_{1} ({δ_{0_{s t}}^{*} δ}_{2_{s t}}^{*} - {δ_{1_{s t}}^{*} δ}_{2_{s t}}^{*} + \frac{1}{2} {(β}_{1} - 1) (γ_{1} - 1) δ_{2_{s t}}^{* 2})]\}

by applying expectation both sides of above equation, we obtain the MSE as follows:

{M S E (F}_{c_{1}}) = {\bar{Y}}^{2} \{1 + α_{1}^{2} [1 + χ_{y}^{*} + 3 χ_{x}^{*} + χ_{z}^{*} β_{1} γ_{1} (β_{1} γ_{1} + 4 (χ_{y z}^{*} - χ_{x z}^{*})) - 4 χ_{y x}^{*}] - 2 α_{1} [1 + χ_{x}^{*} - χ_{y x}^{*} + β_{1} γ_{1} (χ_{y z}^{*} - χ_{x z}^{*} + \frac{1}{2} {(β}_{1} - 1) (γ_{1} - 1) χ_{z}^{*})]\}

(A3)

□

Proof of Theorem 2.

To obtain the expressions of the bias and MSE for

F_{c_{2}},

we need to transform the

F_{c_{2}}

in terms of

δ_{s t}^{*}

’s (by using Table 1) as follows:

F_{c_{2}} = α_{2} {\bar{y}}_{s t}^{*} (\frac{{\bar{x}}_{s t}^{*}}{\bar{X}}) {[1 + γ_{2} l o g (\frac{\bar{Z}}{{\bar{z}}_{s t}^{*}})]}^{β_{2}} = α_{2} \bar{Y} (1 + δ_{0_{s t}}^{*}) (1 + δ_{1_{s t}}^{*}) {[1 + γ_{2} l o g ({1 + δ_{2_{s t}}^{*})}^{- 1}]}^{β_{2}}

(by using Table A1).

By deviating

\bar{Y}

on both sides of the above equation, we obtain the following:

F_{c_{2}} - \bar{Y} = \bar{Y} \{α_{2} [1 + δ_{0_{s t}}^{*} + δ_{1_{s t}}^{*} + δ_{0_{s t}}^{*} δ_{1_{s t}}^{*} - β_{2} γ_{2} (δ_{2_{s t}}^{*} + {δ_{0_{s t}}^{*} δ}_{2_{s t}}^{*} + {δ_{1_{s t}}^{*} δ}_{2_{s t}}^{*} - \frac{1}{2} {(β}_{2} - 1) (γ_{2} + 1) δ_{2_{s t}}^{* 2})] - 1\}

(A4)

By applying expectations on both sides of (A4), we acquire the bias expression as follows:

{B i a s (F}_{c_{2}}) = \bar{Y} \{α_{2} [1 + χ_{y x}^{*} - β_{2} γ_{2} (χ_{y z}^{*} + χ_{x z}^{*} - \frac{1}{2} {(β}_{2} - 1) (γ_{2} + 1) χ_{z}^{*})] - 1\}

(A5)

On squaring both sides of (A4),

{(F_{c_{2}} - \bar{Y})}^{2} = {\bar{Y}}^{2} \{1 + α_{2}^{2} [1 + δ_{0_{s t}}^{* 2} + δ_{1_{s t}}^{* 2} + δ_{2_{s t}}^{* 2} β_{2} γ_{2} (β_{2} γ_{2} + {(β}_{2} - 1) {(γ}_{2} + 1)) + 4 {δ_{0_{s t}}^{*} δ}_{1_{s t}}^{*}] - 2 α_{2} [1 + {δ_{0_{s t}}^{*} δ}_{1_{s t}}^{*} - β_{2} γ_{2} ({δ_{0_{s t}}^{*} δ}_{2_{s t}}^{*} + {δ_{1_{s t}}^{*} δ}_{2_{s t}}^{*} - \frac{1}{2} {(β}_{2} - 1) (γ_{2} + 1) δ_{2_{s t}}^{* 2})]\}

by applying expectations on both sides of the above equation, we obtain the MSE as follows:

M S E (F_{c_{2}}) = {\bar{Y}}^{2} \{1 + α_{2}^{2} [1 + χ_{y}^{*} + χ_{x}^{*} + χ_{z}^{*} β_{2} γ_{2} (β_{2} γ_{2} + {(β}_{2} - 1) {(γ}_{2} + 1)) + 4 χ_{y x}^{*}] - 2 α_{2} [1 + χ_{y x}^{*} - β_{2} γ_{2} (χ_{y z}^{*} + χ_{x z}^{*} - \frac{1}{2} {(β}_{2} - 1) (γ_{2} + 1) χ_{z}^{*})]\}

(A6)

□

Proof of Theorem 3.

To obtain the expressions of the bias and MSE for

F_{c_{3}},

we need to transform the

F_{c_{3}}

in terms of

δ_{s t}^{*}

’s (by using Table 1) as follows:

F_{c_{3}} = {\bar{y}}_{s t}^{*} {(\frac{\bar{X}}{{\bar{x}}_{s t}^{*}})}^{α_{3}} {[1 - γ_{3} l o g (\frac{{\bar{z}}_{s t}^{*}}{\bar{Z}})]}^{β_{3}} = \bar{Y} (1 + δ_{0_{s t}}^{*}) (1 - δ_{1_{s t}}^{*} α_{3} + \frac{α_{3} (α_{3} + 1)}{2} δ_{1_{s t}}^{* 2}) {[1 - γ_{3} l o g ({1 + δ_{2_{s t}}^{*})}^{- 1}]}^{β_{3}}

(by using Table A1).

By deviating

\bar{Y}

on both sides of the above equation, we obtain the following:

F_{c_{3}} - \bar{Y} = \bar{Y} [δ_{0_{s t}}^{*} - α_{3} δ_{1_{s t}}^{*} - β_{3} γ_{3} δ_{2_{s t}}^{*} - α_{3} δ_{0_{s t}}^{*} δ_{1_{s t}}^{*} + α_{3} β_{3} δ_{1_{s t}}^{*} δ_{2_{s t}}^{*} - β_{3} γ_{3} {δ_{0_{s t}}^{*} δ}_{2_{s t}}^{*} + \frac{α_{3} (α_{3} + 1)}{2} δ_{1_{s t}}^{* 2} + \frac{1}{2} β_{3} γ_{3} {(β}_{3} - 1) (γ_{3} + 1) δ_{2_{s t}}^{* 2}]

(A7)

By applying expectations on both sides of (A7), we acquire the bias expression as follows:

B i a s (F_{c_{3}}) = \bar{Y} \{- α_{3} χ_{y x}^{*} - β_{3} γ_{3} [χ_{y z}^{*} + α_{3} χ_{x z}^{*} - \frac{1}{2} {(β}_{3} - 1) (γ_{3} + 1) χ_{z}^{*}] + \frac{α_{3} (α_{3} + 1)}{2} χ_{x}^{*}\}

(A8)

On squaring both sides of (A7),

{(F_{c_{3}} - \bar{Y})}^{2} = {\bar{Y}}^{2} [δ_{0_{s t}}^{* 2} + α_{3}^{2} δ_{1_{s t}}^{* 2} + {β_{3}^{2} γ_{3}^{2} δ}_{2_{s t}}^{* 2} - 2 α_{3} {δ_{0_{s t}}^{*} δ}_{1_{s t}}^{*} - 2 β_{3} γ_{3} {δ_{0_{s t}}^{*} δ}_{2_{s t}}^{*} + α_{3} β_{3} γ_{3} {δ_{1_{s t}}^{*} δ}_{2_{s t}}^{*}]

by applying expectation both sides of the above equation, we obtain the MSE as follows:

M S E (F_{c_{3}}) = {\bar{Y}}^{2} [χ_{y}^{*} + α_{3}^{2} χ_{x}^{*} + β_{3} γ_{3} (β_{3} γ_{3} χ_{z}^{*} - 2 χ_{y z}^{*} + 2 α_{3} χ_{x z}^{*}) - 2 α_{3} χ_{y x}^{*}]

(A9)

□

Proof of Theorem 4.

To obtain the expressions of the bias and MSE for

F_{c_{4}},

we need to transform the

F_{c_{4}}

in terms of

δ_{s t}^{*}

’s (by using Table 1) as follows:

F_{c_{4}} = {\bar{y}}_{s t}^{*} {(\frac{{\bar{x}}_{s t}^{*}}{\bar{X}})}^{α_{4}} {[1 - γ_{4} l o g (\frac{\bar{Z}}{{\bar{z}}_{s t}^{*}})]}^{β_{4}}

= \bar{Y} (1 + δ_{0_{s t}}^{*}) {(1 + δ_{1_{s t}}^{*})}^{α_{4}} [1 + β_{4} γ_{4} δ_{2_{s t}}^{*} + \frac{1}{2} β_{4} γ_{4} (β_{4} - 1) (γ_{4} - 1) δ_{2_{s t}}^{* 2}]

(from Table A1).

By deviating

\bar{Y}

on both sides of the above equation, we obtain the following:

F_{c_{4}} - \bar{Y} = \bar{Y} [δ_{0_{s t}}^{*} + α_{4} δ_{1_{s t}}^{*} + β_{4} γ_{4} δ_{2_{s t}}^{*} + α_{4} δ_{0_{s t}}^{*} δ_{1_{s t}}^{*} + α_{4} β_{4} γ_{4} δ_{1_{s t}}^{*} δ_{2_{s t}}^{*} + β_{4} γ_{4} {δ_{0_{s t}}^{*} δ}_{2_{s t}}^{*} + \frac{α_{4} (α_{4} - 1)}{2} δ_{1_{s t}}^{* 2} + \frac{1}{2} β_{4} γ_{4} {(β}_{4} - 1) (γ_{4} - 1) δ_{2_{s t}}^{* 2}]

(A10)

By applying expectations on both sides of (A10), we acquire the bias expression as follows:

B i a s (F_{c_{4}}) = \bar{Y} \{α_{4} χ_{y x}^{*} + β_{4} γ_{4} [{χ_{y z}^{*} + α}_{4} χ_{x z}^{*} + \frac{1}{2} {(β}_{4} - 1) (γ_{4} - 1) χ_{z}^{*}] + \frac{α_{4} (α_{4} - 1)}{2} χ_{x}^{*}\}

(A11)

On squaring both sides of (A10),

{(F_{c_{4}} - \bar{Y})}^{2} = {\bar{Y}}^{2} [δ_{0_{s t}}^{* 2} + α_{4}^{2} δ_{1_{s t}}^{* 2} + {β_{4}^{2} γ_{4}^{2} δ}_{2_{s t}}^{* 2} + 2 α_{4} {δ_{0_{s t}}^{*} δ}_{1_{s t}}^{*} + 2 α_{4} β_{3} γ_{3} δ_{1_{s t}}^{*} δ_{2_{s t}}^{*} + 2 β_{4} γ_{4} {δ_{0_{s t}}^{*} δ}_{2_{s t}}^{*}]

by applying expectation both sides, we obtain the MSE as follows:

{M S E (F_{c_{4}}) = \bar{Y}}^{2} [χ_{y}^{*} + α_{4}^{2} χ_{x}^{*} + β_{4} γ_{4} (β_{4} γ_{4} χ_{z}^{*} + 2 χ_{y z}^{*} + {2 α}_{4} χ_{x z}^{*}) + 2 α_{4} χ_{y x}^{*}]

(A12)

□

Table A3. Description of first population.

$Stratum (h)$		1	2	3	4	5	6
$N_{h}$		127	117	103	170	205	201
$n_{h}$		31	21	29	38	32	29
$S_{y h}$		883.84	644.92	1033.40	810.58	403.65	711.72
$S_{x h}$		30,486.70	15,180.77	27,549.69	18,218.93	8497.77	23,094.14
$S_{z h}$		555.58	365.46	612.95	458.03	260.85	397.05
${\bar{Y}}_{h}$		703.74	413	573.17	424.66	267.03	393.84
${\bar{X}}_{h}$		20,804.59	9211.79	14,309.30	9478.85	5569.95	12,997.59
${\bar{Z}}_{h}$		498.28	318.33	431.36	311.32	227.2	313.71
$ρ_{x y h}$		0.94	0.99	0.99	0.98	0.98	0.97
$ρ_{x z h}$		0.94	0.97	0.98	0.98	0.97	0.99
$ρ_{y z h}$		0.98	0.98	0.98	0.98	0.96	0.98
10%	$S_{y h (2)}$	510.57	386.77	1872.88	1603.30	264.19	497.84
	$S_{x h (2)}$	9446.93	9198.29	52,429.99	34,794.9	4972.56	12,485.10
	$S_{z h (2)}$	303.29	278.51	960.11	821.46	285.09	287.99
	$ρ_{x y h (2)}$	0.9961	0.9975	0.9998	0.9741	0.9855	0.9324
	$ρ_{x z h (2)}$	0.9931	0.997	0.9972	0.9912	0.985	0.9647
	$ρ_{y z h (2)}$	0.9931	0.9871	0.9972	0.9942	0.985	0.9647
20%	$S_{y h (2)}$	396.77	406.15	1654.40	1333.35	335.83	903.91
	$S_{x h (2)}$	7439.16	8880.46	4574.78	2219.30	6540.43	8411.44
	$S_{z h (2)}$	274.42	274.42	965.42	812.28	188.02	469.86
	$ρ_{x y h (2)}$	0.9954	0.994	0.995	0.9761	0.9879	0.9869
	$ρ_{x z h (2)}$	0.9897	0.9884	0.9789	0.9629	0.982	0.9869
	$ρ_{y z h (2)}$	0.9898	0.9798	0.9846	0.9918	0.9818	0.9874
30%	$S_{y h (2)}$	500.26	356.95	1 383.70	1 193.47	289.41	825.24
	$S_{x h (2)}$	14,017.99	8174.62	38,379.77	26,090.60	6511.32	21,571.95
	$S_{z h (2)}$	284.44	247.63	811.21	731.28	180.51	437.9
	$ρ_{x y h (2)}$	0.9963	0.9985	0.9957	0.9741	0.9903	0.9902
	$ρ_{x z h (2)}$	0.9937	0.9848	0.9771	0.965	0.9799	0.9901
	$ρ_{y z h (2)}$	0.9739	0.9793	0.9839	0.9904	0.9799	0.9829

Table A4. Description of wecond population.

$Stratum (h)$		1	2	3
$N_{h}$		21	34	26
$n_{h}$		06	04	02
$S_{y h}$		12.14	8.34	5.47
$S_{x h}$		76.71	31.94	49.55
$S_{z h}$		19.48	07.10	13.21
${\bar{Y}}_{h}$		37.55	37.25	26.39
${\bar{X}}_{h}$		116.57	93.00	26.39
${\bar{Z}}_{h}$		114.14	106.50	118.88
$ρ_{x y h}$		0.7914	0.8339	0.7696
$ρ_{x z h}$		0.9894	0.8820	0.9669
$ρ_{y z h}$		0.7781	0.6651	0.5935
10%	$S_{y h (2)}$	08.66	10.05	03.95
	$S_{x h (2)}$	42.14	13.28	74.22
	$S_{z h (2)}$	6.25	5.20	20.53
	$ρ_{x y h (2)}$	0.9997	0.9995	0.9840
	$ρ_{x z h (2)}$	0.9707	1.0000	0.9999
	$ρ_{y z h (2)}$	0.9649	0.9996	0.9819
20%	$S_{y h (2)}$	7.96	8.47	4.06
	$S_{x h (2)}$	36.50	25.82	59.32
	$S_{z h (2)}$	5.20	8.18	16.54
	$ρ_{x y h (2)}$	0.9905	0.8026	0.8601
	$ρ_{x z h (2)}$	0.9623	0.9858	0.9956
	$ρ_{y z h (2)}$	0.9297	0.8062	0.8112
30%	$S_{y h (2)}$	12.70	09.86	4.50
	$S_{x h (2)}$	37.69	24.02	52.26
	$S_{z h (2)}$	9.42	6.83	14.54
	$ρ_{x y h (2)}$	0.9288	0.8335	0.8275
	$ρ_{x z h (2)}$	0.9062	0.8859	0.9907
	$ρ_{y z h (2)}$	0.9696	0.5877	0.7542

References

Hansen, M.H.; Hurwitz, W.N. The Problem of Non-Response in Sample Surveys. J. Am. Stat. Assoc. 1946, 41, 517–529. [Google Scholar] [PubMed]
Chaudhary, M.K.; Singh, V.K.; Shukla, R.K. Combined-Type Family of Estimators of Population Mean in Stratified Random Sampling under Non-Response. J. Reliab. Stat. Stud. 2012, 5, 133–142. [Google Scholar]
Chaudhary, M.K.; Kumar, A. Estimating the Population Mean in Stratified Random Sampling Using Two-Phase Sampling in the Presence of Non-Response. World Appl. Sci. J. 2015, 33, 874–882. [Google Scholar] [CrossRef]
Rachokarn, T.; Lawson, N. An Efficient Family of Estimators for the Population Mean Using Auxiliary Information in the Presence of Missing Observations. AIP Conf. Proc. 2016, 1775, 030010. [Google Scholar] [CrossRef]
Rachokarn, T.; Lawson, N. An Efficient General Family of Estimators for Population Mean in the Presence of Non-Response. J. Math. Fund. Sci. 2017, 49, 283–293. [Google Scholar]
Onyeka, A.C.; Ogbumuo, D.T.; Izunobi, C. Estimation of Population Mean in Stratified Random Sampling When Using Auxiliary Information in the Presence of Non-Response. Far East J. Theor. Stat. 2019, 55, 151–167. [Google Scholar] [CrossRef]
Anieting, A. Two-Phase Stratified Sampling Estimator for Population Mean in the Presence of Nonresponse Using Single Auxiliary Variable. Math. J. Interdiscip. Sci. 2020, 8, 49–56. [Google Scholar] [CrossRef]
Yaqub, M.; Shabbir, J.; Gupta, S.N. Estimation of Population Mean Based on Dual Use of Auxiliary Information in Non-Response. Commun. Stat. Theory Methods 2017, 46, 12130–12151. [Google Scholar] [CrossRef]
Sanaullah, A.; Amin, M.N.; Hanif, M.; Koyuncu, N. Generalized Exponential-Type Estimators for Population Mean Taking Two Auxiliary Variables for Unknown Means in Stratified Sampling with Sub-Sampling the Non-Respondents. Int. J. Appl. Comput. Math. 2018, 4, 56. [Google Scholar] [CrossRef]
Shahzad, U.; Hanif, M.; Koyuncu, N.; Luengo, A.G. A Family of Ratio Estimators in Stratified Random Sampling Utilizing Auxiliary Attribute Alongside the Nonresponse Issue. J. Stat. Theory Appl. 2019, 18, 12–25. [Google Scholar] [CrossRef]
Singh, H.P.; Nigam, P. Efficient Method of Estimating the Finite Population Mean Based on Two Auxiliary Variables in the Presence of Non-Response under Stratified Sampling. J. Reliab. Stat. Stud. 2021, 14, 223–242. [Google Scholar] [CrossRef]
Almulhim, F.A.; Aljohani, H.M.; Aldallal, R.; Mustafa, M.S.; Alsolmi, M.M.; Elshenawy, A.; Alrashidi, A. Estimation of Finite Population Mean Using Dual Auxiliary Information under Non-Response with Simple Random Sampling. Alex. Eng. J. 2024, 100, 286–299. [Google Scholar] [CrossRef]
Zahid, E.; Shabbir, J. Estimation of Population Mean in the Presence of Measurement Error and Non-Response under Stratified Random Sampling. PLoS ONE 2018, 13, e0191572. [Google Scholar] [CrossRef]
Nderitu, C.W.; Imboga, H.; Wanjoya, A. Estimation of Finite Population Mean Using Ratio Estimator Based on Known Median of Auxiliary Variable in the Presence of Non-Response. Am. J. Theor. Appl. Stat. 2022, 11, 75–82. [Google Scholar]
Unal, C.; Kadilar, C. A New Population Mean Estimator under Non-Response Cases. J. Taibah Univ. Sci. 2022, 16, 111–119. [Google Scholar] [CrossRef]
Zahid, E.; Shabbir, J.; Alamri, O.A. A Generalized Class of Estimators for Sensitive Variable in the Presence of Measurement Error and Non-Response under Stratified Random Sampling. J. King Saud Univ. Sci. 2022, 34, 101741. [Google Scholar] [CrossRef]
Danish, F.; Rizvi, S.E.H.; Bouza, C. On Approximately Optimum Strata Boundaries Using Two Auxiliary Variables. Investig. Oper. 2020, 41, 445–461. [Google Scholar]
Danish, F.; Rizvi, S.E.H. Approximately Optimum Strata Boundaries for Two Concomitant Stratification Variables under Proportional Allocation. Stat. Transit. New Ser. 2021, 22, 19–40. [Google Scholar]
Kadılar, C.; Cıngı, H. Ratio Estimators Using Robust Regression. Hacet. J. Math. Stat. 2007, 36, 181–188. [Google Scholar]
Zaman, T.; Bulut, H. Modified Regression Estimators Using Robust Regression Methods and Covariance Matrices in Stratified Random Sampling. Commun. Stat. Theory Methods 2020, 49, 3407–3420. [Google Scholar] [CrossRef]
Zaman, T.; Dünder, E.; Audu, A.; Alilah, D.A.; Shahzad, U.; Hanif, M. Robust Regression-Ratio-Type Estimators of the Mean Utilizing Two Auxiliary Variables: A Simulation Study. Math. Probl. Eng. 2021, 2021, 6383927. [Google Scholar] [CrossRef]
Zaman, T.; Bulut, H.; Yadav, S.K. Robust Ratio-Type Estimators for Finite Population Mean in Simple Random Sampling: A Simulation Study. Concurr. Comput. Pract. Exp. 2022, 34, e7273. [Google Scholar] [CrossRef]
Zaman, T.; Bulut, H. An Efficient Family of Robust-Type Estimators for the Population Variance in Simple and Stratified Random Sampling. Commun. Stat. Theory Methods 2023, 52, 2610–2624. [Google Scholar]
Kocyigit, E.G. Using past sample means in exponential ratio and regression type estimators under a simple random sampling. Soft Comput. 2025, 29, 1389–1406. [Google Scholar]
Lakshmi, N.V.; Danish, F.; Alrasheedi, M. Enhanced estimation of finite population mean via power and log-transformed ratio estimators using an auxiliary variable in solar radiation data. J. Radiat. Res. Appl. Sci. 2025, 18, 101379. [Google Scholar]
Singh, G.N.; Pandey, M.K.; Bandyopadhyay, A.; Meetei, M.Z.; Zaagan, A.A.; Mahnashi, A.M.; Al-Marzouki, S. Estimation of Population Mean Using Calibrated Weights in Stratified Random Successive Sampling in Presence of Incomplete Data. J. Math. 2025, 2025, 6778010. [Google Scholar]
Patel, A.; Ray, B.K.; Garg, N. Generalized Calibration Estimator of Population Mean for Stratified Sampling in the Presence of Non-response. J. Indian Soc. Probab. Stat. 2025, 1–23. [Google Scholar] [CrossRef]
Saleem, I.; Sanaullah, A.; Hanif, M. A Generalized Class of Estimators for Estimating Population Mean in the Presence of Non-Response. J. Stat. Theory Appl. 2018, 17, 616–626. [Google Scholar] [CrossRef]
Cost of Living Index by City 2024 Mid-Year. Available online: https://www.numbeo.com/cost-of-living/rankings.jsp (accessed on 25 July 2024).
Solar UV Radiation. Available online: https://www.kaggle.com/datasets/ziya07/solar-uv-radiation (accessed on 1 February 2025).

Figure 1. MSE values for different NR rates for the combined families of estimators in the case that NR occurs in both study and auxiliary variables for first population. (a) MSE of

F_{c_{1}}

with increase of NR rates; (b) MSE of

F_{c_{2}}

across different NR rates; (c) MSE of

F_{c_{3}}

under various NR rates; (d) MSE of

F_{c_{4}}

at 10%, 20% and 30% of NR.

Figure 2. MSE values for different NR rates for the separate families of estimators in the case that NR occurs in both study and auxiliary variables for first population. (a) MSE of

F_{s_{1}}

with increase of NR rates; (b) MSE of

F_{s_{2}}

across different NR rates; (c) MSE of

F_{s_{3}}

under various NR rates; (d) MSE of

F_{s_{4}}

at 10%, 20% and 30% of NR.

Figure 3. Comparing combined-type estimators and separate-type estimators when NR occurs only in study variable for first population. (a) Comparison of MSEs of

T_{c_{1}}

and

T_{s_{1}}

with increase of NR rates; (b) Comparison of MSEs of

T_{c_{2}}

and

T_{s_{2}}

with rise of NR rates; (c) Comparison of MSEs of

T_{c_{3}}

and

T_{s_{3}}

with increase of NR rates; (d) Comparison of MSEs of

T_{c_{4}}

and

T_{s_{4}}

with rise of NR rates.

Figure 4. Trace plots depicting the bias values for various NR rates in the simulation study.

Figure 5. Trace plots depicting the MSE values for various NR rates in the simulation study.

Table 1. NR occurs in both study and auxiliary variables: first population.

NR		Estimator	Combined Type		Estimator	Separate Type
NR		Estimator	Bias	MSE	Estimator	Bias	MSE
10%	$g = 2$	$F_{c_{1}}$	−86.87	8095.37	$F_{s_{1}}$	−15.22	384.79
		$F_{c_{2}}$	−45.14	16,709.43	$F_{s_{2}}$	−9.02	4862.22
		$F_{c_{3}}$	−1.87	675.12	$F_{s_{3}}$	−1.78	150.12
		$F_{c_{4}}$	1.75	5014.22	$F_{s_{4}}$	1.71	1071.23
	$g = 2.5$	$F_{c_{1}}$	−86.87	8117.63	$F_{s_{1}}$	−15.22	386.46
		$F_{c_{2}}$	−45.25	17,828.86	$F_{s_{2}}$	−9.05	5387.35
		$F_{c_{3}}$	−2.03	740.11	$F_{s_{3}}$	−1.83	156.01
		$F_{c_{4}}$	1.89	5417.19	$F_{s_{4}}$	1.75	1108.83
	$g = 3$	$F_{c_{1}}$	−86.87	8139.93	$F_{s_{1}}$	−15.22	388.15
		$F_{c_{2}}$	−45.36	18,948.29	$F_{s_{2}}$	−9.09	5989.87
		$F_{c_{3}}$	−2.19	805.10	$F_{s_{3}}$	−1.88	161.91
		$F_{c_{4}}$	2.03	5820.15	$F_{s_{4}}$	1.80	1147.57
20%	$g = 2$	$F_{c_{1}}$	−86.82	8157.32	$F_{s_{1}}$	−15.22	402.16
		$F_{c_{2}}$	−45.29	18,885.82	$F_{s_{2}}$	−9.14	7015.30
		$F_{c_{3}}$	−2.13	765.58	$F_{s_{3}}$	−2.33	194.22
		$F_{c_{4}}$	2.01	5759.47	$F_{s_{4}}$	2.27	1440.57
	$g = 2.5$	$F_{c_{1}}$	−86.79	8210.57	$F_{s_{1}}$	−15.23	412.61
		$F_{c_{2}}$	−45.46	21,093.44	$F_{s_{2}}$	−9.24	9166.30
		$F_{c_{3}}$	−2.43	875.80	$F_{s_{3}}$	−2.67	222.25
		$F_{c_{4}}$	2.28	6535.10	$F_{s_{4}}$	2.59	1675.79
	$g = 3$	$F_{c_{1}}$	−86.76	8263.86	$F_{s_{1}}$	−15.23	423.15
		$F_{c_{2}}$	−45.64	23,301.07	$F_{s_{2}}$	−9.34	11,909.23
		$F_{c_{3}}$	−2.72	986.01	$F_{s_{3}}$	−2.99	250.35
		$F_{c_{4}}$	2.55	7310.66	$F_{s_{4}}$	2.91	1922.58
30%	$g = 2$	$F_{c_{1}}$	−86.81	8176.98	$F_{s_{1}}$	−15.24	405.68
		$F_{c_{2}}$	−45.39	19,724.77	$F_{s_{2}}$	−9.30	7730.85
		$F_{c_{3}}$	−2.25	810.00	$F_{s_{3}}$	−2.51	214.92
		$F_{c_{4}}$	2.12	6050.79	$F_{s_{4}}$	2.40	1538.53
	$g = 2.5$	$F_{c_{1}}$	−86.78	8240.13	$F_{s_{1}}$	−15.25	418.17
		$F_{c_{2}}$	−45.63	22,351.87	$F_{s_{2}}$	−9.48	10,547.17
		$F_{c_{3}}$	−2.61	942.43	$F_{s_{3}}$	−2.92	253.35
		$F_{c_{4}}$	2.44	6972.04	$F_{s_{4}}$	2.80	1828.98
	$g = 3$	$F_{c_{1}}$	−86.74	8303.35	$F_{s_{1}}$	−15.26	430.92
		$F_{c_{2}}$	−45.86	24,978.97	$F_{s_{2}}$	−9.66	14,276.00
		$F_{c_{3}}$	−2.96	1074.86	$F_{s_{3}}$	−3.34	291.89
		$F_{c_{4}}$	2.76	7893.30	$F_{s_{4}}$	3.19	2136.34

Table 2. NR occurs only in study variable: first population.

NR		Estimator	Combined Type		Estimator	Separate Type
NR		Estimator	Bias	MSE	Estimator	Bias	MSE
10%	$g = 2$	$T_{c_{1}}$	0.44	2094.46	$T_{s_{1}}$	0.12	315.91
		$T_{c_{2}}$	−44.93	14,837.70	$T_{s_{2}}$	−8.94	4131.95
		$T_{c_{3}}$	−2.96	586.69	$T_{s_{3}}$	−3.11	66.95
		$T_{c_{4}}$	1.47	4661.54	$T_{s_{4}}$	1.62	1041.55
	$g = 2.5$	$T_{c_{1}}$	0.44	2321.09	$T_{s_{1}}$	0.12	332.25
		$T_{c_{2}}$	−44.93	15,021.27	$T_{s_{2}}$	−8.94	4193.40
		$T_{c_{3}}$	−2.96	813.31	$T_{s_{3}}$	−3.11	82.19
		$T_{c_{4}}$	1.47	4888.17	$T_{s_{4}}$	1.62	1063.17
	$g = 3$	$T_{c_{1}}$	0.44	2547.72	$T_{s_{1}}$	0.12	348.69
		$T_{c_{2}}$	−44.93	15,204.84	$T_{s_{2}}$	−8.94	4256.18
		$T_{c_{3}}$	−2.96	1039.94	$T_{s_{3}}$	−3.11	97.47
		$T_{c_{4}}$	1.47	5114.80	$T_{s_{4}}$	1.62	1085.12
20%	$g = 2$	$T_{c_{1}}$	0.44	2587.44	$T_{s_{1}}$	0.12	493.27
		$T_{c_{2}}$	−44.93	15,156.01	$T_{s_{2}}$	−8.94	4402.40
		$T_{c_{3}}$	−2.96	979.66	$T_{s_{3}}$	−3.11	237.45
		$T_{c_{4}}$	1.47	5879.66	$T_{s_{4}}$	1.62	1239.94
	$g = 2.5$	$T_{c_{1}}$	0.44	2910.55	$T_{s_{1}}$	0.12	600.12
		$T_{c_{2}}$	−44.93	15,498.74	$T_{s_{2}}$	−8.94	4607.71
		$T_{c_{3}}$	−2.96	1402.78	$T_{s_{3}}$	−3.11	339.19
		$T_{c_{4}}$	1.47	5477.63	$T_{s_{4}}$	1.62	1364.23
	$g = 3$	$T_{c_{1}}$	0.44	3333.67	$T_{s_{1}}$	0.12	708.47
		$T_{c_{2}}$	−44.93	15,841.46	$T_{s_{2}}$	−8.94	4820.49
		$T_{c_{3}}$	−2.96	1825.89	$T_{s_{3}}$	−3.11	441.96
		$T_{c_{4}}$	1.47	5900.75	$T_{s_{4}}$	1.62	1491.43
30%	$g = 2$	$T_{c_{1}}$	0.44	2649.19	$T_{s_{1}}$	0.12	545.86
		$T_{c_{2}}$	−44.93	15,287.03	$T_{s_{2}}$	−8.94	4489.25
		$T_{c_{3}}$	−2.96	1141.42	$T_{s_{3}}$	−3.11	287.94
		$T_{c_{4}}$	1.47	5916.27	$T_{s_{4}}$	1.62	1299.41
	$g = 2.5$	$T_{c_{1}}$	0.44	3153.18	$T_{s_{1}}$	0.12	679.90
		$T_{c_{2}}$	−44.93	15,695.27	$T_{s_{2}}$	−8.94	4742.26
		$T_{c_{3}}$	−2.96	1645.41	$T_{s_{3}}$	−3.11	415.51
		$T_{c_{4}}$	1.47	5920.27	$T_{s_{4}}$	1.62	1455.15
	$g = 3$	$T_{c_{1}}$	0.44	3657.18	$T_{s_{1}}$	0.12	816.15
		$T_{c_{2}}$	−44.93	16,103.50	$T_{s_{2}}$	−8.94	5005.83
		$T_{c_{3}}$	−2.96	2149.40	$T_{s_{3}}$	−3.11	544.61
		$T_{c_{4}}$	1.47	6224.26	$T_{s_{4}}$	1.62	1615.12

Table 3. NR occurs in both study and auxiliary variables: second population.

NR		Estimator	Combined Type		Estimator	Separate Type
NR		Estimator	Bias	MSE	Estimator	Bias	MSE
10%	$g = 2$	$F_{c_{1}}$	−6.08	58.93	$F_{s_{1}}$	7.16	54.80
		$F_{c_{2}}$	−3.30	83.34	$F_{s_{2}}$	−0.35	188.90
		$F_{c_{3}}$	0.20	4.10	$F_{s_{3}}$	4.18	16.80
		$F_{c_{4}}$	−0.02	17.69	$F_{s_{4}}$	−1.37	23.86
	$g = 2.5$	$F_{c_{1}}$	−6.03	60.04	$F_{s_{1}}$	8.00	60.23
		$F_{c_{2}}$	−3.28	87.76	$F_{s_{2}}$	−0.35	207.78
		$F_{c_{3}}$	0.22	4.50	$F_{s_{3}}$	4.61	18.67
		$F_{c_{4}}$	−0.03	18.63	$F_{s_{4}}$	−1.51	25.94
	$g = 3$	$F_{c_{1}}$	−5.98	61.14	$F_{s_{1}}$	8.83	65.68
		$F_{c_{2}}$	−3.28	92.18	$F_{s_{2}}$	−0.35	226.87
		$F_{c_{3}}$	0.24	4.88	$F_{s_{3}}$	5.04	20.55
		$F_{c_{4}}$	−0.03	19.57	$F_{s_{4}}$	−1.66	28.04
20%	$g = 2$	$F_{c_{1}}$	−6.04	59.63	$F_{s_{1}}$	7.63	57.70
		$F_{c_{2}}$	−3.30	87.89	$F_{s_{2}}$	−0.33	203.50
		$F_{c_{3}}$	0.21	4.25	$F_{s_{3}}$	4.41	17.69
		$F_{c_{4}}$	−0.02	18.79	$F_{s_{4}}$	−1.44	25.36
	$g = 2.5$	$F_{c_{1}}$	−5.98	61.08	$F_{s_{1}}$	8.70	64.61
		$F_{c_{2}}$	−3.27	94.58	$F_{s_{2}}$	−0.32	230.53
		$F_{c_{3}}$	0.24	4.71	$F_{s_{3}}$	4.95	20.02
		$F_{c_{4}}$	−0.03	20.29	$F_{s_{4}}$	−1.62	28.22
	$g = 3$	$F_{c_{1}}$	−5.91	62.53	$F_{s_{1}}$	9.78	71.54
		$F_{c_{2}}$	−3.26	101.27	$F_{s_{2}}$	−0.31	258.36
		$F_{c_{3}}$	0.26	5.16	$F_{s_{3}}$	5.49	22.36
		$F_{c_{4}}$	−0.03	21.79	$F_{s_{4}}$	−1.81	31.13
30%	$g = 2$	$F_{c_{1}}$	−6.03	59.97	$F_{s_{1}}$	7.94	59.47
		$F_{c_{2}}$	−3.25	92.77	$F_{s_{2}}$	−0.28	215.86
		$F_{c_{3}}$	0.21	4.75	$F_{s_{3}}$	4.56	18.22
		$F_{c_{4}}$	−0.02	20.92	$F_{s_{4}}$	−1.48	27.03
	$g = 2.5$	$F_{c_{1}}$	−5.96	61.59	$F_{s_{1}}$	9.17	67.28
		$F_{c_{2}}$	−3.23	101.90	$F_{s_{2}}$	−0.24	250.03
		$F_{c_{3}}$	0.23	5.46	$F_{s_{3}}$	5.17	20.83
		$F_{c_{4}}$	−0.02	23.49	$F_{s_{4}}$	−1.69	30.81
	$g = 3$	$F_{c_{1}}$	−5.88	63.21	$F_{s_{1}}$	10.39	75.13
		$F_{c_{2}}$	−3.22	111.03	$F_{s_{2}}$	−0.21	285.63
		$F_{c_{3}}$	0.26	6.17	$F_{s_{3}}$	5.79	23.46
		$F_{c_{4}}$	−0.02	26.05	$F_{s_{4}}$	−1.89	34.68

Table 4. NR occurs only in study variable: second population.

NR		Estimator	Combined Type		Estimator	Separate Type
NR		Estimator	Bias	MSE	Estimator	Bias	MSE
10%	$g = 2$	$T_{c_{1}}$	0.67	34.37	$T_{s_{1}}$	12.09	117.35
		$T_{c_{2}}$	−3.28	75.00	$T_{s_{2}}$	−0.34	152.48
		$T_{c_{3}}$	0.30	8.90	$T_{s_{3}}$	5.74	34.92
		$T_{c_{4}}$	−0.01	16.41	$T_{s_{4}}$	−1.07	19.95
	$g = 2.5$	$T_{c_{1}}$	0.68	34.67	$T_{s_{1}}$	12.09	117.74
		$T_{c_{2}}$	−3.28	75.25	$T_{s_{2}}$	−0.34	152.85
		$T_{c_{3}}$	0.30	9.21	$T_{s_{3}}$	5.74	35.07
		$T_{c_{4}}$	−0.01	16.71	$T_{s_{4}}$	−1.07	20.06
	$g = 3$	$T_{c_{1}}$	0.68	34.98	$T_{s_{1}}$	12.10	118.14
		$T_{c_{2}}$	−3.28	75.50	$T_{s_{2}}$	−0.34	153.21
		$T_{c_{3}}$	0.30	9.51	$T_{s_{3}}$	5.74	35.21
		$T_{c_{4}}$	−0.01	17.02	$T_{s_{4}}$	−1.07	20.16
20%	$g = 2$	$T_{c_{1}}$	0.68	34.70	$T_{s_{1}}$	12.10	117.74
		$T_{c_{2}}$	−3.28	75.27	$T_{s_{2}}$	−0.34	152.85
		$T_{c_{3}}$	0.30	9.24	$T_{s_{3}}$	5.74	35.10
		$T_{c_{4}}$	−0.01	16.74	$T_{s_{4}}$	−1.07	20.10
	$g = 2.5$	$T_{c_{1}}$	0.68	35.18	$T_{s_{1}}$	12.10	118.33
		$T_{c_{2}}$	−3.28	75.66	$T_{s_{2}}$	−0.34	153.40
		$T_{c_{3}}$	0.30	9.71	$T_{s_{3}}$	5.74	35.34
		$T_{c_{4}}$	−0.02	17.21	$T_{s_{4}}$	−1.07	20.28
	$g = 3$	$T_{c_{1}}$	0.68	35.65	$T_{s_{1}}$	12.10	118.92
		$T_{c_{2}}$	−3.28	76.04	$T_{s_{2}}$	−0.34	153.94
		$T_{c_{3}}$	0.30	10.18	$T_{s_{3}}$	5.74	35.58
		$T_{c_{4}}$	−0.02	17.69	$T_{s_{4}}$	−1.07	20.45
30%	$g = 2$	$T_{c_{1}}$	0.68	35.90	$T_{s_{1}}$	12.10	118.93
		$T_{c_{2}}$	−3.28	76.24	$T_{s_{2}}$	−0.34	153.97
		$T_{c_{3}}$	0.30	10.43	$T_{s_{3}}$	5.74	35.56
		$T_{c_{4}}$	−0.02	17.94	$T_{s_{4}}$	−1.07	20.43
	$g = 2.5$	$T_{c_{1}}$	0.68	36.97	$T_{s_{1}}$	12.10	120.13
		$T_{c_{2}}$	−3.28	77.11	$T_{s_{2}}$	−0.35	155.90
		$T_{c_{3}}$	0.30	11.50	$T_{s_{3}}$	5.74	36.04
		$T_{c_{4}}$	−0.12	19.01	$T_{s_{4}}$	−1.07	20.77
	$g = 3$	$T_{c_{1}}$	0.68	38.04	$T_{s_{1}}$	12.09	121.33
		$T_{c_{2}}$	−3.28	77.98	$T_{s_{2}}$	−0.34	156.20
		$T_{c_{3}}$	0.30	12.57	$T_{s_{3}}$	5.74	36.52
		$T_{c_{4}}$	−0.02	20.08	$T_{s_{4}}$	−1.07	21.12

Table 5. NR occurs in both study and auxiliary variables: third population.

NR		Estimator	Combined Type		Estimator	Separate Type
NR		Estimator	Bias	MSE	Estimator	Bias	MSE
10%	$g = 2$	$F_{c_{1}}$	−5.94	40.60	$F_{s_{1}}$	−0.35	5.04
		$F_{c_{2}}$	−3.21	40.02	$F_{s_{2}}$	−0.58	12.44
		$F_{c_{3}}$	−0.09	0.62	$F_{s_{3}}$	−0.18	0.25
		$F_{c_{4}}$	0.04	4.68	$F_{s_{4}}$	0.08	1.12
	$g = 2.5$	$F_{c_{1}}$	−5.99	42.53	$F_{s_{1}}$	$- 0.37$	8.16
		$F_{c_{2}}$	−3.29	54.68	$F_{s_{2}}$	−0.69	21.04
		$F_{c_{3}}$	−0.14	0.91	$F_{s_{3}}$	−0.27	0.38
		$F_{c_{4}}$	0.06	6.84	$F_{s_{4}}$	0.12	1.74
	$g = 3$	$F_{c_{1}}$	−6.10	44.55	$F_{s_{1}}$	−0.39	12.20
		$F_{c_{2}}$	−3.38	69.33	$F_{s_{2}}$	−0.80	31.39
		$F_{c_{3}}$	−0.18	1.20	$F_{s_{3}}$	−0.36	0.51
		$F_{c_{4}}$	0.08	9.01	$F_{s_{4}}$	0.16	2.41
20%	$g = 2$	$F_{c_{1}}$	−6.02	41.45	$F_{s_{1}}$	−0.41	5.01
		$F_{c_{2}}$	−3.31	60.97	$F_{s_{2}}$	−0.53	19.96
		$F_{c_{3}}$	−0.14	1.13	$F_{s_{3}}$	−0.17	0.39
		$F_{c_{4}}$	0.08	9.50	$F_{s_{4}}$	0.08	2.02
	$g = 2.5$	$F_{c_{1}}$	−6.04	43.98	$F_{s_{1}}$	−0.42	8.36
		$F_{c_{2}}$	−3.44	86.10	$F_{s_{2}}$	−0.60	36.93
		$F_{c_{3}}$	−0.21	1.67	$F_{s_{3}}$	−0.24	0.59
		$F_{c_{4}}$	0.11	14.07	$F_{s_{4}}$	0.13	3.29
	$g = 3$	$F_{c_{1}}$	−6.08	46.73	$F_{s_{1}}$	−0.49	12.94
		$F_{c_{2}}$	−3.58	111.23	$F_{s_{2}}$	−0.66	58.93
		$F_{c_{3}}$	−0.28	2.22	$F_{s_{3}}$	−0.32	0.80
		$F_{c_{4}}$	0.15	18.64	$F_{s_{4}}$	0.17	4.73
30%	$g = 2$	$F_{c_{1}}$	−6.10	46.10	$F_{s_{1}}$	−0.51	9.84
		$F_{c_{2}}$	−3.47	88.66	$F_{s_{2}}$	−0.67	32.42
		$F_{c_{3}}$	−0.23	1.53	$F_{s_{3}}$	−0.30	0.26
		$F_{c_{4}}$	0.12	13.30	$F_{s_{4}}$	0.12	1.27
	$g = 2.5$	$F_{c_{1}}$	−6.12	51.40	$F_{s_{1}}$	−0.53	17.84
		$F_{c_{2}}$	−3.68	127.63	$F_{s_{2}}$	−0.78	63.41
		$F_{c_{3}}$	−0.34	2.28	$F_{s_{3}}$	−0.44	0.39
		$F_{c_{4}}$	0.17	19.78	$F_{s_{4}}$	0.19	2.14
	$g = 3$	$F_{c_{1}}$	−6.16	57.22	$F_{s_{1}}$	−0.56	29.21
		$F_{c_{2}}$	−3.89	166.60	$F_{s_{2}}$	−0.87	105.18
		$F_{c_{3}}$	−0.46	3.03	$F_{s_{3}}$	−0.58	0.53
		$F_{c_{4}}$	0.23	26.25	$F_{s_{4}}$	0.25	3.18

Table 6. NR occurs only in study variable: third population.

NR		Estimator	Combined Type		Estimator	Separate Type
NR		Estimator	Bias	MSE	Estimator	Bias	MSE
10%	$g = 2$	$T_{c_{1}}$	0.01	2.54	$T_{s_{1}}$	0.01	0.57
		$T_{c_{2}}$	−3.05	12.35	$T_{s_{2}}$	−0.35	0.83
		$T_{c_{3}}$	−0.01	2.14	$T_{s_{3}}$	−0.01	0.54
		$T_{c_{4}}$	0.01	2.50	$T_{s_{4}}$	0.01	0.55
	$g = 2.5$	$T_{c_{1}}$	0.01	3.62	$T_{s_{1}}$	0.01	0.86
		$T_{c_{2}}$	−3.06	13.22	$T_{s_{2}}$	−0.35	1.09
		$T_{c_{3}}$	−0.01	3.21	$T_{s_{3}}$	−0.01	0.83
		$T_{c_{4}}$	0.02	3.56	$T_{s_{4}}$	0.01	0.84
	$g = 3$	$T_{c_{1}}$	0.01	4.68	$T_{s_{1}}$	0.01	1.16
		$T_{c_{2}}$	−3.07	14.08	$T_{s_{2}}$	−0.36	1.36
		$T_{c_{3}}$	−0.01	4.28	$T_{s_{3}}$	−0.01	1.13
		$T_{c_{4}}$	0.02	4.62	$T_{s_{4}}$	0.01	1.15
20%	$g = 2$	$T_{c_{1}}$	0.01	5.04	$T_{s_{1}}$	0.01	0.99
		$T_{c_{2}}$	−3.08	14.37	$T_{s_{2}}$	−0.35	1.21
		$T_{c_{3}}$	−0.01	4.63	$T_{s_{3}}$	−0.01	0.96
		$T_{c_{4}}$	0.01	4.98	$T_{s_{4}}$	0.01	0.98
	$g = 2.5$	$T_{c_{1}}$	0.01	7.36	$T_{s_{1}}$	0.01	1.55
		$T_{c_{2}}$	−3.10	16.25	$T_{s_{2}}$	−0.36	1.70
		$T_{c_{3}}$	−0.01	6.95	$T_{s_{3}}$	−0.01	1.52
		$T_{c_{4}}$	0.01	7.30	$T_{s_{4}}$	0.01	1.54
	$g = 3$	$T_{c_{1}}$	0.01	9.67	$T_{s_{1}}$	0.01	2.16
		$T_{c_{2}}$	−3.12	18.12	$T_{s_{2}}$	−0.38	2.22
		$T_{c_{3}}$	−0.01	9.26	$T_{s_{3}}$	−0.01	2.12
		$T_{c_{4}}$	0.01	9.61	$T_{s_{4}}$	0.01	2.14
30%	$g = 2$	$T_{c_{1}}$	0.01	6.79	$T_{s_{1}}$	0.01	0.47
		$T_{c_{2}}$	−3.14	15.80	$T_{s_{2}}$	−0.36	0.76
		$T_{c_{3}}$	−0.01	6.38	$T_{s_{3}}$	−0.01	0.44
		$T_{c_{4}}$	0.01	6.73	$T_{s_{4}}$	0.01	0.46
	$g = 2.5$	$T_{c_{1}}$	0.01	9.98	$T_{s_{1}}$	0.01	0.74
		$T_{c_{2}}$	−3.16	18.37	$T_{s_{2}}$	−0.38	1.01
		$T_{c_{3}}$	−0.01	9.57	$T_{s_{3}}$	−0.01	0.71
		$T_{c_{4}}$	0.01	9.92	$T_{s_{4}}$	0.01	0.73
	$g = 3$	$T_{c_{1}}$	0.01	13.17	$T_{s_{1}}$	0.01	1.05
		$T_{c_{2}}$	−3.18	20.96	$T_{s_{2}}$	−0.40	1.28
		$T_{c_{3}}$	−0.01	12.76	$T_{s_{3}}$	−0.01	1.02
		$T_{c_{4}}$	0.01	13.11	$T_{s_{4}}$	0.01	1.04

Table 7. NR occurs in both study and auxiliary variables: fourth population.

NR		Estimator	Combined Type		Estimator	Separate Type
NR		Estimator	Bias	MSE	Estimator	Bias	MSE
10%	$g = 2$	$T_{c_{1}}$	−108.48	1,178,992	$T_{s_{1}}$	−29.19	50,811,719
		$T_{c_{2}}$	−57.27	1,484,842	$T_{s_{2}}$	−19.56	100,199,820
		$T_{c_{3}}$	−1.84	1,818,392	$T_{s_{3}}$	−2.06	182,454,545
		$T_{c_{4}}$	0.63	1,818,310	$T_{s_{4}}$	0.75	182,426,899
	$g = 2.5$	$T_{c_{1}}$	−108.14	1,762,392	$T_{s_{1}}$	−28.68	164,367,045
		$T_{c_{2}}$	−58.51	2,225,550	$T_{s_{2}}$	−20.98	327,638,612
		$T_{c_{3}}$	−2.84	2,727,461	$T_{s_{3}}$	−3.15	600,552,403
		$T_{c_{4}}$	0.97	2,727,349	$T_{s_{4}}$	1.15	600,466,091
	$g = 3$	$T_{c_{1}}$	−107.81	2,345,809	$T_{s_{1}}$	−28.17	381,498,749
		$T_{c_{2}}$	−59.75	2,966,259	$T_{s_{2}}$	−22.39	764,503,302
		$T_{c_{3}}$	−3.84	3,636,530	$T_{s_{3}}$	−4.23	1,405,916,555
		$T_{c_{4}}$	1.31	3,636,388	$T_{s_{4}}$	1.56	1,405,720,104
20%	$g = 2$	$T_{c_{1}}$	−107.36	2,399,597	$T_{s_{1}}$	−27.99	408,219,322
		$T_{c_{2}}$	−60.42	3,035,665	$T_{s_{2}}$	−22.68	819,321,760
		$T_{c_{3}}$	−4.02	3,720,971	$T_{s_{3}}$	−4.34	1,506,445,624
		$T_{c_{4}}$	1.40	3,721,078	$T_{s_{4}}$	1.63	1,506,532,751
	$g = 2.5$	$T_{c_{1}}$	−106.46	3,593,439	$T_{s_{1}}$	−26.87	1,350,803,822
		$T_{c_{2}}$	−63.24	4,551,786	$T_{s_{2}}$	−25.60	2,724,260,768
		$T_{c_{3}}$	−6.11	5,581,330	$T_{s_{3}}$	−6.53	5,024,624,948
		$T_{c_{4}}$	2.13	5,581,502	$T_{s_{4}}$	2.48	5,024,940,823
	$g = 3$	$T_{c_{1}}$	−105.56	4,787,392	$T_{s_{1}}$	−25.75	3,171,249,140
		$T_{c_{2}}$	−66.05	6,067,906	$T_{s_{2}}$	−28.47	6,409,751,068
		$T_{c_{3}}$	−8.19	7,441,689	$T_{s_{3}}$	−8.67	11,840,483,518
		$T_{c_{4}}$	2.87	7,441,925	$T_{s_{4}}$	3.34	11,841,257,886
30%	$g = 2$	$T_{c_{1}}$	−106.34	3,680,702	$T_{s_{1}}$	−27.07	1,454,097,664
		$T_{c_{2}}$	−63.29	4,662,315	$T_{s_{2}}$	−25.22	2,934,501,665
		$T_{c_{3}}$	−6.00	5,718,268	$T_{s_{3}}$	−6.29	5,417,089,085
		$T_{c_{4}}$	2.11	5,718,534	$T_{s_{4}}$	2.39	5,417,712,678
	$g = 2.5$	$T_{c_{1}}$	−104.93	5,515,322	$T_{s_{1}}$	−25.48	4,849,644,828
		$T_{c_{2}}$	−67.54	6,991,760	$T_{s_{2}}$	−29.29	9,813,156,971
		$T_{c_{3}}$	−9.07	8,577,276	$T_{s_{3}}$	−9.38	18,150,249,032
		$T_{c_{4}}$	3.19	8,577,685	$T_{s_{4}}$	3.63	18,152,396,990
	$g = 3$	$T_{c_{1}}$	−103.52	7,350,202	$T_{s_{1}}$	−23.87	11,431,154,565
		$T_{c_{2}}$	−71.79	9,321,206	$T_{s_{2}}$	−33.24	23,154,407,818
		$T_{c_{3}}$	−12.14	11,436,283	$T_{s_{3}}$	−12.38	42,867,290,974
		$T_{c_{4}}$	4.27	11,436,836	$T_{s_{4}}$	4.89	42,872,433,423

Table 8. NR occurs only in study variable: fourth population.

NR		Estimator	Combined Type		Estimator	Separate Type
NR		Estimator	Bias	MSE	Estimator	Bias	MSE
10%	$g = 2$	$T_{c_{1}}$	0.55	1,818,413	$T_{s_{1}}$	0.53	182,458,420
		$T_{c_{2}}$	−54.79	1,475,901	$T_{s_{2}}$	−16.68	98,883,524
		$T_{c_{3}}$	0.26	1,818,195	$T_{s_{3}}$	0.25	182,394,274
		$T_{c_{4}}$	−0.06	1,818,111	$T_{s_{4}}$	−0.06	182,369,395
	$g = 2.5$	$T_{c_{1}}$	0.55	2,727,353	$T_{s_{1}}$	0.53	600,472,655
		$T_{c_{2}}$	−54.79	2,212,142	$T_{s_{2}}$	−16.68	323,284,693
		$T_{c_{3}}$	0.26	2,727,135	$T_{s_{3}}$	0.25	600,330,586
		$T_{c_{4}}$	−0.06	2,727,050	$T_{s_{4}}$	−0.06	600,275,485
	$g = 3$	$T_{c_{1}}$	0.55	3,636,292	$T_{s_{1}}$	0.53	1,405,620,109
		$T_{c_{2}}$	−54.79	2,948,383	$T_{s_{2}}$	−16.68	754,286,088
		$T_{c_{3}}$	0.26	3,636,075	$T_{s_{3}}$	0.25	1,405,369,526
		$T_{c_{4}}$	−0.06	3,635,990	$T_{s_{4}}$	−0.06	1,405,272,341
20%	$g = 2$	$T_{c_{1}}$	0.55	3,720,887	$T_{s_{1}}$	0.53	1,506,317,194
		$T_{c_{2}}$	−54.79	3,016,906	$T_{s_{2}}$	−16.68	807,973,503
		$T_{c_{3}}$	0.26	3,720,670	$T_{s_{3}}$	0.25	1,506,054,376
		$T_{c_{4}}$	−0.06	3,720,586	$T_{s_{4}}$	−0.05	1,505,953,106
	$g = 2.5$	$T_{c_{1}}$	0.55	5,581,064	$T_{s_{1}}$	0.53	5,023,813,395
		$T_{c_{2}}$	−54.79	4,523,650	$T_{s_{2}}$	−16.68	2,686,324,032
		$T_{c_{3}}$	0.26	5,580,847	$T_{s_{3}}$	0.25	5,023,226,535
		$T_{c_{4}}$	−0.06	5,580,763	$T_{s_{4}}$	−0.05	5.02 × 10⁹
	$g = 3$	$T_{c_{1}}$	0.55	7,441,242	$T_{s_{1}}$	0.53	11,838,116,873
		$T_{c_{2}}$	−54.79	6,030,393	$T_{s_{2}}$	−16.68	6,320,254,203
		$T_{c_{3}}$	0.26	7,441,024	$T_{s_{3}}$	0.25	11,837,077,522
		$T_{c_{4}}$	−0.06	7,440,940	$T_{s_{4}}$	−0.05	11,836,677,033
30%	$g = 2$	$T_{c_{1}}$	0.55	5,718,081	$T_{s_{1}}$	0.53	5,416,475,894
		$T_{c_{2}}$	−54.79	4,634,633	$T_{s_{2}}$	−16.68	2,895,122,033
		$T_{c_{3}}$	0.26	5,717,863	$T_{s_{3}}$	0.25	5,415,857,716
		$T_{c_{4}}$	−0.06	5,717,779	$T_{s_{4}}$	−0.06	5,415,619,323
	$g = 2.5$	$T_{c_{1}}$	0.55	8,576,855	$T_{s_{1}}$	0.53	18,147,287,412
		$T_{c_{2}}$	−54.79	6,950,240	$T_{s_{2}}$	−16.68	9,681,054,564
		$T_{c_{3}}$	0.26	8,576,637	$T_{s_{3}}$	0.25	18,145,903,210
		$T_{c_{4}}$	−0.06	8,576,553	$T_{s_{4}}$	−0.06	18,145,369,405
	$g = 3$	$T_{c_{1}}$	0.55	11,435,629	$T_{s_{1}}$	0.53	42,859,221,223
		$T_{c_{2}}$	−54.79	9,265,847	$T_{s_{2}}$	−16.68	22,842,223,498
		$T_{c_{3}}$	0.26	11,435,411	$T_{s_{3}}$	0.25	42,856,766,348
		$T_{c_{4}}$	−0.06	11,435,327	$T_{s_{4}}$	−0.06	42,855,819,645

Table 9. NR occurs in both study and auxiliary variables: simulation study.

NR		Estimator	Combined Type		Estimator	Separate Type
NR		Estimator	Bias	MSE	Estimator	Bias	MSE
10%	$g = 2$	$F_{c_{1}}$	−99.37	11,462.40	$F_{s_{1}}$	−84.13	14,280.74
		$F_{c_{2}}$	−51.86	21,585.80	$F_{s_{2}}$	−61.69	24,029.11
		$F_{c_{3}}$	−2.17	2604.27	$F_{s_{3}}$	−9.73	2627.21
		$F_{c_{4}}$	1.89	7838.75	$F_{s_{4}}$	10.84	8420.32
	$g = 2.5$	$F_{c_{1}}$	−99.16	11,625.62	$F_{s_{1}}$	−81.55	14,580.20
		$F_{c_{2}}$	−51.96	23,621.92	$F_{s_{2}}$	−62.77	26,668.49
		$F_{c_{3}}$	−2.32	2804.80	$F_{s_{3}}$	−10.21	2863.64
		$F_{c_{4}}$	2.07	8587.78	$F_{s_{4}}$	11.82	9277.22
	$g = 3$	$F_{c_{1}}$	−98.96	11,788.81	$F_{s_{1}}$	−77.56	14,946.17
		$F_{c_{2}}$	−52.06	25,658.04	$F_{s_{2}}$	−63.42	29,582.64
		$F_{c_{3}}$	−2.46	3005.32	$F_{s_{3}}$	−10.22	3096.01
		$F_{c_{4}}$	2.24	9336.81	$F_{s_{4}}$	12.67	10,157.83
20%	$g = 2$	$F_{c_{1}}$	−98.64	12,022.85	$F_{s_{1}}$	−77.63	14,901.42
		$F_{c_{2}}$	−52.03	27,944.70	$F_{s_{2}}$	−63.07	29,421.31
		$F_{c_{3}}$	−2.74	3646.18	$F_{s_{3}}$	−10.38	3679.22
		$F_{c_{4}}$	2.47	10,582.09	$F_{s_{4}}$	12.77	11,124.93
	$g = 2.5$	$F_{c_{1}}$	−98.19	12,490.81	$F_{s_{1}}$	−71.18	15,532.90
		$F_{c_{2}}$	−52.30	33,210.38	$F_{s_{2}}$	−64.62	34,754.21
		$F_{c_{3}}$	−3.18	4369.76	$F_{s_{3}}$	−10.86	4533.10
		$F_{c_{4}}$	2.93	12,714.47	$F_{s_{4}}$	14.52	12,797.77
	$g = 3$	$F_{c_{1}}$	−97.74	12,959.26	$F_{s_{1}}$	−64.56	13,191.06
		$F_{c_{2}}$	−52.58	38,476.07	$F_{s_{2}}$	−65.94	40,223.23
		$F_{c_{3}}$	−3.62	5093.33	$F_{s_{3}}$	−11.09	5959.75
		$F_{c_{4}}$	3.40	14,846.84	$F_{s_{4}}$	16.25	14,968.57
30%	$g = 2$	$F_{c_{1}}$	−98.00	13,207.60	$F_{s_{1}}$	−71.38	15,555.58
		$F_{c_{2}}$	−53.16	37,975.07	$F_{s_{2}}$	−64.88	38,885.40
		$F_{c_{3}}$	−3.70	5568.62	$F_{s_{3}}$	−10.88	6540.35
		$F_{c_{4}}$	3.34	14,982.56	$F_{s_{4}}$	14.55	15,830.15
	$g = 2.5$	$F_{c_{1}}$	−97.16	14,252.70	$F_{s_{1}}$	−61.75	16,418.48
		$F_{c_{2}}$	−53.96	48,208.97	$F_{s_{2}}$	−66.53	49,618.41
		$F_{c_{3}}$	−4.62	7252.00	$F_{s_{3}}$	−11.71	7298.92
		$F_{c_{4}}$	4.24	19,304.24	$F_{s_{4}}$	17.28	20,287.76
	$g = 3$	$F_{c_{1}}$	−96.32	15,302.16	$F_{s_{1}}$	−51.88	17,325.78
		$F_{c_{2}}$	−54.76	58,442.88	$F_{s_{2}}$	−68.52	59,654.11
		$F_{c_{3}}$	−5.54	8935.37	$F_{s_{3}}$	−12.41	9899.75
		$F_{c_{4}}$	5.13	23,625.93	$F_{s_{4}}$	19.98	26,830.34

Table 10. NR occurs in only study variable: simulation study.

NR		Estimator	Combined Type		Estimator	Separate Type
NR		Estimator	Bias	MSE	Estimator	Bias	MSE
10%	$g = 2$	$T_{c_{1}}$	0.83	3095.51	$T_{s_{1}}$	0.66	6005.05
		$T_{c_{2}}$	−51.65	18,226.08	$T_{s_{2}}$	−41.46	21,084.33
		$T_{c_{3}}$	−6.89	2490.08	$T_{s_{3}}$	−13.92	2510.18
		$T_{c_{4}}$	1.55	7220.33	$T_{s_{4}}$	10.89	7889.09
	$g = 2.5$	$T_{c_{1}}$	0.83	3535.33	$T_{s_{1}}$	0.62	6432.76
		$T_{c_{2}}$	−51.65	18,582.33	$T_{s_{2}}$	−42.65	21,386.90
		$T_{c_{3}}$	−6.89	2929.90	$T_{s_{3}}$	−14.90	2996.04
		$T_{c_{4}}$	1.55	7660.15	$T_{s_{4}}$	11.08	8208.33
	$g = 3$	$T_{c_{1}}$	0.83	3975.15	$T_{s_{1}}$	0.58	6985.89
		$T_{c_{2}}$	−51.65	18,938.58	$T_{s_{2}}$	−15.62	21,569.48
		$T_{c_{3}}$	−6.89	3369.72	$T_{s_{3}}$	−14.96	3380.67
		$T_{c_{4}}$	1.55	8099.97	$T_{s_{4}}$	12.68	8670.03
20%	$g = 2$	$T_{c_{1}}$	−51.49	4915.58	$T_{s_{1}}$	0.62	7264.21
		$T_{c_{2}}$	6.87	19,613.14	$T_{s_{2}}$	−43.48	21,039.72
		$T_{c_{3}}$	1.54	4317.75	$T_{s_{3}}$	−14.55	4398.77
		$T_{c_{4}}$	0.84	9033.16	$T_{s_{4}}$	12.79	10,670.03
	$g = 2.5$	$T_{c_{1}}$	0.81	6273.49	$T_{s_{1}}$	0.54	7598.65
		$T_{c_{2}}$	−51.49	20,713.05	$T_{s_{2}}$	−44.90	21,112.33
		$T_{c_{3}}$	−6.87	5675.66	$T_{s_{3}}$	−15.30	5890.26
		$T_{c_{4}}$	1.54	10,391.07	$T_{s_{4}}$	13.04	10,876.35
	$g = 3$	$T_{c_{1}}$	0.81	7631.39	$T_{s_{1}}$	0.46	8784.58
		$T_{c_{2}}$	−51.49	21,812.95	$T_{s_{2}}$	−46.51	21,563.02
		$T_{c_{3}}$	−6.87	7033.57	$T_{s_{3}}$	−15.99	7088.90
		$T_{c_{4}}$	1.54	11,748.98	$T_{s_{4}}$	13.50	11,789.07
30%	$g = 2$	$T_{c_{1}}$	0.82	8006.62	$T_{s_{1}}$	0.60	9012.43
		$T_{c_{2}}$	−51.57	22,184.82	$T_{s_{2}}$	−44.01	23,224.42
		$T_{c_{3}}$	−6.87	7398.69	$T_{s_{3}}$	−15.76	7648.04
		$T_{c_{4}}$	1.54	12,113.96	$T_{s_{4}}$	13.07	12,055.13
	$g = 2.5$	$T_{c_{1}}$	0.82	10,894.01	$T_{s_{1}}$	0.52	11,028.41
		$T_{c_{2}}$	−51.57	29,523.61	$T_{s_{2}}$	−45.53	31,358.95
		$T_{c_{3}}$	−6.87	10,286.07	$T_{s_{3}}$	−16.49	10,762.10
		$T_{c_{4}}$	1.54	15,001.34	$T_{s_{4}}$	13.95	15,856.02
	$g = 3$	$T_{c_{1}}$	0.82	13,781.40	$T_{s_{1}}$	0.54	14,638.69
		$T_{c_{2}}$	−51.57	26,862.39	$T_{s_{2}}$	−45.09	33,986.01
		$T_{c_{3}}$	−6.87	13,173.46	$T_{s_{3}}$	−16.84	13,703.60
		$T_{c_{4}}$	1.54	17,888.73	$T_{s_{4}}$	14.11	17,909.84

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Application of Log-Type Estimators for Addressing Non-Response in Survey Sampling Using Real Datasets

Abstract

1. Introduction

Procedure for Estimation of Population Mean When NR Occurs

2. The Proposed Classes of Log-Type Estimators

2.1. Non-Response Occurs in Both Study and Auxiliary Variables: Combined Log-Type Estimators

2.2. Non-Response Occurs in Both Study and Auxiliary Variables: Separate Log-Type Estimators

2.3. Non-Response Occurs Only in Study Variable: Combined Log-Type Estimators

2.4. Non-Response Occurs Only on Study Variable: Separate Log-Type Estimators

3. Empirical Study

3.1. First Population

3.2. Second Population

3.3. Third Population

3.4. Fourth Population

4. Simulation Study

5. Results and Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

References

Article Metrics

Citations

Article Access Statistics