Next Article in Journal
The Generalized Symmetric Non-Metric Connection and Its Applications to ∗-Conformal η-Ricci–Yamabe Solitons on α-Cosymplectic Manifolds
Previous Article in Journal
Element-Oriented Construction Methods for Nullnorms on Bounded Lattices
Previous Article in Special Issue
An Exponentiated Inverse Exponential Distribution Properties and Applications
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Statistical Formulation for Average Parameter Determination via Quantile-Linked Auxiliary Characteristics

by
Huda M. Alshanbari
1,
Malik Muhammad Anas
2,* and
Soofia Iftikhar
3
1
Department of Mathematical Sciences, College of Science, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
2
Department of Economics and Statistics, University of Salerno, Fisciano, 84084 Salerno, Italy
3
Department of Statistics, Shaheed Benazir Bhutto Women University, Peshawar 25000, Pakistan
*
Author to whom correspondence should be addressed.
Axioms 2025, 14(12), 857; https://doi.org/10.3390/axioms14120857 (registering DOI)
Submission received: 15 October 2025 / Revised: 14 November 2025 / Accepted: 20 November 2025 / Published: 23 November 2025
(This article belongs to the Special Issue Probability, Statistics and Estimations, 2nd Edition)

Abstract

This article suggests a brief and integrated methodology for enhancing the mean estimation in a finite population using the pre-specified median of the target variable in conjunction with quantile-related auxiliary measurements. Although a number of conventional ratio and regression estimators are reliant on the average of the auxiliary variable, the current study presents new exponential-type families that combine the variable’s known median under investigation with the dispersion-related characteristics of supplementary information. The estimators are constructed and analyzed theoretically, under Simple Random Sampling Without Replacement (SRSWOR), and the minimum expression of the MSE under optimal conditions is obtained. The results are confirmed using simulated Kumaraswamy–Gamma populations and through several real-world datasets, such as those on education, wheat production, U.S. cereal consumption, and solar radiation data (HI-SEAS weather station, September–December 2016). The findings consistently demonstrate that the developed estimators provide a substantially lower MSE and greater percentage relative efficiency (PRE) relative to conventional estimators. These applications indicate that the median-based unified framework can be used to give more precise and efficient mean estimation in several fields, comprising agriculture, nutrition, and education, as well as meteorological and environmental research.

1. Introduction

Sample surveys are very fundamental in research since they offer a convenient and useful method of gathering representative data from large groups. There are multiple methods of carrying out surveys, such as online questionnaires, face-to-face interviews, group surveys, or mailed or phone surveys. They are applied extensively in the various fields of education, healthcare, labor markets, business, agriculture, radiation, environmental studies, and energy research. The effectiveness of a survey’s data collection approach has a significant impact on its efficacy. Ill-conceived research can lead to an erroneous finding, but a well-crafted survey that employs the best statistical tools can lead to valid and solid findings. Adding more information to survey-based estimates in the form of auxiliary variables is one of the strategies that can be used to strengthen the survey-based estimates. To enhance the accuracy of estimates of study parameters, the addition of auxiliary variables to estimation procedures can go a long way (Alomair and Shahzad [1]).
Regression, ratio, or modified versions of these estimators are typically utilized when there is a favorable association between the supplementary variable X and the research variable Y. Under SRSWOR, these methods consistently outperform the conventional ones. Besides these methods, utilizing the median information regarding the research parameter provides yet another dimension in terms of the construction of efficient estimators. Using the known median in ratio-type or regression-type estimators, the precision of estimation in means can be improved further.
The arithmetic mean is one of the most popular summary measures in the scientific, social, and applied world. Due to the fact that mean estimation is the centerpiece of sampling theory, there has been a continuous effort to enhance its accuracy. Ratio and regression-based techniques have been extended to many studies. For example, Chutiman et al. [2] used linear transformations of auxiliary variables in layered sampling, Alomair and Shahzad [3] created modified regression estimators in the framework of median ranked set sampling, and Koc and Koc [4] developed more efficient mean estimators in the context of simple random sampling using quantile regression coefficients. Despite this development, the majority of current techniques mainly depend on the average of the auxiliary variables, and there has not been much research devoted to exploring the possibility of using the pre-specified median of the target variable as a basis for achieving efficiency gains. To fill this gap, it is essential to have a single framework that incorporates information such as the median along with supplementary variables to enhance the estimation of a population’s true mean value.
In this paper, we focus on the use of supplementary information, specifically, the pre-specified median of the target variable and the average of an auxiliary variable, to derive an innovative family of ratio exponential estimators. A known population median is often assumed in survey situations; median values are frequently available from previous censuses or administrative reports or summaries of categorized results. This is also feasible because in many cases, it is possible to identify the median without having the entire dataset, as it is sufficient to have grouped or frequency data. This assumption provides a unified theoretical framework for the proposed estimators that are mathematically tractable and allow for closed-form representations of bias and mean squared error (MSE). In order to illustrate realistic situations in which such an assumption applies, several applied cases are given below:
  • In a university exam, the task was to estimate the average marks of 5000 students. The median marks, which fell between 60 and 75, were the only category provided.
  • For 800 university faculty members, the aim was to estimate the average salary. The median salary (RMB 10,000) was known from categorized payroll bands provided by the HR department.
  • In a hospital-based BMI survey of 350 patients, the median BMI (21.75) was known from categories ranging from underweight to very severely obese.
  • In a blood pressure monitoring program with 202 patients, the median systolic pressure (104.5 mmHg) was assumed to be known from medical categories such as hypotension and hypertensive emergency.
  • In a farming area, 1000 farmers produce wheat, and their wheat is pooled according to the number of quintals produced per acre. Government survey results indicate that the average yield per acre is 12 quintals. This is why calculating the median value helps estimate the yield per acre more accurately.
  • A housing office of a city keeps records of 2500 houses pooled by valuation in the city. The median house price (USD 350,000) is made publicly available in the summaries of real estate data and can be used as a known value to estimate the average price.
  • The categorical Air Quality Index (AQI) classes are frequently reported in air quality monitoring at 500 stations. The median AQI value obtained using these categories provides a good known value to estimate the mean levels of the AQI.
  • As part of an income distribution study conducted on 3000 workers, administrative data categorizes workers by monthly income levels. A good example of known information, which can effectively be used to estimate average income levels, is the median income (USD 4000), which can be obtained either by conducting employment surveys or using payroll data.
Hence, the current paper constructs such a framework by creating new types of exponential ratio estimators that explicitly make use of the pre-specified median of the target variable as well as raw dispersion measures of auxiliary variables. The reason behind the motivation is that even though means or variances are frequently simpler to compute in practice than medians, they are not always available to the researcher, especially when the data are coded into groups or categories. In order to illustrate the usefulness of the constructed estimators, we provide examples of their applications in various areas, such as education, healthcare, agriculture, and nutrition, plus a real-life application to meteorological data on solar radiation collected at the HI-SEAS weather station. In the second usage, solar radiation (W/m2) is considered the response variable, and temperature (°F) is considered supplementary information. This not only emphasizes the generality of the new method, but it also serves to indicate its relevance to pressing environmental and energy-related studies.
The subsequent sections of this article are arranged as follows: Section 2 surveys the related literature on mean estimation using auxiliary information and identifies key gaps motivating this study. The Quantile-Linked Measures of Dispersion and Adapted Family Section introduces an adapted class of ratio exponential estimators based on pre-specified medians and auxiliary dispersion measures such as the interquartile range, semi-interquartile range, mid-range, and interdecile range. Section 3 extends this to a generalized exponential-type family, with analytical expressions for bias and MSE, including optimal parameter values for the minimum MSE. Section 4 evaluates estimator performance through simulation studies under Kumaraswamy–Gamma distributions and empirical applications to datasets from education, agriculture, nutrition, and solar radiation. Section 5 offers a conclusion.

2. Literature Review on Median-Oriented Studies and Adapted Family

Over the past few years, accurate estimates of population means and parameters have become increasingly important in research. One reason for studying this topic is that researchers aim to create useful and accurate estimators. Ref. [5] introduced a ratio estimator for the median parameter. The idea of applying the study variable’s median in a ratio-based estimator for the population mean, especially in situations lacking auxiliary variables, was first proposed by Subramani [6,7]. His estimator showed that utilizing the known features of the target variable can improve the results of estimation. Additionally, estimators that use the median are important because they handle outliers well and require less complete data to work accurately. Lamichhane et al. [8] improved the median-based method by developing a population average estimator utilizing the median of an additional variable. Irfan et al. [9] and Yusuf et al. [10] defined an improved class of median-based power-type mean estimators. They expanded on previous studies by recommending estimators that help reduce bias and mean squared error when samples are taken without replacement. Their suggested classes outperform traditional ratio and regression estimators, as well as those developed by Subramani [6,7], as supported by theoretical derivations and empirical analyses. Refs. [11,12] introduced exponential families of median-based estimators under SRSWOR, building on earlier works by Subramani [6,7] and Irfan et al. [9]. Their proposed estimators demonstrated better performance in terms of MSE and PRE. Empirical evidence confirmed that the constructed family of estimators consistently outperformed their traditional and contemporary counterparts, reinforcing the utility of median-based modifications in practical survey applications. Abdullahi and Ugwuowo [13] developed median-based regression-type mean estimators with two tuning parameters. Singh and Tiwari [14] defined median-based mean estimators with bivariate auxiliary information under the neutrosophic framework. Kurbah et al. [15] extended this study on median-based mean estimators with double ranked set sampling.
According to the literature, few studies have focused on creating mean estimators that use both median-based linear regression and a median-based exponential component. This gap underscores the motivation for the present study. Therefore, in the following subsection, we define an exponential family of adapted median-based mean estimators that utilize both of these study variable median-based components with known raw measures of dispersion from the auxiliary variable.

Quantile-Linked Measures of Dispersion and Adapted Family

This section defines a generalized family of median-based estimators for the mean parameter, specifically designed for situations where certain quantile-linked raw measures of dispersion related to an auxiliary variable are readily available. These estimators make use of simple statistical methods to enhance the estimation process under SRSWOR. The dispersion measures incorporated into the development of these estimators include the following:
  • IQR:
    R i q = Q 3 ( X ) Q 1 ( X )
  • SIQR:
    R s q = Q 3 ( X ) Q 1 ( X ) 2
  • MR:
    R m i d = max ( X ) + min ( X ) 2
  • IDR:
    R i d = D 9 ( X ) D 1 ( X ) .
  • Measuring the spread of data is crucial, and the interquartile range (IQR), semi-interquartile range (SIQR), mid-range (MR), and interdecile range (IDR) are among the most practical and widely used raw measures for this purpose (Soni and Pandey [16]; Naz et al. [17]). The values obtained from these metrics play a significant role in constructing new estimators for the population mean. When SRSWOR is used, population dispersion estimation accuracy is enhanced by the inclusion of crucial features of the auxiliary variable. The theoretical basis and statistical characteristics of these dispersion measures are well-researched topics in the academic literature.
By following the Cetin and Koyuncu [18,19,20] structure of estimators, we define the known median-based mean family of adapted estimators by including raw measures (IQR, SIQR, MR, IDR), as given below:
Y ^ G i = w 1 μ ^ y + w 2 μ ˜ M μ ˜ m exp a 1 μ ˜ M μ ˜ m a 1 μ ˜ M + μ ˜ m + 2 b 1 ,
where w 1 and w 2 correspond to optimally defined constants. ( μ ^ y , μ ˜ m ) denote the sample mean and median of the response variable (Y), respectively. μ ˜ M is the population median of (Y). Further, a 1 and b 1 are either raw measures of the dispersion of the auxiliary variable (X) or unity. It is worth noting that the exponential formulation is based on Shahzad et al. [11], who revealed that exponential ratio-type structures provide lower bias and more stable MSE when the variables in the survey are positively correlated. The current research extends this argument further by anchoring quantile-related and dispersion metrics to exponential-type estimators.
The lowest MSE of Y ^ G i achieved under optimum settings,
w 1 ( o p t ) = 8 λ θ 2 C ˜ M 2 8 1 + λ C y 2 1 r y m 2 and
w 2 ( o p t ) = μ y λ θ 3 C ˜ M 3 + 8 C y r y m λ θ 2 C ˜ M 2 C y r y m 4 θ C ˜ M 1 λ C y 2 1 r y m 2 8 μ ˜ M C ˜ M 1 + λ C y 2 1 r y m 2 ,
is given by
M S E min Y ^ G i = λ μ y 2 64 C y 2 1 r y m 2 λ θ 4 C ˜ M 4 16 λ θ 2 C ˜ M 2 C y 2 1 r y m 2 64 1 + λ C y 2 1 r y m 2
where λ = 1 f n , f = n N , and θ = a 1 μ M a 1 μ M + b 1 . All details of the estimator family are listed in Table 1, where a 1 and b 1 are chosen constants in light of references [16,17].

3. Generalized Family of Median-Based Mean Estimators Using Raw Dispersion Measures

It is well known that ratio [21], regression [22,23], and exponential [24,25] methods are useful for estimating the mean parameter in survey sampling when a correlation exists between the supplementary and study variables. Building on mid-nineteenth-century developments, these techniques are now vital in statistics. Their use began in agriculture and later expanded to other industries. Although linear regression with the known mean of (X) is widely used, the current literature shows that not many studies have considered using both the median of the target variable and the mean of the supplementary information at the same time. Because of this gap, more research in this area is required. Using Alharbi et al.’s [26] framework, we can form a difference-based estimator for the average parameter utilizing both the median of the target variable and the mean of the auxiliary variable:
Y ^ p = ϖ a μ ^ y + ϖ b μ ˜ M μ ˜ m + ϖ c μ ^ x μ x ,
where ϖ a , ϖ b , ϖ c are suitably chosen constants.
Following Y ^ G i and Y ^ p , we formulate a generalized class of median-based mean estimators Y ^ p i using raw dispersion measures, given by
Y ^ p i = ϖ a μ ^ y + ϖ b μ ˜ M μ ˜ m + ϖ c μ ^ x μ x exp a 1 μ ˜ M μ ˜ m a 1 μ ˜ M + μ ˜ m + 2 b 1 ,
where a 1 and b 1 are as explained earlier, and ( μ ^ x , μ x ) are the sample and population means of (X), respectively. Some members of the generalized family of median-based mean estimators Y ^ p i are provided in Table 2 using different choices of a 1 and b 1 . All exponential formulations of estimators presented in Table 1 and Table 2 are expressed in a ratio-type form such that any homogeneous change in the measurement unit of (X) or (Y) has no effect on the estimator, making scale and unit invariance practically achievable.
Let
ξ a = μ ^ y μ y μ y , ξ b = μ ˜ m μ ˜ M μ ˜ M , ξ c = μ ^ x μ x μ x ,
such that E ( ξ a ) = E ( ξ b ) = E ( ξ c ) = 0 . Further,
E ( ξ a 2 ) = λ C y 2 , E ( ξ b 2 ) = λ C ˜ M 2 , E ( ξ c 2 ) = λ C x 2 , E ( ξ a ξ b ) = λ r y m C y C ˜ M ,
E ( ξ a ξ c ) = λ r y x C y C x , and E ( ξ b ξ c ) = λ r m x C ˜ M C x .
From Equation (4), rewriting Y ^ p i in terms of ξ a , ξ b , and ξ c , we get
Y ^ p i = ϖ a μ y ( 1 + ξ a ) ϖ b μ ˜ M ξ b ϖ c μ x ξ c 1 θ ξ b 2 + 3 θ 2 ξ b 2 8 + .
By expanding Equation (5) and keeping terms only up to order two in ξ s, we can write
Y ^ p i μ y = μ y + μ y ϖ a + μ y ξ a ϖ a 1 2 μ y θ ξ b ϖ a μ ˜ M ξ b ϖ b μ x ξ c ϖ c 1 2 μ y θ ξ a ξ b ϖ a + 3 8 μ y θ 2 ξ b 2 ϖ a + 1 2 μ ˜ M θ ξ b 2 ϖ b + 1 2 μ x θ ξ b ξ c ϖ c .
From Equation (6)’s expectation applicability,
B i a s Y ^ p i 1 8 8 μ y + 4 λ θ C ˜ M μ ˜ M C ˜ M ϖ b + μ x C x ϖ c r m x + μ y ϖ a 8 + λ θ C ˜ M 3 θ C ˜ M 4 C y r y m ,
M S E Y ^ p i μ y 2 + λ μ ˜ M C ˜ M 2 ϖ b μ y θ + μ ˜ M ϖ b + λ μ x 2 C x 2 ϖ c 2 + λ μ x C ˜ M C x μ y θ + 2 μ ˜ M ϖ b ϖ c r m x + μ y 2 ϖ a 2 1 + λ C y 2 + θ C ˜ M θ C ˜ M 2 C y r y m + 1 4 μ y ϖ a 8 μ y + λ C ˜ M θ C ˜ M 3 μ y θ + 8 μ ˜ M ϖ b + 8 μ x θ C x ϖ c r m x + 4 C y μ y θ 2 μ ˜ M ϖ b r y m 8 μ x λ C y C x ϖ c r y x .
By minimizing Equation (8), the optimum versions of ϖ a , ϖ b , and ϖ c are
ϖ a ( o p t ) = 8 λ θ 2 C ˜ M 2 8 1 + λ C y 2 1 Q y , m x 2 ,
ϖ b ( o p t ) = μ y λ θ 3 C ˜ M 3 ( 1 + r m x 2 ) + ( 8 C y + λ θ 2 C ˜ M 2 C y ) ( r y m r m x r y x ) + 4 θ C ˜ M ( 1 + r m x 2 ) 1 + λ C y 2 ( 1 Q y , m x 2 ) 8 μ ˜ M C ˜ M ( 1 + r m x 2 ) 1 + λ C y 2 ( 1 Q y , m x 2 ) ,
ϖ c ( o p t ) = μ y ( 8 λ θ 2 C ˜ M 2 ) C y ( r m x r y m r y x ) 8 μ x C x ( 1 + r m x 2 ) 1 + λ C y 2 ( 1 Q y , m x 2 ) ,
where
Q y , m x 2 = r y m 2 + r y x 2 2 r y m r y x r m x 1 r m x 2
is the coefficient of multiple determination with range [0,1] (Lamichhane et al. [8]). Further, ( C y , C x ) are the coefficients of variation of Y and X, respectively. The correlation between ( μ ^ y , μ ˜ m ) is r y m , ( μ ^ y , μ ^ x ) is r y x , and ( μ ˜ m , μ ^ x ) is r m x . Following Lamichhane et al. [8], C ˜ M = C o v ( μ ^ y , μ ˜ m ) μ y μ ˜ M .
We substitute the optimum values of ϖ a , ϖ b , and ϖ c in Equation (8), and after some simplifications, we get the minimum MSE of Y ^ p i , given by
M S E min Y ^ p i = λ μ y 2 64 C y 2 ( 1 Q y , m x 2 ) λ θ 4 C ˜ M 4 16 λ θ 2 C ˜ M 2 C y 2 ( 1 Q y , m x 2 ) 64 1 + λ C y 2 ( 1 Q y , m x 2 ) .
Finally, comparing M S E min Y ^ p i < M S E min Y ^ G i , we immediately observe the following efficiency condition:
λ μ y 2 C y 2 r y x r y m r m x 2 8 + λ θ 2 C ˜ M 2 2 64 ( 1 r m x 2 ) 1 + λ C y 2 ( 1 r y m 2 ) 1 + λ C y 2 ( 1 Q y , m x 2 ) > 0 .
Note that the exponential ratio-type estimators ( Y ^ G i , Y ^ p ) are recommended for use only when X and Y exhibit a positive association. At a value of r y x above zero, the constructed expressions of the minimum MSE are always strictly non-negative, as expected theoretically. Further, the proposed method assumes a pre-specified median of the target variable and the raw dispersion values of the supplementary variable. If these quantities are unknown or only partially known, the method can be modified by substituting the required parameters with their corresponding sample values, which can be obtained through pilot surveys, historical data, or supplementary registers.

4. Numerical Illustration

4.1. Simulation Study

In our simulation experiment, we initially produced an X of size N = 100 by drawing from a Uniform(0,1) distribution. Then we defined the four parameters associated with the Kumaraswamy–Gamma distribution, namely the following: the Gamma shape of Kumaraswamy denoted by α , the Gamma scale of Kumaraswamy denoted by β , and finally the two shape parameters a and b of Kumaraswamy. We subsequently generated Y using the inverse cumulative distribution (quantile) method, where the same Uniform(0,1) sequence was employed to induce dependence between X and Y. Thus, we used the same Uniform(0,1) and used the inverse Kumaraswamy cumulative distribution function,
w = [ 1 ( 1 u ) 1 / b ] 1 / a ,
so that the probability of w is expressed by the Gamma CDF G ( x ) a , and G indicates the Gamma CDF. We determined Y by taking the inverse of the Gamma CDF:
Y = G 1 ( w ) = qgamma ( w , shape = α , scale = β ) .
Therefore, Y is a strictly increasing transform of the same u that defines X, which ensures C o v ( X , Y ) > 0 . The induced associations of X and Y in the three simulated populations were around 0.94, 0.96, and 0.97, which support the strong positive dependence required by the ratio exponential estimators.
With the application of the Kumaraswamy–Gamma model, the pair of { ( X i , Y i ) } is obtained. For more details about Kumaraswamy–Gamma, see [27,28]. The three datasets were generated using the Kumaraswamy–Gamma model, each with different parameter values, as follows:
  • For Simulated Population-1:
    α = 2.0 , β = 1.0 , a = 1.5 , b = 2.0 .
  • For Simulated Population-2:
    α = 3.0 , β = 1.5 , a = 2.0 , b = 3.0 .
  • For Simulated Population-3:
    α = 4.0 , β = 2.0 , a = 3.0 , b = 4.0 .

Interpretation of Simulated Population Results (Table 3, Table 4, Table 5 and Table 6)

  • Substantial reduction in MSE with Y ^ p i estimators:
    The Y ^ p i (developed) estimators consistently achieve significantly smaller MSE values compared to the traditional estimators Y ^ G 1 to Y ^ G 8 . For instance, in Simulated Population-1, Y ^ G i estimators have MSEs around 0.0299 (e.g., Y ^ G 1 = 0.02992718), whereas Y ^ p i estimators are around 0.0090 (e.g., Y ^ p 1 = 0.009089002). This demonstrates an approximately 70% reduction in estimation error, indicating the improved accuracy of the proposed TP method.
  • Consistency and robustness across populations:
    Y ^ p i estimators are very consistent in all three populations. In Simulated Population-2, Y ^ p i MSEs are tightly grouped around 0.00568 (e.g., Y ^ p 3 = 0.005681118; Y ^ p 5 = 0.005681200), while Y ^ G i estimators vary slightly more, around 0.02824 (e.g., Y ^ G 5 = 0.02824520). This suggests that Y ^ p i methods are more robust to changes in data structure or distribution.
  • Lowest MSE achieved:
    The lowest MSE of all three estimators (in Population-1 and Population-3) is seen in Population-3, which implies that Population-3 has better properties for desirable mean estimation. To illustrate, the MSE of Y ^ p 3 in Population-3 is 0.003936279, whereas that of Y ^ G 3 is 0.02560332. This indicates a significant difference and a lower overall error floor in Population-3. This trend points to the effect of population characteristics on estimators.
  • Highest PRE achieved:
    The Y ^ p i (proposed) estimators consistently yield significantly higher PRE values compared to the adapted estimators Y ^ G 1 to Y ^ G 8 in all three simulated populations. Note that PRE was calculated as the ratio of the MSE of a baseline estimator to that of the corresponding estimator under comparison, multiplied by 100. The proposed estimators were compared against all the adapted ones to perform a thorough efficiency analysis.
  • Dependence between X and Y:
    In practice, there is high positive correlation between these simulated populations. Using the parameter sets of the three simulated populations, the actual correlations are around 0.94, 0.95, and 0.97, respectively. Such values verify the fact that the reported PRE gains are actually due to the nonzero auxiliary information correlation instead of noise.
Table 3. MSE for simulated datasets.
Table 3. MSE for simulated datasets.
θ ^ SP-1SP-2SP-3
Y ^ G 1 0.029927180.028244890.02560332
Y ^ G 2 0.029904570.028239640.02560211
Y ^ G 3 0.029927180.028244890.02560332
Y ^ G 4 0.029942350.028249220.02560441
Y ^ G 5 0.029929120.028245200.02560338
Y ^ G 6 0.029906060.028239830.02560214
Y ^ G 7 0.029929110.028245200.02560338
Y ^ G 8 0.029944340.028249600.02560449
Y ^ p 1 0.0090890020.0056811180.003936279
Y ^ p 2 0.0090810330.0056796910.003935994
Y ^ p 3 0.0090890020.0056811180.003936279
Y ^ p 4 0.0090941890.0056822640.003936531
Y ^ p 5 0.0090896700.0056812000.003936292
Y ^ p 6 0.0090815660.0056797430.003936002
Y ^ p 7 0.0090896700.0056812000.003936292
Y ^ p 8 0.0090948590.0056823640.003936549
Table 4. PRE for Simulated Population-1.
Table 4. PRE for Simulated Population-1.
θ ^ Y ^ p 1 Y ^ p 2 Y ^ p 3 Y ^ p 4 Y ^ p 5 Y ^ p 6 Y ^ p 7 Y ^ p 8
Y ^ G 1 329.2681329.0193329.2681329.4349329.2894329.0357329.2893329.4568
Y ^ G 2 329.5571329.3081329.5570329.7241329.5783329.3245329.5783329.7459
Y ^ G 3 329.2681329.0194329.2681329.4350329.2894329.0358329.2894329.4568
Y ^ G 4 329.0803328.8317329.0803329.2470329.1015328.8481329.1015329.2689
Y ^ G 5 329.2439328.9952329.2439329.4107329.2652329.0115329.2651329.4326
Y ^ G 6 329.5377329.2887329.5377329.7047329.5590329.3051329.5590329.7266
Y ^ G 7 329.2439328.9952329.2439329.4108329.2652329.0116329.2652329.4326
Y ^ G 8 329.0560328.8074329.0560329.2228329.0773328.8238329.0773329.2446
Table 5. PRE for Simulated Population-2.
Table 5. PRE for Simulated Population-2.
θ ^ Y ^ p 1 Y ^ p 2 Y ^ p 3 Y ^ p 4 Y ^ p 5 Y ^ p 6 Y ^ p 7 Y ^ p 8
Y ^ G 1 497.1713497.0789497.1713497.2476497.1767497.0823497.1767497.2543
Y ^ G 2 497.2962497.2038497.2962497.3725497.3017497.2071497.3016497.3793
Y ^ G 3 497.1713497.0790497.1713497.2476497.1768497.0823497.1768497.2544
Y ^ G 4 497.0710496.9787497.0710497.1473497.0764496.9820497.0764497.1540
Y ^ G 5 497.1641497.0717497.1641497.2403497.1695497.0750497.1695497.2471
Y ^ G 6 497.2917497.1993497.2916497.3679497.2971497.2026497.2971497.3747
Y ^ G 7 497.1641497.0717497.1641497.2403497.1695497.0750497.1695497.2471
Y ^ G 8 497.0622496.9699497.0622497.1385497.0677496.9732497.0677497.1452
Table 6. PRE for Simulated Population-3.
Table 6. PRE for Simulated Population-3.
θ ^ Y ^ p 1 Y ^ p 2 Y ^ p 3 Y ^ p 4 Y ^ p 5 Y ^ p 6 Y ^ p 7 Y ^ p 8
Y ^ G 1 650.4448650.4141650.4448650.4726650.4462650.4149650.4462650.4745
Y ^ G 2 650.4918650.4611650.4918650.5196650.4933650.4619650.4932650.5215
Y ^ G 3 650.4448650.4141650.4448650.4726650.4463650.4149650.4462650.4745
Y ^ G 4 650.4031650.3724650.4031650.4309650.4045650.3732650.4045650.4328
Y ^ G 5 650.4427650.4120650.4427650.4704650.4441650.4128650.4441650.4724
Y ^ G 6 650.4906650.4599650.4906650.5184650.4920650.4607650.4920650.5203
Y ^ G 7 650.4427650.4120650.4427650.4705650.4441650.4128650.4441650.4724
Y ^ G 8 650.4002650.3696650.4002650.4280650.4017650.3703650.4017650.4299

4.2. Real-Life Applications

  • Education data:
  • Researchers in education must accurately determine how much is invested in education to make fair and effective decisions. The relationship between public education spending per person and personal income per person is typically quite clear. Estimating the means of these variables reveals what influences educational spending in various parts of the world. Reliable estimation is essential when analyzing datasets such as the Education dataset, originally compiled for modeling public spending on education (Chatterjee and Hadi, [29]). The robustbase package in R software (Ref. [30]) includes this dataset, with per capita personal income (in USD) as the auxiliary variable, X, and per capita spending on public education as the primary variable of interest, Y.
  • Wheat data:
  • To analyze and guide agricultural decisions on wheat crops, scientists in research rely heavily on understanding population means. With datasets from the past, such as the one in Sukhatme and Sukhatme [31], the correct method of estimation is crucial. In the data, the variable Y represents the yield (in quintals per acre) of wheat harvested in 1937, and the variable X represents the area under wheat cultivation in 1936.
  • UScereal data:
  • In nutritional analysis, determining the mean of important dietary elements is necessary to assess food quality and help shape public health advice. This research utilizes the UScereals database, which provides detailed nutritional information for 65 common breakfast cereals consumed in the United States (U.S.). The dataset was compiled for the Statistical Graphics Exposition by the ASA and then organized by Venables and Ripley [32], using one U.S. cup as the reference serving size. Given the wide range of variables in the datasets, researchers chose “Grams of fiber per serving” as the supplementary variable (X) and “grams of potassium per serving” as the main variable (Y).
  • Meteorological Solar Radiation data:
  • The research data used in the given study consist of meteorological measurements from the HI-SEAS weather station for four months, between September and December 2016. It contains the entire record of environmental conditions that are pertinent to the study of solar energy and incorporates the following measured parameters: wind direction, wind speed, humidity, and temperature. One of the response variables is solar radiation (in watts per square meter, W/m2), which serves as the main variable to be estimated and modeled. Along with it, the temperature (in degrees Fahrenheit, i.e., °F) is regarded as the auxiliary variable, as it is not only theoretically but also empirically correlated with solar radiation, and thus, it can also increase the efficiency of estimations. This structured information provides a solid platform on which the estimation methods are to be tested, and this is especially useful in mean estimation methods where auxiliary information is used to enhance precision. The data is publicly accessible on the Kaggle platform, and as such, this ensures the accessibility and reproducibility of its results. We took the initial 100 observations of the data for the purposes of this article.
  • To investigate the observed performance of the developed estimators under realistic conditions, we considered a set of applied datasets with differing sample sizes, units, and strengths of association between the target and supplementary variables. For each population, the corresponding sample sizes and actual correlation between variables were as follows: n = 10 % of the Education dataset with r y x = 0.60 , n = 15 % of the Wheat dataset with r y x = 0.41 , n = 6 % of the U.S. Cereal dataset with r y x = 0.96 , and n = 5 % of the Solar Radiation dataset with r y x = 0.83 . In all cases, the dispersion characteristics of the auxiliary variable (interquartile range, semi-interquartile range, mid-range, and interdecile range) were calculated based on the definitions given in the Quantile-Linked Measures of Dispersion and Adapted Family Section, and the median-based quantities were derived following standard practice in the survey sampling literature, explained by Lamichhane et al. [8] and Shahzad et al. [11].

Interpretation of Real-Life Application Results (Table 7, Table 8, Table 9, Table 10 and Table 11)

  • Significant MSE reduction with Y ^ p i estimators across all datasets:
    On all datasets, the Y ^ p i (proposed) estimators show smaller MSE values compared to the adapted ones, Y ^ G 1 Y ^ G 8 . For adapted methods, the results show that MSE values range from 15.85618 for Y ^ G 2 to 15.93169 for Y ^ G 7 . On the other hand, Y ^ p i approaches reported MSE values ranging from 2.291056 for Y ^ p 2 to 2.307062 for Y ^ p 7 . These findings indicate that Y ^ p i methods provide very precise results, reducing estimation errors by more than 85%.
  • Improved estimation consistency in Education and Wheat datasets:
    In the Education dataset, the adapted methods produce MSEs of around 0.9672 (for instance, Y ^ G 1 = 0.9672025), whereas Y ^ p i estimators have MSEs of around 0.8513 (for example, Y ^ G 1 = 0.8513346), representing a 12% improvement. The values of the MSE of the Wheat dataset in Y ^ G 8 and Y ^ p 2 are 124.8 and 121.7, respectively, indicating a decrease of about 2.5%. These enhancements, although small, show that the Y ^ p i method is more accurate and reliable.
  • Low variability and high stability among Y ^ p i estimators: The Y ^ p i methods exhibit an extremely high level of stability in every dataset with little variation. For example, in the Education dataset, the proposed estimators’ MSEs range narrowly from 0.8512927 ( Y ^ p 2 ) to 0.8513584 ( Y ^ p 3 ). Similarly, in the Wheat dataset, values stay close between 121.6912 ( Y ^ p 2 ) and 122.1281 ( Y ^ p 8 ). Due to this consistency, the Y ^ p i estimators are highly dependable for application across multiple datasets.
  • Highest efficiency achieved: The Y ^ p i (proposed) estimators consistently yield significantly higher PRE values compared to the adapted estimators Y ^ G 1 to Y ^ G 8 in all three real-world datasets.
Additionally, the visual representation of the PRE results is provided in Figure 1, Figure 2, Figure 3, Figure 4, Figure 5, Figure 6 and Figure 7. In this visual representation, note that there are eight blocks labeled t1 to t8. Each block t represents a pair of adapted and proposed estimators, such as t 1 ϵ ( Y ^ G 1 , Y ^ p 1 ) , t 2 ϵ ( Y ^ G 2 , Y ^ p 2 ) , t 3 ϵ ( Y ^ G 3 , Y ^ p 3 ) , t 4 ϵ ( Y ^ G 4 , Y ^ p 4 ) , t 5 ϵ ( Y ^ G 5 , Y ^ p 5 ) , t 6 ϵ ( Y ^ G 6 , Y ^ p 6 ) , t 7 ϵ ( Y ^ G 7 , Y ^ p 7 ) , and t 8 ϵ ( Y ^ G 8 , Y ^ p 8 ) .
Table 7. MSE for real-life datasets.
Table 7. MSE for real-life datasets.
θ ^ EducationWheatUScerealRadiation
Y ^ G 1 0.9672025124.581215.8709526.07291
Y ^ G 2 0.9671549124.367115.8561825.45828
Y ^ G 3 0.9672294124.731015.9290726.38176
Y ^ G 4 0.9672211124.753615.8909726.23066
Y ^ G 5 0.9672183124.750515.8720126.15663
Y ^ G 6 0.9671926124.628415.8567525.67837
Y ^ G 7 0.9672304124.807015.9316926.38337
Y ^ G 8 0.9672270124.813715.8926426.27420
Y ^ p 1 0.8513346121.90072.29427914.40656
Y ^ p 2 0.8512927121.69122.29105614.05724
Y ^ p 3 0.8513584122.04732.30652714.57852
Y ^ p 4 0.8513511122.06942.29857714.49470
Y ^ p 5 0.8513486122.06632.29451014.45342
Y ^ p 6 0.8513259121.94692.29118214.18339
Y ^ p 7 0.8513592122.12162.30706214.57941
Y ^ p 8 0.8513562122.12812.29893214.51891
Table 8. PRE for Education data.
Table 8. PRE for Education data.
θ ^ Y ^ p 1 Y ^ p 2 Y ^ p 3 Y ^ p 4 Y ^ p 5 Y ^ p 6 Y ^ p 7 Y ^ p 8
Y ^ G 1 113.6101113.6045113.6133113.6123113.6120113.6090113.6134113.6130
Y ^ G 2 113.6157113.6101113.6189113.6179113.6176113.6146113.6190113.6186
Y ^ G 3 113.6070113.6014113.6101113.6092113.6088113.6058113.6103113.6099
Y ^ G 4 113.6079113.6024113.6111113.6101113.6098113.6068113.6112113.6108
Y ^ G 5 113.6083113.6027113.6114113.6105113.6101113.6071113.6116113.6112
Y ^ G 6 113.6113113.6057113.6145113.6135113.6132113.6101113.6146113.6142
Y ^ G 7 113.6069113.6013113.6100113.6090113.6087113.6057113.6101113.6097
Y ^ G 8 113.6073113.6017113.6104113.6095113.6091113.6061113.6105113.6101
Table 9. PRE for Wheat data.
Table 9. PRE for Wheat data.
θ ^ Y ^ p 1 Y ^ p 2 Y ^ p 3 Y ^ p 4 Y ^ p 5 Y ^ p 6 Y ^ p 7 Y ^ p 8
Y ^ G 1 102.1989102.0233102.3218102.3404102.3378102.2377102.3842102.3896
Y ^ G 2 102.3749102.1990102.4980102.5165102.5140102.4137102.5604102.5659
Y ^ G 3 102.0762101.9008102.1989102.2174102.2149102.1149102.2612102.2667
Y ^ G 4 102.0577101.8823102.1805102.1989102.1964102.0964102.2427102.2482
Y ^ G 5 102.0603101.8849102.1830102.2015102.1989102.0989102.2453102.2507
Y ^ G 6 102.1603101.9847102.2831102.3016102.2990102.1989102.3454102.3509
Y ^ G 7 102.0140101.8387102.1367102.1552102.1526102.0527102.1989102.2044
Y ^ G 8 102.0086101.8333102.1313102.1497102.1472102.0472102.1935102.1989
Table 10. PRE for UScereal data.
Table 10. PRE for UScereal data.
θ ^ Y ^ p 1 Y ^ p 2 Y ^ p 3 Y ^ p 4 Y ^ p 5 Y ^ p 6 Y ^ p 7 Y ^ p 8
Y ^ G 1 691.7618691.1182694.2953692.6344691.8083691.1432694.4095692.7072
Y ^ G 2 692.7350692.0905695.2721693.6088692.7815692.1155695.3864693.6817
Y ^ G 3 688.0886687.4484690.6087688.9565688.1348687.4732690.7222689.0290
Y ^ G 4 690.4683689.8259692.9971691.3393690.5147689.8509693.1111691.4120
Y ^ G 5 691.6922691.0487694.2255692.5647691.7386691.0736694.3396692.6375
Y ^ G 6 692.6969692.0525695.2339693.5707692.7435692.0775695.3482693.6436
Y ^ G 7 687.9291687.2890690.4486688.7968687.9753687.3139690.5621688.8692
Y ^ G 8 690.3617689.7194692.8901691.2325690.4080689.7443693.0040691.3052
Table 11. PRE for Meteorological Solar Radiation data.
Table 11. PRE for Meteorological Solar Radiation data.
θ ^ Y ^ p 1 Y ^ p 2 Y ^ p 3 Y ^ p 4 Y ^ p 5 Y ^ p 6 Y ^ p 7 Y ^ p 8
Y ^ G 1 180.9794176.7131183.1232182.0744181.5605178.2408183.1344182.3766
Y ^ G 2 185.4767181.1043187.6738186.5989186.0723182.6700187.6853186.9086
Y ^ G 3 178.8448174.6287180.9633179.9268179.4190176.1384180.9743180.2255
Y ^ G 4 179.8790175.6386182.0097180.9673180.4566177.1570182.0209181.2677
Y ^ G 5 180.3927176.1402182.5296181.4842180.9720177.6630182.5407181.7854
Y ^ G 6 183.8271179.4936186.0046184.9393184.4174181.0454186.0160185.2463
Y ^ G 7 178.8338174.6181180.9522179.9158179.4081176.1277180.9633180.2145
Y ^ G 8 179.5790175.3456181.7062180.6655180.1556176.8615181.7173180.9654
The MSE plots for the simulations and Education, Wheat, UScereal, and Meteorological Solar Radiation datasets provide a visual confirmation of the dominant performance of Y ^ p 1 Y ^ p 8 .

5. Conclusions

In this study, a unified class of mean estimation methods was established employing the median as the pre-specified value of the target variable along with the raw measures of the dispersion of auxiliary variables, including the interquartile range, semi-interquartile range, mid-range, and interdecile range. The methodology offers a new addition to survey sampling, where traditional methods usually use the average of auxiliary variables and underutilize the median, even though this is a more robust statistic against distributional aberrations. To fill this gap, two broad classes of estimators were suggested: adapted exponential-type (AET) estimators and generalized exponential-type (GET) estimators, each of which aims to use the joint power of the known median and auxiliary information. Theoretical derivations showed that under ideal circumstances, the GET estimators exhibit lower MSE, and the results were consistently confirmed by simulation studies using Kumaraswamy–Gamma populations. Moreover, the empirical results based on datasets from the education, agriculture, and nutrition fields, as well as a real-world meteorological test with solar radiation data of the HI-SEAS(HS) weather station, indicate that the proposed estimators outperform the current methods by a significant margin. These findings underscore the applicability of the proposed framework in practice, indicating that it may be applied not only in conventional areas of surveys but also in the development of environmental and energy-associated research in which solar radiation is a central figure. Furthermore, the suggested framework can be expanded to complex sampling schemes like stratified, two-stage, and double sampling and to models involving non-response and measurement error in future research.

Author Contributions

Conceptualization, H.M.A.; methodology, H.M.A. and S.I.; software, H.M.A.; validation, H.M.A.; formal analysis, H.M.A.; investigation, M.M.A.; resources, M.M.A. and S.I.; data curation, M.M.A. and S.I.; writing—original draft, H.M.A., M.M.A. and S.I.; writing—review and editing, H.M.A., M.M.A. and S.I.; visualization, H.M.A.; supervision, M.M.A.; project administration, H.M.A.; funding acquisition, H.M.A. All authors have read and agreed to the published version of the manuscript.

Funding

Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2025R 299), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Alomair, A.M.; Shahzad, U. Tweedie-Poisson regression-based mean estimation of dose equivalence rates using gamma count auxiliary information. J. Radiat. Res. Appl. Sci. 2025, 18, 101929. [Google Scholar] [CrossRef]
  2. Chutiman, N.; Nathomthong, A.; Wichitchan, S.; Guayjarernpanishk, P. Improved Estimator Using Auxiliary Information in Adaptive Cluster Sampling with Networks Selected Without Replacement. Symmetry 2025, 17, 375. [Google Scholar] [CrossRef]
  3. Alomair, A.M.; Shahzad, U. Optimizing mean estimators with calibrated minimum covariance determinant in median ranked set sampling. Symmetry 2023, 15, 1581. [Google Scholar] [CrossRef]
  4. Koc, T.; Koc, H. A new class of quantile regression ratio-type estimators for finite population mean in stratified random sampling. Axioms 2023, 12, 713. [Google Scholar] [CrossRef]
  5. Kuk, Y.A.; Mak, K.T. Median estimation in the presence of auxiliary information. J. R. Stat. Soc. Ser. (Methodol.) 1989, 51, 261–269. [Google Scholar] [CrossRef]
  6. Subramani, J. A new modified ratio estimator for estimation of population mean when median of the auxiliary variable is known. Pak. J. Stat. Oper. Res. 2013, 9, 137–145. [Google Scholar] [CrossRef]
  7. Subramani, J. A new median based ratio estimator for estimation of the finite population mean. Stat. Transit. New Ser. 2016, 17, 591–604. [Google Scholar] [CrossRef]
  8. Lamichhane, R.; Singh, S.; Diawara, N. Improved estimation of population mean using known median of auxiliary variable. Commun. Stat. Simul. Comput. 2017, 46, 2821–2828. [Google Scholar] [CrossRef]
  9. Irfan, M.; Javed, M.; Abid, M.; Lin, Z. Improved ratio type estimators of population mean based on median of a study variable and an auxiliary variable. Hacet. J. Math. Stat. 2018, 47, 659–673. [Google Scholar] [CrossRef]
  10. Yusuf, A.Y.; Audu, A.; Yunusa, M.A. Power median-based estimators of finite population mean. Fudma J. Sci. 2024, 8, 296–300. [Google Scholar] [CrossRef]
  11. Shahzad, U.; Al-Noor, N.H.; Hanif, M.; Sajjad, I. An exponential family of median based estimators for mean estimation with simple random sampling scheme. Commun. Stat. Theory Methods 2021, 50, 4890–4899. [Google Scholar] [CrossRef]
  12. Saini, M.; Jitendrakumar, B.R.; Kumar, A. Improved ratio estimator under simple and stratified random sampling. Life Cycle Reliab. Saf. Eng. 2024, 13, 181–187. [Google Scholar] [CrossRef]
  13. Abdullahi, U.K.; Ugwuowo, F.I. On efficient median-based linear regression estimator for population mean. Commun. Stat. Theory Methods 2022, 51, 5012–5024. [Google Scholar] [CrossRef]
  14. Singh, R.; Tiwari, S.N. Improved Estimator for Population Mean Utilizing Known Medians of Two Auxiliary Variables under Neutrosophic Framework. Neutrosophic Syst. Appl. 2025, 25, 38–52. [Google Scholar] [CrossRef]
  15. Kurbah, R.E.; Khongji, P.; Hynniewta, B.C. Efficient rank set sampling estimators of mean using median under double sampling with two auxiliary variables. Adv. Appl. Stat. 2024, 91, 943–968. [Google Scholar] [CrossRef]
  16. Soni, S.S.; Pandey, H. Generalized estimator of population variance utilizing auxiliary information in simple random sampling scheme. J. Probab. Stat. Sci. 2023, 21, 23–33. [Google Scholar] [CrossRef]
  17. Naz, F.; Nawaz, T.; Pang, T.; Abid, M. Use of Nonconventional Dispersion Measures to Improve the Efficiency of Ratio-Type Estimators of Variance in the Presence of Outliers. Symmetry 2020, 12, 16. [Google Scholar] [CrossRef]
  18. Cetin, A.E.; Koyuncu, N. Calibration estimator of population mean in stratified extreme ranked set sampling with simulation study. Filomat 2024, 38, 599–608. [Google Scholar] [CrossRef]
  19. Cetin, A.E.; Koyuncu, N. Robust regression type estimators for body mass index under extreme ranked set and quartile ranked set sampling. Commun. Fac. Sci. Univ. Ank. Ser. Math. Stat. 2024, 73, 336–348. [Google Scholar]
  20. Cetin, A.E.; Koyuncu, N. New robust class of estimators for population mean under different sampling designs. J. Comput. Appl. Math. 2024, 441, 115669. [Google Scholar] [CrossRef]
  21. Zaman, T.; Bulut, H. A simulation study: Robust ratio double sampling estimator of finite population mean in the presence of outliers. Sci. Iran. 2024, 31, 1330–1341. [Google Scholar] [CrossRef]
  22. Zaman, T.; Shazad, U.; Yadav, V.K. An efficient Hartley–Ross type estimators of nonsensitive and sensitive variables using robust regression methods in sample surveys. J. Comput. Appl. Math. 2024, 440, 115645. [Google Scholar] [CrossRef]
  23. Rashedi, K.A.; Abdulrahman, A.T.; Alshammari, T.S.; Alshammari, K.M.; Shahzad, U.; Shabbir, J.; Ahmad, I. Robust Särndal-Type Mean Estimators with Re-Descending Coefficients. Axioms 2025, 14, 261. [Google Scholar] [CrossRef]
  24. Mishra, R.; Singh, R.; Adichwal, N.K. A novel ratio cum product type exponential class of estimators of finite population mean in Adaptive cluster Sampling. Braz. J. Biom. 2025, 43, e43745. [Google Scholar] [CrossRef]
  25. Daraz, U.; Wu, J.; Albalawi, O. Double exponential ratio estimator of a finite population variance under extreme values in simple random sampling. Mathematics 2024, 12, 1737. [Google Scholar] [CrossRef]
  26. Alharbi, R.; Mustafa, M.S.; Al Mutairi, A.; Hussein, M.; Yusuf, M.; Elshenawy, A.; Nassr, S.G. Enhancing mean estimators in median ranked set sampling with dual auxiliary information. Heliyon 2023, 9, e21427. [Google Scholar] [CrossRef] [PubMed]
  27. De Pascoa, M.A.; Ortega, E.M.; Cordeiro, G.M. The Kumaraswamy generalized gamma distribution with application in survival analysis. Stat. Methodol. 2011, 8, 411–433. [Google Scholar] [CrossRef]
  28. Arshad, R.M.I.; Tahir, M.H.; Chesneau, C.; Jamal, F. The Gamma Kumaraswamy-G family of distributions: Theory, inference and applications. Stat. Transit. New Ser. 2020, 21, 17–40. [Google Scholar] [CrossRef]
  29. Chatterjee, S.; Hadi, S.A. Regression Analysis by Example; John Wiley and Sons: Hoboken, NJ, USA, 2015. [Google Scholar]
  30. Maechler, M.; Rousseeuw, P.; Croux, C.; Todorov, V.; Ruckstuhl, A.; Salibian-Barrera, M.; di Palma, M.A. Robustbase: Basic Robust Statistics R Package Version 0.92-7. 2016. Available online: http://CRAN.R-project.org/package=robustbase (accessed on 5 February 2025).
  31. Sukhatme, P.V.; Sukhatme, B.V. Sampling Theory of Surveys with Application; Iowa Statistical University Press: Iowa City, IA, USA, 1970. [Google Scholar]
  32. Venables, W.N.; Ripley, B.D. Modern Applied Statistics with S-PLUS, 3rd ed.; Springer: Berlin/Heidelberg, Germany, 1999. [Google Scholar]
Figure 1. The MSE plot for SP-1.
Figure 1. The MSE plot for SP-1.
Axioms 14 00857 g001
Figure 2. The MSE plot for SP-2.
Figure 2. The MSE plot for SP-2.
Axioms 14 00857 g002
Figure 3. The MSE plot for SP-3.
Figure 3. The MSE plot for SP-3.
Axioms 14 00857 g003
Figure 4. The MSE plot for the Education dataset.
Figure 4. The MSE plot for the Education dataset.
Axioms 14 00857 g004
Figure 5. The MSE plot for the Wheat dataset.
Figure 5. The MSE plot for the Wheat dataset.
Axioms 14 00857 g005
Figure 6. The MSE plot for the UScereal dataset.
Figure 6. The MSE plot for the UScereal dataset.
Axioms 14 00857 g006
Figure 7. The MSE plot for the Metrological Solar Radiation dataset.
Figure 7. The MSE plot for the Metrological Solar Radiation dataset.
Axioms 14 00857 g007
Table 1. Members of adapted family.
Table 1. Members of adapted family.
EstimatorValues of Constants
a 1 b 1
Y ^ G 1 = w 1 μ ^ y + w 2 μ ˜ M μ ˜ m exp μ ˜ M μ ˜ m μ ˜ M + μ ˜ m + 2 R m i d 1 R m i d
Y ^ G 2 = w 1 μ ^ y + w 2 μ ˜ M μ ˜ m exp μ ˜ M μ ˜ m μ ˜ M + μ ˜ m + 2 R i d 1 R i d
Y ^ G 3 = w 1 μ ^ y + w 2 μ ˜ M μ ˜ m exp μ ˜ M μ ˜ m μ ˜ M + μ ˜ m + 2 R i q 1 R i q
Y ^ G 4 = w 1 μ ^ y + w 2 μ ˜ M μ ˜ m exp μ ˜ M μ ˜ m μ ˜ M + μ ˜ m + 2 R s q 1 R s q
Y ^ G 5 = w 1 μ ^ y + w 2 μ ˜ M μ ˜ m exp R m i d μ ˜ M μ ˜ m R m i d μ ˜ M + μ ˜ m + 2 R m i d 1
Y ^ G 6 = w 1 μ ^ y + w 2 μ ˜ M μ ˜ m exp R i d μ ˜ M μ ˜ m R i d μ ˜ M + μ ˜ m + 2 R i d 1
Y ^ G 7 = w 1 μ ^ y + w 2 μ ˜ M μ ˜ m exp R i q μ ˜ M μ ˜ m R i q μ ˜ M + μ ˜ m + 2 R i q 1
Y ^ G 8 = w 1 μ ^ y + w 2 μ ˜ M μ ˜ m exp R s q μ ˜ M μ ˜ m R s q μ ˜ M + μ ˜ m + 2 R s q 1
Table 2. Members of proposed family.
Table 2. Members of proposed family.
EstimatorValues of Constants
a 1 b 1
Y ^ p 1 = ϖ a μ ^ y + ϖ b μ ˜ M μ ˜ m + ϖ c μ ^ x μ x exp μ ˜ M μ ˜ m μ ˜ M + μ ˜ m + 2 R m i d 1 R m i d
Y ^ p 2 = ϖ a μ ^ y + ϖ b μ ˜ M μ ˜ m + ϖ c μ ^ x μ x exp μ ˜ M μ ˜ m μ ˜ M + μ ˜ m + 2 R i d 1 R i d
Y ^ p 3 = ϖ a μ ^ y + ϖ b μ ˜ M μ ˜ m + ϖ c μ ^ x μ x exp μ ˜ M μ ˜ m μ ˜ M + μ ˜ m + 2 R i q 1 R i q
Y ^ p 4 = ϖ a μ ^ y + ϖ b μ ˜ M μ ˜ m + ϖ c μ ^ x μ x exp μ ˜ M μ ˜ m μ ˜ M + μ ˜ m + 2 R s q 1 R s q
Y ^ p 5 = ϖ a μ ^ y + ϖ b μ ˜ M μ ˜ m + ϖ c μ ^ x μ x exp R m i d μ ˜ M μ ˜ m R m i d μ ˜ M + μ ˜ m + 2 R m i d 1
Y ^ p 6 = ϖ a μ ^ y + ϖ b μ ˜ M μ ˜ m + ϖ c μ ^ x μ x exp R i d μ ˜ M μ ˜ m R i d μ ˜ M + μ ˜ m + 2 R i d 1
Y ^ p 7 = ϖ a μ ^ y + ϖ b μ ˜ M μ ˜ m + ϖ c μ ^ x μ x exp R i q μ ˜ M μ ˜ m R i q μ ˜ M + μ ˜ m + 2 R i q 1
Y ^ p 8 = ϖ a μ ^ y + ϖ b μ ˜ M μ ˜ m + ϖ c μ ^ x μ x exp R s q μ ˜ M μ ˜ m R s q μ ˜ M + μ ˜ m + 2 R s q 1
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Alshanbari, H.M.; Anas, M.M.; Iftikhar, S. Statistical Formulation for Average Parameter Determination via Quantile-Linked Auxiliary Characteristics. Axioms 2025, 14, 857. https://doi.org/10.3390/axioms14120857

AMA Style

Alshanbari HM, Anas MM, Iftikhar S. Statistical Formulation for Average Parameter Determination via Quantile-Linked Auxiliary Characteristics. Axioms. 2025; 14(12):857. https://doi.org/10.3390/axioms14120857

Chicago/Turabian Style

Alshanbari, Huda M., Malik Muhammad Anas, and Soofia Iftikhar. 2025. "Statistical Formulation for Average Parameter Determination via Quantile-Linked Auxiliary Characteristics" Axioms 14, no. 12: 857. https://doi.org/10.3390/axioms14120857

APA Style

Alshanbari, H. M., Anas, M. M., & Iftikhar, S. (2025). Statistical Formulation for Average Parameter Determination via Quantile-Linked Auxiliary Characteristics. Axioms, 14(12), 857. https://doi.org/10.3390/axioms14120857

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop