Optimal Transformation-Based Median Estimation Under Stratified Double Sampling with Limited Auxiliary Information
Abstract
1. Introduction
Practical Implementation and Applicability of the Proposed Estimators
2. Notation and Symbols
3. Review of Conventional Estimators
4. Order-Statistic-Based Estimators
- whereandrepresent the decile mean, quartile deviation, quartile range, quartile average, and mid-range.
5. Comparative Evaluation of Estimator Properties
- (i):
- (ii):
- (iii):
- (iv):
- (v):
- (vi):
- (vii):
- (viii):
6. Simulation Design and Numerical Assessment
6.1. Simulation Study
- Population 1 (Exponential Distribution A). The auxiliary variable follows , showing strong right skewness. The relationship is defined by , where . This construction produces an approximate correlation of between X and Y.
- Population 2 (Exponential Distribution B). The variable , showing a more moderate degree of skewness. The corresponding study variable is , where resulting in a correlation of .
- Population 3 (Gamma Distribution). The Gamma distribution with parameters and , showing moderate right skewness. The Y is generated using the relationship where producing an approximate correlation of .
- Population 4 (Cauchy Distribution). The auxiliary variable follows a heavy-tailed distribution, . The study variable is modeled as , where indicating a negative linear association of .
- Population 5 (Lognormal Distribution). The auxiliary variable X follows a lognormal distribution with parameters and , indicating mild skewness. The study variable Y is defined as where This specification produces an approximate correlation of between X and Y.
- 1.
- Population generation under stratification. Partition the finite data of size N into L nonoverlapping strata with sizes and weights (, ). In each stratum h, draw the auxiliary variable from a specified distribution (with stratum-specific parameters), and generate the study variable fromallowing the X and Y association and noise to vary by stratum.
- 2.
- Phase-I (first-phase sampling on X only). Within each stratum h, select an SRSWOR of size with . Record only the auxiliary measurements X.
- 3.
- Phase-II (second-phase subsampling on X and Y). From the phase-I sample in each stratum h, select a subsample of size (SRSWOR; ). For these units, observe both X and Y.
- 4.
- Estimator construction (median-based, stratified). Combine phase-I information on X with the paired from phase-II to compute median-based estimators under stratification (e.g., simple, ratio, difference, exponential, Rao–Gupta, and the proposed transformation class). The finite-population target is defined at the stratum level and aggregated using stratum weights, e.g., the stratified median target .
- 5.
- Monte Carlo evaluation. Repeat the two-phase stratified design (Steps 1–4) for a large number of replications (e.g., 25,000) to study sampling behavior.
- 6.
- Performance summaries. The sampling process is repeated 25,000 times in order to achieve uniform results. On the estimator and in combination with m and n, MSE is calculated as the average of all replicas. The percent relative efficiency (PRE) and empirical MSE of each estimator are then computed below:andwhere The results obtained from the simulation are given in Table 2.
6.2. Empirical Performance Evaluation
6.3. Discussion and Interpretation of Findings
- Performance on simulated populations: All transformation-based estimators perform better than the traditional ratio, regression, exponential, and difference-type estimators across the five simulated populations, as shown in Table 2 and Figure 1. The efficiency values show a constant improvement in efficiency, as they increase gradually from to . The largest PREs are recorded by estimators like , , and ; these estimators frequently cross 300 percent, whilst conventional estimators stay below 180 percent. This outcome highlights the significant advantages of using phase-wise stratification and auxiliary transformations, which improve stability even in distributions that are heavily tailed or strongly skewed.
- The PRE patterns observed for Populations 3 and 5 are relatively similar because both populations exhibit moderate positive correlation structures and comparatively stable distributional behavior despite originating from different distributions. In both cases, the transformation-based estimators effectively utilize auxiliary information under positively skewed population settings, resulting in closely related efficiency trends. This similarity further indicates that the proposed estimator family maintains stable performance across different distributional forms when the underlying correlation structure between the study and auxiliary variables is reasonably comparable.
- Validation on real populations: The same efficiency trends are confirmed to apply to empirical data in Table 6 and Figure 2. In all three real-life populations, the transformation-based estimators continue to produce better PRE values; the greatest overall performance is given by . The suggested approaches show their dependability outside of controlled simulation conditions in each dataset by showing resistance to distributional asymmetry and moderate correlations between the variables Y and X.
- It is important to note that the PRE values obtained from the simulation study are comparatively larger than those observed for the real population datasets. This behavior is expected because the simulated populations were generated under controlled distributional environments specifically designed to evaluate estimator performance under skewed and heavy-tailed conditions. Under such settings, the transformation structure can more effectively utilize auxiliary information, leading to larger relative efficiency improvements.
- In contrast, real population data naturally contain additional variability arising from heterogeneous population structures, irregular correlations, measurement fluctuations, and practical survey complexities. These factors generally reduce the magnitude of achievable efficiency improvements. Nevertheless, the proposed estimators continue to demonstrate clear superiority over the existing estimators in both simulated and empirical analyses, indicating that the observed efficiency improvements are not simulation-driven but reflect the overall robustness and practical applicability of the proposed methodology.
- The derivation of the proposed estimators is based on first-order Taylor series approximation techniques commonly adopted in survey sampling theory. These approximations are developed under the assumption that the sampling errors are sufficiently small so that higher-order terms may be neglected for analytical tractability. It is acknowledged that, under extremely heavy-tailed distributions such as the Cauchy distribution, higher-order moments may not exist and the approximation accuracy may therefore be affected. However, the inclusion of the Cauchy population in the simulation study was intended to examine the practical robustness and stability of the proposed estimators under challenging distributional settings. The simulation and empirical findings indicate that the proposed estimators continue to exhibit stable and improved performance even in such heavy-tailed situations.
7. Conclusions and Future Perspectives
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Cochran, W.B. Sampling Techniques; John Wiley and Sons: Hoboken, NJ, USA, 1963. [Google Scholar]
- Särndal, C.E. Sample survey theory vs. general statistical theory: Estimation of the population mean. Int. Stat. Rev./Rev. Int. Stat. 1972, 40, 1–12. [Google Scholar] [CrossRef]
- Stigler, S.M. Linear functions of order statistics. Ann. Math. Stat. 1969, 40, 770–788. [Google Scholar] [CrossRef]
- Gross, S. Median estimation in sample surveys. In Proceedings of the Section on Survey Research Methods; American Statistical Association Ithaca: Alexandria, VA, USA, 1980. [Google Scholar]
- Sedransk, J.; Meyer, J. Confidence intervals for the quantiles of a finite population: Simple random and stratified simple random sampling. J. R. Stat. Soc. Ser. B (Methodol.) 1978, 40, 239–252. [Google Scholar] [CrossRef]
- Philip, S.; Sedransk, J. Lower bounds for confidence coefficients for confidence intervals for finite population quantiles. Commun. Stat.-Theory Methods 1983, 12, 1329–1344. [Google Scholar] [CrossRef]
- Kuk, Y.C.A.; Mak, T.K. Median estimation in the presence of auxiliary information. J. R. Stat. Soc. Ser. B 1989, 51, 261–269. [Google Scholar] [CrossRef]
- Rao, T.J. On certail methods of improving ration and regression estimators. Commun. Stat.-Theory Methods 1991, 20, 3325–3340. [Google Scholar] [CrossRef]
- Singh, S.; Joarder, A.H.; Tracy, D.S. Median estimation using double sampling. Aust. N. Z. J. Stat. 2001, 43, 33–46. [Google Scholar] [CrossRef]
- Khoshnevisan, M.; Singh, H.P.; Singh, S.; Smarandache, F. A General Class of Estimators of Population Median Using Two Auxiliary Variables in Double Sampling; Virginia Polytechnic Institute and State University: Blacksburg, VA, USA, 2002. [Google Scholar]
- Singh, S. Advanced Sampling Theory with Applications: How Michael Selected Amy; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2003; Volume 2. [Google Scholar]
- Gupta, S.; Shabbir, J.; Ahmad, S. Estimation of median in two-phase sampling using two auxiliary variables. Commun. Stat.-Theory Methods 2008, 37, 1815–1822. [Google Scholar] [CrossRef]
- Aladag, S.; Cingi, H. Improvement in estimating the population median in simple random sampling and stratified random sampling using auxiliary information. Commun. Stat.-Theory Methods 2015, 44, 1013–1032. [Google Scholar] [CrossRef]
- Solanki, R.S.; Singh, H.P. Some classes of estimators for median estimation in survey sampling. Commun. Stat.-Theory Methods 2015, 44, 1450–1465. [Google Scholar] [CrossRef]
- Daraz, U.; Almulhim, F.A.; Alomair, M.A.; Alomair, A.M. Population median estimation using auxiliary variables: A simulation study with real data across sample sizes and parameters. Mathematics 2025, 13, 1660. [Google Scholar] [CrossRef]
- Sharma, P.; Lata, A.; Yadav, S.K.; Noor-ul-Amin, M. Family of estimators for estimating population median using auxiliary information in survey sampling. J. Reliab. Stat. Stud. 2025, 18, 343–370. [Google Scholar] [CrossRef]
- Baig, A.; Masood, S.; Ahmed Tarray, T. Improved class of difference-type estimators for population median in survey sampling. Commun. Stat.-Theory Methods 2019, 49, 5778–5793. [Google Scholar] [CrossRef]
- Sharma, P.; Singh, R. Generalized class of estimators for population median using auxiliary information. Hacet. J. Math. Stat. 2015, 44, 443–453. [Google Scholar] [CrossRef]
- Masood, S.; Ibrar, B.; Shabbir, J.; Movaheedil, Z. Estimating neutrosophic finite median employing robust measures of the auxiliary variable. Sci. Rep. 2024, 14, 10255. [Google Scholar] [CrossRef] [PubMed]
- Sharma, P.; Pusa, N.; Kumari, M.; Singh, P. Balancing accuracy and cost: A new estimator for stratified random sampling. Commun. Stat.-Simul. Comput. 2025, 1–17. [Google Scholar] [CrossRef]
- Shabbir, J.; Gupta, S. A generalized class of difference type estimators for population median in survey sampling. Hacet. J. Math. Stat. 2017, 46, 1015–1028. [Google Scholar] [CrossRef]
- Irfan, M.; Maria, J.; Shongwe, S.C.; Zohaib, M.; Bhatti, S.H. Estimation of population median under robust measures of an auxiliary variable. Math. Probl. Eng. 2021, 2021, 4839077. [Google Scholar] [CrossRef]
- Shabbir, J.; Gupta, S.; Narjis, G. On improved class of difference type estimators for population median in survey sampling. Commun. Stat.-Theory Methods 2022, 51, 3334–3354. [Google Scholar] [CrossRef]
- Subzar, M.; Lone, S.A.; Ekpenyong, E.J.; Salam, A.; Aslam, M.; Raja, T.A.; Almutlak, S.A. Efficient class of ratio cum median estimators for estimating the population median. PLoS ONE 2023, 18, e0274690. [Google Scholar] [CrossRef] [PubMed]
- Iseh, M.J. Model formulation on efficiency for median estimation under a fixed cost in survey sampling. Model Assist. Stat. Appl. 2023, 18, 373–385. [Google Scholar] [CrossRef]
- Alghamdi, A.S.; Almulhim, F.A. Stratified median estimation using auxiliary transformations: A robust and efficient approach in asymmetric populations. Axioms 2025, 17, 1127. [Google Scholar] [CrossRef]
- Singh, H.P.; Vishwakarma, G.K. Modified exponential ratio and product estimators for finite population mean in double sampling. Austrian J. Stat. 2007, 36, 217–225. [Google Scholar] [CrossRef]
- Bureau of Statistics. Punjab Development Statistics Government of the Punjab, Lahore, Pakistan; Bureau of Statistics: Islamabad, Pakistan, 2013. [Google Scholar]
- Bureau of Statistics. Punjab Development Statistics Government of the Punjab, Lahore, Pakistan; Bureau of Statistics: Islamabad, Pakistan, 2014. [Google Scholar]


| Different Estimators of | ||||
|---|---|---|---|---|
| 1 | ||||
| 1 | 0 | |||
| 0 | 1 | |||
| 0 | ||||
| 1 | 1 | |||
| 1 | ||||
| 0 |
| Estimator | PRE-1 | PRE-2 | PRE-3 | PRE-4 | PRE-5 |
|---|---|---|---|---|---|
| 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | |
| 146.72 | 136.45 | 128.93 | 131.52 | 134.08 | |
| 158.46 | 150.21 | 142.80 | 147.62 | 152.09 | |
| 164.18 | 157.94 | 150.31 | 154.08 | 159.73 | |
| 110.92 | 111.87 | 114.76 | 118.32 | 114.57 | |
| 172.65 | 165.32 | 159.29 | 162.46 | 168.22 | |
| 177.84 | 169.04 | 163.44 | 166.87 | 171.95 | |
| 181.27 | 172.39 | 166.92 | 170.42 | 175.11 | |
| 262.83 | 278.65 | 286.17 | 295.54 | 281.63 | |
| 273.19 | 289.72 | 297.34 | 307.29 | 292.77 | |
| 281.48 | 298.46 | 306.88 | 316.58 | 302.11 | |
| 288.95 | 306.11 | 314.62 | 324.97 | 310.38 | |
| 294.63 | 312.72 | 321.39 | 332.81 | 318.12 | |
| 301.52 | 318.93 | 328.22 | 340.95 | 325.64 | |
| 307.74 | 325.48 | 334.76 | 347.92 | 333.71 | |
| 313.92 | 331.87 | 341.52 | 354.86 | 340.24 |
| Parameter | Stratum-1 | Stratum-2 |
|---|---|---|
| 36 | 36 | |
| 18 | 18 | |
| 9 | 9 | |
| 24 | 24 | |
| 1986 | 2055 | |
| 168.500 | 171.500 | |
| 10,484.500 | 10,494.500 | |
| 438.519 | 452.713 | |
| 0.002463666 | 0.002315051 | |
| 0.00004033736 | 0.00004086913 | |
| 0.912 | 0.519 | |
| 193.438 | 195.750 | |
| 432.500 | 431.500 | |
| 2.106 | 2.345 | |
| 218.375 | 220 | |
| 127.125 | 132.500 | |
| 193.438 | 99 | |
| 252.25 | 265 | |
| 1005 | 1039.500 |
| Parameter | Stratum-1 | Stratum-2 |
|---|---|---|
| 36 | 36 | |
| 18 | 18 | |
| 9 | 9 | |
| 388 | 84 | |
| 1534 | 478 | |
| 1016.500 | 206 | |
| 116,230 | 49,661 | |
| 402.609 | 424.937 | |
| 0.000951993 | 0.004094403 | |
| 0.00000835 | 0.0000143374 | |
| 0.784 | 0.875 | |
| 891.188 | 210.688 | |
| 982.650 | 231 | |
| 1.008 | 1.023 | |
| 891.875 | 215.375 | |
| 982.650 | 62.875 | |
| 289 | 267 | |
| 378.250 | 125.750 | |
| 961 | 281 |
| Parameter | Stratum-1 | Stratum-2 |
|---|---|---|
| 18 | 18 | |
| 9 | 9 | |
| 5 | 5 | |
| 28 | 15 | |
| 95 | 75 | |
| 60.664 | 44.205 | |
| 52.728 | 46.339 | |
| 10.130 | 12.750 | |
| 0.0004729 | 0.0003791 | |
| 0.0000219 | 0.0000481 | |
| 0.337 | 0.496 | |
| 61.623 | 45.287 | |
| 61.600 | 45.242 | |
| 0.300 | 0.500 | |
| 54.287 | 48.940 | |
| 6.840 | 8.614 | |
| 8.100 | 10.200 | |
| 13.721 | 17.201 | |
| 61.500 | 45 |
| Estimator | PRE-1 | PRE-2 | PRE-3 |
|---|---|---|---|
| 100.00 | 100.00 | 100.00 | |
| 148.51 | 157.36 | 116.52 | |
| 160.24 | 169.78 | 122.08 | |
| 167.91 | 176.53 | 124.05 | |
| 163.65 | 172.35 | 123.11 | |
| 169.13 | 180.24 | 124.01 | |
| 170.29 | 180.83 | 125.49 | |
| 170.78 | 181.37 | 125.63 | |
| 184.56 | 195.54 | 155.16 | |
| 185.62 | 196.90 | 155.59 | |
| 186.74 | 197.68 | 157.65 | |
| 185.95 | 199.03 | 156.85 | |
| 187.21 | 198.14 | 153.18 | |
| 188.27 | 197.77 | 153.83 | |
| 188.13 | 198.15 | 154.77 | |
| 189.67 | 197.92 | 157.64 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Daraz, U.; M. Aljohani, H.; Almulhim, F.A. Optimal Transformation-Based Median Estimation Under Stratified Double Sampling with Limited Auxiliary Information. Symmetry 2026, 18, 933. https://doi.org/10.3390/sym18060933
Daraz U, M. Aljohani H, Almulhim FA. Optimal Transformation-Based Median Estimation Under Stratified Double Sampling with Limited Auxiliary Information. Symmetry. 2026; 18(6):933. https://doi.org/10.3390/sym18060933
Chicago/Turabian StyleDaraz, Umer, Hassan M. Aljohani, and Fatimah A. Almulhim. 2026. "Optimal Transformation-Based Median Estimation Under Stratified Double Sampling with Limited Auxiliary Information" Symmetry 18, no. 6: 933. https://doi.org/10.3390/sym18060933
APA StyleDaraz, U., M. Aljohani, H., & Almulhim, F. A. (2026). Optimal Transformation-Based Median Estimation Under Stratified Double Sampling with Limited Auxiliary Information. Symmetry, 18(6), 933. https://doi.org/10.3390/sym18060933

