Abstract
The objective of this work is to propose the iteratively reweighted least squares concept to form a fiducial generalized pivotal quantity of the between-group variance component for the unbalanced variance components model. The fiducial generalized pivotal quantity is a subclass of the generalized pivotal quantity which is useful technique to deal with problem of nuisance parameters for finding interval estimator. This research provides the probability distribution and the properties of the statistics to lead the constructing of the confidence interval. The authors also prove the construction of the fiducial generalized pivotal quantity through iteratively reweighted least squares. The performance comparison for the new proposed method with other competing methods in the literature is studied through a simulation study. The results of the simulation study demonstrate that the proposed method is very satisfactory in terms of both the coverage probability and the average width of the confidence interval. Furthermore, the analysis of real data for patients of sickle cell disease also illustrates that the proposed method gives the smallest average width of the confidence interval. All these results confirm that the iteratively reweighted least squares fiducial generalized pivotal quantity confidence interval is recommended.
Keywords:
iteratively reweighted least squares; between-group variance component; coverage probability; variance components model; fiducial generalized pivotal quantity MSC:
62F12; 62F25; 62F86
1. Introduction
The variance components model plays a vital role in several fields, such as industrial process management, agricultural genetics, medical treatment, animal breeding studies, etc. The variance components model is also called the random effects model, the conclusions of which can be applied to the entire population level of factors. Consider the variance components model given by
where is the jth random observation associated with the ith group of the random factor, and is the grand mean. The ith group of the random factor, , is distributed as . The jth random error associated with the ith group of the random factor, , is distributed as . Both and are mutually independent. It follows that is distributed as . Here, and are known as variance components. Generally, and are called the between-group variance and within-group variance, respectively. The number of observations is denoted as , and the overall sample size is denoted as . The model (1) is called a balanced model when of each group is equal. Otherwise, it is called an unbalanced model.
The minimal sufficient statistic is an important property for an estimator, which is used to form the confidence interval about the parameter of interest when the probability distribution for the statistic can be obtained [1]. For the balanced random effects model, the minimal sufficient statistics are available in closed-form expression. Nevertheless, this expression is unavailable for the unbalanced random effects model. Moreover, resolving the closed-form expression for the minimal sufficient statistic is rather complicated in the unbalanced case [2]. In this research, we are concerned with the inference of the between-group variance component for the unbalanced variance components model. There are numerous works from the literature that provide a method to make an inference about the variance component for the unbalanced random model, such as those of Ting et al. [3] in 1990 and Hartung and Knapp [4] in 2000, which are based on an asymptotic frequentist method. The favorite method for making an inference about the parameter of interest is a pivotal quantity approach applied by Wald [5] in 1940, Thomas and Hultquist [6] in 1978 and Park and Burdick [7] in 2003. Alternatively, Li and Li [8] in 2005 utilized the method via a fiducial inference using the concept provided by Fisher [9] in 1935. Later, the connection between the pivotal quantity approach and the fiducial inference was presented by Hannig et al. [10] in 2006. According to this connection, the construction for the series of fiducial generalized confidence interval for variance components was provided by Lidong et al. [11] in 2008, and the fiducial generalized pivotal quantity through the least squares concept for finding confidence interval for the parameters of interest was proposed by Liu et al. [12] in 2015.
There are several works in the literature that construct the confidence intervals of in an unbalanced random effects model. For instance, Ting et al. [3] in 1990 and Hartung and Knapp [4] in 2000 applied the classical exact method to construct confidence intervals of . In 2003, Park and Burdick [7] presented the closed-form expression for the minimal sufficient statistic in the unbalanced case and used the idea of the generalized pivotal quantity to construct the confidence interval about the parameter of interest. In 2005, Li and Li [8] applied the fiducial approach to derive the confidence interval of the between-group variance component in unbalanced random effects model. Later, Liu et al. [12] in 2015 utilized the concept of the generalized pivotal quantity and the fiducial approach via the least square idea for obtaining confidence interval of variance components in unbalanced two-component normal mixed linear model. The point of this study is to propose a new iteratively reweighted least squares fiducial generalized pivotal quantity to form the confidence intervals of the between-group variance component for the unbalanced variance components model. The methods for comparing the performance with the proposed method are as follows: the TG method [3], the HK method [4], the PB method [7], the LL method [8], and the LXH method [12].
The organization for the remaining of this research is the following: the idea of the fiducial generalized pivotal quantity for finding confidence intervals is introduced in Section 2. The new iterative reweighted least squares fiducial generalized pivotal quantity to construct the confidence interval of is proposed in Section 3. We perform simulation studies to investigate its performance in Section 4. Section 5 illustrates the application utilizing the new proposed method. A conclusion is presented in Section 6.
2. Review of Fiducial Generalized Pivotal Quantity
The idea of a generalized pivotal quantity is a useful technique to deal with the problem of nuisance parameters for finding interval estimation. The fiducial generalized pivotal quantity (FGPQ) is a subclass of the generalized pivotal quantity, which is based on an extension of the fiducial argument proposed by Fisher [9] in 1935. The FGPQ are extensively used to obtain confidence intervals. For instance, Lidong et al. [11] in 2008 focused on developing fiducial intervals for estimating variance components and intraclass correlation in scenarios with unbalanced data structures. The method of constructing the fiducial intervals was based on an extension of the fiducial approach presented by Fisher [9] in 1935. Moreover, the results of the simulation study showed satisfactory performance in terms of coverage probability and the average width confidence interval. In 2011, Burch [13] investigated confidence intervals for variance components in the unbalanced one-way random effects model under non-normal distribution assumptions. The procedure of developing confidence intervals was constructed by the FGPQ. Later, Liu and Xu [14] in 2015 proposed a new kind of confidence interval for variance components in a one-way random effects model by deriving the combined asymptotic confidence distribution and property of confidence distribution. In addition, the related measures of variance components such as their ratio and intraclass correlation were presented. Herein, the method of Liu and Xu [14] was based on the FGPQ. In 2015, Liu et al. [12] developed the use of least squares for the FGPQ to construct confidence intervals for variance components in a two-component normal mixed linear model. Based on the aforementioned works in the literature, Li and Li [15] in 2007 and Jiratampradab et al. [16] in 2022 compared those methods via the FGPQ for constructing confidence intervals for variance components in an unbalanced one-way random effects model. In this regard, the motivation of this study is to use the FGPQ to obtain the inference about variance components. The definition of the FGPQ derived from Hannig et al. [10] in 2006 is stated below.
Definition 1.
Assume that is a vector of observable random such that the distribution of is indexed through vector of parameter ξ, where and ξ are elements of . Suppose that is a parameter to make inferences, where θ is element of . An independent copy of is represented by . The realized values of and are denoted by and , respectively. A fiducial generalized pivotal quantity for θ is denoted by . The properties of are given as follows:
1. The conditional distribution of is free of ξ, where .
2. For all , which is an element of , .
- The FGPQ for θ is derived from the percentiles of .
3. Confidence Interval for
3.1. Matrix Formulation of Model
In order to obtain the matrix formulation, let an vector represent the observations, and thus the model (1) can be shown as
where is the grand mean, is an vector of ones, is an incidence matrix, is the vector of the random factor with , and is the vector of the random error with . Note that is a vector of zero, and is a identity matrix. Independence among and is assumed. A horizontal concatenation for matrices and is denoted as . Denote and . The matrix of dimension , namely, is satisfied , where is, herein, a Householder matrix. The distribution of in model (2) is given by , and it follows that
The distinct eigenvalues of are such that having multiplicities . The matrix is orthogonal such that , where , is an corresponding to . The independently quadratic forms, defined by
are minimal sufficient statistics for . Let , be mutually independent. The distribution of can be written in terms of and as
where represents the central chi-squared distribution with degrees of freedom.
3.2. Construction of Iteratively Reweighted Least Squares FGPQ
At first, we construct FGPQ for using each , and . Notice that (5) is a pivotal quantity for . Regarding the sum of squares due to within groups, follows the chi-squared distribution with degrees of freedom. According to (4), the minimal sufficient statistics are , . There is an invertible relationship between and . Thus, it is straightforward to provide fiducial inference as described by Hannig et al. [10] in 2006. According to (5), it can be written as
Note that (6) represents the structure of the vector of the observable random in term of a random vector . The FGPQ for can be derived by solving a set of equations with the method of the iteratively reweighted least squares. The sum of squares of the differences between and in (6) with the weights can be expressed as
where , . The weighted least squares solution of , represented by , is formed as in Algorithm 1 with the sequence of weights , , . For a complete iterative procedure, we start with the initial value of as the ordinary least squares solution. The iterations of the reweighted least squares solution continue until the stopping criterion is satisfied, which is specified at 0.001 [17].
⋮
| Algorithm 1 The weighted least squares solution of |
Output: |
1: Initial: . |
2: Repeat: |
3: Update: , . |
4: Update: by minimizing respect to . |
5: Until Convergence . |
6: Return . |
Next, we define the FGPQ for given by
where , is an eigenvalue of in (3), is observable, and is the independent copies of in (5), . Regarding (8), it can be constructed through the iteratively reweighted least squares as proven in Theorem 1.
Theorem 1.
The FGPQ for is in (8).
Proof.
The value of that minimizes D can be obtained by the derivative of (7) with respect to and equating to zero so that the weighted least squares solution of is given by
Let be the independent copies of . Notice that , , and we obtain
The distribution of is independent of . When for , it is proved that , so the requirements of Definition 1 are satisfied. □
3.3. Iteratively Reweighted Least Squares Fiducial Generalized Confidence Intervals for
According to Theorem 1, the FGPQ is denoted by as the solution of . Then, the iteratively reweighted least squares fiducial generalized confidence intervals of can be obtained by . The confidence interval of the between-group variance component () is given by
where and are the and quantiles for the distribution of in (8), respectively. Denote the solutions of and , which are based on the pivotal quantity.
4. Simulation Study
The comparison of the performance for the new proposed method with the previously existing methods is studied through the simulated data. Without loss of generality, the value of in model (2) is set to zero. The values selected for are and . That is, it is defined in the way that varies from small to large in the interval . The nominal level is set at . This process is generated 5000 times for each setting of the values of and sample size. For the efficiency criterion, the coverage probability and the average width of the confidence interval are considered. Generally, we first consider the coverage probability that maintains at the nominal level, and then the average widths of the confidence interval are compared. The best method gives the smallest average width of the confidence interval. The function of subclass frequencies which is based on the design in the experiment, denoted by , is called the measurements of the imbalanced, where , where g is the number of groups and is the number of observations of each group [18]. Note that is the ratio between the harmonic mean and the mean of ; it follows that [19]. Moreover, when is equal to 1, then the model is balanced, and when is close to 0, then the model is very unbalanced. Table 1 presents eight different cases of unbalanced design for simulation.
Table 1.
Cases of unbalanced design for simulation.
The simulation study results are illustrated in Figure 1 and Figure 2. The results in Figure 1 present the coverage probability of the confidence interval for . The results in Figure 2 demonstrate the relative difference of the average width of the confidence interval for . The relative width is used to compare the average widths of the confidence interval of the competing method. This relative width is denoted by , where is the average width of the confidence interval of competing methods and is the average width of the confidence interval of the IRLF method [20]. If the value of the relative width is positive, then it indicates that is smaller than . On the other hand, if the value of the relative width is negative, then it indicates that is smaller than . Furthermore, the relative width equal to zero indicates that is equal to .
Figure 1.
The coverage probability of the 95% confidence interval for .
Figure 2.
Relative difference of the average width of the 95% confidence interval for .
Figure 1 displays the coverage probability for all competing methods with and . The TG, PB, and IRLF methods maintain the nominal level, where , and they obtain higher than the nominal level where . The HK method obtains lower than the nominal level where , and it obtains higher than the nominal level where . The LL and LXH methods obtain higher than the nominal level for all situations.
The comparison of the average width of the confidence interval is shown in Figure 2, it implies that the average width of the IRLF interval is the smallest. The average width of the LXH interval is smaller than that of the other four methods. The average widths of the TG, HK, PB and LL intervals behave similarly.
5. Application
The data from Anionwu et al. [21] in 1981 are of a study of the steady-state hemoglobin levels for patients with various types of sickle cell disease. An interesting question is whether steady-state hemoglobin levels differ significantly between patients with the three different types. The three different types are Hemoglobin SS (HbSS), Hemoglobin S-beta-thalassaemia (HbSβ-thal), and Hemoglobin SC (HbSC). The data are shown in Table 2 [22].
Table 2.
The steady-state hemoglobin levels in each type of sickle cell disease.
Model (1) is used to analyze the experiment of the steady-state hemoglobin levels, that is,
where , , and the measurement of the imbalanced is . Here, denotes the random factor of types of sickle cell disease, and we assume that is normally distributed with a mean of 0 and variance of . represents the jth random error associated with the ith type of sickle cell disease, and we assume that is normally distributed with a mean of 0 and variance of . The independence of and is also assumed. According to Model (2), the vector of the random factor and the vector of the random error are denoted by and , respectively. The distributions of and are given by and , respectively. Moreover, and are independent. Next, we provide the confidence intervals of followed by the procedure of the proposed method and the previous methods in the literature. The six confidence intervals of based on the TG, HK, PB, LL, LXH and IRLF methods are demonstrated in Table 3. It is notably seen that the proposed method provides the shortest confidence interval for between-group variance, .
Table 3.
The 95% confidence interval of for the steady-state hemoglobin levels data.
6. Conclusions
This paper proposes a new method to form the confidence interval of between-group variance in the unbalanced variance components model applying the iteratively reweighted least squares concept combining with fiducial generalized pivotal quantity. The simulation study is performed to collate the newly proposed method with five other methods. The results show that the TG, PB, and IRLF methods mostly maintain the nominal level where is small. The HK method is liberal where is small. Conversely, where is large, the HK method is conservative. The LL method is conservative for all situations. The LXH method is mostly conservative for all . Clearly, the average width of the IRLF interval is the smallest. Other intervals behave similarly in term of the average width. In summary, these results confirms that the iteratively reweighted least squares fiducial generalized pivotal quantity to form the confidence intervals can be applied instead of the previous methods in the literature. Future research could incorporate the enhanced Laplace approximation [23] to construct the confidence interval for the between-group variance in unbalanced variance components model. Also, it can be used to implement hypothesis tests about the parameters of interest. Additionally, the analysis of the dataset further highlights the satisfactory performance of the IRLF method in terms of the smallest confidence interval. This finding confirms that the confidence interval based on the IRLF method is recommended for practical use in applications in various fields, such as industrial process management [24], animal breeding studies [25], education [26], and so on.
Author Contributions
Conceptualization, J.S. and T.S.; Methodology, A.J., J.S. and T.S.; Validation, A.J.; Writing – original draft, A.J. and T.S.; Writing—review & editing, A.J., T.S. and J.S.; Supervision, J.S. and T.S. All authors have read and agreed to the published version of the manuscript.
Funding
This work was financially supported by King Mongkut’s Institute of Technology Ladkrabang Research Fund: KREF186707 and International SciKU Branding (ISB), Faculty of Science, Kasetsart University.
Data Availability Statement
Data derived from public domain resources [British Medical Journal] [https://pubmed.ncbi.nlm.nih.gov/6779988/, accessed on 1 November 2024].
Acknowledgments
The authors thank the reviewers for their valuable comments and suggestions which substantially improved the quality of the article. Additionally, this research was supported by King Mongkut’s Institute of Technology Ladkrabang and Faculty of Science, Kasetsart University.
Conflicts of Interest
The authors declare no conflicts of interest.
Abbreviations
The following abbreviations and symbols are used in this manuscript:
| Abbreviation | Meaning |
| TG | Ting and others method |
| HK | Hartung–Knapp method |
| PB | Park–Burdick method |
| LL | Li–Li method |
| LXH | Liu–Xu–Hannig method |
| IRLF | iteratively reweighted least squares fiducial generalized pivotal quantity |
| FGPQ | fiducial generalized pivotal quantity |
| Symbol | Meaning |
| the jth random observation associated with the ith group of the random factor | |
| grand mean | |
| the ith group of the random factor | |
| the jth random error associated with the ith group of the random factor | |
| between-group variance | |
| within-group variance | |
| g | the number of groups |
| the number of observations of the ith group | |
| n | the number of the total observations |
| vector of observations | |
| vector of ones | |
| incidence matrix | |
| vector of the random factor | |
| vector of the random error | |
| identity matrix | |
| Householder matrix | |
| measurements of imbalance | |
| fiducial generalized pivotal quantity for | |
| average widths of the confidence interval of the IRLF method | |
| average widths of the confidence interval of competing methods |
References
- Wackerly, D.; Mendenhall, W.; Scheaffer, R.B. Mathematical Statistics with Applications, 7th ed.; Thomson Learning: Chicago, IL, USA, 2008; p. 468. [Google Scholar]
- Searle, S.R.; Casella, G.; McCulloch, C.E.B. Variance Components; John Wiley and Sons: Hoboken, NJ, USA, 2006; p. 38. [Google Scholar]
- Ting, N.; Burdick, R.K.; Graybill, F.A.; Jeyaratnam, S.; Lu, T.F.C. Confidence intervals on linear combinations of variance components that are unrestricted in sign. J. Stat. Comput. Simul. 1990, 35, 135–143. [Google Scholar] [CrossRef]
- Hartung, J.; Knapp, G. Confidence intervals for the between group variance in the unbalanced one-way random effects model of analysis of variance. J. Stat. Comput. Simul. 2000, 65, 311–323. [Google Scholar] [CrossRef]
- Wald, A. A note on the analysis of variance with unequal class frequencies. Ann. Math. Stat. 1940, 11, 96–100. [Google Scholar] [CrossRef]
- Thomas, J.D.; Hultquist, R.A. Interval estimation for the unbalanced case of the one-way random effects model. Ann. Stat. 1978, 6, 582–587. [Google Scholar] [CrossRef]
- Park, D.J.; Burdick, R.K. Performance of confidence intervals in regression models with unbalanced one-fold nested error structures. Commun. Stat.-Simul. Comput. 2003, 32, 717–732. [Google Scholar] [CrossRef]
- Li, X.; Li, G. Confidence intervals on sum of variance components with unbalanced designs. Commun. Stat.-Theory Methods 2005, 34, 833–845. [Google Scholar] [CrossRef]
- Fisher, R.A. The fiducial argument in statistical inference. Ann. Eugen. 1935, 6, 391–398. [Google Scholar] [CrossRef]
- Hannig, J.; Iyer, H.; Patterson, P. Fiducial generalized confidence intervals. J. Am. Stat. Assoc. 2006, 101, 254–269. [Google Scholar] [CrossRef]
- Lidong, E.; Hannig, J.; Iyer, H. Fiducial intervals for variance components in an unbalanced two-component normal mixed linear model. J. Am. Stat. Assoc. 2008, 103, 854–865. [Google Scholar]
- Liu, X.; Xu, X.; Hannig, J. Least squares generalized inferences in unbalanced two component normal mixed linear model. Comput. Stat. 2015, 31, 973–988. [Google Scholar] [CrossRef]
- Burch, B.D. Confidence intervals for variance components in unbalanced one-way random effects model using non-normal distributions. J. Stat. Plan. Inference 2011, 141, 3793–3807. [Google Scholar] [CrossRef]
- Liu, X.; Xu, X. Confidence distribution inferences in one-way random effects model. Test 2015, 25, 59–74. [Google Scholar] [CrossRef]
- Li, X.; Li, G. Comparison of confidence intervals on the among group variance in the unbalanced variance component model. J. Stat. Comput. Simul. 2007, 77, 477–486. [Google Scholar] [CrossRef]
- Jiratampradab, A.; Supapakorn, T.; Suntornchost, J. Comparison of confidence intervals for variance components in an unbalanced one-way random effects model. Stat. Transit. New Ser. 2022, 23, 149–160. [Google Scholar] [CrossRef]
- Seber, G.A.; Wild, C.J.B. Nonlinear Regression; John Wiley and Sons: Hoboken, NJ, USA, 2003; pp. 582–587. [Google Scholar]
- Ahrens, H.; Pincus, R. On two measures of unbalancedness in a one-way model and their relation to efficiency. Biom. J. 1981, 23, 227–235. [Google Scholar] [CrossRef]
- Khuri, A.I. Measures of imbalance for unbalanced models. Biom. J. 1987, 29, 383–396. [Google Scholar] [CrossRef]
- Tornqvist, L.; Vartia, P.; Vartia, Y.O. How should relative changes be measured? Am. Stat. 1985, 39, 43–46. [Google Scholar]
- Anionwu, E.; Walford, D.; Brozovic, M.; Kirkwood, B. Sickle-cell disease in a British urban community. Br. Med J. 1981, 282, 283–286. [Google Scholar] [CrossRef]
- Sahai, H.; Ageel, M.I. The Analysis of Variance: Fixed, Random and Mixed Models; Springer Science and Business Media: New York, NY, USA, 2000; p. 121. [Google Scholar]
- Han, J.; Lee, Y. Enhanced laplace approximation. J. Multivar. Anal. 2024, 202, 105321. [Google Scholar] [CrossRef]
- Montgomery, D.C. Design and Analysis of Experiments; John Wiley and Sons: Hoboken, NJ, USA, 2017; pp. 133–135. [Google Scholar]
- Sahai, H.; Ojeda, M.M. Analysis of Variance for Random Models, Volume 2: Unbalanced Data: Theory, Methods, Applications, and Data Analysis; Springer Science and Business Media: Boston, UK, 2004; pp. 154–155. [Google Scholar]
- Heiberger, R.M.; Holland, B. Statistical Analysis and Data Display an Intermediate Course with Examples in R; Springer: New York, NY, USA, 2015; p. 215. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).