Pretest Estimation for the Common Mean of Several Normal Distributions: In Meta-Analysis Context

Mphekgwana, Peter M.; Kifle, Yehenew G.; Marange, Chioneso S.

doi:10.3390/axioms13090648

Open AccessArticle

Pretest Estimation for the Common Mean of Several Normal Distributions: In Meta-Analysis Context

by

Peter M. Mphekgwana

^1,2,*

,

Yehenew G. Kifle

³

and

Chioneso S. Marange

¹

Department of Statistics, Faculty of Science and Agriculture, Fort Hare University, Alice 5700, South Africa

²

Department of Research Administration and Development, University of Limpopo, Polokwane 0727, South Africa

³

Department of Mathematics and Statistics, University of Maryland Baltimore County, Baltimore, MD 21250, USA

^*

Author to whom correspondence should be addressed.

Axioms 2024, 13(9), 648; https://doi.org/10.3390/axioms13090648

Submission received: 1 July 2024 / Revised: 11 September 2024 / Accepted: 20 September 2024 / Published: 22 September 2024

(This article belongs to the Special Issue Probability, Statistics and Estimation)

Download

Browse Figure

Versions Notes

Abstract

The estimation of unknown quantities from multiple independent yet non-homogeneous samples has garnered increasing attention in various fields over the past decade. This interest is evidenced by the wide range of applications discussed in recent literature. In this study, we propose a preliminary test estimator for the common mean

(μ)

with unknown and unequal variances. When there exists prior information regarding the population mean with consideration that

μ

might be equal to the reference value for the population mean, a hypothesis test can be conducted:

H_{0} : μ = μ_{0}

versus

H_{1} : μ \neq μ_{0}

. The initial sample is used to test

H_{0}

, and if

H_{0}

is not rejected, we become more confident in using our prior information (after the test) to estimate

μ

. However, if

H_{0}

is rejected, the prior information is discarded. Our simulations indicate that the proposed preliminary test estimator significantly decreases the mean squared error (MSE) values compared to unbiased estimators such as the Garybill-Deal (GD) estimator, particularly when

μ

closely aligns with the hypothesized mean (

μ_{0}

). Furthermore, our analysis indicates that the proposed test estimator outperforms the existing method, particularly in cases with minimal sample sizes. We advocate for its adoption to improve the accuracy of common mean estimation. Our findings suggest that through careful application to real meta-analyses, the proposed test estimator shows promising potential.

Keywords:

common mean; pretest; shrinkage; meta-analysis

MSC:

62H15; 62E15; 62F03; 62H10

1. Introduction

As medical knowledge continues to expand rapidly, healthcare providers face significant challenges in thoroughly evaluating and analyzing the necessary data to make well-informed decisions [1,2,3]. The complexity of these challenges is further heightened by the variety of findings, presented in different studies, which are sometimes conflicting. Meta-analysis, along with research synthesis or integration, has become an effective tool for addressing these issues. This method achieves its objective by applying rigorous statistical techniques to aggregate the results from multiple individual studies, thereby combining their findings [2,4]. Additionally, meta-analysis has gained widespread attention across numerous scientific fields, such as education, social sciences, and medicine. For example, in education has been used to consolidate research on the effectiveness of coaching in improving Scholastic Aptitude Test (SAT) scores in both verbal and mathematical sections [5]. In social sciences, it has been used to synthesize studies on gender differences in quantitative, verbal, and visual-spatial abilities [6]. In healthcare, meta-analysis has been particularly valuable during the COVID-19 pandemic, enhancing our understanding of the virus and informing public health strategies [7,8].

The challenge of combining two or more unbiased estimators is a common issue in applied statistics, with significant implications across various fields. A notable example of this problem occurred when Meier [9] was tasked with making inferences about the mean albumin level in plasma protein in human subjects using data from four separate experiments. Similarly, Eberhardt et al. [10] faced a scenario where they needed to draw conclusions about the mean selenium content in non-fat milk powder by integrating results from four different methods across four experiments.

Most of the early research on drawing inferences about the common mean

(μ)

focuses on point estimation and theoretical decision rules regarding

μ

. Graybill and Deal were the first among a few researchers to research about estimating

μ

[11]. Since then, numerous works have been building upon and expanding upon their initial work [12,13,14,15,16,17,18], along with the related references. Conversely, Meier [9] developed a method for estimating the confidence interval for

μ

. In addition, refs. [19,20] have devised approximate confidence intervals. The properties of such estimators have accumulated substantial attention in the literature. Sinha et al. [2] derived an unbiased estimator of the variance for the Graybill-Deal estimator, and Krishnamoorthy and Moore [21] considered this in the prediction problem of linear regression.

In some cases, researchers find situations where prior information

(θ_{0})

on the mean population is available, whether through pre-test information or historical data. Pretest or preliminary tests or shrinkage estimators involve the concept of leveraging preliminary information to improve parameter estimation accuracy. These estimators work with the idea of borrowing strength from both sample data and pre-test information, resulting in higher efficiency and reliability than traditional estimators. Bancroft [22] and Stein [23] introduced and extensively examined the preliminary test shrinkage estimator. Their method has influenced numerous advancements and applications in statistics and has established a basis for the use of shrinkage estimators in contemporary statistical practice [24,25,26]. Thompson [27] proposed a shrinkage technique given as

ω = q θ_{0} + (1 - q) θ,

(1)

where

q = 0

(accept

H_{0}

),

q = 1

(reject

H_{0}

) and

θ_{0}

as prior guess. This was aimed at improving the current estimator of a parameter

θ

to estimate the mean, thereby reducing the mean square error (MSE) of the uniform minimum-variance unbiased estimator (UMVUE) for the population mean. It has been observed that the shrinkage estimator performs better than the conventional estimator when the assumed value of q aligns closely with the prior guess. Consequently, instead of treating q as a constant value in the shrinkage estimator, it is advisable to regard it as a weight ranging between 0 and 1 [27]. In this context, q can be viewed as a continuous function dependent on certain pertinent statistics, anticipating that its value will decrease consistently as the deviation

(θ - θ_{0})

from a reference value increases.

This preliminary test has been widely used in statistics [24,25,26]. Khan et al. [24] deployed a preliminary test for estimating the mean of a univariate normal population with an unknown variance. Shih et al. [25] proposed a class of general pretest estimators for the univariate normal mean which included numerous existing estimators, such as pretest, shrinkage, Bayes, and empirical Bayes estimators. In the context of meta-analysis, Taketomi et al. [26] proposed simultaneous estimation of individual means using the James–Stein shrinkage estimators, which improved upon individual studies’ estimators. Literature has observed that when prior information is available, shrinkage estimators for parameters of various distributions tend to outperform standard estimators in terms of MSE, especially when the estimated value is close to the true value [22,27,28].

The use of prior information in estimating the common mean has several significant advantages. For example, it allows researchers to leverage significant past knowledge, which could be from historical data, expert opinion, or preliminary investigations, improving the accuracy of the estimation process. Secondly, they tend to strike a compromise between bias and variance, resulting in estimates that are both unbiased and more efficient than traditional estimators, particularly in circumstances with small sample sizes. However, there are limited research studies on point estimation of

μ

proposing preliminary test-based estimators. It is therefore, in this study we propose a preliminary test estimator for the common mean with unknown and unequal variances. In order to find the ideal estimator, the properties of the proposed preliminary test estimator will be examined, which includes its theoretical basis and performance-based criteria such as bias and MSE.

2. Background

To define the current problem, we assume there are k independent normal populations with a common mean

(μ)

, but with unknown and potentially unequal variances

σ_{1}^{2}, \dots, σ_{k}^{2} > 0

. We assume we have independent and identically distributed (

i . i . d

) observations

X_{i 1}, \dots, X_{i n_{i}}

from

N (μ, σ_{i}^{2})

,

i = 1, 2, \dots, k

and we define

{\bar{X}}_{i}

and

S_{i}^{2}

as

\begin{matrix} {\bar{X}}_{i} & = & \sum_{j = 1}^{n_{i}} \frac{X_{i j}}{n_{i}}, \end{matrix}

(2)

\begin{matrix} S_{i}^{2} & = & \sum_{j = 1}^{n_{i}} {(X_{i j} - {\bar{X}}_{i})}^{2} / (n_{i} - 1), \end{matrix}

(3)

where

{\bar{X}}_{i} \sim N (μ, σ_{i}^{2} / n_{i})

,

(n_{i} - 1) S_{i}^{2} \sim σ_{i}^{2} χ_{n_{i} - 1}^{2}

. Note that these statistics

{{\bar{X}}_{i}, S_{i}^{2}, i = 1, 2}

are all mutually independent. Again, it can be noted that

{{\bar{X}}_{1}, S_{1}^{2}, {\bar{X}}_{2}, S_{2}^{2}, \dots, {\bar{X}}_{k}, S_{k}^{2}}

are minimal sufficient statistics for

(μ, σ_{1}^{2}, σ_{2}^{2}, \dots, σ_{k}^{2})

but not complete [29]. As a result, one cannot get the uniformly minimum variance unbiased estimator (UMVUE) if it exists using the standard Rao-Blackwell theorem on an unbiased estimator for estimating

μ

. For the case of k when the population variances are fully known,

μ

can be readily estimated as

\begin{matrix} \hat{μ} & = & (\sum_{i = 1}^{k} \frac{n_{i}}{σ_{i}^{2}} {\bar{X}}_{i}) / (\sum_{i = 1}^{k} \frac{n_{i}}{σ_{i}^{2}}), \end{matrix}

(4)

\begin{matrix} Var (\hat{μ}) & = & \frac{1}{\sum_{i = 1}^{k} (n_{i} / σ_{i}^{2})} . \end{matrix}

(5)

This estimator,

\hat{μ}

, is the UMVUE, the best linear unbiased estimator (BLUE), and the maximum likelihood estimator (MLE). In the context of our current problem, where the population variances are unknown and possibly unequal, the most appealing unbiased estimator for

μ

is the Graybill-Deal (GD) estimator [11], which is

\begin{matrix} {\hat{μ}}_{G D} & = (\sum_{i = 1}^{k} \frac{n_{i}}{S_{i}^{2}} {\bar{X}}_{i}) / (\sum_{i = 1}^{k} \frac{n_{i}}{S_{i}^{2}}), \end{matrix}

(6)

\begin{matrix} Var ({\hat{μ}}_{G D}) & = E [\sum_{i = 1}^{k} (\frac{n_{i} σ_{i}^{2}}{S_{i}^{4}}) / {(\sum_{i = 1}^{k} \frac{n_{i}}{S_{i}^{2}})}^{2}] . \end{matrix}

(7)

In the case of two samples, GD [11] first demonstrated that an unbiased estimator

{\hat{μ}}_{G D}

in Equation (6) has a lower variance compared to either sample mean, provided that both sample sizes exceed 10.

Khatri and Shah [30] proposed an exact variance formula for

{\hat{μ}}_{G D}

which is complex and not easily applied. To tackle this inference issue, Meier [9] derived a first-order approximation of the variance of

{\hat{μ}}_{G D}

, given by

\begin{matrix} V a r ({\hat{μ}}_{G D}) & = {[\sum_{i = 1}^{k} \frac{n_{i}}{σ_{i}^{2}}]}^{- 1} [1 + 2 \sum_{i = 1}^{k} \frac{1}{n_{i} - 1} c_{i} (1 - c_{i}) + O (\sum_{i = 1}^{k} \frac{1}{{(n_{i} - 1)}^{2}})], \end{matrix}

where

c_{i} = \frac{n_{i} / σ_{i}^{2}}{\sum_{j = 1}^{k} n_{j} / σ_{j}^{2}}

.

A few years later, Sinha [31] developed an unbiased estimator for the variance of

\hat{μ}

that takes the form of a convergent series. A first-order approximation of this estimator is

\begin{matrix} {\hat{V a r}}_{(1)} ({\hat{μ}}_{G D}) = \frac{1}{\sum_{i = 1}^{k} n_{i} / s_{i}^{2}} [1 + \sum_{i = 1}^{k} \frac{4}{n_{i} + 1} (\frac{n_{i} / s_{i}^{2}}{\sum_{j = 1}^{k} n_{j} / s_{j}^{2}} - \frac{n_{i}^{2} / s_{i}^{4}}{{(\sum_{j = 1}^{k} n_{j} / s_{j}^{2})}^{2}})] . \end{matrix}

The above estimator is comparable to Meier’s [9] approximate estimator, defined as

\begin{matrix} {\hat{V a r}}_{(2)} ({\hat{μ}}_{G D}) = \frac{1}{\sum_{i = 1}^{k} n_{i} / s_{i}^{2}} [1 + \sum_{i = 1}^{k} \frac{4}{n_{i} - 1} (\frac{n_{i} / s_{i}^{2}}{\sum_{j = 1}^{k} n_{j} / s_{j}^{2}} - \frac{n_{i}^{2} / s_{i}^{4}}{{(\sum_{j = 1}^{k} n_{j} / s_{j}^{2})}^{2}})] . \end{matrix}

The “classical” meta-analysis variance estimator,

{\hat{V a r}}_{(3)}

is given as

\begin{matrix} {\hat{V a r}}_{(3)} ({\hat{μ}}_{G D}) = \frac{1}{\sum_{i = 1}^{k} n_{i} / s_{i}^{2}} . \end{matrix}

Approximate variance estimator proposed by Hartung [32],

{\hat{V a r}}_{(4)}

is given as

\begin{matrix} {\hat{V a r}}_{(4)} ({\hat{μ}}_{G D}) = \frac{1}{k - 1} \sum_{i = 1}^{k} \frac{n_{i} / s_{i}^{2}}{\sum_{j = 1}^{k} n_{j} / s_{j}^{2}} {({\bar{X}}_{i} - {\hat{μ}}_{G D})}^{2} . \end{matrix}

3. Proposed Preliminary Test Estimator

It is reasonable to test a null hypothesis when uncertain non-sample prior information is available. A preliminary test estimator is a two-step process that estimates a key parameter using the results of a preliminary test. To estimate

μ

, we consider the hypothesis

H_{0} : μ = μ_{0} v s . H_{1} : μ \neq μ_{0} .

(8)

Our proposed preliminary test estimate for

μ

is as follows:

{\hat{μ}}_{P T} = \{\begin{matrix} μ_{0} & , if H_{0} is accepted \\ {\hat{μ}}_{G D} = (\sum_{i = 1}^{k} \frac{n_{i}}{s_{i}^{2}} {\bar{X}}_{i}) / (\sum_{i = 1}^{k} \frac{n_{i}}{s_{i}^{2}}) & , if H_{0} is rejected \end{matrix}

(9)

where

{\hat{μ}}_{G D}

is unbiased estimator of

μ

. Shih et al. [25] defined

a : R \mapsto [0, 1]

be a test function with

a = 0 (accept H_{0})

,

a = 1 (reject H_{0})

, and

0 < a < 1 (reject H_{0} with probability a)

. For

0 \leq α_{1} \leq α_{2} \leq 1

, then randomized test is defined as

a (X) = \{\begin{matrix} 1 & , if | t_{o b s} | > t_{α_{1} / 2, n - 1}, \\ q (X) & , if t_{α_{2} / 2, n - 1} < | t_{o b s} | \leq t_{α_{1} / 2, n - 1}, \\ 0 & , if | t_{o b s} | \leq t_{α_{2} / 2, n - 1} . \end{matrix}

The rejection or failure to reject

H_{0}

will be based on the t statistic. A standard notation for a t statistic based on a sample of size n is

t_{o b s} = \sqrt{n} (\bar{x} - μ_{0}) / s

. We can refer to this t computed from a specific set of data as the observed value of our test statistic, and reject

H_{0}

when

| t_{o b s} | > t_{α / 2, ν}

, where

ν

is

n - 1

degrees of freedom and

α

is Type-I error level. A test for

H_{0}

based on a p-value on the other hand is based on

P_{o b s} = P [| t_{ν} | > | t_{o b s} |]

, and we reject

H_{0}

at level

α

if

P_{o b s} < α

. We let

t_{0, ν} \equiv \infty

as usual. Then

t_{ν}

stands for the central t variable with

ν

degrees of freedom, and

t_{α / 2, ν}

stands for the upper

α / 2

percentile of

t_{ν}

. The general preliminary test estimator [33] can be defined as

{\hat{μ}}_{G P T} = a (X) {\hat{μ}}_{G D} + [1 - a (X)] μ_{0} .

The estimator can also be written as

\begin{matrix} {\hat{μ}}_{G P T} & = & μ_{0} + ({\hat{μ}}_{G D} - μ_{0}) I (| t_{o b s} | > t_{α_{1} / 2, n - 1}) + \\ q (X) I (t_{α_{2} / 2, n - 1} < | t_{o b s} | \leq t_{α_{1} / 2, n - 1}) . \end{matrix}

(10)

In this study, we focus only on the case where

α_{1} = α_{2} = α

. We can define our proposed preliminary test estimator with unknown variance as

{\hat{μ}}_{P T} = μ_{0} + ({\hat{μ}}_{G D} - μ_{0}) I (| t_{r a n} | > t_{α / 2, n - 1}),

(11)

where

I (\cdot)

is the indicator function defined as

I (A) = 1

if A is true and

I (A) = 0

if A is false. A random p-value which has a

U n i f o r m (0, 1)

distribution under

H_{0}

is defined as

P_{r a n} = P [| t_{ν} | > | t_{r a n} |]

, where

t_{r a n} = \sqrt{n} (\bar{X} - μ_{0}) / s

. Most suggested tests for

H_{0}

are based on

P_{o b s}

and

t_{o b s}

values. To simplify the notation, we will denote

P_{o b s}

by small p and

P_{r a n}

by large P. In our context, we have independent t statistics,

t_{1}, \dots, t_{k}

, and also independent p-values,

P_{1}, \dots, P_{k}

. In the following, we suggest various test procedures for testing

μ = μ_{0}

based on suitable combinations of

t_{i}^{'} s

and

P_{i}^{'} s

[34]. Depending on the test procedure we use, the rejection set A will be defined and used to compute the Bias and MSE of the preliminary test estimator of the common mean

μ

.

3.1. P-Value Based Exact Tests

Suppose

P_{(1)}, \dots, P_{(k)}

are independent p-values obtained from k continuous distributions of test statistics, then when individual hypothesis

H_{0 i}

is true,

P_{i}

is uniformly distributed over the interval

[0, 1]

. Testing the joint null hypothesis

H_{0} : μ = μ_{0}

versus

H_{1} : μ \neq μ_{0}

. Five p-value-based exact tests based on

t_{o b s}

and p-value from k independent studies as available in the literature are listed below.

3.1.1. Tippett’s Test

Suppose

P_{(1)}, \dots, P_{(k)}

are independent and ordered p-values. Then

H_{0}

is rejected if

P_{(1)} < α^{'}

. If the overall significance level is

α

then

α^{'} = 1 - {(1 - α)}^{\frac{1}{k}}

. Interestingly, this test is equivalent to the test based on

M_{T} = m a x_{1 \leq i \leq k} ∣ t_{i} ∣

suggested by Cohen and Sackrowitz [35]. This

S_{T} = m i n (P_{(1)}, \dots, P_{(k)})

test was proposed by Tippet et al. [36] also called the union-intersection.

3.1.2. Wilkinson’s Test

Wilkinson [37] provided a generalization of Tippett’s test, where

P_{(1)} \leq P_{(2)} \leq \dots \leq P_{(k)}

are ordered p-values with

r^{t h}

the smallest p-value,

P_{(r)}

as a test statistic. The common mean null hypothesis

H_{0} : μ = μ_{0}

will be rejected if

P_{(r)} < d_{r}, α

, where

P_{(r)}

follows a beta distribution with parameters r and

k - r + 1

under the null hypothesis and

d_{r}, α

satisfies

P_{r} (P_{(r)} < d_{r}, α ∣ H_{0}) = α

. This generates a series of tests for various values of

r = 1, 2, \dots, k

.

3.1.3. Inverse Normal Test

Stouffer et al. [38], reported that the Inverse Normal test procedure involves transforming the p-values to the corresponding standard normal distributions. The test statistic is defined as

Z = \frac{1}{\sqrt{k}} \sum_{i = 1}^{k} ϕ^{- 1} (P_{i})

, where

ϕ

is the standard normal cumulative distribution function (CDF). The common mean null hypothesis

H_{0} : μ = μ_{0}

will be rejected if

Z < - z_{α}

, where

z_{α}

denotes the upper

α

level cut-off point of the standard normal distribution.

3.1.4. Fisher’s Inverse $χ^{2}$ -Test

Fisher [39] noted that the test statistic

t_{F} = - 2 \sum_{i = 1}^{k} \ln (P_{i}) = - 2 \ln \prod_{i = 1}^{k} P_{i}

has a

χ^{2}

distribution with

2 k

degrees of freedom when

H_{0}

is true. This procedure uses the

\prod_{i = 1}^{k} P_{i}

to combine the k independent p-values. The common mean null hypothesis

H_{0} : μ = μ_{0}

will be rejected if

t_{F} > χ_{2 k, α}^{2}

, where

χ_{2 k, α}^{2}

denotes the upper

α

critical value of

χ^{2}

-distribution with

2 k

degrees of freedom.

3.1.5. The Logit Test

This exact test procedure which involves transforming each p-value into a logit was proposed by Mudholker and George [40]. The test statistic is defined as

G = - \sum_{i = 1}^{k} \ln (P_{i} / [1 - P_{i}]) {(3 / k π^{2})}^{1 / 2}

, where G follows student’s t-distribution with

5 k + 4

degrees of freedom. The common mean null hypothesis

H_{0} : μ = μ_{0}

is rejected if

G > z_{1 - α}

.

3.2. Exact Tests

3.2.1. Modified t

Fairweather [41] suggested using a weighted linear combination of the

t_{i}^{'} s

namely,

T_{1} = \sum_{i = 1}^{k} w_{1 i} ∣ t_{i} ∣

, where

w_{1 i} = \frac{{(Var (t_{i}))}^{- 1}}{\sum_{i = 1}^{k} {(Var (t_{i}))}^{- 1}}

, with

Var (∣ t_{i} ∣) = [(ν_{i} {(ν_{i} - 2)}^{- 1}) - {([Γ (\frac{ν_{i} - 1}{2}) \sqrt{ν_{i}}] {[Γ (\frac{ν_{i}}{2}) \sqrt{π}]}^{- 1})}^{2}]

. The null hypothesis

H_{0} : μ = μ_{0}

is rejected if

T_{1} > d_{1} α

, where

P_{r} [T_{1} > d_{1} α ∣ H_{0}] = α

with

d_{1} α

computed by simulation.

3.2.2. Modified F

Jordan and Krishnamoorthy [42] suggested using linear combinations of the

F_{i}

’s namely,

T_{2} = \sum_{i = 1}^{k} w_{2 i} F_{i}

, where

w_{2 i} = \frac{{(Var (F_{i}))}^{- 1}}{\sum_{i = 1}^{k} {(Var (F_{i}))}^{- 1}}

, with

Var (F_{i}) = [2 ν_{i}^{2} (ν_{i} - 1)] {[{(ν_{i} - 2)}^{2} (ν_{i} - 4)]}^{- 1}

for

ν_{i} > 4

. The null hypothesis

H_{0} : μ = μ_{0}

will be rejected if

T_{2} > d_{2 α}

, where

P_{r} [T_{2} > d_{2} α ∣ H_{0}] = α

with

d_{2} α

computed by simulation.

3.3. Properties of the Proposed Preliminary Test Estimator

3.3.1. Bias

Bias of the proposed preliminary test estimator is equal to

E [{\hat{μ}}_{P T} - μ]

, where

\begin{matrix} E ({\hat{μ}}_{P T}) & = E [{\hat{μ}}_{P T}] - μ \\ = E [μ_{0} + ({\hat{μ}}_{G D} - μ_{0}) I (A)] - μ \\ = μ_{0} E [(1 - I (A))] + E [{\hat{μ}}_{G D} I (A)] - μ . \end{matrix}

(12)

Given that the rejection of

H_{0}

and

{\hat{μ}}_{G D}

are dependent upon sample mean and sample variance

{\bar{X}}_{i}

and

S_{i}^{2}

, respectively, it may be concluded that

{\hat{μ}}_{G D}

and

I (\cdot)

are not mutually independent.

3.3.2. Mean Square Error

The MSE of

{\hat{μ}}_{P T}

can be expressed as

\begin{matrix} MSE ({\hat{μ}}_{P T}) & = & E [{({\hat{μ}}_{P T} - μ)}^{2}] \\ = & Var ({\hat{μ}}_{P T} - μ) + {(E [{\hat{μ}}_{P T} - μ])}^{2} \\ = & Var ({\hat{μ}}_{P T}) + {(μ_{0} E [(1 - I (A))] + E [{\hat{μ}}_{G D} I (A)] - μ)}^{2} \\ = & μ_{0}^{2} Var [I (A^{c})] + Var [{\hat{μ}}_{G D} I (A)] + {(μ_{0} E [I (A^{c})] + E [{\hat{μ}}_{G D} I (A)] - μ)}^{2} . \end{matrix}

(13)

4. Simulation Study

Bias and Mean Squared Error

We will now assess the effectiveness of the proposed preliminary test estimator

({\hat{μ}}_{P T})

performs in terms of bias and MSE. To achieve a high level of accuracy, each simulated bias and MSE value was calculated using

Q = 10^{5}

replications, resulting in an exceptionally large simulation. It should be noted that MSE and relative efficiency (RE) of the proposed preliminary test estimator are functions of

n_{1}, n_{2}

and

δ = σ_{1}^{2} / σ_{2}^{2}

. Among these parameters, n represents the sample size, and

δ

is the estimated value of the parameter used in the proposed preliminary test estimator. These extensive computations were carried out using statistical software R [43]. The procedure for our proposed preliminary test estimator of common mean is defined as:

Select two positive integers $n_{1}$ and $n_{2}$ .
Generate independent random observations $X_{1 i}, i = 1, \dots, n_{1}$ and $X_{2 i}, i = 1, \dots, n_{2}$ .
Test $H_{0} : μ = μ_{0}$ versus $H_{1} : μ \neq μ_{0}$ at significance level $α$ using p-value and exact based tests in Section 3 for $H_{0}$ versus $H_{1}$ .
If we fail to reject $H_{0}$ , we take the estimator of ${\hat{μ}}_{P T} = μ_{0}$ . However, if $H_{0}$ is rejected we take the estimator of ${\hat{μ}}_{P T}$ as ${\hat{μ}}_{G D}$ .
The effectiveness of this proposed estimator is assessed using the simulated bias as $Q^{- 1} \sum_{q}^{Q} ({\hat{μ}}_{q} - μ)$ and simulated MSE as $Q^{- 1} \sum_{q}^{Q} {({\hat{μ}}_{q} - μ)}^{2}$ .

The expression provided in Equation (12) for bias, can be computed for various values of

δ = (0.6, 1.0, 1.2 and 2.0), μ = (- 1.0, 0.0 and 1.0), and n_{1}, n_{2} = (10, 15, 20, 25, 50, 60 and 100)

. Without loss of generality, in our computed simulated bias and MSE, we set

α = 0.05

,

μ_{0} = 0

and

σ_{2}^{2} = 1

.

Remark 1.

Table 1, Table 2 and Table 3 provide some illustrative values. Generally, it is observed that as δ increases, the bias increases in magnitude for unequal sample sizes. Ultimately, for a value of δ close to 1, the bias approaches zero. Furthermore, the comparison of tables reveals that as the sample size increases, the magnitude of bias decreases. Furthermore, as μ deviates further from

μ_{0}

, the bias becomes larger and is dependent on

n_{2}

. In particular, when

n > 25

and

δ < 2

, the bias of our proposed test estimator appears to approach zero. Furthermore, as μ deviates further from

μ_{0}

, the MSE becomes larger and independent on

n_{2}

when

μ = μ_{0}

.

Remark 2.

Table 1, Table 2 and Table 3 illustrate the changes in MSE with respect to both δ and sample size. Specifically, an increase in δ leads to a corresponding increase in MSE. Furthermore, the comparison across the tables shows that as the sample size grows, the MSE decreases accordingly. The minimum MSE is consistently observed when the estimated value is close to the true value

μ = μ_{0}

, regardless of the test performed. It is also noteworthy that the MSE values are nearly identical across all p-value-based tests, except for the Inverse Normal test. On the other hand, the modified exact tests tend to produce higher MSE values compared to their P-value-based counterparts.

Remark 3.

To evaluate the performance of the proposed preliminary test estimator (

{\hat{μ}}_{P T}

) in comparison to the conventional single-stage estimator (

{\hat{μ}}_{G D}

) using equal sample sizes (

n_{1} = n_{2} = n

) and a fixed significance level (

α = 0.05

), it is observed that as the sample size (n) increases, the MSE generally decreases. Notably, when μ is closer to the hypothesized mean (

μ_{0}

), the preliminary test estimator outperforms the unbiased estimator across various values of δ. This range of values where the preliminary test estimator excels can be referred to as its effective interval. After reaching a minimum at

μ = 0

, a slight rise in MSE is observed as μ deviates further from

μ_{0}

. This trend is evident in the results depicted in Figure 1a,b, indicating that for

δ = 1.2

and

δ = 0.6

, the proposed estimator performs better than the unbiased estimator when

- 0.2 \leq μ \leq 0.2

. Conversely, for

δ = 1.2

and

δ = 0.6

, the proposed estimator outperforms the unbiased estimator when

- 0.12 \leq μ \leq 0.12

and

- 0.08 \leq μ \leq 0.08

, respectively (as shown in Figure 1c,e for

n = 30

). Again, for

δ = 1.2

and

δ = 0.6

, the proposed estimator outperforms the unbiased estimator when

- 0.12 \leq μ \leq 0.12

and

- 0.06 \leq μ \leq 0.06

, respectively (as shown in Figure 1d,f for

n = 60

). The preliminary test estimator, employing Tippett, Wilkinson (r = 2), Fisher’s inverse

χ^{2}

, logit, and modified t tests, demonstrates satisfactory performance within its effective interval, as indicated by MSE values. These findings are consistent with the conclusions drawn by Kifle et al. (2021) regarding the efficacy of Fisher’s inverse

χ^{2}

and modified t tests across various sample sizes and significance levels [33].

5. Application in Biological Research

To demonstrate the practical applicability of the proposed preliminary test estimator, we analyzed data from four experiments used to estimate the percentage of albumin in plasma protein of normal human subjects. This dataset is reported in Meier [9] and appears in Table 4. For this dataset, previous studies focusing on the test problem [44,45], have compared the various test procedures for testing

H_{1} : μ = 59.50

versus

H_{2} : μ \neq 59.50

.

In our scenario, we could consider 59.50 as our non-sample prior information and apply our proposed preliminary test estimator to address this issue. According to the findings presented in Table 5, the estimated mean (

{\hat{μ}}_{P T}

) derived from p-value based tests (including Tippett’s, Wilkinson (

r = 3

and

r = 4

), Inverse normal, and Fisher’s tests) notably integrates the non-sample prior information.

In our second application of the proposed preliminary test estimator, we analyzed the data from four experiments about non-fat milk powder. This data set is reported by Eberhardt et al. [10] and appears in Table 6. We can compute values of

μ_{P T}

for different values of

μ_{0}

with fixed sampling values, based on P-value and modified exact tests. The resulting values are shown in Table 7.

The findings presented in Table 7 suggest that when

μ_{0}

is below 110.00,

{\hat{μ}}_{P T} = {\hat{μ}}_{G D}

. However, when

μ_{0}

falls within the range of 110.00 to 110.50, tests including Tippett’s, Wilkinson’s (

r = 2

,

r = 3

, and

r = 4

), Fisher’s, and the logit tests do not reject the null hypothesis (

H_{0}

), indicating an estimated common mean

(μ)

of equal to 110.00, whereas other tests reject

H_{0}

, estimating the common mean

μ

equal to 109.60. For

μ_{0} = 111.00

, tests based on Wilkinson’s (

r = 2

and

r = 3

) and the modified F tests also fail to reject

H_{0}

, with an estimated

μ

equal to 110.00. Both the Inverse normal test and the Modified t test accepted the null hypothesis for various values of

μ_{0}

. This may be because the Inverse normal test transforms p-values into z-scores and combines them, whereas the Modified t test adjusts the traditional t test procedure to address specific issues such as heteroscedasticity or small sample sizes.

From the above results, we do not intend to make any broad conclusions here, but our simulation results suggest that our proposed preliminary test estimator based on Tippett’s, Wilkinson’s (

r = 2

,

r = 3

, and

r = 4

), Fisher’s, and the logit tests are feasible and could be applied to this specific case if prior information about the population mean is available.

6. Conclusions

The past decade has witnessed increased interest in estimating unknown quantities using data from multiple independent yet non-homogeneous samples. This approach finds application across various domains, as evidenced by the diverse range of applications discussed in the most recent book by Sinah et al. [2]. In this study, we introduce a preliminary test estimator that integrates non-sample prior information. Our simulations indicate that this proposed estimator exhibits distinct advantages in certain scenarios, particularly when dealing with very small sample sizes and situations where

σ_{2}^{2}

exceeds

σ_{1}^{2}

. Notably, the proposed estimator significantly reduces MSE values compared to traditional unbiased estimators, especially when

μ

is in proximity to

μ_{0}

. Moreover, the performance of the proposed estimator, when based on Tippett’s, Wilkinson’s (

r = 2

,

r = 3

, and

r = 4

), Fisher’s, and logit tests, surpasses that of

{\hat{μ}}_{G D}

, particularly in cases involving very small sample sizes. For substantial sample sizes, the effectiveness of the suggested estimator, deploying Inverse normal and modified F tests, appeared to demonstrate consistent and dependable performance, MSE discrepancy of less than

0.02

compared to the MSE of the unbiased estimator. Consequently, we advocate for the adoption of the proposed estimator to enhance the accuracy of

μ

estimation. Nevertheless, no universally optimal estimator performs best across all scenarios. Consequently, it becomes crucial to select an appropriate estimator tailored to each specific scenario. The decision on which estimator to employ relies on the objectives of the research, making it challenging to devise a purely statistical strategy for selection. Our findings in this article suggest that through careful application to real meta-analyses, the proposed estimator exhibits promising potential.

This article primarily considered the scenario under the general preliminary test estimator whereby

α_{1} = α_{2} = α

. Extensions of this work could explore cases where

α_{1} = 0

and

α_{2} = 1

through the introduction of a randomized test, where the probability function

q (\cdot)

is treated as a shrinkage parameter. Consequently, the proposed estimator would transition to a non-randomized form [25]. Additionally, it’s pertinent to highlight that this study focuses on the univariate common mean of multiple normal populations. Future extensions could broaden the scope to encompass multiple responses, such as bivariate common mean.

Author Contributions

Conceptualization, P.M.M. and Y.G.K.; methodology, P.M.M. and Y.G.K.; software, P.M.M.; validation, P.M.M. and Y.G.K.; formal analysis, P.M.M.; investigation, P.M.M. and Y.G.K.; resources, P.M.M.; data curation, P.M.M.; writing—original draft preparation, P.M.M.; writing—review and editing, P.M.M., Y.G.K. and C.S.M.; visualization, P.M.M.; supervision, Y.G.K. and C.S.M.; project administration, Y.G.K. and C.S.M.; funding acquisition, Y.G.K. and C.S.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the University Staff Doctoral Programme (USDP) hosted by the University of Limpopo in collaboration with the University of Maryland Baltimore County. Again, the first author acknowledges the financial support from the Research and Innovation Department of the University of Fort Hare.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Acknowledgments

The authors extend their gratitude to Bimal Sinha of the University of Maryland in Baltimore County, USA, for his insightful guidance and support. Our heartfelt thanks go to three reviewers for their excellent comments, which helped us clarify several key points and enhance the quality of this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Wang, X.M.; Zhang, X.R.; Li, Z.H.; Zhong, W.F.; Yang, P.; Mao, C. A brief introduction of meta-analyses in clinical practice and research. J. Gene Med. 2021, 23, e3312. [Google Scholar] [CrossRef] [PubMed]
Sinha, B.K.; Hartung, J.; Knapp, G. Statistical Meta-Analysis with Applications; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
Haidich, A.B. Meta-analysis in medical research. Hippokratia 2010, 14, 29. [Google Scholar] [PubMed]
Glass, G.V. Primary, secondary, and meta-analysis of research. Educ. Res. 1976, 5, 3–8. [Google Scholar] [CrossRef]
DerSimonian, R.; Laird, N. Evaluating the effect of coaching on SAT scores: A meta-analysis. Harv. Educ. Rev. 1983, 53, 1–15. [Google Scholar] [CrossRef]
Hedges, L.V. Advances in statistical methods for meta-analysis. New Dir. Program Eval. 1984, 24, 25–42. [Google Scholar] [CrossRef]
Liu, Q.; Qin, C.; Liu, M.; Liu, J. Effectiveness and safety of SARS-CoV-2 vaccine in real-world studies: A systematic review and meta-analysis. Infect. Dis. Poverty 2021, 10, 132. [Google Scholar] [CrossRef]
Watanabe, A.; Kani, R.; Iwagami, M.; Takagi, H.; Yasuhara, J.; Kuno, T. Assessment of efficacy and safety of mRNA COVID-19 vaccines in children aged 5 to 11 years: A systematic review and meta-analysis. JAMA Pediatr. 2023, 177, 384–394. [Google Scholar] [CrossRef]
Meier, P. Variance of a weighted mean. Biometrics 1953, 9, 59–73. [Google Scholar] [CrossRef]
Eberhardt, K.R.; Reeve, C.P.; Spiegelman, C.H. A minimax approach to combining means, with practical examples. Chemom. Intell. Lab. Syst. 1989, 5, 129–148. [Google Scholar] [CrossRef]
Graybill, F.A.; Deal, R. Combining unbiased estimators. Biometrics 1959, 15, 543–550. [Google Scholar] [CrossRef]
Kubokawa, T. Admissible minimax estimation of a common mean of two normal populations. Ann. Stat. 1987, 1245–1256. [Google Scholar] [CrossRef]
Brown, L.D.; Cohen, A. Point and confidence estimation of a common mean and recovery of interblock information. Ann. Stat. 1974, 2, 963–976. [Google Scholar] [CrossRef]
Cohen, A.; Sackrowitz, H.B. On estimating the common mean of two normal distributions. Ann. Stat. 1974, 1274–1282. [Google Scholar] [CrossRef]
Moore, B.; Krishnamoorthy, K. Combining independent normal sample means by weighting with their standard errors. J. Stat. Comput. Simul. 1997, 58, 145–153. [Google Scholar] [CrossRef]
Huang, H. Combining estimators in interlaboratory studies and meta-analyses. Res. Synth. Methods 2023, 14, 526–543. [Google Scholar] [CrossRef] [PubMed]
Dong, Y.F.; Chen, W.X.; Xie, M.Y. Best linear unbiased estimators of location and scale ranked set parameters under moving extremes sampling design. Acta Math. Appl. Sin. Engl. Ser. 2023, 39, 222–231. [Google Scholar] [CrossRef]
Khatun, H.; Tripathy, M.R.; Pal, N. Hypothesis testing and interval estimation for quantiles of two normal populations with a common mean. Commun. Stat.-Theory Methods 2022, 51, 5692–5713. [Google Scholar] [CrossRef]
Marić, N.; Graybill, F.A. Evaluation of a method for setting confidence intervals on the common mean of two normal populations. Commun. Stat.-Simul. Comput. 1979, 8, 53–60. [Google Scholar] [CrossRef]
Pagurova, V.I.; Gurskii, V. A confidence interval for the common mean of several normal distributions. Theory Probab. Appl. 1980, 24, 882–888. [Google Scholar] [CrossRef]
Krishnamoorthy, K.; Moore, B.C. Combining information for prediction in linear regression. Metrika 2002, 56, 73–81. [Google Scholar] [CrossRef]
Bancroft, T.A. On biases in estimation due to the use of preliminary tests of significance. Ann. Math. Stat. 1944, 15, 190–204. [Google Scholar] [CrossRef]
Stein, C. Inadmissibility of the usual estimator for the mean of a multivariate normal distribution. In Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Contributions to the Theory of Statistic, 26–31 December 1954; University of California Press: Berkeley, CA, USA, 1956; Volume 3, pp. 197–207. [Google Scholar]
Khan, S.; Saleh, A.M.E. On the comparison of the pre-test and shrinkage estimators for the univariate normal mean. Stat. Pap. 2001, 42, 451–473. [Google Scholar] [CrossRef]
Shih, J.H.; Konno, Y.; Chang, Y.T.; Emura, T. A class of general pretest estimators for the univariate normal mean. Commun. Stat.-Theory Methods 2023, 52, 2538–2561. [Google Scholar] [CrossRef]
Taketomi, N.; Konno, Y.; Chang, Y.T.; Emura, T. A meta-analysis for simultaneously estimating individual means with shrinkage, isotonic regression and pretests. Axioms 2021, 10, 267. [Google Scholar] [CrossRef]
Thompson, J.R. Some shrinkage techniques for estimating the mean. J. Am. Stat. Assoc. 1968, 63, 113–122. [Google Scholar] [CrossRef]
Mphekgwana, P.M.; Kifle, Y.G.; Marange, C.S. Shrinkage Testimator for the Common Mean of Several Univariate Normal Populations. Mathematics 2024, 12, 1095. [Google Scholar] [CrossRef]
Pal, N.; Lin, J.J.; Chang, C.H.; Kumar, S. A revisit to the common mean problem: Comparing the maximum likelihood estimator with the Graybill–Deal estimator. Comput. Stat. Data Anal. 2007, 51, 5673–5681. [Google Scholar] [CrossRef]
Khatri, C.; Shah, K. Estimation of location parameters from two linear models under normality. Commun. Stat.-Theory Methods 1974, 3, 647–663. [Google Scholar]
Sinha, B.K. Unbiased estimation of the variance of the Graybill-Deal estimator of the common mean of several normal populations. Can. J. Stat. 1985, 13, 243–247. [Google Scholar] [CrossRef]
Hartung, J. An alternative method for meta-analysis. Biom. J. J. Math. Methods Biosci. 1999, 41, 901–916. [Google Scholar] [CrossRef]
Kifle, Y.G.; Moluh, A.M.; Sinha, B.K. Comparison of local powers of some exact tests for a common normal mean with unequal variances. In Methodology and Applications of Statistics; Springer: Berlin/Heidelberg, Germany, 2021; pp. 77–101. [Google Scholar]
Kifle, Y.G.; Moluh, A.M.; Sinha, B.K. Inference about a Common Mean Vector from Several Independent Multinormal Populations with Unequal and Unknown Dispersion Matrices. Mathematics 2024, 12, 2723. [Google Scholar] [CrossRef]
Cohen, A.; Sackrowitz, H. Exact tests that recover interblock information in balanced incomplete blocks designs. J. Am. Stat. Assoc. 1989, 84, 556–559. [Google Scholar] [CrossRef]
Tippett, L.H.C. The Methods of Statistics: An Introduction Mainly for Workers in the Biological Sciences; Williams & Norgate: London, UK, 1931. [Google Scholar]
Wilkinson, B. A statistical consideration in psychological research. Psychol. Bull. 1951, 48, 156. [Google Scholar] [CrossRef] [PubMed]
Stouffer, S.A.; Suchman, E.A.; DeVinney, L.C.; Star, S.A.; Williams, R.M., Jr. The American Soldier: Adjustment during Army Life. (Studies in Social Psychology in World War ii); Princeton University Press: Princeton, NJ, USA, 1949; Volume 1. [Google Scholar]
Fisher, R. Statistical Methods for Research Workers, 4th ed.; Oliver and Boyd: Edinburgh, Scotland; London, UK, 1932. [Google Scholar]
George, E.O.; Mudholkar, G.S. The Logit Method for Combining Tests; Technical Report; Department of Statistics, Rochester University: Rochester, NY, USA, 1979. [Google Scholar]
Fairweather, W.R. A method of obtaining an exact confidence interval for the common mean of several normal populations. J. R. Stat. Soc. Ser. (Appl. Stat.) 1972, 21, 229–233. [Google Scholar] [CrossRef]
Jordan, S.M.; Krishnamoorthy, K. Exact confidence intervals for the common mean of several normal populations. Biometrics 1996, 52, 77–86. [Google Scholar] [CrossRef]
R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2021. [Google Scholar]
Chang, C.H.; Pal, N. Testing on the common mean of several normal distributions. Comput. Stat. Data Anal. 2008, 53, 321–333. [Google Scholar] [CrossRef]
Li, X.; Williamson, P.P. Testing on the common mean of normal distributions using Bayesian methods. J. Stat. Comput. Simul. 2014, 84, 1363–1380. [Google Scholar] [CrossRef]

Figure 1. Efficiency of estimator

{\hat{μ}}_{P T}

based on p-value and modified exact tests with respect to

{\hat{μ}}_{G D}

.

Figure 1. Efficiency of estimator

{\hat{μ}}_{P T}

based on p-value and modified exact tests with respect to

{\hat{μ}}_{G D}

.

Table 1. Simulated bias (MSE) of the proposed estimator

{\hat{μ}}_{P T}

for different choices of

δ

(

n < 25

).

Table 1. Simulated bias (MSE) of the proposed estimator

{\hat{μ}}_{P T}

for different choices of

δ

(

n < 25

).

$δ$	$μ$	$(n_{1}, n_{2})$	Tippett	Wilkinson $r = 2$	Inverse Normal	Fisher	Logit	Modified t	Modified F
0.6	−1.0	(10, 10)	−0.0013	−0.0325	−0.0010	−0.0036	−0.0027	0.0006	0.0028
			(0.0105)	(0.0414)	(0.0067)	(0.0078)	(0.0079)	(0.0079)	(0.0066)
		(15, 15)	0.0003	−0.0386	−0.0014	−0.0002	−0.0013	0.0032	0.0004
			(0.0111)	(0.0431)	(0.0067)	(0.0074)	(0.0076)	(0.0080)	(0.0067)
		(20, 20)	−0.1368	−0.0344	−0.0028	−0.0018	0.0012	0.0017	0.0011
			(0.0989)	(0.0431)	(0.0067)	(0.0075)	(0.0073)	(0.0078)	(0.0067)
	0.0	(10, 10)	0.0004	0.0013	−0.0003	−0.0018	−0.0014	−0.0018	−0.0021
			(0.0003)	(0.0003)	(0.0063)	(0.0003)	(0.0003)	(0.0005)	(0.0058)
		(15, 15)	0.0001	0.0007	0.0001	0.0015	0.0000	0.0003	0.0021
			(0.0003)	(0.0003)	(0.0063)	(0.0003)	(0.0003)	(0.0005)	(0.0058)
		(20, 20)	0.0219	0.0004	0.0009	−0.0001	0.0001	−0.0004	0.0004
			(0.0330)	(0.0003)	(0.0063)	(0.0003)	(0.0003)	(0.0006)	(0.0058)
	1.0	(10, 10)	−0.0002	0.0011	−0.0012	−0.0004	−0.0029	0.0002	0.0003
			(0.0067)	(0.0075)	(0.0067)	(0.0067)	(0.0067)	(0.0067)	(0.0067)
		(15, 15)	−0.0028	0.0019	−0.0002	0.0008	−0.0038	0.0000	−0.0003
			(0.0066)	(0.0076)	(0.0067)	(0.0067)	(0.0067)	(0.0066)	(0.0067)
		(20, 20)	−0.0174	0.0017	0.0018	−0.0015	−0.0001	−0.0011	0.0011
			(0.0416)	(0.0085)	(0.0067)	(0.0067)	(0.0067)	(0.0067)	(0.0066)
1.0	−1.0	(10, 10)	−0.0116	−0.0362	−0.0001	−0.0068	−0.0051	−0.0103	−0.0016
			(0.0429)	(0.0540)	(0.0110)	(0.0195)	(0.0186)	(0.0330)	(0.0114)
		(15, 15)	−0.0206	−0.0346	−0.0044	−0.0069	−0.0002	−0.0072	−0.0011
			(0.0397)	(0.0532)	(0.0111)	(0.0195)	(0.0181)	(0.0295)	(0.0112)
		(20, 20)	−0.2090	−0.0375	0.0020	−0.0059	−0.0008	−0.0075	0.0007
			(0.1394)	(0.0536)	(0.0111)	(0.0209)	(0.0187)	(0.0296)	(0.0111)
	0.0	(10, 10)	0.0016	−0.0030	−0.0009	−0.0007	0.0007	−0.0026	−0.0021
			(0.0005)	(0.0005)	(0.0106)	(0.0004)	(0.0004)	(0.0009)	(0.0092)
		(15, 15)	0.0003	−0.0002	0.0024	0.0003	−0.0009	0.0035	−0.0014
			(0.0005)	(0.0005)	(0.0105)	(0.0004)	(0.0004)	(0.0009)	(0.0092)
		(20, 20)	0.0746	0.0026	−0.0001	0.0006	−0.0006	0.0006	−0.0010
			(0.1027)	(0.0005)	(0.0105)	(0.0005)	(0.0004)	(0.0009)	(0.0092)
	1.0	(10, 10)	−0.0021	−0.0006	0.0000	0.0016	0.0004	0.0019	−0.0031
			(0.0116)	(0.0121)	(0.0111)	(0.0111)	(0.0111)	(0.0114)	(0.0111)
		(15, 15)	−0.0055	−0.0016	−0.0010	0.0027	0.0004	0.0034	−0.0029
			(0.0111)	(0.0121)	(0.0111)	(0.0116)	(0.0111)	(0.0114)	(0.0112)
		(20, 20)	−0.0868	0.0023	0.0038	−0.0019	0.0004	−0.0020	−0.0033
			(0.1464)	(0.0122)	(0.0111)	(0.0111)	(0.0111)	(0.0113)	(0.0111)
2.0	−1.0	(10, 10)	−0.0770	−0.0643	0.0012	−0.0297	−0.0292	−0.0798	−0.0024
			(0.1472)	(0.1301)	(0.0222)	(0.0805)	(0.0738)	(0.1763)	(0.0289)
		(15, 15)	−0.0835	−0.0624	0.0021	−0.0295	−0.0216	−0.0727	−0.0026
			(0.1475)	(0.1311)	(0.0221)	(0.0712)	(0.0763)	(0.1786)	(0.0271)
		(20, 20)	−0.2676	−0.0461	0.0008	−0.0324	−0.0307	−0.0726	−0.0007
			(0.1764)	(0.1277)	(0.0221)	(0.0720)	(0.0730)	(0.1830)	(0.0273)
	0.0	(10, 10)	−0.0028	0.0005	0.0044	0.0000	0.0011	−0.0045	0.0003
			(0.0009)	(0.0009)	(0.0210)	(0.0009)	(0.0008)	(0.0019)	(0.0169)
		(15, 15)	−0.0001	0.0020	0.0033	−0.0014	−0.0012	0.0032	0.0038
			(0.0009)	(0.0010)	(0.0210)	(0.0009)	(0.0009)	(0.0018)	(0.0170)
		(20, 20)	0.1862	0.0027	−0.0068	0.0002	−0.0002	0.0032	−0.0012
			(0.2115)	(0.0009)	(0.0210)	(0.0008)	(0.0008)	(0.0018)	(0.0170)
	1.0	(10, 10)	0.0015	−0.0005	−0.0027	0.0038	−0.0033	0.0058	−0.0029
			(0.0279)	(0.0304)	(0.0222)	(0.0226)	(0.0225)	(0.0537)	(0.0221)
		(15, 15)	−0.0035	−0.0003	−0.0006	0.0007	0.0026	0.0049	−0.0003
			(0.0301)	(0.0299)	(0.0222)	(0.0226)	(0.0227)	(0.0484)	(0.0222)
		(20, 20)	−0.1904	0.0075	0.0026	−0.0035	−0.0053	0.0096	0.0000
			(0.2784)	(0.0291)	(0.0222)	(0.0228)	(0.0229)	(0.0470)	(0.0227)

Table 2. Simulated bias (MSE) of the proposed estimator

{\hat{μ}}_{P T}

for different choices of

δ

(

n > 25

).

Table 2. Simulated bias (MSE) of the proposed estimator

{\hat{μ}}_{P T}

for different choices of

δ

(

n > 25

).

$δ$	$μ$	$(n_{1}, n_{2})$	Tippett	Wilkinson $r = 2$	Inverse Normal	Fisher	Logit	Modified t	Modified F
0.6	−1.0	(25, 25)	0.0007	−0.0015	−0.0018	−0.0030	−0.0005	−0.0002	−0.0002
			(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)
		(50, 50)	−0.0012	0.0012	−0.0009	0.0003	0.0006	−0.0006	−0.0014
			(0.0017)	(0.0018)	(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)
		(100, 100)	0.0024	0.0018	0.0010	0.0011	−0.0016	−0.0010	−0.0008
			(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)
	0.0	(25, 25)	−0.0002	−0.0001	−0.0005	−0.0007	0.0001	0.0011	0.0009
			(0.0001)	(0.0001)	(0.0016)	(0.0001)	(0.0001)	(0.0001)	(0.0016)
		(50, 50)	−0.0004	0.0008	−0.0003	−0.0008	0.0004	−0.0001	−0.0001
			(0.0001)	(0.0001)	(0.0016)	(0.0001)	(0.0001)	(0.0001)	(0.0016)
		(100, 100)	0.0001	−0.0004	0.0012	0.0001	−0.0002	−0.0001	0.0015
			(0.0001)	(0.0001)	(0.0016)	(0.0001)	(0.0001)	(0.0001)	(0.0016)
	1.0	(25, 25)	−0.0010	−0.0002	0.0000	0.0009	0.0001	0.0006	−0.0013
			(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)
		(50, 50)	−0.0012	0.0004	−0.0004	0.0000	−0.0004	−0.0003	0.0019
			(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)
		(100, 100)	0.0002	0.0016	−0.0018	0.0013	−0.0008	0.0015	0.0007
			(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)	(0.0017)
1.0	−1.0	(25, 25)	0.0001	−0.0011	−0.0019	0.0007	0.0023	0.0022	0.0010
			(0.0029)	(0.0031)	(0.0029)	(0.0029)	(0.0029)	(0.0029)	(0.0029)
		(50, 50)	−0.0010	−0.0036	−0.0024	−0.0005	0.0007	−0.0002	−0.0027
			(0.0029)	(0.0029)	(0.0029)	(0.0029)	(0.0029)	(0.0029)	(0.0029)
		(100, 100)	−0.0014	−0.0015	0.0003	0.0026	0.0003	0.0001	−0.0005
			(0.0029)	(0.0029)	(0.0029)	(0.0029)	(0.0029)	(0.0029)	(0.0029)
	0.0	(25, 25)	0.0002	−0.0005	0.0002	−0.0001	−0.0006	0.0005	0.0006
			(0.0001)	(0.0001)	(0.0027)	(0.0001)	(0.0001)	(0.0002)	(0.0027)
		(50, 50)	0.0002	0.0007	0.0013	−0.0005	−0.0010	0.0015	−0.0050
			(0.0001)	(0.0001)	(0.0027)	(0.0001)	(0.0001)	(0.0002)	(0.0027)
		(100, 100)	−0.0010	0.0000	−0.0012	0.0005	0.0013	0.0002	−0.0013
			(0.0001)	(0.0001)	(0.0027)	(0.0001)	(0.0001)	(0.0002)	(0.0027)
	1.0	(25, 25)	−0.0020	0.0004	−0.0008	0.0011	0.0005	−0.0012	−0.0001
			(0.0029)	(0.0029)	(0.0029)	(0.0029)	(0.0029)	(0.0029)	(0.0029)
		(50, 50)	0.0001	−0.0025	−0.0007	−0.0001	0.0009	0.0003	−0.0016
			(0.0028)	(0.0029)	(0.0029)	(0.0029)	(0.0028)	(0.0029)	(0.0029)
		(100, 100)	0.0003	−0.0018	0.0017	−0.0014	−0.0006	−0.0007	0.0017
			(0.0029)	(0.0029)	(0.0029)	(0.0028)	(0.0028)	(0.0029)	(0.0029)
2.0	−1.0	(25, 25)	−0.0028	−0.0015	−0.0002	0.0018	−0.0014	−0.0002	0.0017
			(0.0057)	(0.0061)	(0.0057)	(0.0057)	(0.0057)	(0.0065)	(0.0057)
		(50, 50)	−0.0029	0.0022	0.0006	0.0006	−0.0026	0.0011	0.0011
			(0.0057)	(0.0059)	(0.0057)	(0.0057)	(0.0057)	(0.0064)	(0.0057)
		(100, 100)	0.0007	−0.0007	−0.0009	0.0022	0.0037	−0.0027	−0.0029
			(0.0057)	(0.0060)	(0.0057)	(0.0057)	(0.0057)	(0.0064)	(0.0057)
	0.0	(25, 25)	0.0010	0.0012	0.0016	0.0009	−0.0004	−0.0014	0.0003
			(0.0003)	(0.0003)	(0.0054)	(0.0003)	(0.0003)	(0.0004)	(0.0052)
		(50, 50)	0.0008	−0.0002	−0.0017	−0.0010	0.0010	−0.0025	−0.0019
			(0.0003)	(0.0003)	(0.0054)	(0.0003)	(0.0003)	(0.0004)	(0.0051)
		(100, 100)	−0.0011	−0.0010	0.0006	0.0000	0.0008	−0.0001	−0.0031
			(0.0003)	(0.0003)	(0.0054)	(0.0003)	(0.0003)	(0.0004)	(0.0051)
	1.0	(25, 25)	−0.0021	0.0015	−0.0017	−0.0005	0.0022	−0.0040	0.0005
			(0.0057)	(0.0057)	(0.0057)	(0.0057)	(0.0057)	(0.0057)	(0.0057)
		(50, 50)	0.0018	0.0001	0.0002	−0.0018	0.0034	−0.0012	0.0006
			(0.0057)	(0.0057)	(0.0057)	(0.0057)	(0.0057)	(0.0057)	(0.0057)
		(100, 100)	−0.0028	0.0023	−0.0012	0.0002	0.0000	−0.0024	−0.0003
			(0.0057)	(0.0057)	(0.0057)	(0.0057)	(0.0057)	(0.0057)	(0.0057)

Table 3. Simulated bias (MSE) of the proposed estimator

{\hat{μ}}_{P T}

for different choices of

δ

(

n_{1} \neq n_{2}

).

Table 3. Simulated bias (MSE) of the proposed estimator

{\hat{μ}}_{P T}

for different choices of

δ

(

n_{1} \neq n_{2}

).

$δ$	$μ$	$(n_{1}, n_{2})$	Tippett	Wilkinson $r = 2$	Inverse Normal	Fisher	Logit	Modified t	Modified F
0.6	−1.0	(10, 25)	0.0018	−0.0380	−0.0004	−0.0007	0.0005	0.0016	0.0043
			(0.0070)	(0.0422)	(0.0071)	(0.0071)	(0.0071)	(0.01)	(0.0135)
		(10, 50)	0.0003	−0.0373	0.0022	0.0004	0.0007	0.0018	0.0039
			(0.0070)	(0.0410)	(0.0071)	(0.0070)	(0.0071)	(0.0100)	(0.0133)
		(25, 10)	−0.0035	−0.0016	0.0009	0.0057	−0.0003	−0.0068	0.0013
			(0.0143)	(0.0173)	(0.0133)	(0.0133)	(0.0134)	(0.0358)	(0.0172)
	0.0	(10, 25)	0.0014	0.0010	−0.0015	0.0005	0.0013	0.0015	0.0030
			(0.0003)	(0.0003)	(0.0067)	(0.0003)	(0.0003)	(0.0007)	(0.0104)
		(10, 50)	−0.0005	0.0010	0.0003	0.0003	−0.0003	−0.0006	−0.0069
			(0.0003)	(0.0003)	(0.0067)	(0.0003)	(0.0003)	(0.0007)	(0.0105)
		(25, 10)	−0.0012	−0.0004	0.0030	0.0006	0.0002	−0.0004	−0.0034
			(0.0005)	(0.0005)	(0.0127)	(0.0005)	(0.0005)	(0.0012)	(0.0132)
	1.0	(10, 25)	0.0010	−0.0006	0.0010	0.0004	−0.0033	0.0004	−0.0001
			(0.0070)	(0.0082)	(0.0071)	(0.0071)	(0.0070)	(0.0100)	(0.0133)
		(10, 50)	−0.0003	0.0037	−0.0019	0.0006	0.0025	0.0012	0.0005
			(0.0071)	(0.0079)	(0.0071)	(0.007)	(0.0070)	(0.0100)	(0.0134)
		(25, 10)	−0.0006	0.0005	0.0006	0.0009	−0.0003	−0.0020	−0.0016
			(0.0133)	(0.0134)	(0.0133)	(0.0133)	(0.0133)	(0.0172)	(0.0171)
1.0	−1.0	(10, 25)	−0.0012	−0.0403	0.0033	−0.0007	−0.0980	−0.0044	−0.0078
			(0.0142)	(0.0476)	(0.0117)	(0.0121)	(0.0123)	(0.0179)	(0.0234)
		(10, 50)	0.0026	−0.0343	−0.0002	−0.0005	−0.0964	0.0013	0.0039
			(0.0143)	(0.0466)	(0.0118)	(0.0122)	(0.0122)	(0.0176)	(0.0235)
		(25, 10)	−0.0012	−0.0094	−0.0032	−0.0031	−0.0010	−0.0499	0.0015
			(0.0240)	(0.0525)	(0.0221)	(0.0226)	(0.0227)	(0.1254)	(0.0309)
	0.0	(10, 25)	0.0009	−0.0006	−0.0003	−0.0001	0.0093	0.0032	−0.0017
			(0.0005)	(0.0006)	(0.0112)	(0.0005)	(0.0005)	(0.0011)	(0.0164)
		(10, 50)	0.0004	−0.0005	0.0003	0.0009	0.0038	−0.0018	0.0001
			(0.0005)	(0.0005)	(0.0112)	(0.0005)	(0.0005)	(0.0011)	(0.0162)
		(25, 10)	0.0009	0.0014	−0.0026	0.0012	0.0015	−0.0033	0.0010
			(0.0009)	(0.0009)	(0.0209)	(0.0008)	(0.0008)	(0.0018)	(0.0203)
	1.0	(10, 25)	−0.0030	0.0003	−0.0015	−0.0018	−0.0033	−0.0002	0.0084
			(0.0117)	(0.0122)	(0.0117)	(0.0118)	(0.0117)	(0.0167)	(0.0222)
		(10, 50)	−0.0002	0.0041	−0.0018	−0.0010	0.0025	−0.0012	0.0010
			(0.0118)	(0.0126)	(0.0117)	(0.0118)	(0.0117)	(0.0166)	(0.0223)
		(25, 10)	0.0055	−0.0030	−0.0011	0.0035	0.0046	−0.0004	−0.0011
			(0.0224)	(0.0239)	(0.0222)	(0.0221)	(0.0223)	(0.0367)	(0.0286)
2.0	−1.0	(10, 25)	−0.0204	−0.0374	−0.0004	−0.0157	−0.0092	−0.0181	0.0036
			(0.0641)	(0.0706)	(0.0235)	(0.0375)	(0.0346)	(0.0797)	(0.0705)
		(10, 50)	−0.0251	−0.0396	−0.0012	−0.0062	−0.0130	−0.0190	−0.0013
			(0.0638)	(0.0671)	(0.0236)	(0.0372)	(0.0352)	(0.0805)	(0.0713)
		(25, 10)	−0.0008	−0.0688	0.0003	0.0142	−0.0092	−0.1897	−0.0054
			(0.0477)	(0.2115)	(0.0444)	(0.0467)	(0.0476)	(0.3748)	(0.0889)
	0.0	(10, 25)	0.0060	0.0011	0.0019	0.0007	−0.0009	0.0000	−0.0001
			(0.0010)	(0.0011)	(0.0224)	(0.0010)	(0.0011)	(0.0022)	(0.0282)
		(10, 50)	−0.0016	0.0016	−0.0018	−0.0007	0.0019	−0.0035	−0.0054
			(0.0010)	(0.0010)	(0.0225)	(0.0011)	(0.0010)	(0.0024)	(0.0285)
		(25, 10)	0.0001	−0.0002	0.0006	−0.0004	0.0019	0.0046	0.0018
			(0.0016)	(0.0018)	(0.0422)	(0.0017)	(0.0017)	(0.0037)	(0.0341)
	1.0	(10, 25)	−0.0031	0.0060	−0.0025	−0.0013	−0.0059	−0.0029	0.0047
			(0.0238)	(0.0243)	(0.0234)	(0.0235)	(0.0235)	(0.0349)	(0.0489)
		(10, 50)	0.0012	−0.0016	0.0019	0.0031	−0.0013	−0.0020	0.0045
			(0.0235)	(0.0254)	(0.0235)	(0.0236)	(0.0235)	(0.0353)	(0.0498)
		(25, 10)	0.0011	0.0069	0.0027	0.0047	−0.0013	0.0642	0.0008
			(0.0445)	(0.0982)	(0.0444)	(0.0447)	(0.0446)	(0.2225)	(0.0619)

Table 4. Albumin in plasma protein.

Experiment	$n_{i}$	Mean	Variance
A	12	62.30	12.99
B	15	60.30	7.84
C	7	59.50	33.43
D	16	61.50	18.51

Table 5. The proposed test estimator for albumin in plasma protein with

μ_{0} = 59.50

.

Table 5. The proposed test estimator for albumin in plasma protein with

μ_{0} = 59.50

.

${\hat{μ}}_{GD}$	${\hat{μ}}_{PT}^{T}$	${\hat{μ}}_{PT}^{W (r = 2)}$	${\hat{μ}}_{PT}^{W (r = 3)}$	${\hat{μ}}_{PT}^{W (r = 4)}$	${\hat{μ}}_{PT}^{IN}$	${\hat{μ}}_{PT}^{F}$	${\hat{μ}}_{PT}^{L}$	${\hat{μ}}_{PT}^{Mt}$	${\hat{μ}}_{PT}^{MF}$
60.99	59.5	60.99	59.50	59.50	59.50	59.50	60.99	60.99	60.99

T: Tippett’s test, W: Wilkinson’s test,

I N

: Inverse normal test, F: Fisher’s inverse

χ^{2}

-test. L: The logit test,

M t

: Modified t test,

M F

: Modified F test.

Table 6. Selenium in non-fat milk powder.

Methods	$n_{i}$	Mean	Variance
Atomic absorption spectrometry	8	105.00	85.71
Neutron activation:
(1.) Instrumental	12	109.75	20.75
(2.) Radiochemical	14	109.50	2.73
Isotope dilution mass spectrometry	8	113.25	33.64

Table 7. The proposed test estimator for selenium in non-fat milk powder for various values of

μ_{0}

.

Table 7. The proposed test estimator for selenium in non-fat milk powder for various values of

μ_{0}

.

$μ_{0}$	${\hat{μ}}_{GD}$	${\hat{μ}}_{PT}^{T}$	${\hat{μ}}_{PT}^{W (r = 2)}$	${\hat{μ}}_{PT}^{W (r = 3)}$	${\hat{μ}}_{PT}^{W (r = 4)}$	${\hat{μ}}_{PT}^{IN}$	${\hat{μ}}_{PT}^{F}$	${\hat{μ}}_{PT}^{L}$	${\hat{μ}}_{PT}^{Mt}$	${\hat{μ}}_{PT}^{MF}$
90.00	109.60	109.60	109.60	109.60	109.60	109.60	109.60	109.60	109.60	109.60
100.00	109.60	109.60	109.60	109.60	109.60	109.60	109.60	109.60	109.60	109.60
110.00	109.60	110.00	110.00	110.00	110.00	109.60	110.00	110.00	109.60	109.60
110.50	109.60	110.50	110.50	110.50	110.50	109.60	110.50	110.50	109.60	109.60
111.00	109.60	109.60	111.00	111.00	109.60	109.60	109.60	109.60	109.60	111.00

T: Tippett’s test, W: Wilkinson’s test,

I N

: Inverse normal test, F: Fisher’s inverse

χ^{2}

-test. L: The logit test,

M t

: Modified t test,

M F

: Modified F test.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mphekgwana, P.M.; Kifle, Y.G.; Marange, C.S. Pretest Estimation for the Common Mean of Several Normal Distributions: In Meta-Analysis Context. Axioms 2024, 13, 648. https://doi.org/10.3390/axioms13090648

AMA Style

Mphekgwana PM, Kifle YG, Marange CS. Pretest Estimation for the Common Mean of Several Normal Distributions: In Meta-Analysis Context. Axioms. 2024; 13(9):648. https://doi.org/10.3390/axioms13090648

Chicago/Turabian Style

Mphekgwana, Peter M., Yehenew G. Kifle, and Chioneso S. Marange. 2024. "Pretest Estimation for the Common Mean of Several Normal Distributions: In Meta-Analysis Context" Axioms 13, no. 9: 648. https://doi.org/10.3390/axioms13090648

APA Style

Mphekgwana, P. M., Kifle, Y. G., & Marange, C. S. (2024). Pretest Estimation for the Common Mean of Several Normal Distributions: In Meta-Analysis Context. Axioms, 13(9), 648. https://doi.org/10.3390/axioms13090648

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Pretest Estimation for the Common Mean of Several Normal Distributions: In Meta-Analysis Context

Abstract

1. Introduction

2. Background

3. Proposed Preliminary Test Estimator

3.1. P-Value Based Exact Tests

3.1.1. Tippett’s Test

3.1.2. Wilkinson’s Test

3.1.3. Inverse Normal Test

3.1.4. Fisher’s Inverse $χ^{2}$ -Test

3.1.5. The Logit Test

3.2. Exact Tests

3.2.1. Modified t

3.2.2. Modified F

3.3. Properties of the Proposed Preliminary Test Estimator

3.3.1. Bias

3.3.2. Mean Square Error

4. Simulation Study

Bias and Mean Squared Error

5. Application in Biological Research

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Pretest Estimation for the Common Mean of Several Normal Distributions: In Meta-Analysis Context

Abstract

1. Introduction

2. Background

3. Proposed Preliminary Test Estimator

3.1. P-Value Based Exact Tests

3.1.1. Tippett’s Test

3.1.2. Wilkinson’s Test

3.1.3. Inverse Normal Test

3.1.4. Fisher’s Inverse χ 2 -Test

3.1.5. The Logit Test

3.2. Exact Tests

3.2.1. Modified t

3.2.2. Modified F

3.3. Properties of the Proposed Preliminary Test Estimator

3.3.1. Bias

3.3.2. Mean Square Error

4. Simulation Study

Bias and Mean Squared Error

5. Application in Biological Research

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.1.4. Fisher’s Inverse $χ^{2}$ -Test