An Exact and Near-Exact Distribution Approach to the Behrens–Fisher Problem

Hong, Serim; Coelho, Carlos A.; Park, Junyong

doi:10.3390/math10162953

Open AccessArticle

An Exact and Near-Exact Distribution Approach to the Behrens–Fisher Problem

by

Serim Hong

¹,

Carlos A. Coelho

^2,*

and

Junyong Park

^3,*

¹

College of Liberal Studies, Seoul National University, Seoul 08826, Korea

²

NOVA Math (CMA-FCT/UNL) and Mathematics Department, NOVA School of Science and Technology, NOVA University of Lisbon (FCT/UNL), 2829-516 Caparica, Portugal

³

Department of Statistics, Seoul National University, Seoul 08826, Korea

^*

Authors to whom correspondence should be addressed.

Mathematics 2022, 10(16), 2953; https://doi.org/10.3390/math10162953

Submission received: 4 July 2022 / Revised: 10 August 2022 / Accepted: 13 August 2022 / Published: 16 August 2022

(This article belongs to the Special Issue Mathematical and Computational Statistics and Their Applications)

Download

Browse Figures

Versions Notes

Abstract

:

The Behrens–Fisher problem occurs when testing the equality of means of two normal distributions without the assumption that the two variances are equal. This paper presents approaches based on the exact and near-exact distributions for the test statistic of the Behrens–Fisher problem, depending on different combinations of even or odd sample sizes. We present the exact distribution when both sample sizes are odd and the near-exact distribution when one or both sample sizes are even. The near-exact distributions are based on a finite mixture of generalized integer gamma (GIG) distributions, used as an approximation to the exact distribution, which consists of an infinite series. The proposed tests, based on the exact and the near-exact distributions, are compared with Welch’s t-test through Monte Carlo simulations, in particular for small and unbalanced sample sizes. The results show that the proposed approaches are competent solutions to the Behrens–Fisher problem, exhibiting precise sizes and better powers than Welch’s approach for those cases. Numerical studies show that the Welch’s t-test tends to be a bit more conservative than the test statistics based on the exact or near-exact distribution, in particular when sample sizes are small and unbalanced, situations in which the proposed exact or near-exact distributions obtain higher powers than Welch’s t-test.

Keywords:

Behrens–Fisher problem; near-exact distribution; Welch’s t-test; generalized integer gamma distribution

MSC:

62F03; 62E15; 62E20

1. Introduction

Let us suppose we have two independent random samples

X_{1 i}, i = 1, \dots, n_{1}

and

X_{2 i}, i = 1, \dots, n_{2}

, which are drawn from two normal distributions,

N (μ_{1}, σ_{1}^{2})

and

N (μ_{2}, σ_{2}^{2})

, respectively. The Behrens–Fisher problem occurs when testing the equality of the two means

μ_{1}

and

μ_{2}

based on random samples like these without the assumption that the two variances,

σ_{1}^{2}

and

σ_{2}^{2}

, are equal. [1] showed that a uniformly most powerful test does not exist in this case, and the Behrens–Fisher problem remains one of the unsolved problems of statistics. Many different approaches have been tried to solve this problem. Among those approaches are the fiducial approach proposed by [2,3], which somehow opened the way to the Bayesian approach proposed by [4,5], based on setting independent and locally uniform prior distributions for

μ_{1}, μ_{2}, log σ_{1}, log σ_{2}

. The frequentist approach proposed by [6,7] uses Student’s t distribution with

{(\frac{S_{1}^{2}}{n_{1}} + \frac{S_{2}^{2}}{n_{2}})}^{2} / (\frac{S_{1}^{4}}{(n_{1} - 1) n_{1}^{2}} + \frac{S_{2}^{4}}{(n_{2} - 1) n_{2}^{2}})

degrees of freedom as the approximate distribution of the Behrens–Fisher statistic.

In this paper, we will obtain the exact and near-exact distributions, both the probability density function and cumulative distribution function, of the Behrens–Fisher statistic,

T^{*} = \frac{{\bar{X}}_{1} - {\bar{X}}_{2}}{\sqrt{\frac{S_{1}^{2}}{n_{1}} + \frac{S_{2}^{2}}{n_{2}}}},

(1)

under

H_{0} : μ_{1} = μ_{2}

, where

{\bar{X}}_{j} = \frac{1}{n_{j}} \sum_{i = 0}^{n_{j}} X_{j i}

and

S_{j}^{2} = \frac{1}{n_{j} - 1} \sum_{i = 0}^{n_{j}} {(X_{j i} - {\bar{X}}_{j})}^{2}

for

j = 1, 2

, in the form of mixtures of Student’s t distributions multiplied by constants. Particularly for the case when both sample sizes are odd, the exact distribution will be derived in a finite closed form without any unsolved integrals or infinite sums by using the GIG (generalized integer gamma) distribution in [8], which is the distribution of the sum of independent gamma variables with integer shape parameters and nonequal rate parameters. For the other cases, that is, when both sample sizes are even or one of them is even and the other one is odd, the near-exact distribution will be obtained by approximating the exact distribution using a finite mixture of GIG distributions to obtain a more manageable cumulative distribution function. Such exact and near-exact distributions include

σ_{1}^{2}

and

σ_{2}^{2}

as unknown parameters, which have to be estimated, based on the observed samples, being then the p-values obtained from these exact or near-exact distributions with estimated parameters. The results will be compared with Welch’s t-test, one of the most widely used solutions to the problem, through Monte-Carlo simulations for relatively small sample sizes. We will see that the tests based on the exact or near-exact distribution show some advantage in terms of being able to obtain higher power than Welch’s t-test, especially when sample sizes are small and unbalanced and variances are also unbalanced.

This paper is organized as follows: Section 2 presents the exact distribution when both sample sizes are odd; Section 3 provides the near-exact distribution when one sample size is even and the other one odd, and Section 4 presents the exact near-exact distribution of the test statistic when both sample sizes are even. Numerical studies are provided in Section 5 to compare the exact and the near-exact distribution approaches with Welch’s t-test, and concluding remarks are presented in Section 6.

2. The Exact Distribution of the Behrens–Fisher Statistic for Odd-Numbered Sample Sizes

In this section, we present the exact distribution of the Behrens–Fisher statistic in (1), when both

n_{1}

and

n_{2}

are odd. Since

\frac{S_{j}^{2}}{n_{j}} = \frac{σ_{j}^{2}}{n_{j} (n_{j} - 1)} \frac{(n_{j} - 1) s_{j}^{2}}{σ_{j}^{2}}

where

\frac{(n_{j} - 1) s_{j}^{2}}{σ_{j}^{2}} \sim χ_{n_{j} - 1}^{2} (j = 1, 2), W = \frac{S_{1}^{2}}{n_{1}} + \frac{S_{2}^{2}}{n_{2}}

is the sum of two independent gamma variables, each of which follows the distributions

Γ (\frac{n_{1} - 1}{2}, \frac{n_{1} (n_{1} - 1)}{2 σ_{1}^{2}})

and

Γ (\frac{n_{2} - 1}{2}, \frac{n_{2} (n_{2} - 1)}{2 σ_{2}^{2}})

, where

Γ (r, λ)

indicates a gamma distribution with shape parameter

r

and rate parameter

λ

.

Now, we can divide the problem into two cases: one is the case of

\frac{n_{1} (n_{1} - 1)}{2 σ_{1}^{2}} \neq \frac{n_{2} (n_{2} - 1)}{2 σ_{2}^{2}}

, and the other is the case

\frac{n_{1} (n_{1} - 1)}{2 σ_{1}^{2}} = \frac{n_{2} (n_{2} - 1)}{2 σ_{2}^{2}}

. When

\frac{n_{1} (n_{1} - 1)}{2 σ_{1}^{2}} \neq \frac{n_{2} (n_{2} - 1)}{2 σ_{2}^{2}}, W

follows a GIG distribution of depth 2 with shape parameters

r_{j} = \frac{n_{j} - 1}{2}

(j = 1, 2)

and rate parameters

λ_{j} = \frac{n_{j} (n_{j} - 1)}{2 σ_{j}^{2}}

(j = 1, 2)

, which are different. Notice that

\frac{n_{1} - 1}{2}

and

\frac{n_{2} - 1}{2}

are both integers because of odd-numbered sample sizes. The probability density function for this distribution is

f_{W} (w) = \sum_{j = 1}^{2} \sum_{k = 1}^{r_{j}} (\prod_{j = 1}^{2} λ_{j}^{r_{j}}) c_{j, k} w^{k - 1} e^{- λ_{j} w} I (w > 0)

where

c_{j, k}

are given by (11)–(13) in [8]. In this case,

c_{i, r_{i}} = \frac{1}{(r_{i} - 1)!} \prod_{j = 1, j \neq i}^{2} {(λ_{j} - λ_{i})}^{- r_{j}}

and

c_{i, r_{i} - k} = \frac{1}{k} \sum_{j = 1}^{k} \frac{(r_{i} - k + j - 1)!}{(r_{i} - k - 1)!} (\sum_{s = 1, s \neq i}^{2} r_{s} {(λ_{i} - λ_{s})}^{- j}) c_{i, r_{i} - k + j}

for

k = 1, \dots, r_{i} - 1, i = 1, 2

.

As a matter of fact, this probability density function can be rewritten as

f_{W} (w) = \sum_{j = 1}^{2} \sum_{k = 1}^{r_{j}} \underset{p_{j, k}}{\underset{︸}{(\prod_{j = 1}^{2} λ_{j}^{r_{j}}) c_{j, k} \frac{Γ (k)}{λ_{j}^{k}}}} \underset{probability density function of Γ (k, λ_{j})}{\underset{︸}{\frac{λ_{j}^{k}}{Γ (k)} w^{k - 1} e^{- λ_{j} w}}} I (w > 0)

which is a finite mixture of integer gamma distributions in [9]. Now that we have obtained the exact distribution of W, we can easily get the joint distribution of W and

Y = {\bar{X}}_{1} - {\bar{X}}_{2}

, under

H_{0} : μ_{1} = μ_{2}

. Y follows a normal distribution

N (0, \frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}})

under

H_{0}

, and so, given the independence of W and Y, the joint probability density function of these two random variables is

f_{W, Y} (w, y) = \sum_{j = 1}^{2} \sum_{k = 1}^{r_{j}} p_{j, k} \frac{λ_{j}^{k}}{Γ (k)} w^{k - 1} e^{- λ_{j} w} \frac{1}{σ} Φ (\frac{y}{σ}) (w > 0, y \in (- \infty, \infty))

where

σ = \sqrt{\frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}}}

and

Φ

is the probability density function of a standard normal distribution.

Since we want to derive the distribution of

T^{*} = \frac{Y}{\sqrt{W}}

, we need to go further and obtain the joint distribution of

T^{*}

and

V = \sqrt{W}

from

f_{W, Y} (w, y)

by a simple change of variables. From this process, we obtain

\begin{matrix} f_{T^{*}, V} (t, v) & = & \sum_{j = 1}^{2} \sum_{k = 1}^{r_{j}} p_{j, k} \frac{2 λ_{j}^{k}}{Γ (k)} v^{2 k} e^{- λ_{j} v^{2}} \frac{1}{σ} Φ (\frac{t v}{σ}), \end{matrix}

for

v > 0

and

t \in (- \infty, \infty)

and

\begin{matrix} f_{T^{*}} (t) & = & \sum_{j = 1}^{2} \sum_{k = 1}^{r_{j}} p_{j, k} \int_{0}^{\infty} 2 \frac{λ_{j}^{k}}{Γ (k)} v^{2 k} e^{- λ_{j} v^{2}} \frac{1}{σ} Φ (\frac{t v}{σ}) d v, \end{matrix}

where

\int_{0}^{\infty} 2 \frac{λ_{j}^{k}}{Γ (k)} v^{2 k} e^{- λ_{j} v^{2}} \frac{1}{σ} Φ (\frac{t v}{σ}) d v

yields the probability density function of a

σ \sqrt{\frac{λ_{j}}{k}} T_{2 k}

random variable, with

T_{2 k}

denoting a Student’s t distribution with

2 k

degrees of freedom.

Hence,

f_{T^{*}} (t)

is a mixture of probability density functions of

σ \sqrt{\frac{λ_{j}}{k}} T_{2 k}

random variables

(k = 1, \dots, r_{j} j = 1, 2)

, with weights

p_{j, k}

. Given that the probability density function of Student’s t variable with n degrees of freedom is given by

f_{T_{n}} (t) = \frac{1}{B (\frac{n}{2}, \frac{1}{2})} \frac{1}{\sqrt{n}} {(1 + \frac{t^{2}}{n})}^{- (n + 1) / 2}

for

t \in (- \infty, \infty)

, the probability density function of

T^{*}

, under

H_{0}

, can be rewritten as

\begin{matrix} f_{T^{*}} (t) & = & \sum_{j = 1}^{2} \sum_{k = 1}^{r_{j}} p_{j, k} f_{σ \sqrt{λ_{j} / k} T_{2 k}} (t) \\ = & (\prod_{j = 1}^{2} λ_{j}^{r_{j}}) \frac{1}{σ \sqrt{2 π}} \sum_{j = 1}^{2} \sum_{k = 1}^{r_{j}} c_{j, k} Γ (k + \frac{1}{2}) {(\frac{t^{2}}{2 σ^{2}} + λ_{j})}^{- (k + \frac{1}{2})} \end{matrix}

for

t \in (- \infty, \infty)

.

Then, the cumulative distribution function of

T^{*}

would also be a mixture of cumulative distribution functions of

σ \sqrt{\frac{λ_{j}}{k}} T_{2 k} (k = 1, \dots, r_{j} j = 1, 2)

with weights

p_{j, k}

. For Student’s t distribution with even degrees of freedom, the cumulative distribution function is given by

\begin{matrix} F_{T_{2 k}} (t) & = & \frac{1}{2} + \frac{t}{B (k, \frac{1}{2})} \frac{1}{\sqrt{2 k}}_{2} F_{1} (k + \frac{1}{2}, \frac{1}{2}; \frac{3}{2}; - \frac{t^{2}}{2 k}) \\ = & \frac{1}{2} + \frac{t}{B (k, \frac{1}{2})} \frac{1}{\sqrt{2 k}}_{2} F_{1} (1 - k, 1; \frac{3}{2}; - \frac{t^{2}}{2 k}) \\ = & \frac{1}{2} + \frac{Γ (k + \frac{1}{2})}{\sqrt{2 k}} \frac{t}{2} {(1 + \frac{t^{2}}{2 k})}^{\frac{1}{2} - k} \sum_{i = 0}^{k - 1} \frac{{(\frac{t^{2}}{2 k})}^{i}}{Γ (\frac{3}{2} + i) Γ (k - i)} \end{matrix}

for

t \in (- \infty, \infty)

, which is obtained by applying

15.3.3

and

15.4.1

from [10], where

_{2} F_{1}

denotes a Gaussian hypergeometric function.

Thus, the cumulative distribution function of

T^{*}

, under

H_{0}

, can be expressed as

\begin{matrix} F_{T^{*}} (t) = \sum_{j = 1}^{2} \sum_{k = 1}^{r_{j}} p_{j, k} F_{T_{2 k}} (t / σ \sqrt{\frac{λ_{j}}{k}}) \\ = \frac{1}{2} + \sum_{j = 1}^{2} \sum_{k = 1}^{r_{j}} \frac{(\prod_{j = 1}^{2} λ_{j}^{r_{j}})}{\sqrt{2 σ^{2}}} c_{j, k} \frac{Γ (k) Γ (k + \frac{1}{2})}{λ_{j}} \frac{t}{2} {(\frac{t^{2}}{2 σ^{2}} + λ_{j})}^{\frac{1}{2} - k} \sum_{i = 0}^{k - 1} \frac{{(\frac{t^{2}}{2 σ^{2} λ_{j}})}^{i}}{Γ (\frac{3}{2} + i) Γ (k - i)} \end{matrix}

for

t \in (- \infty, \infty)

.

The distribution of

T^{*}

is much simpler when

\frac{n_{1} (n_{1} - 1)}{2 σ_{1}^{2}} = \frac{n_{2} (n_{2} - 1)}{2 σ_{2}^{2}} .

Since the rate parameters of the two independent gamma variables,

\frac{S_{1}^{2}}{n_{1}}

and

\frac{S_{2}^{2}}{n_{2}}

, are, in this case, equal, W simply follows the gamma distribution

Γ (\frac{n_{1} + n_{2} - 2}{2}, \frac{n_{1} (n_{1} - 1)}{2 σ_{1}^{2}} = \frac{n_{2} (n_{2} - 1)}{2 σ_{2}^{2}})

. Therefore, in this case,

T^{*} \equiv T_{n_{1} + n_{2} - 2}

. This means that the probability density function and cumulative distribution function of

T^{*}

are the same as those of Student’s

t

distribution with

n_{1} + n_{2} - 2

degrees of freedom. The probability density function for this distribution is written as

f_{T^{*}} (t) = \frac{1}{B (\frac{n_{1} + n_{2} - 2}{2}, \frac{1}{2})} \frac{1}{\sqrt{n_{1} + n_{2} - 2}} {(1 + \frac{t^{2}}{n_{1} + n_{2} - 2})}^{- (n_{1} + n_{2} - 1) / 2}

for

t \in (- \infty, \infty)

.

Additionally, as

n_{1} + n_{2} - 2

is an even number when sample sizes are odd-numbered, the cumulative distribution function can be written as

F_{T^{*}} (t) = \frac{1}{2} + \frac{Γ (\frac{n_{1} + n_{2} - 1}{2})}{\sqrt{n_{1} + n_{2} - 2}} \frac{t}{2} {(1 + \frac{t^{2}}{n_{1} + n_{2} - 2})}^{\frac{3 - n_{1} - n_{2}}{2}} \sum_{i = 0}^{\frac{n_{1} + n_{2} - 4}{2}} \frac{{(\frac{t^{2}}{n_{1} + n_{2} - 2})}^{i}}{Γ (\frac{3}{2} + i) Γ (\frac{n_{1} + n_{2} - 2}{2} - i)}

for

t \in (- \infty, \infty)

.

3. The Exact and Near-Exact Distribution of the Behrens–Fisher Statistic for Even-Numbered Sample Sizes

In this section, we present the exact distribution of

T^{*}

when sample sizes are both even. The exact distribution consists of an infinite series, but we provide a near-exact distribution based on a finite mixture of GIG distributions, which yields an approximation to the exact distribution.

3.1. The Exact Distribution

Unlike the case where both sample sizes are odd,

W = \frac{S_{1}^{2}}{n_{1}} + \frac{S_{2}^{2}}{n_{2}}

does not follow a GIG distribution when both sample sizes are even. This is so because the shape parameters for

Γ (\frac{n_{1} - 1}{2}, \frac{n_{1} (n_{1} - 1)}{2 σ_{1}^{2}})

and

Γ (\frac{n_{2} - 1}{2}, \frac{n_{2} (n_{2} - 1)}{2 σ_{2}^{2}})

are not integers when sample sizes are even. However, we can use the Kummer confluent hypergeometric function to obtain the exact distribution of W for this case.

Given the integral definition of the Kummer confluent hypergeometric function, the probability density function of W, which is the sum of two independent gamma variables with shape parameters

r_{j} = \frac{n_{j} - 1}{2}, j = 1, 2,

(2)

and rate parameters

λ_{j} = \frac{n_{j} (n_{j} - 1)}{2 σ_{j}^{2}}, j = 1, 2,

(3)

can be expressed as

\begin{matrix} f_{W} (w) & = & \frac{λ_{1}^{r_{1}}}{Γ (r_{1})} \frac{λ_{2}^{r_{2}}}{Γ (r_{2})} e^{- λ_{2} w} \int_{0}^{w} e^{(λ_{2} - λ_{1}) s} s^{r_{1} - 1} {(w - s)}^{r_{2} - 1} d s \\ = & \frac{λ_{1}^{r_{1}} λ_{2}^{r_{2}}}{Γ (r_{1} + r_{2})} e^{- λ_{2} w} w^{r_{1} + r_{2} - 1}_{1} F_{1} (r_{1}, r_{1} + r_{2}, (λ_{2} - λ_{1}) w) \end{matrix}

for

w > 0

.

Since

_{1} F_{1} (r_{1}, r_{1} + r_{2}, (λ_{2} - λ_{1}) w) = \sum_{i = 0}^{\infty} \frac{Γ (r_{1} + i)}{Γ (r_{1})} \frac{Γ (r_{1} + r_{2})}{Γ (r_{1} + r_{2} + i)} \frac{{(λ_{2} - λ_{1})}^{i}}{i!} w^{i},

the probability density function of W can be further written as

\begin{matrix} f_{W} (w) & = & \sum_{i = 0}^{\infty} \frac{Γ (r_{1} + i)}{Γ (r_{1})} \frac{λ_{1}^{r_{1}} λ_{2}^{r_{2}}}{Γ (r_{1} + r_{2} + i)} \frac{{(λ_{2} - λ_{1})}^{i}}{i!} w^{r_{1} + r_{2} + i - 1} e^{- λ_{2} w} \\ = & \sum_{i = 0}^{\infty} \underset{p_{i}}{\underset{︸}{\frac{Γ (r_{1} + i)}{Γ (r_{1}) i!} {(\frac{λ_{1}}{λ_{2}})}^{r_{1}} {(1 - \frac{λ_{1}}{λ_{2}})}^{i}}} \underset{p . d . f of Γ (r_{1} + r_{2} + i, λ_{2})}{\underset{︸}{\frac{λ_{2}^{r_{1} + r_{2} + i}}{Γ (r_{1} + r_{2} + i)} w^{r_{1} + r_{2} + i - 1} e^{- λ_{2} w}}}, \end{matrix}

for

w > 0

, which is the probability density function of an infinite mixture of gamma distributions with weights

p_{i}

(i = 0, 1, \dots)

. Now, using a similar approach to the one used in Section 2 for the case of odd-numbered sample sizes, we can obtain the exact probability density function of

T^{*}

under

H_{0}

in the form of an infinite mixture of probability density functions of

σ \sqrt{\frac{λ_{2}}{r_{1} + r_{2} + i}} T_{2 (r_{1} + r_{2} + i)}

distributions, with weights

p_{i}

, for

i = 0, 1, \dots

. The probability density function of

T^{*}

may be then stated as follows:

\begin{matrix} f_{T^{*}} (t) & = & \sum_{i = 0}^{\infty} p_{i} f_{σ \sqrt{\frac{λ_{2}}{r_{1} + r_{2} + i}} T_{2 (r_{1} + r_{2} + i)}} (t) \\ = & \sum_{i = 0}^{\infty} p_{i} \frac{1}{B (r_{1} + r_{2} + i, \frac{1}{2}) \sqrt{2 σ^{2} λ_{2}}} {(1 + \frac{t^{2}}{2 σ^{2} λ_{2}})}^{- (r_{1} + r_{2} + i + \frac{1}{2})} \end{matrix}

for

t \in (- \infty, \infty)

.

Regarding the exact cumulative distribution function of

T^{*}

, under

H_{0}

, this cumulative distribution function would also be an infinite mixture of cumulative distribution functions of

σ \sqrt{\frac{λ_{2}}{r_{1} + r_{2} + i}} T_{2 (r_{1} + r_{2} + i)}

distributions, with weights

p_{i}

, for

i = 0, 1, \dots

. It can be written as

\begin{matrix} F_{T^{*}} (t) \\ = & \sum_{i = 0}^{\infty} p_{i} F_{T_{2 (r_{1} + r_{2} + i)}} (t / σ \sqrt{\frac{λ_{2}}{r_{1} + r_{2} + i}}) \\ = & \frac{1}{2} + \sum_{i = 0}^{\infty} p_{i} \frac{Γ (r_{1} + r_{2} + i + \frac{1}{2})}{\sqrt{2 σ^{2} λ_{2}}} \frac{t}{2} {(1 + \frac{t^{2}}{2 σ^{2} λ_{2}})}^{\frac{1}{2} - r_{i}} \sum_{k = 0}^{r_{i} - 1} \frac{{(\frac{t^{2}}{2 σ^{2} λ_{2}})}^{k}}{Γ (\frac{3}{2} + k) Γ (r_{1} + r_{2} + i - k)} \end{matrix}

(4)

for

t \in (- \infty, \infty)

, where

r_{i} = r_{1} + r_{2} + i

,

p_{i} = \frac{Γ (r_{1} + i)}{Γ (r_{1}) i!} {(\frac{λ_{1}}{λ_{2}})}^{r_{1}} {(1 - \frac{λ_{1}}{λ_{2}})}^{i}

, by applying the expression for

F_{T_{2 k}} (t)

obtained in Section 2.

As the exact cumulative distribution function of

T^{*}

is expressed as an infinite sum when

λ_{1} \neq λ_{2}

, it is not much of a manageable cumulative distribution function. In order to obtain numerical values of this cumulative distribution function in a reasonable amount of time, use of an integer upper bound for the summation in i is required. However, the number of terms required in order to obtain a small enough truncation error is often very large. Hence, a near-exact distribution of

T^{*}

with a manageable cumulative distribution function needs to be obtained for the case where

λ_{1} \neq λ_{2}

so that quantiles and p-values can be computed in a faster and more practical way.

3.2. Near-Exact Distribution

In order to obtain a near-exact distribution for

T^{*}

, based on a finite mixture, we will first obtain a near-exact distribution for W and then derive the distribution of

T^{*}

from this distribution of W.

Let

r_{j}^{*} = \frac{n_{j} - 2}{2}

(j = 1, 2)

, and let

λ_{1} \neq λ_{2}

. Then, the exact characteristic function of W is given by

\begin{matrix} Φ_{W} (t) & = & λ_{1}^{r_{1}} {(λ_{1} - i t)}^{- r_{1}} λ_{2}^{r_{2}} {(λ_{2} - i t)}^{- r_{2}} \\ = & λ_{1}^{r_{1}^{*}} {(λ_{1} - i t)}^{- r_{1}^{*}} λ_{2}^{r_{2}^{*}} {(λ_{2} - i t)}^{- r_{2}^{*}} λ_{1}^{0.5} {(λ_{1} - i t)}^{- 0.5} λ_{2}^{0.5} {(λ_{2} - i t)}^{- 0.5} \end{matrix}

where

i

is the imaginary number with

i^{2} = - 1

. Since

λ_{1}^{0.5} {(λ_{1} - i t)}^{- 0.5} λ_{2}^{0.5} {(λ_{2} - i t)}^{- 0.5}

is the characteristic function of the sum of two independent gamma variables

Γ (0.5, λ_{1})

and

Γ (0.5, λ_{2})

, we can make use of the probability density function of W expressed as an infinite mixture of probability density functions of gamma distributions obtained in Section 3.1 to write

λ_{1}^{0.5} {(λ_{1} - i t)}^{- 0.5} λ_{2}^{0.5} {(λ_{2} - i t)}^{- 0.5} = \sum_{j = 0}^{\infty} p_{j} λ_{2}^{j + 1} {(λ_{2} - i t)}^{- (j + 1)},

where

p_{j} = \frac{Γ (0.5 + j)}{Γ (0.5) j!} {(\frac{λ_{1}}{λ_{2}})}^{0.5} {(1 - \frac{λ_{1}}{λ_{2}})}^{j}

. This leads to

Φ_{W} (t) = λ_{1}^{r_{1}^{*}} {(λ_{1} - i t)}^{- r_{1}^{*}} λ_{2}^{r_{2}^{*}} {(λ_{2} - i t)}^{- r_{2}^{*}} \sum_{j = 0}^{\infty} p_{j} λ_{2}^{j + 1} {(λ_{2} - i t)}^{- (j + 1)} .

Now, for a given integer

m^{*}

, we propose to approximate

Φ_{W} (t)

by

\begin{matrix} Φ_{W}^{*} (t) & = λ_{1}^{r_{1}^{*}} {(λ_{1} - i t)}^{- r_{1}^{*}} λ_{2}^{r_{2}^{*}} {(λ_{2} - i t)}^{- r_{2}^{*}} \sum_{j = 0}^{m^{*}} π_{j} λ_{2}^{j + 1} {(λ_{2} - i t)}^{- (j + 1)} \\ = \sum_{j = 0}^{m^{*}} π_{j} λ_{1}^{r_{1}^{*}} {(λ_{1} - i t)}^{- r_{1}^{*}} λ_{2}^{r_{2}^{*} + j + 1} {(λ_{2} - i t)}^{- (r_{2}^{*} + j + 1)} \end{matrix}

where

π_{j} (j = 0, \dots, m^{*} - 1)

are determined in such a way that satisfies

{\frac{\partial^{h}}{\partial t^{h}} Φ_{W} (t)|}_{t = 0} = {\frac{\partial^{h}}{\partial t^{h}} Φ_{W}^{*} (t)|}_{t = 0} for h = 1, \dots, m^{*},

with

π_{m^{*}} = 1 - \sum_{i = 0}^{m^{*} - 1} π_{i}

. This approximate characteristic function

Φ_{W}^{*} (t)

is, in fact, the characteristic function of a finite mixture (with weights

π_{j}

) of

m^{*} + 1

GIG distributions of depth 2 with integer shape parameters

r_{1}^{*}, r_{2}^{*} + j + 1

(j = 0, \dots, m^{*})

and rate parameters

λ_{1}

and

λ_{2}

, the first

m^{*}

moments of which are the same as those of W.

Hence, a finite mixture of probability density functions of

m^{*} + 1

GIG distributions with weights

π_{j}

for

j = 0, \dots, m^{*}

will then be a near-exact probability density function of W. As the GIG distribution itself is a finite mixture of gamma distributions, as shown in Section 2, this probability density function can be written as

\begin{matrix} f_{W}^{*} (w) & = & \sum_{i = 0}^{m^{*}} π_{i} \sum_{j = 1}^{2} \sum_{k = 1}^{r_{j i}^{* *}} (\prod_{j = 1}^{2} λ_{j}^{r_{j i}^{* *}}) c_{j k, i}^{*} w^{k - 1} e^{- λ_{j} w} \\ = & \sum_{i = 0}^{m^{*}} π_{i} \sum_{j = 1}^{2} \sum_{k = 1}^{r_{j i}^{* *}} \underset{p_{j k, i}^{*}}{\underset{︸}{(\prod_{j = 1}^{2} λ_{j}^{r_{j i}^{* *}}) c^{*}_{j k, i} \frac{Γ (k)}{λ_{j}^{k}}}} \underset{p . d . f of Γ (k, λ_{j})}{\underset{︸}{\frac{λ_{j}^{k}}{Γ (k)} w^{k - 1} e^{- λ_{j} w}}} \end{matrix}

for

w > 0

, where

r_{j i}^{* *} = \{\begin{matrix} r_{1}^{*} & j = 1 \\ r_{2}^{*} + i + 1 & j = 2 \end{matrix}

for

i = 0, \dots, m^{*}

and

c_{j k, i}^{*}

are defined in the same way as

c_{j, k}

in Section 2, except here we use

r_{j i}^{* *}

instead of

r_{j}

.

From this near-exact probability density function of W, using the same logic as before just like in Section 2 and Section 3.1, we can obtain a near-exact probability density function of

T^{*}

in the form of a finite mixture of probability density function

s

of

σ \sqrt{\frac{λ_{j}}{k}} T_{2 k}

with weights

π_{i} p_{j k, i}^{*}

. Thus, the near-exact probability density function under

H_{0}

is given as follows:

\begin{matrix} f_{T^{*}}^{*} (t) & = & \sum_{i = 0}^{m^{*}} \sum_{j = 1}^{2} \sum_{k = 1}^{r_{j i}^{* *}} π_{i} p_{j k, i}^{*} f_{σ \sqrt{\frac{λ_{j}}{k}} T_{2 k}} (t) \\ = & \sum_{i = 0}^{m^{*}} \sum_{j = 1}^{2} \sum_{k = 1}^{r_{j i}^{* *}} π_{i} (\prod_{j = 1}^{2} λ_{j}^{r_{j i}^{* *}}) c_{j k, i}^{*} \frac{Γ (k + 0.5)}{\sqrt{2 π σ^{2}}} {(λ_{j} + \frac{t^{2}}{2 σ^{2}})}^{- (k + 0.5)} \end{matrix}

for

t \in (- \infty, \infty)

. Naturally, the corresponding near-exact cumulative distribution function of

T^{*}

would be a finite mixture of cumulative distribution function

s

of

σ \sqrt{\frac{λ_{j}}{k}} T_{2 k}

with weights

π_{i} p_{j k, i}^{*}

, which can be written as

\begin{matrix} \begin{matrix} F_{T^{*}}^{*} (t) & = & \frac{1}{2} + \sum_{i = 0}^{m^{*}} \sum_{j = 1}^{2} \sum_{k = 1}^{r_{j i}^{* *}} {π_{i} (\prod_{j = 1}^{2} λ_{j}_{j i}^{* *}) c_{j k, i}^{*} \frac{Γ (k) Γ (k + 0.5)}{\sqrt{2 σ^{2}}} \frac{t}{2 λ_{j}} {(λ_{j} + \frac{t^{2}}{2 σ^{2}})}^{0.5 - k} \\ \times \sum_{s = 0}^{k - 1} \frac{{(\frac{t^{2}}{2 σ^{2} λ_{j}})}^{s}}{Γ (\frac{3}{2} + s) Γ (k - s)}} \end{matrix} \end{matrix}

(5)

for

t \in (- \infty, \infty)

, by applying

F_{T_{2 k}} (t)

obtained in Section 2.

4. One of the Sample Sizes Is Even and the Other Is Odd

The exact distribution of the Behrens–Fisher statistic

T^{*}

for this case is given by the same expressions used in Section 3.1. Just like the case of even-numbered sample sizes, the exact cumulative distribution function is not that manageable for usage when

λ_{1} \neq λ_{2}

in this case, too. Hence there is a need to obtain a near-exact distribution of

T^{*}

with a manageable cumulative distribution function for faster and more practical computation of quantiles and

p

-values.

As we did in Section 3.2, we will first obtain a near-exact distribution for W, and then we will derive the near-exact distribution for

T^{*}

from the distribution of W. Let, without any loss of generality,

n_{1}

be even and

n_{2}

be odd. Additionally, let

r_{1}^{*} = \frac{n_{1} - 2}{2}

, which is equivalent to

[r_{1}]

, and let

λ_{1} \neq λ_{2}

. Then, for

r_{j}

and

λ_{j}

as defined in (2) and (3), the exact characteristic function of W can be written as

\begin{matrix} Φ_{W} (t) & = & λ_{1}^{r_{1}} {(λ_{1} - i t)}^{- r_{1}} λ_{2}^{r_{2}} {(λ_{2} - i t)}^{- r_{2}} \\ = & \underset{Φ_{W_{1}} (t)}{\underset{︸}{λ_{1}^{r_{1}^{*}} {(λ_{1} - i t)}^{- r_{1}^{*}} {λ_{2}}^{r_{2}} {(λ_{2} - i t)}^{- r_{2}}}} \underset{Φ_{W_{2}} (t)}{\underset{︸}{λ_{1}^{\frac{1}{2}} {(λ_{1} - i t)}^{- \frac{1}{2}}}}, \end{matrix}

where

Φ_{W_{1}} (t)

is the characteristic function of a GIG distribution of depth 2 with integer shape parameters

r_{1}^{*}

and

r_{2}

and rate parameters

λ_{1}

and

λ_{2}

, while

Φ_{W_{2}} (t)

is the characteristic function of a

Γ (0.5, λ_{1})

distribution. Now, for a given

m^{*} \in N

, we propose to approximate

Φ_{W_{2}} (t)

by

Φ_{W_{2}}^{*} (t) = \sum_{j = 0}^{m^{*}} π_{j} {(2 λ_{1})}^{(j + 1)} {(2 λ_{1} - i t)}^{- (j + 1)}

where

π_{j} (j = 0, \dots, m^{*} - 1)

are determined in such a way that

{\frac{\partial^{h}}{\partial t^{h}} Φ_{W_{2}} (t)|}_{t = 0} = {\frac{\partial^{h}}{\partial t^{h}} Φ_{W_{2}}^{*} (t)|}_{t = 0} for h = 1, \dots, m^{*}

with

π_{m^{*}} = 1 - \sum_{i = 0}^{m^{*} - 1} π_{i}

.

The characteristic function

Φ_{W_{2}}^{*} (t)

is the characteristic function of a finite mixture of

m^{*} + 1

distributions, which are

Γ (j + 1, 2 λ_{1})

distributions

(j = 0, \dots, m^{*})

. The first

m^{*}

moments of this mixture are the same as those of

Γ (0.5, λ_{1})

.

We should note that the weights

π_{j}

do not depend on

λ_{1}

. The h-th non-central moment of

Γ (0.5, λ_{1})

and of the mixture of

Γ (j + 1, 2 λ_{1})

distributions are, respectively, given by

\frac{Γ (h + \frac{1}{2})}{Γ (\frac{1}{2}) λ_{1}^{h}} and \sum_{j = 0}^{m^{*}} π_{j} \frac{Γ (h + j + 1)}{Γ (j + 1) {(2 λ_{1})}^{h}},

which means

π_{j}

are determined in such a way that satisfies

\frac{Γ (h + \frac{1}{2})}{Γ (\frac{1}{2})} = \sum_{j = 0}^{m^{*}} π_{j} \frac{Γ (h + j + 1)}{Γ (j + 1) 2^{h}},

for

h = 1, \dots, m^{*}

, definitely not depending on

λ_{1}

.

By using

Φ_{W_{2}}^{*} (t)

instead of

Φ_{W_{2}} (t)

, we can then approximate

Φ_{W} (t)

with

\begin{matrix} Φ_{W}^{*} (t) & = λ_{1}^{r_{1}^{*}} {(λ_{1} - i t)}^{- r_{1}^{*}} λ_{2}^{r_{2}} {(λ_{2} - i t)}^{- r_{2}} \sum_{j = 0}^{m^{*}} π_{j} {(2 λ_{1})}^{(j + 1)} {(2 λ_{1} - i t)}^{- (j + 1)} \\ = & \sum_{j = 0}^{m^{*}} π_{j} λ_{1}^{r_{1}^{*}} {(λ_{1} - i t)}^{- r_{1}^{*}} λ_{2}^{r_{2}} {(λ_{2} - i t)}^{- r_{2}} {(2 λ_{1})}^{(j + 1)} {(2 λ_{1} - i t)}^{- (j + 1)} \end{matrix}

which is the characteristic function of a finite mixture of

m^{*} + 1 GIG

distributions of depth 3 with shape parameters

r_{1}^{*}, r_{2}, j + 1 (j = 0, \dots, m^{*})

and rate parameters

λ_{1}, λ_{2}, 2 λ_{1}

.

Thus, a finite mixture of

m^{*} + 1

probability density functions s of

G I G

distributions of depth 3 with weights

π_{j}

can be a near-exact probability density function of W. As GIG distribution itself is a finite mixture of gamma distributions, this near-exact probability density function of W is written as

\begin{matrix} f_{W}^{*} (w) & = & \sum_{i = 0}^{m^{*}} π_{i} \sum_{j = 1}^{3} \sum_{k = 1}^{r_{j i}^{*}} (\prod_{j = 1}^{3} {(λ_{j}^{*})}^{r_{j i}^{*}}) c_{j k, i}^{*} w^{k - 1} e^{- λ_{j}^{*} w} \\ = & \sum_{i = 0}^{m^{*}} π_{i} \sum_{j = 1}^{3} \sum_{k = 1}^{r_{j i}^{*}} \underset{p_{j k, i}^{*}}{\underset{︸}{(\prod_{j = 1}^{3} {(λ_{j}^{*})}^{r_{j i}^{*}}) c_{j k, i}^{*} \frac{Γ (k)}{{(λ_{j}^{*})}^{k}}}} \underset{p . d . f of Γ (k, λ_{j}^{*})}{\underset{︸}{\frac{{(λ_{j}^{*})}^{k}}{Γ (k)} w^{k - 1} e^{- λ^{*} w}}} (w > 0) \end{matrix}

where

r_{j i}^{*} = \{\begin{matrix} r_{1}^{*} & j = 1 \\ r_{2} & j = 2 \\ i + 1 & j = 3 \end{matrix} (i = 0, \dots, m^{*}), λ_{j}^{*} = \{\begin{matrix} λ_{1} & j = 1 \\ λ_{2} & j = 2 \\ 2 λ_{1} & j = 3 \end{matrix}

and

c_{j k, i}^{*}

are given by (11)–(13) in [8] using

r_{j i}^{*}

and

λ_{j}^{*}

, which are

\begin{matrix} c_{j, r_{j i}^{*}, i} = \frac{1}{(r_{j i}^{*} - 1)!} \prod_{s = 1, s \neq j}^{3} {(λ_{s}^{*} - λ^{*}_{j})}^{- r_{s i}^{*}}, \\ c_{j, r_{j i}^{*} - k, i}^{*} = \frac{1}{k} \sum_{n = 1}^{k} \frac{(r_{j i}^{*} - k + n - 1)!}{(r_{j i}^{*} - k - 1)!} (\sum_{s = 1, s \neq j}^{3} r_{s} {(λ_{j}^{*} - λ_{s}^{*})}^{- n}) c_{j, r_{j i}^{*} - k + n, i} \end{matrix}

where

k = 1, \dots, r_{j i}^{*} - 1

and

j = 1, 2, 3, i = 0, \dots, m^{*}

. From this near-exact probability density function of W, using the same logic as before, we can once again obtain a near-exact probability density function of

T^{*}

in the form of a finite mixture of probability density functions of

σ \sqrt{\frac{λ_{j}^{*}}{k}} T_{2 k}

distributions, with weights

π_{i} p_{j k, i}^{*}

. The near-exact probability density function of

T^{*}

, under

H_{0}

, is thus given by

\begin{matrix} f_{T^{*}}^{*} (t) & = & \sum_{i = 0}^{m^{*}} \sum_{j = 1}^{3} \sum_{k = 1}^{r_{j i}^{*}} π_{i} p_{j k, i}^{*} f_{σ \sqrt{\frac{λ_{j}^{*}}{k}} T_{2 k}} (t) \\ = & \sum_{i = 0}^{m^{*}} \sum_{j = 1}^{3} \sum_{k = 1}^{r_{j i}^{*}} π_{i} (\prod_{j = 1}^{3} {(λ_{j}^{*})}^{r_{j i}^{*}}) c_{j k, i}^{*} \frac{Γ (k + \frac{1}{2})}{\sqrt{2 π σ^{2}}} {(λ_{j}^{*} + \frac{t^{2}}{2 σ^{2}})}^{- (k + \frac{1}{2})} \end{matrix}

for

t \in (- \infty, \infty)

.

Hence, the corresponding near-exact cumulative distribution function of

T^{*}

, under

H_{0}

, is also a finite mixture of cumulative distribution functions of

σ \sqrt{\frac{λ_{j}^{*}}{k}} T_{2 k}

, with weights

π_{i} p_{j k, i}^{*}

, which can be written as

\begin{matrix} \begin{matrix} F_{T^{*}}^{*} (t) & = & \frac{1}{2} + \sum_{i = 0}^{m^{*}} \sum_{j = 1}^{3} \sum_{k = 1}^{r_{j i}^{*}} {π_{i} (\prod_{j = 1}^{3} {(λ_{j}^{*})}^{r_{j i}^{*}}) c_{j k, i}^{*} \frac{Γ (k) Γ (k + 0.5)}{\sqrt{2 σ^{2}}} \frac{t}{2 λ_{j}^{*}} {(λ_{j}^{*} + \frac{t^{2}}{2 σ^{2}})}^{0.5 - k} \\ \times \sum_{s = 0}^{k - 1} \frac{{(\frac{t^{2}}{2 σ^{2} λ_{j}^{*}})}^{s}}{Γ (\frac{3}{2} + s) Γ (k - s)}} \end{matrix} \end{matrix}

(6)

for

t \in (- \infty, \infty)

by applying

F_{T_{2 k}} (t)

obtained in Section 2.

5. Comparison of the Exact or Near-Exact Distribution and Welch’s t Test

When it is plausible to assume that

λ_{1} \neq λ_{2}

, that is, that

\frac{n_{1} (n_{1} - 1)}{2 σ_{1}^{2}} \neq \frac{n_{2} (n_{2} - 1)}{2 σ_{2}^{2}}

, we can make use of the exact and near-exact distribution of

T^{*}

to solve the Behrens–Fisher problem. The exact cumulative distribution function of

T^{*}

will be used for computation of p-values when both sample sizes are odd, while the near-exact cumulative distribution functions obtained in Section 3 and Section 4 will be used for computing p-values when both sample sizes are even or when one of the sample sizes is even and the other is odd. Because the cumulative distribution functions of

T^{*}

include the unknown parameters

σ_{1}^{2}

and

σ_{2}^{2}

, these will be estimated by the sample variances

S_{i}^{2}

(i = 1, 2)

.

We will compare the exact or near-exact distributions and Welch’s t-test by their actual sizes and powers for testing

H_{0} : μ_{1} = μ_{2}

versus

H_{1} : μ_{1} > μ_{2}

. For

T^{*} = t

, since the hypothesis is for the right-tailed test, the corresponding p-value is computed from

1 - F_{T^{*}} (t)

in (4) or

1 - F_{T^{*}}^{*} (t)

in (5) or (6), depending on the parity of the sample sizes, according to the derivations in the previous sections, and with

σ

estimated by

\hat{σ} = \sqrt{S_{1}^{2} / n_{1} + S_{2}^{2} / n_{2}}

. We used different type I error rates

α \in {0.1, 0.05, 0.01}

and we conducted Monte Carlo experiments under a range of different sets of parameters.

Simulations were conducted for variance and sample size pairs that correspond to

θ = \frac{σ_{1}^{2}}{n_{1}} / (\frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}}) \in {0.1, 0.3, 0.5} .

For

μ_{1} - μ_{2}

, we covered the cases where

μ_{1} - μ_{2}

satisfies

δ = \frac{μ_{1} - μ_{2}}{\sqrt{\frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}}}} \in {0, 1, 2},

with

δ = 0

corresponding to the null hypothesis

H_{0} : μ_{1} = μ_{2}

.

For each combination of parameters, the number of replications was 50,000, and in each subsection, we provide two scenarios: one in which sample sizes were balanced, and another one where they were unbalanced. For each generated sample, we computed

t_{i}

for

i = 1, \dots,

50,000 and then obtained the type I error under

δ = 0

and the power under

δ > 0

from

\begin{matrix} \frac{1}{50, 000} \sum_{i = 1}^{50, 000} I (p_{i} \leq α) \end{matrix}

(7)

where

I (\cdot)

is the indicator function and the p-value

p_{i} = 1 - F_{T^{*}} (t_{i})

is computed using

F_{T^{*}} (\cdot)

in (4) for the case of both odd sample sizes or

F_{T^{*}} (\cdot)

in (5) or (6) when at least one of the sample sizes is even.

All computations were done with the software R, version 4.1.0.

5.1. Odd n₁ and n₂

In Table 1 and Table 2, we present the power values for the exact distribution and Welch’s test, represented, respectively, by E and W. We considered for

θ

and

δ

the sets of values indicated above.

We may see that when sample sizes are unbalanced, as in Table 2, the exact distribution gives larger values of power than Welch’s t-test, namely for larger values of

δ

and smaller values of

α

, while they tend to give similar results when the two sample sizes are homogeneous, still with the exact distribution giving larger power values when the variances are unbalanced. Namely, for the case of

(n_{1}, n_{2}) = (15, 3)

and

(σ_{1}^{2}, σ_{2}^{2}) = (15, 27)

and

δ = 2

, the near-exact distribution shows a gain of over 30% in power in relation to Welch’s t-test for

α = 0.01

.

5.2. Even $n_{1}$ and $n_{2}$

Table 3 and Table 4 provide numerical results for type I error rates and powers for Welch’s t-test and near-exact distributions that use

m^{*} = 4

for cases where both sample sizes are even. In these tables, the near-exact distributions and Welch’s test are, respectively, denoted by NE and W.

Similar to the case of odd sample sizes, we see that the differences in power between the near-exact distribution and Welch’s t-test are quite slim when the sample sizes are equal, as shown in Table 3, although with a tendency for the near-exact distributions to exhibit larger powers when variances are unbalanced, while in the unbalanced case

(n_{1}, n_{2}) = (12, 4)

shown in Table 4 the power displayed by the near-exact distributions is quite larger, particularly if the variances are also unbalanced. Namely, for the case

(σ_{1}^{2}, σ_{2}^{2}) = (8, 72)

and

δ = 2

, the near-exact distribution shows a gain of over 20% in power.

5.3. $n_{1}$ Is Even and $n_{2}$ Is Odd

Table 5 displays the case of similar sample sizes, and it shows that once again, the near-exact distribution and Welch’s test are fairly similar to each other in terms of Type I error rates and values of power. On the other hand, Table 6 presents the results for the unbalanced sample size case, and it shows that the near-exact distribution and Welch’s test can control well the type I error rates, but that the near-exact distribution can obtain larger powers than Welch’s test. In particular, when

(σ_{1}^{2}, σ_{2}^{2}) = (12, 27)

and

δ = 2

, the near-exact distribution shows a gain of almost 30% in power in relation to Welch’s t-test for

α = 0.01

. The near-exact distributions and Welch’s test are, respectively, denoted by NE and W in these tables.

5.4. Brief Study of Power Evolution for Increasing Sample Sizes

With the aim of showing the evolution of the values of power for increasing sample sizes, the plots of power curves for increasing values of the sample sizes are shown in Figure 1, Figure 2 and Figure 3, for given

σ_{1}^{2}

,

σ_{2}^{2}

,

μ_{1}

and

μ_{2}

. As expected, it is clear that the power curves are increasing with increasing values of the sample sizes. Figure 1, Figure 2 and Figure 3 represent, respectively, cases of

(n_{1}, n_{2}) = (o d d, o d d)

,

(e v e n, e v e n)

and

(e v e n, o d d)

. It is also clear from these Figures that, also as expected, although the use of the exact or near-exact distributions lead to an increase in the values of power, the power values from Welch’s test approach asymptotically those obtained when using the exact or near-exact distributions for increasing sample sizes. In Figure 1, Figure 2 and Figure 3, we use

(μ_{1}, μ_{2}) = (\sqrt{40}, 0)

and present the values of

(σ_{1}^{2}, σ_{2}^{2})

and

(n_{1}, n_{2})

in each figure. In addition, the corresponding values of

δ

are also presented.

6. Conclusions

Over the years since the Behrens–Fisher problem was first introduced, many different solutions have been presented for the problem. In this paper, we propose another approach for the Behrens–Fisher problem that is based on a version of the exact distribution and near-exact distributions for its statistic, which are based on GIG (generalized integer gamma) distributions. Overall, the differences between the sizes of the near-exact distribution and Welch’s t-test are negligible, while the use of the exact or near-exact distributions provide powers that are larger than those provided by Welch’s t-test, mainly for the cases where the sample sizes and/or the variances are unbalanced, and namely when smaller sample sizes are associated with larger variances. The results thus show that, mainly for the cases of unbalanced sample sizes and/or unbalanced variances, the use of Welch’s t-test leads to some loss in power compared with the use of the exact or near-exact distributions developed, thus advising towards the use of these latter ones.

The computation of the exact or near-exact distributions poses absolutely no problems, even for large sample sizes, with the computation times remaining in the hundredths of a second for sample sizes in the order of a few hundreds.

In order to decide which distribution to use, the user may want to test the hypothesis

λ_{1} = λ_{2}

. We may note that, given the definition of

λ_{1}

and

λ_{2}

in (3), testing that hypothesis is indeed equivalent to testinf the hypothesis

\frac{σ_{2}}{σ_{1}} \frac{n_{1} (n_{1} - 1)}{n_{2} (n_{2} - 1)} = 1,

which may tested in as much the same way we run a test of equality of two variances based on two independent samples.

Author Contributions

Conceptualization, S.H., C.A.C. and J.P.; methodology, S.H., C.A.C. and J.P.; software, S.H.; validation, S.H., C.A.C. and J.P.; formal analysis, S.H.; investigation, C.A.C. and J.P.; writing—original draft preparation, S.H.; writing—review and editing, C.A.C. and J.P.; funding acquisition, C.A.C. and J.P. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2020R1A2C1A01100526) and also funded by Portuguese national funds through FCT—Fundação para a Ciência e a Tecnologia, I.P., under the scope of the projects UIDB/00297/2020 and UIDP/00297/2020 (Center for Mathematics and Applications-NOVAMath).

Conflicts of Interest

The authors declare no conflict of interest. The founding institutions had no role in the design of the study, in the collection, analysis, or interpretation of data, in the writing of the manuscript, or in the decision to publish the results.

References

Linnik, J.V. Statistical Problems with Nuisance Parameters (Scripta Technica, Trans.); (Original Work Published 1966); American Mathematical Society: Providence, RI, USA, 1968. [Google Scholar]
Fisher, R.A. The fiducial argument in statistical inference. Ann. Eugen. 1935, 6, 391–398. [Google Scholar] [CrossRef]
Fisher, R.A. The comparison of samples with possibly unequal variances. Ann. Eugen. 1939, 9, 174–180. [Google Scholar] [CrossRef]
Jeffreys, H. Note on the Behrens-Fisher formula. Ann. Eugen. 1940, 10, 48–51. [Google Scholar] [CrossRef]
Jeffreys, H. Theory of Probability, 3rd ed.; Oxford University Press: London, UK, 1961. [Google Scholar]
Welch, B.L. The significance of the difference between two means when the population variances are unequal. Biometrika 1938, 29, 350–362. [Google Scholar] [CrossRef]
Welch, B.L. The generalization of ‘Student’s’ problem when several different population variances are involved. Biometrika 1947, 34, 28–35. [Google Scholar] [CrossRef] [PubMed]
Coelho, C.A. The Generalized Integer Gamma distribution—A basis for distributions in Multivariate Stastistics. J. Multivar. Anal. 1998, 64, 86–102. [Google Scholar] [CrossRef]
Coelho, C.A. The wrapped Gamma distribution and wrapped sums and linear combinations of independent Gamma and Laplace distributions. J. Stat. Theory Pract. 2007, 1, 1–29. [Google Scholar] [CrossRef]
Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions, 9th ed.; Dover: New York, NY, USA, 1974. [Google Scholar]

Figure 1. The left panel is based on

σ_{1}^{2} = 5

,

σ_{2}^{2} = 45

.

(n_{1}, n_{2}) = (5 + 2 k, 5 + 2 k)

for

k = 0, 1, \dots, 12

and

δ_{k} = (μ_{1} - μ_{2}) / \sqrt{\frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}}} = \sqrt{40} / \sqrt{\frac{5}{5 + 2 k} + \frac{45}{5 + 2 k}}

. The right panel is based on

σ_{1}^{2} = 15

,

σ_{2}^{2} = 27

and

(n_{1}, n_{2}) = (15 + 2 k, 3 + 2 k)

for

k = 0, 1, \dots, 9

and

δ_{k} = (μ_{1} - μ_{2}) / \sqrt{\frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}}} = \sqrt{40} / \sqrt{\frac{15}{15 + 2 k} + \frac{27}{3 + 2 k}}

. Red and blue lines represent the power curves from exact distribution and Welch’s test, respectively.

Figure 1. The left panel is based on

σ_{1}^{2} = 5

,

σ_{2}^{2} = 45

.

(n_{1}, n_{2}) = (5 + 2 k, 5 + 2 k)

for

k = 0, 1, \dots, 12

and

δ_{k} = (μ_{1} - μ_{2}) / \sqrt{\frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}}} = \sqrt{40} / \sqrt{\frac{5}{5 + 2 k} + \frac{45}{5 + 2 k}}

. The right panel is based on

σ_{1}^{2} = 15

,

σ_{2}^{2} = 27

and

(n_{1}, n_{2}) = (15 + 2 k, 3 + 2 k)

for

k = 0, 1, \dots, 9

and

δ_{k} = (μ_{1} - μ_{2}) / \sqrt{\frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}}} = \sqrt{40} / \sqrt{\frac{15}{15 + 2 k} + \frac{27}{3 + 2 k}}

. Red and blue lines represent the power curves from exact distribution and Welch’s test, respectively.

Figure 2. The left panel is based on

σ_{1}^{2} = 8

,

σ_{2}^{2} = 72

.

(n_{1}, n_{2}) = (8 + 2 k, 8 + 2 k)

for

k = 0, 1, \dots, 17

and

δ_{k} = (μ_{1} - μ_{2}) / \sqrt{\frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}}} = \sqrt{40} / \sqrt{\frac{8}{8 + 2 k} + \frac{72}{8 + 2 k}}

. The right panel is based on

σ_{1}^{2} = 12

,

σ_{2}^{2} = 36

and

(n_{1}, n_{2}) = (12 + 2 k, 4 + 2 k)

for

k = 0, 1, \dots, 11

and

δ_{k} = (μ_{1} - μ_{2}) / \sqrt{\frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}}} = \sqrt{40} / \sqrt{\frac{12}{12 + 2 k} + \frac{36}{4 + 2 k}}

Red and blue lines represent the power curves from the nearly-exact distribution and Welch’s test, respectively.

Figure 2. The left panel is based on

σ_{1}^{2} = 8

,

σ_{2}^{2} = 72

.

(n_{1}, n_{2}) = (8 + 2 k, 8 + 2 k)

for

k = 0, 1, \dots, 17

and

δ_{k} = (μ_{1} - μ_{2}) / \sqrt{\frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}}} = \sqrt{40} / \sqrt{\frac{8}{8 + 2 k} + \frac{72}{8 + 2 k}}

. The right panel is based on

σ_{1}^{2} = 12

,

σ_{2}^{2} = 36

and

(n_{1}, n_{2}) = (12 + 2 k, 4 + 2 k)

for

k = 0, 1, \dots, 11

and

δ_{k} = (μ_{1} - μ_{2}) / \sqrt{\frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}}} = \sqrt{40} / \sqrt{\frac{12}{12 + 2 k} + \frac{36}{4 + 2 k}}

Red and blue lines represent the power curves from the nearly-exact distribution and Welch’s test, respectively.

Figure 3. The left panel is based on

σ_{1}^{2} = 8

,

σ_{2}^{2} = 63

.

(n_{1}, n_{2}) = (8 + 2 k, 7 + 2 k)

for

k = 0, 1, \dots, 14

and

δ_{k} = (μ_{1} - μ_{2}) / \sqrt{\frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}}} = \sqrt{40} / \sqrt{\frac{8}{8 + 2 k} + \frac{63}{7 + 2 k}}

. The right panel is based on

σ_{1}^{2} = 12

,

σ_{2}^{2} = 27

and

(n_{1}, n_{2}) = (12 + 2 k, 3 + 2 k)

for

k = 0, 1, \dots, 9

and

δ_{k} = (μ_{1} - μ_{2}) / \sqrt{\frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}}} = \sqrt{40} / \sqrt{\frac{12}{12 + 2 k} + \frac{27}{3 + 2 k}}

. Red and blue lines represent the power curves from the nearly exact distribution and Welch’s test, respectively.

Figure 3. The left panel is based on

σ_{1}^{2} = 8

,

σ_{2}^{2} = 63

.

(n_{1}, n_{2}) = (8 + 2 k, 7 + 2 k)

for

k = 0, 1, \dots, 14

and

δ_{k} = (μ_{1} - μ_{2}) / \sqrt{\frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}}} = \sqrt{40} / \sqrt{\frac{8}{8 + 2 k} + \frac{63}{7 + 2 k}}

. The right panel is based on

σ_{1}^{2} = 12

,

σ_{2}^{2} = 27

and

(n_{1}, n_{2}) = (12 + 2 k, 3 + 2 k)

for

k = 0, 1, \dots, 9

and

δ_{k} = (μ_{1} - μ_{2}) / \sqrt{\frac{σ_{1}^{2}}{n_{1}} + \frac{σ_{2}^{2}}{n_{2}}} = \sqrt{40} / \sqrt{\frac{12}{12 + 2 k} + \frac{27}{3 + 2 k}}

. Red and blue lines represent the power curves from the nearly exact distribution and Welch’s test, respectively.

Table 1. Power values for

(n_{1}, n_{2}) = (5, 5)

.

Table 1. Power values for

(n_{1}, n_{2}) = (5, 5)

.

$θ$	$α$	$δ = 0$		$δ = 1$		$δ = 2$
		E	W	E	W	E	W
$0.1$	$0.1$	$0.1009$	$0.1000$	$0.3578$	$0.3558$	$0.7059$	$0.7036$
$σ_{1}^{2} = 5$	$0.05$	$0.0518$	$0.0506$	$0.2234$	$0.2199$	$0.5368$	$0.5313$
$σ_{2}^{2} = 45$	$0.01$	$0.0123$	$0.0114$	$0.0731$	$0.0676$	$0.2291$	$0.2139$
$0.3$	$0.1$	$0.0971$	$0.0965$	$0.3664$	$0.3648$	$0.7205$	$0.7187$
$σ_{1}^{2} = 15$	$0.05$	$0.0480$	$0.0473$	$0.2271$	$0.2241$	$0.5528$	$0.5482$
$σ_{2}^{2} = 35$	$0.01$	$0.0100$	$0.0092$	$0.0636$	$0.0601$	$0.2376$	$0.2263$
$0.5$	$0.1$	$0.0958$	$0.0953$	$0.3628$	$0.3614$	$0.7221$	$0.7205$
$σ_{1}^{2} = 5$	$0.05$	$0.0462$	$0.0456$	$0.2243$	$0.2218$	$0.5582$	$0.5545$
$σ_{2}^{2} = 5$	$0.01$	$0.0094$	$0.0005$	$0.0641$	$0.0606$	$0.2375$	$0.2288$

Table 2. Power values for

(n_{1}, n_{2}) = (15, 3)

.

Table 2. Power values for

(n_{1}, n_{2}) = (15, 3)

.

$θ$	$α$	$δ = 0$		$δ = 1$		$δ = 2$
		E	W	E	W	E	W
$0.1$	$0.1$	$0.1100$	$0.1048$	$0.3542$	$0.3402$	$0.6614$	$0.6441$
$σ_{1}^{2} = 15$	$0.05$	$0.0660$	$0.0595$	$0.2304$	$0.2090$	$0.4893$	$0.4522$
$σ_{2}^{2} = 27$	$0.01$	$0.0273$	$0.0053$	$0.1076$	$0.0843$	$0.2609$	$0.1987$
$0.3$	$0.1$	$0.1052$	$0.1013$	$0.3636$	$0.3538$	$0.7033$	$0.6900$
$σ_{1}^{2} = 45$	$0.05$	$0.0593$	$0.0557$	$0.2434$	$0.2287$	$0.5414$	$0.5133$
$σ_{2}^{2} = 21$	$0.01$	$0.0180$	$0.0161$	$0.0990$	$0.0872$	$0.2940$	$0.2533$
$0.5$	$0.1$	$0.1001$	$0.0976$	$0.3743$	$0.3679$	$0.7263$	$0.7185$
$σ_{1}^{2} = 15$	$0.05$	$0.0532$	$0.0507$	$0.2390$	$0.2301$	$0.5657$	$0.5488$
$σ_{2}^{2} = 3$	$0.01$	$0.0133$	$0.0119$	$0.0866$	$0.0788$	$0.2949$	$0.2687$

Table 3. Power values for

(n_{1}, n_{2}) = (8, 8)

.

Table 3. Power values for

(n_{1}, n_{2}) = (8, 8)

.

$θ$	$α$	$δ = 0$		$δ = 1$		$δ = 2$
		NE	W	NE	W	NE	W
$0.1$	$0.1$	$0.1002$	$0.0998$	$0.3731$	$0.3722$	$0.7319$	$0.7311$
$σ_{1}^{2} = 8$	$0.05$	$0.0525$	$0.0519$	$0.2384$	$0.2371$	$0.5775$	$0.5750$
$σ_{2}^{2} = 72$	$0.01$	$0.0111$	$0.0106$	$0.0755$	$0.0726$	$0.2747$	$0.2672$
$0.3$	$0.1$	$0.1001$	$0.0992$	$0.3746$	$0.3741$	$0.7392$	$0.7395$
$σ_{1}^{2} = 24$	$0.05$	$0.0497$	$0.0488$	$0.2373$	$0.2360$	$0.5927$	$0.5922$
$σ_{2}^{2} = 56$	$0.01$	$0.0105$	$0.0094$	$0.0760$	$0.0744$	$0.2929$	$0.2879$
$0.5$	$0.1$	$0.0981$	$0.0968$	$0.3775$	$0.3772$	$0.7370$	$0.7385$
$σ_{1}^{2} = 8$	$0.05$	$0.0511$	$0.0495$	$0.2403$	$0.2393$	$0.5942$	$0.5947$
$σ_{2}^{2} = 8$	$0.01$	$0.0103$	$0.0087$	$0.0757$	$0.0739$	$0.2911$	$0.2890$

Table 4. Power values for

(n_{1}, n_{2}) = (12, 4)

.

Table 4. Power values for

(n_{1}, n_{2}) = (12, 4)

.

$θ$	$α$	$δ = 0$		$δ = 1$		$δ = 2$
		NE	W	NE	W	NE	W
$0.1$	$0.1$	$0.1061$	$0.1020$	$0.3554$	$0.3458$	$0.6925$	$0.6823$
$σ_{1}^{2} = 12$	$0.05$	$0.0578$	$0.0539$	$0.2293$	$0.2129$	$0.5233$	$0.4974$
$σ_{2}^{2} = 36$	$0.01$	$0.0175$	$0.0148$	$0.0894$	$0.0725$	$0.2568$	$0.2082$
$0.3$	$0.1$	$0.1031$	$0.1009$	$0.3666$	$0.3612$	$0.7186$	$0.7120$
$σ_{1}^{2} = 36$	$0.05$	$0.0545$	$0.0523$	$0.2361$	$0.2266$	$0.5657$	$0.5492$
$σ_{2}^{2} = 28$	$0.01$	$0.0131$	$0.0117$	$0.0864$	$0.0769$	$0.2781$	$0.2498$
$0.5$	$0.1$	$0.0991$	$0.0976$	$0.3730$	$0.3701$	$0.7332$	$0.7308$
$σ_{1}^{2} = 12$	$0.05$	$0.0511$	$0.0492$	$0.2388$	$0.2340$	$0.5802$	$0.5718$
$σ_{2}^{2} = 4$	$0.01$	$0.0117$	$0.0103$	$0.0785$	$0.0732$	$0.2841$	$0.2683$

Table 5. Power values for

(n_{1}, n_{2}) = (8, 7)

.

Table 5. Power values for

(n_{1}, n_{2}) = (8, 7)

.

$θ$	$α$	$δ = 0$		$δ = 1$		$δ = 2$
		NE	W	NE	W	NE	W
$0.1$	$0.1$	$0.1015$	$0.1012$	$0.3684$	$0.3674$	$0.7279$	$0.7269$
$σ_{1}^{2} = 8$	$0.05$	$0.0504$	$0.0497$	$0.2377$	$0.2354$	$0.5669$	$0.5638$
$σ_{2}^{2} = 63$	$0.01$	$0.0117$	$0.0111$	$0.0744$	$0.0712$	$0.2642$	$0.2549$
$0.3$	$0.1$	$0.0980$	$0.0971$	$0.3736$	$0.3730$	$0.7366$	$0.7367$
$σ_{1}^{2} = 24$	$0.05$	$0.0496$	$0.0486$	$0.2354$	$0.2341$	$0.5874$	$0.5857$
$σ_{2}^{2} = 49$	$0.01$	$0.0115$	$0.0104$	$0.0747$	$0.0722$	$0.2876$	$0.2820$
$0.5$	$0.1$	$0.1015$	$0.0995$	$0.3711$	$0.3706$	$0.7389$	$0.7405$
$σ_{1}^{2} = 8$	$0.05$	$0.0494$	$0.0475$	$0.2410$	$0.2393$	$0.5897$	$0.5896$
$σ_{2}^{2} = 7$	$0.01$	$0.0116$	$0.0089$	$0.0737$	$0.0701$	$0.2844$	$0.2806$

Table 6. Power values for

(n_{1}, n_{2}) = (12, 3)

.

Table 6. Power values for

(n_{1}, n_{2}) = (12, 3)

.

$θ$	$α$	$δ = 0$		$δ = 1$		$δ = 2$
		NE	W	NE	W	NE	W
$0.1$	$0.1$	$0.1099$	$0.1045$	$0.3495$	$0.3355$	$0.6590$	$0.6418$
$σ_{1}^{2} = 12$	$0.05$	$0.0632$	$0.0574$	$0.2311$	$0.2099$	$0.4807$	$0.4455$
$σ_{2}^{2} = 27$	$0.01$	$0.0254$	$0.0206$	$0.1066$	$0.0849$	$0.2586$	$0.1994$
$0.3$	$0.1$	$0.1074$	$0.1035$	$0.3632$	$0.3523$	$0.6966$	$0.6817$
$σ_{1}^{2} = 36$	$0.05$	$0.0582$	$0.0546$	$0.2417$	$0.2261$	$0.5417$	$0.5120$
$σ_{2}^{2} = 21$	$0.01$	$0.0157$	$0.0137$	$0.0929$	$0.0808$	$0.2825$	$0.2428$
$0.5$	$0.1$	$0.1006$	$0.0986$	$0.3671$	$0.3611$	$0.7185$	$0.7102$
$σ_{1}^{2} = 12$	$0.05$	$0.0503$	$0.0486$	$0.2351$	$0.2263$	$0.5664$	$0.5493$
$σ_{2}^{2} = 3$	$0.01$	$0.0115$	$0.01058$	$0.0817$	$0.0755$	$0.2826$	$0.2581$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hong, S.; Coelho, C.A.; Park, J. An Exact and Near-Exact Distribution Approach to the Behrens–Fisher Problem. Mathematics 2022, 10, 2953. https://doi.org/10.3390/math10162953

AMA Style

Hong S, Coelho CA, Park J. An Exact and Near-Exact Distribution Approach to the Behrens–Fisher Problem. Mathematics. 2022; 10(16):2953. https://doi.org/10.3390/math10162953

Chicago/Turabian Style

Hong, Serim, Carlos A. Coelho, and Junyong Park. 2022. "An Exact and Near-Exact Distribution Approach to the Behrens–Fisher Problem" Mathematics 10, no. 16: 2953. https://doi.org/10.3390/math10162953

APA Style

Hong, S., Coelho, C. A., & Park, J. (2022). An Exact and Near-Exact Distribution Approach to the Behrens–Fisher Problem. Mathematics, 10(16), 2953. https://doi.org/10.3390/math10162953

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Exact and Near-Exact Distribution Approach to the Behrens–Fisher Problem

Abstract

1. Introduction

2. The Exact Distribution of the Behrens–Fisher Statistic for Odd-Numbered Sample Sizes

3. The Exact and Near-Exact Distribution of the Behrens–Fisher Statistic for Even-Numbered Sample Sizes

3.1. The Exact Distribution

3.2. Near-Exact Distribution

4. One of the Sample Sizes Is Even and the Other Is Odd

5. Comparison of the Exact or Near-Exact Distribution and Welch’s t Test

5.1. Odd n₁ and n₂

5.2. Even $n_{1}$ and $n_{2}$

5.3. $n_{1}$ Is Even and $n_{2}$ Is Odd

5.4. Brief Study of Power Evolution for Increasing Sample Sizes

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

An Exact and Near-Exact Distribution Approach to the Behrens–Fisher Problem

Abstract

1. Introduction

2. The Exact Distribution of the Behrens–Fisher Statistic for Odd-Numbered Sample Sizes

3. The Exact and Near-Exact Distribution of the Behrens–Fisher Statistic for Even-Numbered Sample Sizes

3.1. The Exact Distribution

3.2. Near-Exact Distribution

4. One of the Sample Sizes Is Even and the Other Is Odd

5. Comparison of the Exact or Near-Exact Distribution and Welch’s t Test

5.1. Odd n1 and n2

5.2. Even n 1 and n 2

5.3. n 1 Is Even and n 2 Is Odd

5.4. Brief Study of Power Evolution for Increasing Sample Sizes

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

5.1. Odd n₁ and n₂

5.2. Even $n_{1}$ and $n_{2}$

5.3. $n_{1}$ Is Even and $n_{2}$ Is Odd