Constrained Bayesian Method for Testing Equi-Correlation Coefficient

Kachiashvili, Kartlos; SenGupta, Ashis

doi:10.3390/axioms13100722

Open AccessArticle

Constrained Bayesian Method for Testing Equi-Correlation Coefficient

by

Kartlos Kachiashvili

^1,2,3,*

and

Ashis SenGupta

^4,5,6

¹

Faculty of Informatics and Control Systems, Georgian Technical University, Tbilisi 0160, Georgia

²

Ilia Vekua Institute of Applied Mathematics, Ivane Javakhishvili Tbilisi State University, Tbilisi 0186, Georgia

³

Muskhelishvili Institute of Computational Mathematics, Georgian Technical University, Tbilisi 0159, Georgia

⁴

Applied Statistics Unit, Indian Statistical Institute, Kolkata 700108, India

⁵

Medical College of Georgia, Augusta University, Augusta, GA 30912, USA

⁶

Department of Statistics, Middle East Technical University, 06800 Ankara, Turkey

^*

Author to whom correspondence should be addressed.

Axioms 2024, 13(10), 722; https://doi.org/10.3390/axioms13100722

Submission received: 27 August 2024 / Revised: 27 September 2024 / Accepted: 9 October 2024 / Published: 17 October 2024

(This article belongs to the Special Issue Applications of Bayesian Methods in Statistical Analysis)

Download

Browse Figure

Versions Notes

Abstract

The problem of testing the equi-correlation coefficient of a standard symmetric multivariate normal distribution is considered. Constrained Bayesian and classical Bayes methods, using the maximum likelihood estimation and Stein’s approach, are examined. For the investigation of the obtained theoretical results and choosing the best among them, different practical examples are analyzed. The simulation results showed that the constrained Bayesian method (CBM) using Stein’s approach has the advantage of making decisions with higher reliability for testing hypotheses concerning the equi-correlation coefficient than the Bayes method. Also, the use of this approach with the probability distribution of linear combinations of chi-square random variables gives better results compared to that of using the integrated probability distributions in terms of providing both the necessary precisions as well as convenience of implementation in practice. Recommendations towards the use of the proposed methods for solving practical problems are given.

Keywords:

symmetric multivariate normal distribution; hypothesis; constrained Bayesian method; Bayes method; Stein’s approach

MSC:

62F15; 62F03

1. Introduction

Symmetric multivariate normal distribution (SMND) is widely used in many applications of different spheres of human activities such as psychology, education, genetics, and so on [1]. It is also used extensively in statistical inference procedures, e.g., in analysis of variance (ANOVA) for the modeling of the error part [2]. A random vector has an SMND if its components have equal means, equal variances, and equal correlation coefficients between the pairs of the components. The last is called the equi-correlation coefficient [1,3,4,5,6]. A vector has a standard symmetric multivariate normal distribution (SSMND) when the components have zero mean and unit variances. The consideration of SSMND instead of SMND does not reduce generality when the means and variances are known. On the other hand, SSMND is interesting from several theoretical aspects [1] (p. 2): it is an invariant model which belongs to a curved exponential family [6,7], allows us to find a simple estimator of the correlation coefficient [8,9], and is amenable to the derivation of a small-sample optimal test for the correlation coefficient [1].

Extensive research on inference for the correlation coefficient in SMND exists, starting from the early 1940s [10] up to recent years [11,12,13,14]. In the majority of the works, the problem of finding estimators of the correlation coefficient was considered (see, e.g., [14,15,16,17,18]), compared to the testing problem [1,3,4,5,10]. The likelihood ratio test (LRT) for the general case of testing

H_{0} : ρ = ρ_{0}

against

H_{2} : ρ \neq ρ_{0}

and the locally most powerful test (LMPT) for testing

H_{0} : ρ = 0

against

H_{1} : ρ > (<) 0

are presented in [1]. The LMPT is based on the best (minimum variance) natural unbiased estimator (BNUE) of

ρ

. The beta-optimal test and the power envelope for testing

H_{0} : ρ = 0

vs.

H_{1} : ρ > 0

is considered in [5]. The relative performances of these tests are compared with a locally best test proposed in [1]. The analogous test of [1] for the symmetric multivariate normal distribution is constructed in [10] where it is shown that the proposed test is uniformly most powerful (invariant) even in the presence of a nuisance parameter, σ².

In the present work, we for the first time consider the CBM for testing hypotheses

H_{0} : ρ = ρ_{0}

vs.

H_{2} : ρ \neq ρ_{0}

concerning the equi-correlation coefficient using the exact and asymptotical probability density functions (pdfs) of test statistics obtained in [1,18,19]. It allows us to make a decision with the restricted criteria of optimality on the desired levels, i.e., permits us to specify both the significance level and the power of the test, which is not achieved by the testing rules given in [1,3,4,5,10]. Along with the theoretical results, the results of the simulation are presented for the demonstration of the validity of the obtained results and the investigation of their properties.

The layout of the rest of this paper is as follows: A statement of the problem is given in Section 2. Approaches for handling directional hypotheses tests concerning the equi-correlation coefficient based on both the maximum ratio test and on Stein’s approach using CBM methodology are developed in Section 3. The general statement of the CBM of testing hypotheses is offered in Section 4, and the application of the CBM’s two possible statements to testing formulated hypotheses is given in Section 5 and Section 6. Statistical computations under concrete examples are presented in Section 7 followed by a discussion and brief conclusions included in Section 8 and Section 9.

2. The Problem Under Consideration

Consider first the problem of the testing of hypotheses concerning a specified value of the correlation coefficient of SMND.

Let the

k

-dimensional random vector

X

follow the

k

-variate normal distribution with zero mean vector and a correlation matrix

W

of the following structure:

W = (1 - ρ) \cdot I_{k \times k} + ρ \cdot J_{k \times k},

(1)

where

I_{k \times k}

is an identity matrix and

J_{k \times k}

is a matrix of ones.

It is known [1] that

W^{- 1} = {(c_{i j})}_{k \times k},

where

c_{i i} = \{1 + (k - 2) ρ\} / \{(1 - ρ) [1 + (k - 1) ρ]\},

c_{i j} = - ρ / \{(1 - ρ) [1 + (k - 1) ρ]\}, i \neq j,

and the density function of

X

for non-singular

W

is the following:

p (X; ρ) = \frac{1}{{(2 π)}^{k / 2} {|W|}^{1 / 2}} \exp \{- \frac{1}{2} [\frac{(\sum_{i = 1}^{k} x_{i}^{2})}{(1 - ρ)} + \frac{{(\sum_{i = 1}^{k} x_{i})}^{2} (- ρ)}{(1 + (k - 1) ρ) (1 - ρ)}]\},

(2)

- \infty < x_{i} < + \infty

,

i = 1, \dots, k

;

- 1 / (k - 1) < ρ < 1

.

The support of

ρ

,

(- 1 / (k - 1), 1)

assures the non-singularity of the correlation matrix

W

[4].

Let us introduce the Helmert orthogonal transformation

Y_{1}, Y_{2}, \dots

on the uncorrelated multivariate normal vectors sequence

X_{1}, X_{2}, \dots

, i.e.,

Y_{i} = H \cdot X_{i},

i = 1, 2, \dots,

and

H

is the Helmert matrix given as [2]

H = [\begin{matrix} \begin{matrix} \frac{1}{\sqrt{k}} & \frac{1}{\sqrt{k}} & \frac{1}{\sqrt{k}} & \dots & \frac{1}{\sqrt{k}} \\ \frac{1}{\sqrt{2}} & - \frac{1}{\sqrt{2}} & 0 & \dots & 0 \\ \frac{1}{\sqrt{6}} & \frac{1}{\sqrt{6}} & - \frac{2}{\sqrt{6}} & 0 & \dots & 0 \end{matrix} \\ ........................................................ \\ \frac{1}{\sqrt{k (k - 1)}} \dots \frac{1}{\sqrt{k (k - 1)}} - \frac{(k - 1)}{\sqrt{k (k - 1)}} \end{matrix}] .

(3)

Then, each of

Y_{1}, Y_{2}, \dots

are i.i.d.

k

-dimensional random vectors with zero means and a diagonal correlation matrix

V = H W H^{T} = d i a g (v_{1}, \dots, v_{k}),

(4)

where

v_{1}, \dots, v_{k}

are the eigenvalues of the correlation matrix

W

, i.e.,

Y_{i} ~ N_{k} (0, V)

[4]. It is known that [6]

v_{1} = 1 + (k - 1) ρ and v_{2} = \dots = v_{k} = 1 - ρ .

(5)

Let

Y_{1}, \dots, Y_{n}

be i.i.d. observations, i.e., a random sample from

N_{k} (0, V)

, and introduce variables

V_{1 n} = \sum_{i = 1}^{n} y_{i 1}^{2} and V_{2 n} = \sum_{i = 1}^{n} \sum_{j = 2}^{k} y_{i j}^{2},

(6)

where

Y_{i} = (y_{i 1}, y_{i 2}, \dots, y_{i k})

,

i = 1, \dots, n

.

The maximum likelihood estimation (MLE) of the correlation coefficient

ρ

is [2,19]

{\hat{ρ}}_{n} = \frac{V_{1 n} - V_{2 n} {(k - 1)}^{- 1}}{V_{1 n} + V_{2 n}} .

(7)

The problem we want to solve can be formulated as follows: to test

H_{0} : ρ = ρ_{0}, vs . H_{1} : ρ < (>) ρ_{0},

(8a)

or

H_{0} : ρ = ρ_{0} vs . H_{2} : ρ \neq ρ_{0}, ({2.8}^{2})

(8b)

on the basis of the sample

Y_{1}, \dots, Y_{n}

. Here,

Y_{i}

is a

k

-dimensional random vector with independent components each of which has zero mean and variances determined by (5).

The pdf of

Y_{i}

is

p (Y_{i} | ρ) = {(2 π)}^{- \frac{k}{2}} {|V|}^{- \frac{1}{2}} \cdot \exp \{- \frac{1}{2} Y_{i} \cdot V^{- 1} \cdot Y_{i}^{T}\}, i = 1, \dots, n .

(9)

The joint pdf of the sample

Y_{1}, \dots, Y_{n}

has the following form [1]:

p (Y_{1}, \dots, Y_{n} | ρ) = {(2 π)}^{- k n / 2} \cdot {|V|}^{- \frac{n}{2}} \cdot \exp \{- \frac{1}{2} \sum_{i = 1}^{n} Y_{i} \cdot V^{- 1} \cdot Y_{i}^{T}\} = = {(2 π)}^{- k n / 2} {(1 + (k - 1) ρ)}^{- \frac{n}{2}} {(1 - ρ)}^{- \frac{n (k - 1)}{2}} \cdot \exp \{- \frac{1}{2} (\frac{V_{1 n}}{1 + (k - 1) ρ} + \frac{V_{2 n}}{1 - ρ})\},

(10)

where

V_{1 n}

and

V_{2 n}

are defined by (6).

3. Testing (2.8) Hypotheses

Let us transform hypotheses (8) into directional ones, i.e., instead of (8), consider the following hypotheses:

H_{0} : ρ = ρ_{0} vs . H_{-} : - 1 / (k - 1) < ρ < ρ_{0} or H_{+} : ρ_{0} < ρ < 1,

(11)

ρ, ρ_{0} \in (- 1 / (k - 1), 1)

.

3.1. The Test Using Maximum Ratio Estimation of the Parameter

Consider testing the hypothesis

H_{0} : ρ = ρ_{0} vs . H_{A} : ρ = {\hat{ρ}}_{n}, ρ_{0} \neq {\hat{ρ}}_{n},

(12)

where

{\hat{ρ}}_{n}

is defined by (7).

We use the sample

Y^{m} = (Y_{n + 1}, \dots, Y_{n + m})

and the pdf (10) with the parameters

ρ_{0}

and

{\hat{ρ}}_{n}

under the hypotheses

H_{0}

and

H_{A}

, respectively, i.e., pdfs at null and alternative hypotheses are determined by (10) using the values of

ρ_{0}

and

{\hat{ρ}}_{n}

for null and alternative specifications, respectively.

The algorithm for making decisions is the following. Let us designate

Y_{1}, \dots, Y_{n}, Y_{n + 1}, \dots, Y_{n + m}

i.i.d. random vectors obtained by the Helmert orthogonal transformation of the sample

X_{1}, \dots, X_{n}, X_{n + 1}, \dots, X_{n + m}

, i.e.,

Y_{i} = H \cdot X_{i},

i = 1, 2, \dots, n + m

. On the basis of the first half of the vectors

Y_{1}, \dots, Y_{n},

we compute the MLE of the parameter

ρ

by (7), and on the basis of the second half of the sample

Y_{n + 1}, \dots, Y_{n + m}

, we test the hypotheses (12). The pdfs used for testing are the following:

p (Y_{n + 1}, \dots, Y_{n + m} | H_{0}) = p (Y^{m} | ρ_{0})

and

p (Y_{n + 1}, \dots, Y_{n + m} | H_{A}) = p (Y^{m} | {\hat{ρ}}_{A})

determined by (10). When testing (12), the decision is made in favor of the alternative hypothesis, i.e., we accept

H_{-}

if

{\hat{ρ}}_{n} < ρ_{0}

, otherwise

H_{+}

.

3.2. Stein’s Approach

Stein’s method [20,21] integrates the density over

ρ

using special measures to obtain the density of the maximum invariant statistic, which can then be used to analyze the problems, for example, to find the uniformly most powerful invariant test [22].

The integrated conditional pdfs of the sample

Y^{m} = (Y_{n + 1}, \dots, Y_{n + m})

are necessary in this case for the hypotheses under consideration, i.e., the following pdfs:

H_{0} : p (Y^{m} | H_{0}) = {(2 π)}^{- \frac{m k}{2}} {(1 + (k - 1) ρ_{0})}^{- \frac{m}{2}} {(1 - ρ_{0})}^{- \frac{m (k - 1)}{2}} \cdot \exp \{- \frac{1}{2} (\frac{V_{1 m}}{1 + (k - 1) ρ_{0}} + \frac{V_{2 m}}{1 - ρ_{0}})\},

(13)

H_{-} : p (Y^{m} | H_{-}) = \int_{- 1 / (k - 1)}^{ρ_{0}} p (Y^{m} | ρ) \cdot γ_{-} (ρ) d ρ,

(14)

H_{+} : p (Y^{m} | H_{+}) = \int_{ρ_{0}}^{1} p (Y^{m} | ρ) \cdot γ_{+} (ρ) d ρ,

(15)

where

p (Y^{m} | ρ)

is defined by (10) for the sample

Y^{m} = (Y_{n + 1}, \dots, Y_{n + m})

;

γ_{-} (ρ)

and

γ_{+} (ρ)

are the densities of the parameter

ρ

at the alternative specifications

H_{-}

and

H_{+}

, respectively.

Because at alternative hypotheses

H_{-}

and

H_{+}

the value of the parameter

ρ

belongs to the intervals

(- 1 / (k - 1), ρ_{0})

and

(ρ_{0}, 1)

, respectively, and no information is available about the preference of some values from these intervals, we use the uniform distributions of

ρ

in the appropriate intervals, i.e., (14) and (15) pdfs transform in the following forms:

H_{-} : p (Y^{m} | H_{-}) = {(2 π)}^{- m k / 2} \cdot \frac{1}{ρ_{0} + {(k - 1)}^{- 1}} \cdot \cdot \int_{- 1 / (k - 1)}^{ρ_{0}} {(1 + (k - 1) ρ)}^{- \frac{m}{2}} {(1 - ρ)}^{- \frac{m (k - 1)}{2}} \cdot \exp \{- \frac{1}{2} (\frac{V_{1 m}}{1 + (k - 1) ρ} + \frac{V_{2 m}}{1 - ρ})\} d ρ,

(16)

H_{+} : p (Y^{m} | H_{+}) = {(2 π)}^{- m k / 2} \cdot \frac{1}{1 - ρ_{0}} \cdot

\cdot \int_{ρ_{0}}^{1} {(1 + (k - 1) ρ)}^{- \frac{m}{2}} {(1 - ρ)}^{- \frac{m (k - 1)}{2}} \cdot \exp \{- \frac{1}{2} (\frac{V_{1 m}}{1 + (k - 1) ρ} + \frac{V_{2 m}}{1 - ρ})\} d ρ,

(17)

where

V_{1 m}

and

V_{2 m}

are computed by (6) for the sample

Y^{m} = (Y_{n + 1}, \dots, Y_{n + m})

.

For testing hypotheses (11) and (12), using pdfs (10) and (13), (16), and (17), respectively, let us use the CBM—a brief introduction to it is given below [22,23].

4. Constrained Bayesian Method of Testing Hypotheses

Let

x^{T} = (x_{1}, \dots, x_{n})

be generated from

p (x; θ)

, and the problem of interest is to test

H_{i} : θ_{i} \in Θ_{i}

,

i = 1, 2, \dots, S

, where

Θ_{i} \subset R^{m}

,

i = 1, 2, \dots, S

, are disjoint subsets with

\cup Θ_{i} = R^{m}

. The number of tested hypotheses is

S

. Let the prior on

θ

be denoted by

\sum_{i = 1}^{S} π (θ | H_{i}) p (H_{i})

, where for each

i = 1, 2, \dots, S

,

p (H_{i})

is the a priori probability of hypothesis

H_{i}

, and

π (θ | H_{i})

is a prior density with support

Θ_{i}

;

p (x | H_{i})

denotes the marginal density of

x

given

H_{i}

, i.e.,

p (x | H_{i}) = \int_{Θ_{i}} p (x | θ) π (θ | H_{i}) d θ

, and

D = \{d\}

is the set of solutions, where

d = \{d_{1}, \dots, d_{S}\}

, it being so that

d_{i} = \{\begin{cases} 1, i f h y p o t h e i s H_{i} i s a c c e p t e d, \\ 0, o t h e r w i s e, \end{cases}

δ (x) = \{δ_{1} (x), δ_{2} (x), \dots, δ_{S} (x)\}

is the decision function that associates each observation vector

x

with a certain decision:

x \overset{δ (x)}{\to} d \in D;

(notation: depending upon the choice of

x

, there is a possibility that

δ_{j} (x) = 1

for more than one

j

or

δ_{j} (x) = 0

for all

j = 1, \dots, S

).

Γ_{j}

is the region of acceptance of the hypothesis

H_{j}

, i.e.,

Γ_{j} = \{x : δ_{j} (x) = 1\}

. It is obvious that

δ (x)

is completely determined by the

Γ_{j}

regions, i.e.,

δ (x) = \{Γ_{1}, Γ_{2}, \dots, Γ_{S}\}

.

Let us introduce the loss function

L (H_{i}, δ (x))

which determines the value of the loss in the case where the sample has a probability distribution corresponding to hypothesis

H_{i}

, but the decision

δ (x)

is made wrongly. In the general case, the loss function

L (H_{i}, δ (x))

consists of two components:

L (H_{i}, δ (x)) = \sum_{j = 1}^{S} L_{1} (H_{i}, δ_{j} (x) = 1) + \sum_{j = 1}^{S} L_{2} (H_{i}, δ_{j} (x) = 0),

(18)

where

L_{1} (H_{i}, δ_{j} (x) = 1)

is the loss for the incorrect acceptance of

H_{i}

when

H_{j}

is true and

L_{2} (H_{i}, δ_{j} (x) = 0)

is the incorrect rejection of

H_{i}

in favor of

H_{j}

.

It is possible to formulate nine different statements of the CBM depending on what type of restriction is desired, determined by the aim of the specific problem to be solved [22,23].

For concreteness, let us introduce one of the possible statements, namely Task 1, as an example, for a demonstration of the specificity of the CBM. In this case, we have to minimize the averaged loss of the incorrectly accepted hypotheses

r_{δ} = \min_{\{Γ_{j}\}} \{\sum_{i = 1}^{S} p (H_{i}) \sum_{j = 1}^{S} \int_{Γ_{j}} L_{1} (H_{i}, δ_{j} (x) = 1) p (x | H_{i}) d x\},

(19)

subject to the averaged loss of the incorrectly rejected hypotheses

\sum_{i = 1}^{S} p (H_{i}) \sum_{j = 1}^{S} \int_{R^{n} - Γ_{j}} L_{2} (H_{i}, δ_{j} (x) = 0) p (x | H_{i}) d x = = \sum_{i = 1}^{S} p (H_{i}) \sum_{j = 1}^{S} \int_{R^{n}} L_{2} (H_{i}, δ_{j} (x) = 0) p (x | H_{i}) d x -

- \sum_{i = 1}^{S} p (H_{i}) \sum_{j = 1}^{S} \int_{Γ_{j}} L_{2} (H_{i}, δ_{j} (x) = 0) p (x | H_{i}) d x \leq r_{1},

(20)

where

r_{1}

is some real number determining the level of the averaged loss of the incorrectly rejected hypotheses.

By solving problem (19) and (20), we have

Γ_{j} = \{x : \sum_{i = 1}^{S} L_{1} (H_{i}, δ_{j} (x) = 1) p (H_{i}) p (x | H_{i}) < λ \sum_{i = 1}^{S} L_{2} (H_{i}, δ_{j} (x) = 0) p (H_{i}) p (x | H_{i})\},

j = 1, \dots, S,

(21)

where the Lagrange multiplier

λ

(

λ > 0

) is defined so that the equality holds in (20).

The acceptance regions (21) of the hypotheses differ from the classical cases where the observation space is divided into two complementary sub-spaces for acceptance and rejection regions. Here, the observation space contains the regions where decisions can be made as well as the regions where they cannot be made. This also gives us the opportunity to develop the corresponding sequential tests [22,23,24].

5. CBM for Testing Hypotheses in (3.1)

Let us consider the CBM for testing the hypotheses (11) with a stepwise loss function.

One of possible statements of the CBM, namely, the so-called Task 7, has the following form [22,23,24]:

G_{δ} = \max_{\{Γ_{-}, Γ_{0}, Γ_{+}\}} \{K_{0} \cdot [p (H_{-}) \cdot P (Y^{m} \in Γ_{-} | H_{-}) + p (H_{0}) \cdot P (Y^{m} \in Γ_{0} | H_{0}) +

+ p (H_{+}) \cdot P (Y^{m} \in Γ_{+} | H_{+})]\},

(22)

subject to

K_{1} \cdot [p (H_{0}) \cdot P (Y^{m} \in Γ_{-} | H_{0}) + p (H_{+}) \cdot P (Y^{m} \in Γ_{-} | H_{+})] \leq r_{7}^{-},

K_{1} \cdot [p (H_{-}) \cdot P (Y^{m} \in Γ_{0} | H_{-}) + p (H_{+}) \cdot P (Y^{m} \in Γ_{0} | H_{+})] \leq r_{7}^{0},

K_{1} \cdot [p (H_{-}) \cdot P (Y^{m} \in Γ_{+} | H_{-}) + p (H_{0}) \cdot P (Y^{m} \in Γ_{+} | H_{0})] \leq r_{7}^{+},

(23)

where

p (H_{0})

,

p (H_{-})

, and

p (H_{+})

are a priori probabilities of the appropriate hypotheses,

P (Y^{m} \in Γ_{i} | H_{j})

,

i, j \in (-, 0, +)

, is the probability of the acceptance of the

H_{i}

hypothesis when the

H_{j}

hypothesis is true, on the basis of

Y^{m}

, and

K_{1}

and

K_{0}

define the loss functions of incorrectly accepting and incorrectly rejecting the hypotheses given by

L_{1} (H_{i}, δ_{j} (Y^{m}) = 1) = \{\begin{cases} 0 a t i = j, \\ K_{1} a t i \neq j; \end{cases} and L_{2} (H_{i}, δ_{j} (Y^{m}) = 0) = \{\begin{cases} K_{0} a t i = j, \\ 0 a t i \neq j; \end{cases}

(24)

δ (Y^{m}) = \{δ_{-} (Y^{m}), δ_{0} (Y^{m}), δ_{+} (Y^{m})\}

is a decision function,

Γ_{j} = \{Y^{m} : δ_{j} (Y^{m}) = 1\}

,

j \in (-, 0, +)

, is the acceptance region of the hypothesis

H_{j}

, and

r_{7}^{-}

,

r_{7}^{0}

, and

r_{7}^{+}

are the specified levels of the averaged losses of the incorrect acceptances of the hypotheses

H_{-}

,

H_{0}

, and

H_{+}

, respectively.

The solution of the problem (22) and (23) using undetermined Lagrange multipliers yields the acceptance regions for the hypotheses as follows:

Γ_{-} = \{Y^{m} : K_{1} \cdot (p (H_{0} | Y^{m}) + p (H_{+} | Y^{m})) < \frac{1}{λ_{7}^{-}} \cdot K_{0} \cdot p (H_{-} | Y^{m})\},

Γ_{0} = \{Y^{m} : K_{1} \cdot (p (H_{-} | Y^{m}) + p (H_{+} | Y^{m})) < \frac{1}{λ_{7}^{0}} \cdot K_{0} \cdot p (H_{0} | Y^{m})\},

Γ_{+} = \{Y^{m} : K_{1} \cdot (p (H_{-} | Y^{m}) + p (H_{0} | Y^{m})) < \frac{1}{λ_{7}^{+}} \cdot K_{0} \cdot p (H_{+} | Y^{m})\},

(25)

where the Lagrange multipliers

λ_{7}^{-}

,

λ_{7}^{0}

, and

λ_{7}^{+}

are determined so that the equalities hold in the conditions (23).

In accordance with the theorems proven in [24], when the circumstances

\frac{λ_{7}^{-} + λ_{7}^{+}}{K_{1} \cdot P_{\min}} = q

,

\frac{λ_{7}^{-} + λ_{7}^{0} + λ_{7}^{+}}{K_{1} \cdot P_{\min}} = q

(

0 < q < 1

) are satisfied, where

P_{\min} = \min \{p (H_{-}), p (H_{0}), p (H_{+})\}

, the conditions

m d F D R = S E R R_{I I I} \leq q

or

m d F D R + F A R \leq q

are fulfilled. Here, the following criteria of optimality of arriving at the decisions are used: the mixed directional false discovery rate (

m d F D R

), the summary type III error rate (

S E R R_{I I I}

), and the false acceptance rate (

F A R

). They are determined by the following formulae:

m d F D R = P (Y^{m} \in Γ_{-} | H_{+}) + P (Y^{m} \in Γ_{-} | H_{0}) + P (Y^{m} \in Γ_{+} | H_{-}) + P (Y^{m} \in Γ_{+} | H_{0}),

(26)

F A R = P (Y^{m} \in Γ_{0} | H_{-}) + P (Y^{m} \in Γ_{0} | H_{+}) .

(27)

Another possible statement of the CBM, namely, the so-called Task 2, has the following form [22,23,24]:

R_{δ} = \min_{\{Γ_{-}, Γ_{0}, Γ_{+}\}} \{p (H_{-}) \cdot K_{1} \cdot [P (Y^{m} \in Γ_{0} | H_{-}) + P (Y^{m} \in Γ_{+} | H_{-})] +

+ p (H_{0}) \cdot K_{1} \cdot [P (Y^{m} \in Γ_{-} | H_{0}) + P (Y^{m} \in Γ_{+} | H_{0})] +

+ p (H_{+}) \cdot K_{1} \cdot [P (Y^{m} \in Γ_{-} | H_{+}) + P (Y^{m} \in Γ_{0} | H_{+})]\},

(28)

subject to

P (Y^{m} \in Γ_{-} | H_{-}) \geq 1 - \frac{r_{2}^{-}}{p (H_{-}) \cdot K_{0}}, P (Y^{m} \in Γ_{0} | H_{0}) \geq 1 - \frac{r_{2}^{0}}{p (H_{0}) \cdot K_{0}},

P (Y^{m} \in Γ_{+} | H_{+}) \geq 1 - \frac{r_{2}^{+}}{p (H_{+}) \cdot K_{0}},

(29)

where

r_{2}^{-}

,

r_{2}^{0}

, and

r_{2}^{+}

are specification levels in the considered statement.

The solution of the problem of (28) and (29) by the Lagrange method yields the following acceptance regions for the hypotheses:

Γ_{-} = \{Y^{m} : K_{1} \cdot (p (H_{0} | Y^{m}) + p (H_{+} | Y^{m})) < λ_{2}^{-} \cdot K_{0} \cdot p (H_{-} | Y^{m})\},

Γ_{0} = \{Y^{m} : K_{1} \cdot (p (H_{-} | Y^{m}) + p (H_{+} | Y^{m})) < λ_{2}^{0} \cdot K_{0} \cdot p (H_{0} | Y^{m})\},

Γ_{+} = \{Y^{m} : K_{1} \cdot (p (H_{-} | Y^{m}) + p (H_{0} | Y^{m})) < λ_{2}^{+} \cdot K_{0} \cdot p (H_{+} | Y^{m})\},

(30)

where the Lagrange multipliers

λ_{2}^{-}

,

λ_{2}^{0}

, and

λ_{2}^{+}

are determined so that the equalities hold in the conditions (29).

Theorems are proven in [24] in accordance with which (30) decision regions ensure specified error rates of type I and type II, determined by the following ratios:

α \leq \frac{r_{2}^{0}}{K_{0} \cdot p (H_{0})}, β \leq \frac{r_{2}^{-}}{K_{0} \cdot p (H_{-})} + \frac{r_{2}^{+}}{K_{0} \cdot p (H_{+})} .

(31)

They also ensure the specification of the averaged loss of incorrectly accepted hypotheses

R_{δ}

and the averaged loss of incorrectly rejected hypotheses

α_{δ}

by the following ratios:

R_{δ} \leq \frac{K_{1}}{K_{0}} \cdot [r_{2}^{-} + r_{2}^{0} + r_{2}^{+}], α_{δ} \leq r_{2}^{-} + r_{2}^{0} + r_{2}^{+},

(32)

where

α_{δ} = p (H_{-}) \cdot K_{0} \cdot (1 - P (Y^{m} \in Γ_{-} | H_{-})) + p (H_{0}) \cdot K_{0} \cdot (1 - P (Y^{m} \in Γ_{0} | H_{0})) +

+ p (H_{+}) \cdot K_{0} \cdot (1 - P (Y^{m} \in Γ_{+} | H_{+})) .

6. Evolution of CBM 2 for Testing (3.1) Hypotheses

6.1. Using the Maximum Ratio Estimation

Like (30), the hypotheses acceptance regions, testing (12) hypotheses using the CBM 2, have the following forms:

Γ_{0} = \{Y^{m} : K_{1} \cdot p (H_{A}) \cdot P (Y^{m} | H_{A}) < λ_{2}^{0} \cdot K_{0} \cdot p (H_{0}) \cdot P (Y^{m} | H_{0})\},

Γ_{A} = \{Y^{m} : K_{1} \cdot p (H_{0}) \cdot P (Y^{m} | H_{0}) < λ_{2}^{A} \cdot K_{0} \cdot p (H_{A}) \cdot P (Y^{m} | H_{A})\},

(33)

where the Lagrange multipliers

λ_{2}^{0}

and

λ_{2}^{A}

are determined from the following conditions [23]:

K_{0} \cdot p (H_{0}) \cdot (1 - P (Y^{m} \in Γ_{0} | H_{0})) = r_{2}^{0},

K_{0} \cdot p (H_{A}) \cdot (1 - P (Y^{m} \in Γ_{A} | H_{A})) = r_{2}^{A},

(34)

where

r_{2}^{0}

and

r_{2}^{A}

are the specified levels under

H_{0}

and

H_{A}

, respectively.

For the conditional pdfs of

Y^{m}

, we have (13) under

H_{0}

and the following under

H_{A}

:

H_{A} : P (Y^{m} | H_{A}) = {(2 π)}^{- \frac{m k}{2}} {(1 + (k - 1) {\hat{ρ}}_{n})}^{- \frac{m}{2}} {(1 - {\hat{ρ}}_{n})}^{- \frac{m (k - 1)}{2}} \cdot \exp \{- \frac{1}{2} (\frac{V_{1 m}}{1 + (k - 1) {\hat{ρ}}_{n}} + \frac{V_{2 m}}{1 - {\hat{ρ}}_{n}})\} .

(35)

Let us determine the Lagrange multipliers

λ_{2}^{0}

and

λ_{2}^{A}

in (33) of (34). The acceptance regions (33) of the hypotheses under the conditional densities (13) and (35), after simple transformations, take the forms

Γ_{0} = \{Y^{m} : (d_{1}^{0} \cdot V_{1 m} + d_{2}^{0} \cdot V_{2 m}) < 2 (g_{2}^{0} - g_{1}^{0})\},

(36)

where

d_{1}^{0} = \frac{1}{1 + (k - 1) ρ_{0}} - \frac{1}{1 + (k - 1) {\hat{ρ}}_{n}},

d_{2}^{0} = \frac{1}{1 - ρ_{0}} - \frac{1}{1 - {\hat{ρ}}_{n}},

g_{1}^{0} = \frac{m}{2} \cdot \ln (\frac{1 + (k - 1) ρ_{0}}{1 + (k - 1) {\hat{ρ}}_{n}}) + \frac{m (k - 1)}{2} \cdot \ln (\frac{1 - ρ_{0}}{1 - {\hat{ρ}}_{n}}),

g_{2}^{0} = \ln (λ_{2}^{0} \cdot \frac{p (H_{0}) \cdot K_{0}}{p (H_{A}) \cdot K_{1}}) .

(37)

Γ_{A} = \{Y^{m} : (d_{1}^{A} \cdot V_{1 m} + d_{2}^{A} \cdot V_{2 m}) < 2 (g_{2}^{A} - g_{1}^{A})\},

(38)

where

d_{1}^{A} = - d_{1}^{0}, d_{2}^{A} = - d_{2}^{0}, g_{1}^{A} = - g_{1}^{0}, g_{2}^{A} = \ln (λ_{2}^{A} \cdot \frac{p (H_{A}) \cdot K_{0}}{p (H_{0}) \cdot K_{1}}) .

(39)

For the determination of the Lagrange multipliers

λ_{2}^{0}

and

λ_{2}^{A}

from the conditions (34), the computation of the probabilities

P (Y^{m} \in Γ_{0} | H_{0})

and

P (Y^{m} \in Γ_{A} | H_{A})

is essential. For this purpose, knowledge of the pdfs of the random variables

ξ_{0} = d_{1}^{0} \cdot V_{1 m} + d_{2}^{0} \cdot V_{2 m},

ξ_{A} = d_{1}^{A} \cdot V_{1 m} + d_{2}^{A} \cdot V_{2 m},

(40)

is necessary.

The properties of

V_{1 m}

and

V_{2 m}

are given in [1,4]:

(i) V_{1 m} ~ (1 + (k - 1) ρ) χ_{m}^{2};

(ii) V_{2 m} ~ (1 - ρ) χ_{m (k - 1)}^{2};

(41)

(iii) V_{1 m} and V_{2 m} are independent .

The distribution function of a linear combination of chi-square random variables is considered in many works [24,25,26,27]. Below, we use the results of [26], in accordance with which the distribution function

F^{l} (z, ρ)

of

ξ_{l}

,

l \in (0, A)

, from (40) is

F^{l} (z, ρ) = b_{2}^{l} \sum_{j = 0}^{\infty} a_{j}^{l} \int_{0}^{z} f_{j}^{l} (y, ρ) d y,

(42)

where the formulae for the determination of the coefficients

b_{2}^{l}

,

a_{j}^{l}

,

j = 0, 1, \dots

, and

f_{j}^{l} (y, ρ)

density are given in Appendix A.

Using (42), conditions (34) will be written as follows:

F^{0} (2 \cdot (g_{2}^{0} - g_{1}^{0}), ρ_{0}) = 1 - \frac{r_{2}^{0}}{p (H_{0}) \cdot K_{0}},

F^{A} (2 \cdot (g_{2}^{A} - g_{1}^{A}), {\hat{ρ}}_{n}) = 1 - \frac{r_{2}^{A}}{p (H_{A}) \cdot K_{0}},

(43)

We solve the above with respect to the Lagrange multipliers

λ_{2}^{0}

and

λ_{2}^{A}

. Then, using the sample

Y^{m}

, we can make the decision depending on which of the conditions (33) is satisfied.

Note 1. Because of the specificity on the acceptance regions of the hypotheses, in the CBM, they can be intersected or their union may not lead to the entire observation space. Thus, the situation can arise such that none of the (12) hypotheses are accepted on the basis of the sample $Y^{m} = (Y_{n + 1}, \dots, Y_{n + m})$ . In this case, we enhance the sequential approach, i.e., we increase the sample size by one and test hypotheses (12). If a unique decision is not made on the basis of the sample $Y^{m + 1} = (Y_{n + 1}, \dots, Y_{n + m}, Y_{n + m + 1})$ , we again increase its size by one, i.e., we obtain the sample $Y^{m + 2} = (Y_{n + 1}, \dots, Y_{n + m}, Y_{n + m + 1}, Y_{n + m + 2})$ and so on until a unique decision is made.
Note 2. The decision-making procedure can be purely sequential when we start testing (12) hypotheses for $m = 1$ . Otherwise, it is combined: after the original testing approach, if on the basis of $m$ observations a simple decision is not made, we adopt the sequential approach and proceed as above until a decision is made.

Notes 1 and 2 concern the Stein’s approach too, which is described below.

6.2. Using the Stein’s Approach

We generate the sample vectors

Y_{i, l} ~ N_{k} (0, V)

,

i = 1, \dots, n

,

l \in (-, 0, +)

, where

N_{k} (0, V)

is a

k

-variate normal distribution. As a result, we have

Y_{i, l} = (y_{i 1}^{l}, \dots, y_{i k}^{l})

,

i = 1, \dots, n

, where the components of the vectors

Y_{i, l}

are independent and are given by the ratios

y_{i 1}^{l} ~ N (0, 1 + (k - 1) ρ_{l}), y_{i j}^{l} ~ N (0, 1 - ρ_{l}), j = 2, \dots, k .

(44)

Here,

N (0, θ)

is a one-dimensional normal distribution with zero mean and a variance equal to

θ

. Depending on which of equation from (5.8) we solve,

ρ_{l}

takes values

ρ_{l} \in (- 1 / (k - 1), ρ_{0}), ρ_{l} = ρ_{0}, or ρ_{l} \in (ρ_{0}, 1),

(45)

under

H_{-}

,

H_{0}

, or

H_{+}

hypotheses, respectively.

Let us suppose that for the vectors

Y_{i, l}

,

i = 1, \dots, n

, condition

Y_{i, l} \in Γ_{l}

is fulfilled

n_{l}

times; then, we have

P (Y_{l} \in Γ_{l} | H_{l}) \approx \frac{n_{l}}{n},

(46)

where

Y_{l} = (y_{1}^{l}, \dots, y_{k}^{l})

and

Γ_{l}

,

l \in (-, 0, +)

, are defined by (30).

By changing

λ_{2}^{l}

in

Γ_{l}

, we achieve the fulfilment of the condition

|P (Y_{l} \in Γ_{l} | H_{l}) - (1 - \frac{r_{2}^{l}}{p (H_{l}) \cdot K_{0}})| \leq ε,

(47)

where

ε

is the desired accuracy of the solution of the Equation (29).

For the next generated value

Y_{n + 1, l}

and already determined

λ_{2}^{l}

,

l \in (-, 0, +)

, we test the conditions (30) using distribution densities

p (Y_{n + 1, l} | H_{t}) = {(2 π)}^{- \frac{k}{2}} \cdot {(1 + (k - 1) ρ_{t})}^{- \frac{1}{2}} \cdot {(1 - ρ_{t})}^{- \frac{k - 1}{2}} \cdot \exp \{\frac{1}{2} (\frac{{(y_{n + 1, 1}^{l})}^{2}}{(1 + (k - 1) ρ_{t})} + \sum_{j = 2}^{k} \frac{{(y_{n + 1, j}^{l})}^{2}}{(1 - ρ_{t})})\}, l, t \in (-, 0, +) .

That particular hypothesis is accepted which belongs to the acceptance region

Y_{n + 1, l}

corresponding to it. If the accepted hypothesis is

H_{l}

, a decision is correct; otherwise, the decision is incorrect.

If none of the hypotheses are accepted, a new random vector

Y_{n + 2, l}

is generated. Then, the conditions of (30) are tested using (10) (in which

n = 2

) until one of the tested hypotheses is not accepted.

Note 3. The fact that ${\hat{ρ}}_{n}$ is asymptotically normally distributed when $n \to \infty$ is shown in [19], i.e.,

\sqrt{n} ({\hat{ρ}}_{n} - ρ) \to N (0, σ_{ρ}^{2}), where σ_{ρ}^{2} = \frac{2 {(1 - ρ)}^{2} {(1 + (k - 1) ρ)}^{2}}{k (k - 1)} .

(48)

On the basis of this fact, for a quite big

n

, we can test the directional hypotheses (11) using the CBM for the following conditional distributions, instead of (13), (16), and (17):

H_{0} : p (z | H_{0}) = N (z; ({\hat{ρ}}_{n} - ρ_{0}), σ_{ρ_{0}}^{2}),

(49)

H_{-} : p (z | H_{-}) = \frac{1}{ρ_{0} + 1} \int_{- 1}^{ρ_{0}} N (z; ({\hat{ρ}}_{n} - ρ), σ_{ρ}^{2}) d ρ,

(50)

H_{+} : p (z | H_{+}) = \frac{1}{1 - ρ_{0}} \int_{ρ_{0}}^{1} N (z; ({\hat{ρ}}_{n} - ρ), σ_{ρ}^{2}) d ρ .

(51)

For testing hypotheses (12) using the maximum ratio test, we have to use the following densities:

H_{0} : p (z | H_{0}) = N (z; ({\hat{ρ}}_{n} - ρ_{0}), σ_{ρ_{0}}^{2}),

H_{A} : p (z | H_{A}) = N (z; 0, σ_{{\hat{ρ}}_{n}}^{2}) .

(52)

Here,

N (z; a, σ^{2})

is a normal distribution with a mathematical expectation

a

and variance

σ^{2}

.

7. Computation Results

The CBM 2 for both the maximum ratio test and for Stein’s approach with the uniform distribution of the densities of the parameter

ρ

at the alternative specifications

H_{-}

and

H_{+}

was obtained through MATLAB R2021b codes, both for making computations and the investigation of the theoretical results, which are given above. The CBM with the maximum likelihood estimation using both densities (13) and (35) or densities (52) gives very unreliable results due to the informational closeness of the null and alternative hypotheses. Therefore, the computational results for only the Stein’s approach for both the cases when the densities are given by formulae (13), (16), and (17) as well as when they are given by formulae (49), (50), and (51) are given below. The Bayes method when the distribution densities are given by the formulae (49), (50), and (51) gives very unreliable results. Therefore, its results for only the distribution densities of (13), (16), and (17) are given below.

For the implementation of the CBM 2 using the Stein’s approach for testing the hypotheses of (11) for the distributions (13), (16), and (17), the following algorithm was used:

Estimation of the parameter $ρ$ is computed by (7) using the sample $Y_{1}, \dots, Y_{n}$ .
The values of the Lagrange multipliers $λ_{2}^{0}$ , $λ_{2}^{-}$ , and $λ_{2}^{+}$ are determined by solving equations (29) using densities (13), (16), and (17); necessary integrals are computed by the Monte Carlo method.
The sample $Y^{m} = (Y_{n + 1}, \dots, Y_{n + m})$ with the size $m$ distributed by one of the distributions (13), (16), and (17), depending on which hypothesis we test, is used for making a decision using (30) acceptance regions for the hypotheses.

The same algorithm was used for testing (11) hypotheses in the second case where the densities (13), (16), and (17) were changed by the densities (49), (50), and (51).

Example: The results of testing (11) hypotheses using the CBM 2 (acceptance regions for the hypotheses are (30) where Lagrange multipliers are defined from (29)) or Bayes rules (hypotheses acceptance regions are (30) where Lagrange multipliers are equal to 1) for making a decision are given in Appendix B and Appendix C. The following notations are used there: AN—averaged number of observations necessary for making a decision;

k

—the size of the observation vector;

n

—the number of observations for computing the

ρ

parameter’s estimation;

ρ_{0}

—the value of the correlation coefficient at hypothesis

H_{0}

;

λ_{2}^{0}

,

λ_{2}^{-}

, and

λ_{2}^{+}

—Lagrange multipliers under hypotheses

H_{0}

,

H_{-}

, and

H_{+}

, respectively;

r_{2}^{0}

,

r_{2}^{-}

, and

r_{2}^{+}

—restriction levels in (29) determining probabilities of correct decisions at

H_{0}

,

H_{-}

, and

H_{+}

, respectively;

m

—the number of generated random variables for computing probabilities of acceptance of hypothesis

H_{i}

under hypothesis

H_{j}

, i.e., for computing probabilities

P (x \in Γ_{i} | H_{j})

,

i, j \in (-, 0, +)

; and Averaged—averaged values of the appropriate variables obtained at different computations (runs of the program) for one scenario.

In all computations presented in Appendix B and Appendix C, the following values were used:

k = 5

;

m = 5000

; a priori probabilities of hypotheses

p (H_{0}) =

1/3,

p (H_{-}) =

1/3, and

p (H_{A}) =

1/3;

r_{2}^{0} = r_{2}^{-} = r_{2}^{+} = 0.01 (6)

; and the accuracy of the solution of Equation (29) is equal to 0.001.

8. Discussion

The results of the computation showed the following characteristics of the considered problem.

When using the (13), (16), and (17) distribution laws in the CBM 2, the following was observed:

Distribution laws (13), (16), and (17) are very close to each other for hypotheses (11), as can be seen from the computed values of the divergences between hypotheses and from their graphical representations given in Figure A1 of Appendix D.
The minimum divergence between testing hypotheses that can be discriminated with given reliability decreases with increasing the size of the random vector. This fact is seen in Table of Appendix E and in the graphs of Figure A1.
The absolute value of the difference between correlation coefficients corresponding to the null and alternative hypotheses equal to 0.05, i.e., $|ρ_{0} - ρ_{A}| = 0.05$ , is sufficient in the CBM for making the correct decision with a high reliability for $k = 15$ .
For the case $k = 5$ , which is computed in Appendix B, $|ρ_{0} - ρ_{A}|$ must be no less than 0.18 or 0.20 in the CBM for different values of $ρ_{0}$ for testing hypotheses with given reliabilities (equal to 0.05). The Bayes method does not guarantee the reliability of a decision equal to 0.05 for some divergences (see Appendix B).
The number of observations necessary for making a decision is equal to 51.
The number of observations for computing the $ρ$ parameter’s estimation is equal to 50.
The Bayes method uses fifty observations for computing the estimation of $ρ$ and only one additional observation for making a decision. In general, it gives worse results than the CBM 2 (see Appendix B).
Conditions (31) and (32) are fulfilled for the CBM 2. This is violated for the Bayes method for some values of $|ρ_{0} - ρ_{A}|$ .

For the practical use of the presented method when the alternative values of the correlation coefficient are not known for testing hypotheses, we use Lagrange multipliers computed for a divergence equal to 0.18 between correlation coefficients. In this case, the necessary reliability of the decisions made when the divergence between the correlation coefficients of null and alternative hypotheses is greater than 0.18 is guaranteed. This is due to the fact that with the increase in the informational distance between the hypotheses, the errors of the decisions made decrease [28,29].

The probabilities of correct decisions for the different values of

ρ_{0}

and for different divergences between correlations of null and alternative hypotheses when the Lagrange multipliers correspond to 0.18—to the minimal value of the divergence—are given in Appendix E.

The following was found using the (49), (50), and (51) distribution densities in the CBM 2:

It is seen from the (49), (50), and (51) formulae that the distribution densities depend on $ρ_{0}$ and ${\hat{ρ}}_{n}$ (the last is ML estimator of $ρ_{0}$ ). Therefore, the computed values of $λ_{2}^{-}$ , $λ_{2}^{0}$ , and $λ_{2}^{+}$ are the same for all possible alternative hypotheses $H_{-}$ and $H_{+}$ when given $ρ_{0}$ , and correct decisions are made with identical reliability in all cases.
Based on what has been stated above, in practical applications, we compute the Lagrange multipliers for given $ρ_{0}$ and for the minimal distance between the (11) testing hypotheses (for example, equal to 0.05) and use them to test the $H_{0}$ hypothesis versus any possible $H_{-}$ and $H_{+}$ hypotheses.
The number of observations necessary for making a decision is equal to 26 in the considered case.
The number of observations for computing the $ρ$ parameter’s estimation, i.e., for ${\hat{ρ}}_{n}$ , is equal to 15 in the considered case.
Conditions (31) and (32) are fulfilled.

Despite that the divergences between distribution densities (49), (50), and (51) are also very close to each other for the (11) hypotheses, the CBM 2 using Stein’s approach for these distributions gives better results than the use of distribution laws (13), (16), and (17). The superiority is in the number of used observations, providing necessary reliabilities, and also in the convenience of implementation in practical situations. The basic superiority of this case is that for testing the (11) hypotheses the knowledge of

ρ_{A}

is not necessary. Only the knowledge of

ρ_{0}

and the Lagrange multipliers computed for the values

|ρ_{0} - ρ_{A}| = 0.05

are required, i.e., in its practical use, for any possible values of

ρ_{0}

and

ρ

-s, corresponding to alternative hypotheses

H_{-}

and

H_{+}

such that

|ρ_{0} - ρ_{A}| = 0.05

, the Lagrange multipliers are computed and they are used in real testing processes when the exact values of

ρ_{A}

are not known but

|ρ_{0} - ρ_{A}| \geq 0.05

. Theorems confirming the convergence of the testing algorithms for both the (13), (16), and (17) and (49), (50), and (51) distribution densities can be found in [24,29]. Also, as was mentioned above, the results in Table A1 clearly demonstrate that conditions (31) and (32) are fulfilled for both cases.

Because the Bayes method yields very bad results when

|ρ_{0} - ρ_{A}| < 0.18

, for this case, the computation results of the Bayes method are not provided when the distributions are (49), (50), and (51).

Note 4. Because the noted peculiarities remain in force for other possible values of $ρ_{0}$ , the computation results for only $ρ_{0} = 0$ are given in Appendix C to conserve space.

It is worth separately noting the fact that the method proposed in this paper can achieve better results than other methods known to us for the considered problem [1,5,10]. For example, comparing the results given in Appendix C of this paper with the results of the beta-optimal methods given in Table 5 of [5] clearly shows the superiority of the proposed method. On the other hand, in [5] (see p. 87 “Summary”) it is mentioned that the beta-optimal methods outperform SenGupta’s test, but those are motivated differently.

9. Conclusions

Constrained Bayesian and classical Bayes methods, using the maximum likelihood estimation and Stein’s approach, for testing the equi-correlation coefficient of a standard symmetric multivariate normal distribution are developed and investigated. For the investigation of the obtained theoretical results and choosing the best among them, different practical examples are computed using computer codes developed in MATLAB R2021b. The simulation results showed that the CBM using Stein’s approach gives opportunities to make decisions with higher reliability for testing the (11) hypotheses than the Bayes method. Also, the CBM 2 using Stein’s approach with distribution densities (49), (50), and (51) gives better results in comparison with the (13), (16), and (17) densities in terms of the number of observations needed and providing the necessary reliabilities, as well the convenience of their implementations in practice. Recommendations and guidelines for the use of the proposed methods for solving practical problems are given.

Author Contributions

Conceptualization, K.K. and A.S.; methodology, K.K.; software, K.K.; validation, K.K. and A.S.; formal analysis, K.K. and A.S.; investigation, K.K.; resources, A.S.; data curation, K.K.; writing—original draft preparation, K.K.; writing—review and editing, A.S.; visualization, K.K.; supervision, A.S.; project administration, K.K.; funding acquisition, no funding. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

All computations were made on the basis of simulated data, which can be repeated by any interested researcher based on the data described in Item 7.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. An Algorithm for Computation of (42) Distribution Function

The algorithm is developed on the basis of the results of [25].

Let us consider random variable

ξ_{l} = q_{1}^{l} \cdot V_{1 m} + q_{2}^{l} \cdot V_{2 m}, l \in (0, A),

(A1)

where

q_{1}^{l} = d_{1}^{l} \cdot (1 + (k - 1) ρ) and q_{2}^{l} = d_{2}^{l} \cdot (1 - ρ),

(A2)

where

d_{1}^{l}

and

d_{2}^{l}

are determined by (37) and (39).

Then, the probability distribution function of

ξ_{l}

is [25]

F^{l} (z, ρ) = b_{2}^{l} \sum_{j = 0}^{\infty} a_{j}^{l} \int_{0}^{z} f_{j}^{l} (y, ρ) d y,

(A3)

where

m_{1} = \frac{m}{2}, m_{2} = \frac{m (k - 1)}{2}, s = m_{1} + m_{2} = \frac{m k}{2},

{(m)}_{0} = 1, {(m)}_{j} = m (m + 1) \dots (m + j - 1),

b_{2}^{l} = {(\frac{d_{1}^{l} \cdot (1 + (k - 1) ρ)}{d_{2}^{l} \cdot (1 - ρ)})}^{\frac{m (k - 1)}{2}},

a_{j}^{l} = {(m_{2})}_{j} \cdot {(1 - q_{1}^{l} / q_{2}^{l})}^{j} / j!,

f_{j}^{l} (y, ρ) = (y^{s + j - 1} \cdot \exp (- \frac{y}{2 q_{1}^{l}})) / ({(2 q_{1}^{l})}^{s + j} \cdot Γ (s + j)) .

Appendix B. Stein’s and Bayes Methods Using Distributions (13), (16), and (17)

(A). Hypothesis $H_{0}$ is true. The samples for making decisions are generated by (13) with the size $m = 5000$ .
$ρ_{0}$ $n =$ 50		CBM										Bayes Method
		AN		$P (x \in Γ_{-} \| H_{0})$			$P (x \in Γ_{0} \| H_{0})$		$P (x \in Γ_{+} \| H_{0})$	$λ_{2}^{0} /$ $λ_{2}^{-} /$ $λ_{2}^{+}$		$P (x \in Γ_{-} \| H_{0})$			$P (x \in Γ_{0} \| H_{0})$			$P (x \in Γ_{+} \| H_{0})$
−0.1 $\|ρ_{0} - ρ_{A}\|$ = 0.145 Averaged		(1) 51.0490 (2) 51.0569 (3) 51.0552 (4) 51.0520 (5) 51.0514 51.0529		0 0 0 0 0 0			0.9976 0.9974 0.9978 0.9966 0.9982 0.9975		0.0024 0.0014 0.0008 0.0009 0.0004 0.0012	0.3418/ 2.1034 × 10⁻¹⁴/ 0.2441		0.0008 0.0001 0.0004 0.0008 0.0001 0.00044			0.9850 0.9838 0.9864 0.9848 0.9828 0.9846			0.0142 0.0152 0.0132 0.0144 0.0162 0.01464
0 $\|ρ_{0} - ρ_{A}\|$ = 0.18 Averaged		(1) 51.0052 (2) 51.0098 (3) 51.0092 (4) 51.0082 (5) 51.0083 51.00814		0 0 0 0 0 0			0.9582 0.9622 0.9590 0.9554 0.9568 0.95832		0.0418 0.0378 0.0410 0.0446 0.0432 0.04168	0.5310/ 0.0057/ 1.8768		0.0026 0.0022 0.0022 0.0018 0.0012 0.002			0.9782 0.9778 0.9786 0.9798 0.9792 0.97872			0.0192 0.0200 0.0192 0.0184 0.0196 0.01928
0.1 $\|ρ_{0} - ρ_{A}\|$ = 0.18 Averaged		(1) 51 (2) 51.0053 (3) 51.0035 (4) 51.0027 (5) 51 51.0023		0.0056 0.0063 0.0042 0.0036 0.0074 0.00542			0.9534 0.9548 0.9522 0.9562 0.9550 0.95432		0.0410 0.0389 0.0436 0.0402 0.0376 0.04026	0.6836/ 2.9297/ 2.5635		0.0052 0.0034 0.0042 0.0040 0.0044 0.00424			0.9730 0.9728 0.9740 0.9746 0.9740 0.97368			0.0218 0.0238 0.0218 0.0214 0.0216 0.02208
0.3 $\|ρ_{0} - ρ_{A}\|$ = 0.2 Averaged		(1) 51.0185 (2) 51.0225 (3) 51.0211 (4) 51.0202 (5) 51.0197 51.0204		0.0091 0.0050 0.0073 0.0077 0.0256 0.01094			0.9724 0.9877 0.9864 0.9878 0.9712 0.9811		0.0185 0.0073 0.0063 0.0045 0.0032 0.00796	0.8240/ 7.8125/ 0.6773		0.0052 0.0050 0.0054 0.0074 0.0068 0.00596			0.9722 0.9718 0.9698 0.9656 0.9676 0.9694			0.0226 0.0232 0.0248 0.0270 0.0256 0.02464
0.5 $\|ρ_{0} - ρ_{A}\|$ = 0.2 Averaged		(1) 51.0269 (2) 51.0338 (3) 51.0327 (4) 51.0317 (5) 51.0314 51.0313		0.0055 0.0036 0.0030 0.0020 0.0016 0.00314			0.9742 0.9759 0.9698 0.9710 0.9742 0.97302		0 1.746 × 10⁻⁷ 1.2001 × 10⁻¹¹ 1.5429 × 10⁻⁴ 4.1067 × 10⁻⁵	0.9277/ 3.9063/ 0.0134		0.0066 0.0040 0.0048 0.0062 0.0060 0.00552			0.9692 0.9690 0.9666 0.9704 0.9680 0.96864			0.0242 0.0270 0.0286 0.0234 0.0260 0.02584
0.7 $\|ρ_{0} - ρ_{A}\|$ = 0.18 Averaged		(1) 51.0368 (2) 51.0401 (3) 51.0396 (4) 51.0380 (5) 51.0368 51.03826		0.0012 0.0008 0.0002 0.0003 0.0002 0.00054			0.9988 0.9992 0.9998 0.9997 0.9998 0.99946		0 0 0 0 0 0	0.8789/ 0.2365/ 8.7041 × 10⁻¹⁰		0.0024 0.0038 0.0026 0.0044 0.0040 0.00344			0.9728 0.9738 0.9714 0.9712 0.9686 0.97156			0.0248 0.0224 0.0260 0.0244 0.0274 0.025
(B). Hypothesis $H_{-}$ is true. The samples for making decisions are generated by (10) with $ρ \in (- 1 / (k - 1), ρ_{0})$ for the size $m = 5000$ .
$ρ_{0}$ $n =$ 50		CBM												Bayes Method
		AN		$P (x \in Γ_{-} \| H_{-})$		$P (x \in Γ_{0} \| H_{-})$			$P (x \in Γ_{+} \| H_{-})$		$λ_{2}^{0} /$ $λ_{2}^{-} /$ $λ_{2}^{+}$			$P (x \in Γ_{-} \| H_{-})$			$P (x \in Γ_{0} \| H_{-})$			$P (x \in Γ_{+} \| H_{-})$
−0.1 $\|ρ_{0} - ρ_{A}\|$ = 0.145 Averaged		(1) 51.0502 (2) 51.0583 (3) 51.0567 (4) 51.0549 (5) 51.0549 51.055		1 1 1 1 1 1		0 0 0 0 0 0			0 0 0 0 0 0		0.3422/ 2.1684 × 10⁻¹⁴/ 0.2756			1 1 1 1 1 1			0 0 0 0 0 0			0 0 0 0 0 0
0 $\|ρ_{0} - ρ_{A}\|$ = 0.18 Averaged		(1) 51.0560 (2) 51.0613 (3) 51.0606 (4) 51.0589 (5) 51.0538 51.058		1 1 1 1 1 1		0 0 0 0 0 0			0 0 0 0 0 0		0.5127/ 0.0054/ 2.0142			1 0.9996 1 0.9998 1 0.99988			0 0.0004 0 0.0002 0 0.00012			0 0 0 0 0 0
0.1 $\|ρ_{0} - ρ_{A}\|$ = 0.18 Averaged		(1) 51 (2) 51 (3) 51.0053 (4) 51.0035 (5) 51.0027 51.0023		0.9549 0.9555 0.9544 0.9541 0.9544 0.95466		0.0451 0.0445 0.0456 0.0459 0.0456 0.04534			0 0 0 0 0 0		0.6592/ 2.9297/ 2.5635			0.8868 0.8910 0.8986 0.8900 0.8922 0.89172			0.1132 0.1090 0.1014 0.1100 0.1078 0.10828			0 0 0 0 0 0
0.3 $\|ρ_{0} - ρ_{A}\|$ = 0.2 Averaged		(1) 51 (2) 51 (3) 51 (4) 51.0059 (5) 51.0039 51.00196		0.9566 0.9568 0.9533 0.9533 0.9536 0.95472		0.0372 0.0378 0.0402 0.0467 0.0464 0.04166			0 0 0 0 0 0		0.8423/ 7.8164/ 0.6580			0.8018 0.8122 0.8080 0.8068 0.8098 0.80772			0.1982 0.1878 0.1920 0.1932 0.1902 0.19228			0 0 0 0 0 0
0.5 $\|ρ_{0} - ρ_{A}\|$ = 0.2 Averaged		(1) 51 (2) 51.0054 (3) 51.0036 (4) 51.0027 (5) 51.0022 51.00278		0.9585 0.9568 0.9565 0.9561 0.9564 0.95686		0.0415 0.0432 0.0435 0.0439 0.0436 0.03484			0 0 0 0 0 0		0.9766/ 4.0893/ 0.0153			0.8978 0.8988 0.8998 0.8924 0.8934 0.89644			0.1022 0.1012 0.1002 0.1076 0.1066 0.10356			0 0 0 0 0 0
0.7 $\|ρ_{0} - ρ_{A}\|$ = 0.18 Averaged		(1) 51.0206 (2) 51.0271 (3) 51.0256 (4) 51.0260 (5) 51.0249 51.02484		0.9884 0.9891 0.9894 0.9896 0.9894 0.98918		0.0116 0.0109 0.0106 0.0104 0.0106 0.01082			0 0 0 0 0 0		0.8286/ 0.2651/ 8.5265 × 10⁻¹⁰			0.9898 0.9860 0.9854 0.9876 0.9872 0.9872			0.0102 0.0140 0.0146 0.0124 0.0128 0.0128			0 0 0 0 0 0
(C). Hypothesis $H_{+}$ is true. The samples for making decisions are generated by (10) with $ρ \in (ρ_{0}, 1)$ for the size $m = 5000$ .
$ρ_{0}$ $n =$ 50	CBM												Bayes Method
	AN		$P (x \in Γ_{-} \| H_{+})$		$P (x \in Γ_{0} \| H_{+})$			$P (x \in Γ_{+} \| H_{+})$		$λ_{2}^{0} /$ $λ_{2}^{-} /$ $λ_{2}^{+}$			$P (x \in Γ_{-} \| H_{+})$			$P (x \in Γ_{0} \| H_{+})$			$P (x \in Γ_{+} \| H_{+})$
−0.1 $\|ρ_{0} - ρ_{A}\|$ = 0.145 Averaged	(1) 51.0414 (2) 51.0408 (3) 51.0452 (4) 51.0434 (5) 51.0420 51.04256		0 0 0 0 0 0		0.0060 0.0058 0.0050 0.0050 0.0072 0.0058			0.9940 0.9942 0.9950 0.9950 0.9928 0.9942		0.3296/ 2.4293 × 10⁻¹⁴/ 0.2632			0 0 0 0 0 0			0.0202 0.0158 0.0180 0.0204 0.0196 0.0188			0.9798 0.9842 0.9820 0.9796 0.9804 0.9812
0 $\|ρ_{0} - ρ_{A}\|$ = 0.18 Averaged	(1) 51.0038 (2) 51.0089 (3) 51.0069 (4) 51.0061 (5) 51 51.00514		0 0 0 0 0 0		0.0446 0.0469 0.0473 0.0467 0.0457 0.04624			0.9554 0.9531 0.9527 0.9533 0.9543 0.95376		0.4883/ 0.0057/ 1.9531			0 0 0 0 0 0			0.0970 0.0888 0.0914 0.0912 0.0874 0.09116			0.9030 0.9112 0.9086 0.9088 0.9126 0.90884
0.1 $\|ρ_{0} - ρ_{A}\|$ = 0.2 Averaged	(1) 51.0004 (2) 51.0053 (3) 51.0037 (4) 51.0030 (5) 51.0025 51.00298		0 0 0 0 0 0		0.0474 0.0497 0.0442 0.0480 0.0488 0.04762			0.9526 0.9503 0.9558 0.9520 0.9512 0.95238		0.6348/ 0.7334/ 1.5869			0 0 0 0 0 0			0.0858 0.0870 0.0826 0.0734 0.0836 0.08248			0.9142 0.9130 0.9174 0.9266 0.9164 0.91752
0.3 $\|ρ_{0} - ρ_{A}\|$ = 0.2 Averaged	(1) 51.0336 (2) 51.0336 (3) 51.0393 (4) 51.0372 (5) 51.0384 51.03642		0 0 0 0 0 0		0.0304 0.0284 0.0275 0.0289 0.0294 0.02892			0.9696 0.9716 0.9725 0.9711 0.9706 0.97108		0.8446/ 7.0801/ 0.6084			0 0 0 0 0 0			0.0404 0.0376 0.0348 0.0370 0.0382 0.0376			0.9596 0.9624 0.9652 0.9630 0.9618 0.9624
0.5 $\|ρ_{0} - ρ_{A}\|$ = 0.18 Averaged	(1) 51.0734 (2) 51.0845 (3) 51.0790 (4) 51.0789 (5) 51.0781 51.07878		0 0 0 0 0 0		0.0002 0.0006 0.0004 0.0003 0.0004 0.00038			0.9998 0.9996 0.9996 0.9997 0.9996 0.99966		0.8667/ 4.5166/ 0.0143			0 0 0 0 0 0			0 0.0002 0 0.0004 0.0002 0.00016			1 0.9998 1 0.9996 0.9998 0.99984
0.7 $\|ρ_{0} - ρ_{A}\|$ = 0.18 Averaged	(1) 51.0840 (2) 51.0911 (3) 51.0863 (4) 51.0860 (5) 51.0851 51.0865		0 0 0 0 0 0		0 0 0 0 0 0			1 1 1 1 1 1		0.7813/ 0.2518/ 7.9581 × 10⁻¹⁰			0 0 0 0 0 0			0 0 0 0 0 0			1 1 1 1 1 1

Appendix C. Stein’s Method Using Distributions (49), (50), and (51)

(A). Hypothesis $H_{0}$ is true. The samples for making decisions are generated by $N (z; ({\hat{ρ}}_{n} - ρ_{0}), σ_{ρ_{0}}^{2})$ (see (49)) with the size $m = 5000$ .
$ρ_{0} = 0$ $n =$ 15	CBM
	AN	$P (x \in Γ_{-} \| H_{0})$		$P (x \in Γ_{0} \| H_{0})$		$P (x \in Γ_{+} \| H_{0})$			$λ_{2}^{0} /$ $λ_{2}^{-} /$ $λ_{2}^{+}$
$\|ρ_{0} - ρ_{+}\|$ = 0.05 $n k =$ 25 Averaged	(1) 26.0506 (2) 26.0635 (3) 26.0563 (4) 26.0542 (5) 26.0540 6.05572	0.0071 0.0020 0.0017 0.0010 0.0016 0.00268		0.9810 0.9780 0.9770 0.9750 0.9710 0.9764		0 0 0 0 0 0			7.3590087890625/ 2.298583984375/ 0.58624267578125
$\|ρ_{0} - ρ_{+}\|$ = 0.10 $n k =$ 25 Averaged	(1) 26.0618 (2) 26.0569 (3) 26.0538 (4) 26.0516 (5) 26.0512 26.05506	0.0036 0.0031 0.0010 0.0010 0.0012 0.00198		0.9730 0.9680 0.9820 0.9780 0.9750 0.9752		0 0 0 0 0 0
$\|ρ_{0} - ρ_{+}\|$ = 0.15 $n k =$ 25 Averaged	(1) 26.0653 (2) 26.0592 (3) 26.0580 (4) 26.0596 (5) 26.0587 26.06016	0.0031 0.0014 0.0015 0.0004 0.0010 0.00148		0.9780 0.9779 0.9740 0.9790 0.9760 0.97698		0 0 0 0 0 0
$\|ρ_{0} - ρ_{+}\|$ = 0.20 $n k =$ 25 Averaged	(1) 26.0565 (2) 26.0562 (3) 26.0567 (4) 26.0557 (5) 26.0556 26.05614	0.0007 0.0005 0.0006 0.0004 0.0004 0.00052		0.9800 0.9830 0.9760 0.9780 0.9780 0.9790		0 0 0 0 0 0
$\|ρ_{0} - ρ_{+}\|$ = 0.249 $n k =$ 25 Averaged	(1) 26.0564 (2) 26.0556 (3) 26.0558 (4) 26.0559 (5) 26.0557 26.05588	0.0007 0.0002 0.0007 0.0006 0.0003 0.0005		0.9750 0.9780 0.9860 0.9680 0.9730 0.9760		0 0 0 0 0 0
$n k$ —the number of observations distributed in accordance with $p (z \| H_{0}) = N (z; ({\hat{ρ}}_{n} - ρ_{0}), σ_{ρ_{0}}^{2})$ , the arithmetic mean of which is used for making a decision. $n$ —the number of observations used for the computation of ML estimator ${\hat{ρ}}_{n}$ .
(B). Hypothesis $H_{-}$ is true. The samples for making decisions are generated by $N (z; ({\hat{ρ}}_{n} - ρ), σ_{ρ}^{2})$ (see (50)) with $ρ \in (- 1 / (k - 1), ρ_{0})$ for the size $m = 5000$ .
$ρ_{0} = 0$ $n =$ 15	CBM
	AN		$P (x \in Γ_{-} \| H_{-})$		$P (x \in Γ_{0} \| H_{-})$		$P (x \in Γ_{+} \| H_{-})$			$λ_{2}^{0} /$ $λ_{2}^{-} /$ $λ_{2}^{+}$
$\|ρ_{0} - ρ_{+}\|$ = 0.05 $n k =$ 25 Averaged	(1) 26.0530 (2) 26.0570 (3) 26.0497 (4) 26.0475 (5) 26.0476 26.05096		0.9960 0.9985 0.9957 0.9960 0.9960 0.99644		0.0040 0.0040 0.0050 0.0030 0.0040 0.0040		0 0 0 0 0 0			7.3590087890625/ 2.298583984375/ 0.58624267578125
$\|ρ_{0} - ρ_{+}\|$ = 0.10 $n k =$ 25 Averaged	(1) 26.0397 (2) 26.0340 (3) 26.0298 (4) 26.0445 (5) 26.0560 26.0408		0.9967 0.9971 0.9975 1 1 0.99826		4.003 × 10⁻⁶ 4.003 × 10⁻⁶ 4.003 × 10⁻⁶ 0 0 2.4018 × 10⁻⁶		0 0 0 0 0 0
$\|ρ_{0} - ρ_{+}\|$ = 0.15 $n k =$ 25 Averaged	(1) 26.0297 (2) 26.0223 (3) 26.0178 (4) 26.0148 (5) 26.0127 26.01946		1 1 1 1 1 1		0 0 0 0 0 0		0 0 0 0 0 0
$n k$ —the number of observations distributed in accordance with $p (z \| H_{-}) = N (z; ({\hat{ρ}}_{n} - ρ), σ_{ρ}^{2})$ , $ρ \in (- 1 / (k - 1), ρ_{0})$ , the arithmetic mean of which is used for making a decision. $n$ —the number of observations used for the computation of ML estimator ${\hat{ρ}}_{n}$ .
(C). Hypothesis $H_{+}$ is true. The samples for making decisions are generated by $N (z; ({\hat{ρ}}_{n} - ρ_{0}), σ_{ρ_{0}}^{2})$ (see (51)) with $ρ \in (ρ_{0}, 1)$ for the size $m = 5000$ .
$ρ_{0}$ $n =$ 15	CBM
	AN	$P (x \in Γ_{-} \| H_{+})$		$P (x \in Γ_{0} \| H_{+})$		$P (x \in Γ_{+} \| H_{+})$		$λ_{2}^{0} /$ $λ_{2}^{-} /$ $λ_{2}^{+}$
$\|ρ_{0} - ρ_{+}\|$ = 0.05 $n k =$ 25 Averaged	(1) 26.0126 (2) 26.0126 (3) 26.0125 (4) 26.0124 (5) 26.0123 26.01248	2.0298 × 10⁻⁵ 2.8654 × 10⁻⁵ 4.0069 × 10⁻⁵ 5.5761 × 10⁻⁵ 7.7157 × 10⁻⁵ 4.43878 × 10⁻⁵		0.0390 0.0450 0.0470 0.0351 0.0410 0.04142		0.9945 0.9881 0.9789 0.9741 0.9686 0.98084		7.3590087890625/ 2.298583984375/ 0.58624267578125
$\|ρ_{0} - ρ_{+}\|$ = 0.10 $n k =$ 25 Averaged	(1) 26.0274 (2) 26.0209 (3) 26.0168 (4) 26.0140 (5) 26.0119 26.0182	0 0 0 0 0 0		4.6000 × 10⁻⁸ 4.6000 × 10⁻⁸ 4.6000 × 10⁻⁸ 4.6000 × 10⁻⁸ 4.6000 × 10⁻⁸ 4.6000 × 10⁻⁸		0.9516 0.9631 0.9704 0.9753 0.9789 0.96786
$\|ρ_{0} - ρ_{+}\|$ = 0.15 $n k =$ 25 Averaged	(1) 26.0367 (2) 26.0244 (3) 26.0183 (4) 26.0147 (5) 26.0122 6.02126	0 0 0 0 0 0		1.4000 × 10⁻⁵ 1.4000 × 10⁻⁵ 1.4000 × 10⁻⁵ 1.4000 × 10⁻⁵ 1.4000 × 10⁻⁵ 1.4000 × 10⁻⁵		0.9930 0.9953 0.9965 0.9972 0.9977 0.99594
$\|ρ_{0} - ρ_{+}\|$ = 0.20 $n k =$ 25 Averaged	(1) 26.0440 (2) 26.0293 (3) 26.0220 (4) 26.0176 (5) 26.0147 6.02552	0 0 0 0 0 0		0 0 0 0 0 0		1 1 1 1 1 1
$n k$ —the number of observations distributed in accordance with $p (z \| H_{+}) = N (z; ({\hat{ρ}}_{n} - ρ), σ_{ρ}^{2})$ , $ρ \in (ρ_{0}, 1)$ , the arithmetic mean of which is used for making a decision. $n$ —the number of observations used for the computation of ML estimator ${\hat{ρ}}_{n}$ .

Appendix D. The Kullback–Leibler Divergence Between the Distributions Corresponding to the Basic and Alternative Hypotheses

The formula for computing the Kullback–Leibler divergence [29] between the distributions corresponding to the basic and alternative hypotheses (11) has the following form:

J (H_{0}, H_{A}) = \frac{1}{2} (t r (V_{A}^{- 1} \cdot V_{0}) - k + \ln (\frac{\det V_{A}}{\det V_{0}})) .

Table A1. The Kullback–Leibler divergence between the considered distributions.

Absolute Value of the Difference between Correlation Coefficients	Divergence between Hypotheses
	$k$ —Dimension of the Random Vector
	2	3	4	5	10	15
0	−0.500000	−2	−2	−2	−2	−2
0.05	−0.499998	−1.998452	−1.999800	−1.999868	−1.978545	−1.913498
0.10	−0.499975	−1.996207	−1.999897	−1.9981428	−1.884038	−1.508130
0.15	−0.499870	−1.994991	−1.999923	−1.991384	−1.647123	−0.361850
0.20	−0.499578	−1.995222	−1.998287	−1.974225	−1.140647	2.5478240
0.25	−0.498936	−1.996664	−1.992357	−1.938667	−0.118422	9.863595
0.30	−0.497705	−1.998624	−1.978181	−1.872546	1.921221	28.907530
0.35	−0.495539	−1.999952	−1.949830	−1.756739	6.057835	81.612806
0.40	−0.491939	−1.998882	−1.898169	−1.560040	14.7615462	239.804571
0.45	−0.486178	−1.992709	−1.808639	−1.229397	34.099183	764.261963
0.50	−0.477174	−1.977174	−1.657081	−0.670322	80.263477	2723.863943
0.55	−0.463281	−1.945327	−1.401508	0.294955	200.982274	1.118287 × 10⁴
0.60	−0.441894	−1.885285	−0.964754	2.023792	554.973471	5.477538 × 10⁴
0.65	−0.408723	−1.775568	−0.194875	5.296226	1755.615892	3.355081 × 10⁵
0.70	−0.356280	−1.574484	1.234836	12.003869	6677.652705	2.750979 × 10⁶
0.75	−0.270482	−1.193147	4.113706	27.420558	32759.954823	3.3554421 × 10⁷
0.80	−0.121937	−0.415705	10.684338	69.403428	2.325059 × 10⁵	7.2660899 × 10⁸
0.85	0.160835	1.394187	29.065156	218.913453	2.955932 × 10⁶	3.8925993 × 10¹⁰
0.90	0.801213	6.830008	103.504803	1080.614380	1.086956 × 10⁸	1.086957 × 10¹³
0.95	2.964254	36.955242	827.124043	16,658.959510	5.333333 × 10¹⁰	1.706667 × 10¹⁷

Figure A1. Dependence of the divergence between hypotheses on the difference between correlation coefficients, i.e., on

|ρ_{0} - ρ_{A}|

, for different

k

-dimensions of the random vector.

Figure A1. Dependence of the divergence between hypotheses on the difference between correlation coefficients, i.e., on

|ρ_{0} - ρ_{A}|

, for different

k

-dimensions of the random vector.

Appendix E. The Probabilities of Correct Decisions for the Different Values of $ρ_{0}$ and for the Different Divergences Between Correlations of Basic and Alternative Hypotheses When the Lagrange Multipliers Correspond to the Minimal Value of the Divergence Equal to 0.20

$Divergence between ρ_{0}$ $and ρ_{+}$	Probabilities of correct decisions $for ρ_{0} = 0 / 0.3 / 0.5$		$Divergence between ρ_{0}$ $and ρ_{-}$	$Probabilities of correct decisions for ρ_{0} = 0 / - 0.1$
$Divergence between ρ_{0}$ $and ρ_{+}$	$P (x \in Γ_{0} \| H_{0})$	$P (x \in Γ_{+} \| H_{+})$	$Divergence between ρ_{0}$ $and ρ_{-}$	$P (x \in Γ_{-} \| H_{-})$	$P (x \in Γ_{0} \| H_{0})$
0.20	0.9814/0.9724/0.9708	0.9732/0.9769/0.9998	0.145	0.9670/1	0.9776/1
0.25	0.9810/0.9760/0.9694	0.9842/0.9794/0.9999	0.146	0.9681/1	0.9754/1
0.30	0.9774/0.9774/0.9716	0.9892/0.9815/0.99993	0.147	0.9701/1	0.9792/1
0.35	0.9776/0.9762/0.9708	0.9919/0.9832/0.99995	0.148	0.9712/1	0.9794/1
0.40	0.9804/0.9766/0.9658	0.9935/0.9846/0.99996	0.149	0.9712/1	0.9752/1
0.45	0.9780/0.9760/0.9684	0.9946/0.9858/0.99997
0.50	0.9788/0.9760/0.9684	0.9953/0.9868/0.99997

References

SenGupta, A. On Tests for Equi-correlation Coefficient of a Standard Symmetric Multivariate Normal Distribution. Aust. J. Stat. 1987, 29, 49–59. [Google Scholar] [CrossRef]
De, S.K.; Mukhopadhyay, N. Two-stage fixed-width and bounded-width confidence interval estimation methodologies for the common correlation in an equi-correlated multivariate normal distribution. Seq. Anal. 2019. [Google Scholar] [CrossRef]
SenGupta, A.; Pal, C. Locally Optimal Test for no Contamination in Standard Symmetric Multivariate Normal Mixtures. J. Stat. Plan. Inference 1991, 29, 145–155. [Google Scholar] [CrossRef]
SenGupta, A.; Pal, C. Optimal Tests for No Contamination in Symmetric Multivariate Normal Mixtures. Ann. Inst. Stat. Math. 1993, 45, 137–146. [Google Scholar] [CrossRef]
Bhatti, M.I.; King, M.L. A Beta-optimal Test of the Equicorrelation Coefficient. Aust. J. Stat. 1990, 32, 87–97. [Google Scholar] [CrossRef]
Rao, C.R. Linear Statistical Inference and Its Applications; Wiley: New York, NY, USA, 1973. [Google Scholar]
Efron, B. Defining the curvature of a statistical problem. Ann. Stat. 1975, 3, 1189–1242. [Google Scholar] [CrossRef]
Anderssen, S. Invariant normal models. Ann. Stat. 1976, 3, 132–154. [Google Scholar] [CrossRef]
Sampson, A.R. Stepwise BAN estimators for exponential families with multivariate normal applications. J. Multivar. Anal. 1976, 6, 175–176. [Google Scholar] [CrossRef]
Mahalanobis, P.C. A Sample Survey of the Acreage Under Jute in Bengal. Sankhya 1940, 4, 511–530. [Google Scholar]
Bhatti, M.I. On optimal testing for the equality of Equi-correlation: An example of loss in power. Stat. Pap. 2000, 41, 345–352. [Google Scholar] [CrossRef]
Mukhopadhyay, N.; De Silva, B.M. Sequential Methods and Their Applications; CRC: Boca Raton, FL, USA, 2009. [Google Scholar]
Zacks, S. The Exact Distributions of the Stopping Times and Their Functionals in Two-Stage and Sequential Fixed-Width Confidence Intervals of the Exponential Parameter. Seq. Anal. 2009, 28, 69–81. [Google Scholar] [CrossRef]
Banerjee, S.; Mukhopadhyay, N. A General Sequential Fixed-Accuracy Confidence Interval Estimation Methodology for a Positive Parameter: Illustrations Using Health and Safety Data. Ann. Inst. Stat. Math. 2016, 68, 541–570. [Google Scholar] [CrossRef]
Sampson, A.R. Simple BAN estimators of correlations for certain multivariate normal models with known variances. J. Amer. Stat. Assoc. 1978, 73, 859–862. [Google Scholar] [CrossRef]
Ghosh, B.K.; Sen, P.K. Handbook of Sequential Analysis; Edited volume; Dekker: New York, NY, USA, 1991. [Google Scholar]
Ghosh, M.; Mukhopadhyay, N.; Sen, P.K. Sequential Estimation; Wiley: New York, NY, USA, 1997. [Google Scholar]
De, S.K.; Mukhopadhyay, N. Fixed Accuracy Interval Estimation of the Common Variance in an Equi-Correlated Normal Distribution. Seq. Anal. 2015, 34, 364–386. [Google Scholar] [CrossRef]
Zacks, S.; Ramig, P.F. Confidence Intervals for the Common Variance of Equicorrelated Normal Random Variables. In Contributions to the Theory and Applications of Statistics, Volume in Honor of Herbert Solomon; Gelfand, A.E., Ed.; Academic Press: New York, NY, USA, 1987; pp. 511–544. [Google Scholar]
Andersson, S. Distributions of Maximal Invariants Using Quotient Measures. Ann. Stat. 1982, 10, 955–961. [Google Scholar] [CrossRef]
Wijsman, R.A. Cross-Sections of Orbits and Their Application to Densities of Maximal Invariants. In Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, USA, 21 June–18 July 1965 and 27 December 1965–7 January 1966; Volume 1, pp. 389–400. [Google Scholar]
Kachiashvili, K.J. Constrained Bayesian Methods of Hypotheses Testing: A New Philosophy of Hypotheses Testing in Parallel and Sequential Experiments; Nova Science Publishers, Inc.: New York, NY, USA, 2018. [Google Scholar]
Kachiashvili, K.J. Testing Statistical Hypotheses with Given Reliability; Cambridge Scholars Publishing: Cambridge, UK, 2023. [Google Scholar]
Kachiashvili, K.J.; Kachiashvili, J.K.; Prangishvili, I.A. CBM for Testing Multiple Hypotheses with Directional Alternatives in Sequential Experiments. Seq. Anal. 2020, 39, 115–131. [Google Scholar] [CrossRef]
Fleiss, J.L. On the Distribution of a Linear Combination of Independent Chi Squares. J. Am. Stat. Assoc. Theory Method 1971, 66, 142–144. [Google Scholar] [CrossRef]
Solomon, H.; Stephens, M.A. Distribution of a Sum of Weighted Chi-Square Variables. J. Am. Stat. Assoc. Theory Method 1977, 72, 881–885. [Google Scholar]
Moschopoulos, P.G.; Canada, W.B. The distribution function of a linear Combination of Chi-Squares. Comput. Math. Appl. 1984, 10, 383–386. [Google Scholar] [CrossRef]
Coelho, C.A. On the Distribution of Linear Combinations of Chi-Square Random Variables. In Computational and Methodological Statistics and Biostatistics: Contemporary Essays in Advancement; Bekker, A., Chen, D., Ferreira, J., Eds.; Emerging Topics in Statistics and Biostatistics; Springer: Berlin/Heidelberg, Germany, 2020; Volume 1, pp. 211–252. [Google Scholar] [CrossRef]
Kullback, S. Information Theory and Statistics; Mass. Peter Smith: Gloucester, UK, 1978. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kachiashvili, K.; SenGupta, A. Constrained Bayesian Method for Testing Equi-Correlation Coefficient. Axioms 2024, 13, 722. https://doi.org/10.3390/axioms13100722

AMA Style

Kachiashvili K, SenGupta A. Constrained Bayesian Method for Testing Equi-Correlation Coefficient. Axioms. 2024; 13(10):722. https://doi.org/10.3390/axioms13100722

Chicago/Turabian Style

Kachiashvili, Kartlos, and Ashis SenGupta. 2024. "Constrained Bayesian Method for Testing Equi-Correlation Coefficient" Axioms 13, no. 10: 722. https://doi.org/10.3390/axioms13100722

APA Style

Kachiashvili, K., & SenGupta, A. (2024). Constrained Bayesian Method for Testing Equi-Correlation Coefficient. Axioms, 13(10), 722. https://doi.org/10.3390/axioms13100722

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Constrained Bayesian Method for Testing Equi-Correlation Coefficient

Abstract

1. Introduction

2. The Problem Under Consideration

3. Testing (2.8) Hypotheses

3.1. The Test Using Maximum Ratio Estimation of the Parameter

3.2. Stein’s Approach

4. Constrained Bayesian Method of Testing Hypotheses

5. CBM for Testing Hypotheses in (3.1)

6. Evolution of CBM 2 for Testing (3.1) Hypotheses

6.1. Using the Maximum Ratio Estimation

6.2. Using the Stein’s Approach

7. Computation Results

8. Discussion

9. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. An Algorithm for Computation of (42) Distribution Function

Appendix B. Stein’s and Bayes Methods Using Distributions (13), (16), and (17)

Appendix C. Stein’s Method Using Distributions (49), (50), and (51)

Appendix D. The Kullback–Leibler Divergence Between the Distributions Corresponding to the Basic and Alternative Hypotheses

Appendix E. The Probabilities of Correct Decisions for the Different Values of $ρ_{0}$ and for the Different Divergences Between Correlations of Basic and Alternative Hypotheses When the Lagrange Multipliers Correspond to the Minimal Value of the Divergence Equal to 0.20

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Constrained Bayesian Method for Testing Equi-Correlation Coefficient

Abstract

1. Introduction

2. The Problem Under Consideration

3. Testing (2.8) Hypotheses

3.1. The Test Using Maximum Ratio Estimation of the Parameter

3.2. Stein’s Approach

4. Constrained Bayesian Method of Testing Hypotheses

5. CBM for Testing Hypotheses in (3.1)

6. Evolution of CBM 2 for Testing (3.1) Hypotheses

6.1. Using the Maximum Ratio Estimation

6.2. Using the Stein’s Approach

7. Computation Results

8. Discussion

9. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. An Algorithm for Computation of (42) Distribution Function

Appendix B. Stein’s and Bayes Methods Using Distributions (13), (16), and (17)

Appendix C. Stein’s Method Using Distributions (49), (50), and (51)

Appendix D. The Kullback–Leibler Divergence Between the Distributions Corresponding to the Basic and Alternative Hypotheses

Appendix E. The Probabilities of Correct Decisions for the Different Values of ρ 0 and for the Different Divergences Between Correlations of Basic and Alternative Hypotheses When the Lagrange Multipliers Correspond to the Minimal Value of the Divergence Equal to 0.20

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Appendix E. The Probabilities of Correct Decisions for the Different Values of $ρ_{0}$ and for the Different Divergences Between Correlations of Basic and Alternative Hypotheses When the Lagrange Multipliers Correspond to the Minimal Value of the Divergence Equal to 0.20