Capturing a Change in the Covariance Structure of a Multivariate Process

Andriette Bekker; Johannes T. Ferreira; Schalk W. Human; Karien Adamski

doi:10.3390/sym14010156

,

and

¹

Department of Statistics, Faculty of Natural and Agricultural Sciences, University of Pretoria, Pretoria 0002, South Africa

²

Centre of Excellence in Mathematical and Statistical Sciences, Johannesburg 2000, South Africa

^*

Author to whom correspondence should be addressed.

Symmetry2022, 14(1), 156;https://doi.org/10.3390/sym14010156

This article belongs to the Special Issue Symmetry in Multivariate Analysis

Version Notes

Order Reprints

Abstract

This research is inspired from monitoring the process covariance structure of q attributes where samples are independent, having been collected from a multivariate normal distribution with known mean vector and unknown covariance matrix. The focus is on two matrix random variables, constructed from different Wishart ratios, that describe the process for the two consecutive time periods before and immediately after the change in the covariance structure took place. The product moments of these constructed random variables are highlighted and set the scene for a proposed measure to enable the practitioner to calculate the run-length probability to detect a shift immediately after a change in the covariance matrix occurs. Our results open a new approach and provides insight for detecting the change in the parameter structure as soon as possible once the underlying process, described by a multivariate normal process, encounters a permanent/sustained upward or downward shift.

Keywords:

generalised bimatrix variate beta type II distribution; Meijer’s G-function; run-length; sequential; shift

1. Introduction

1.1. Problem Description and Approach

Ref. [1] investigated the problem of monitoring an attribute from the start of production, whether or not prior information is available, and presented Q-charts assuming that the observations from each sample are independent and identically distributed normal random variables. The run-length is a measure to gain insight into the performance of a control chart. Ref. [2] proposed an accurate, analytic approximation, while the approach of [3] was embedded in a nonstationary, discrete-time Markov chain to compute the run-length distribution. Ref. [4] considered independent samples observed from a normal distribution and monitors the variance of the sequential process when it encounters an unknown sustained shift. Only a permanent upward or downward step shift in the variance was considered. The focus of [5,6] were to develop exact expressions for the probabilities of run-lengths, and as a result the joint distribution of the charting statistics is needed. The statistical property that is of interest is the moments of the random variables (charting statistics) to illustrate the behaviour of this distribution. The property is of relevance since, once the process is out-of-control, the charting statistics are no longer independent. The correlation structure is then of particular interest. We refer to the papers of [4,5,6] that provide an overview of the practical problem which is the genesis of the following random variables:

Q_{0} = \frac{λ T_{0}}{Z} and Q_{j} = \frac{λ T_{j}}{Z + λ \sum_{k = 0}^{j - 1} T_{k}}, j = 1, 2, \dots, p with λ > 0

(1)

(see expression (6) p. 1049 of [5]) where Z and

T_{j},

j = 0, 1, \dots, p

are central chi-squared random variables and

λ

indicates the shift parameter when the process variance parameter has changed from

σ^{2}

to

σ_{1}^{2} = λ σ^{2}

.

The direct focus of this paper is, however, on a multivariate process. Suppose the covariance structure of q attributes of the items of a single process are monitored simultaneously where the samples are independently observed, and at each point in time

(i)

a sample of size

n_{i}

is collected. Assume that these samples are collected from a multivariate normal distribution with known mean vector (

{\underset{̲}{μ}}_{0}

) and unknown covariance matrix (

Σ : q \times q

) which we’ll denote as

M V N ({\underset{̲}{μ}}_{0}, Σ)

. Let

Y^{(i)} : n_{i} \times q

denote the matrix of observations for time period

i,

where

{\underset{̲}{Y}}_{1}^{(i)}, {\underset{̲}{Y}}_{2}^{(i)}, \dots, {\underset{̲}{Y}}_{q}^{(i)}

denote the column vectors (i.e., the

n_{i}

observations of each attribute) and

{\underset{̲}{Y}}_{(1)}^{(i)}, {\underset{̲}{Y}}_{(2)}^{(i)}, \dots, {\underset{̲}{Y}}_{(n_{i})}^{(i)}

denote the row vectors (i.e., observations of each sample) of

Y^{(i)}

, i.e.,

Y^{(i)} : n_{i} \times q = (\begin{matrix} Y_{11}^{(i)} & Y_{12}^{(i)} & \dots & Y_{1 q}^{(i)} \\ Y_{21}^{(i)} & Y_{22}^{(i)} & \dots & Y_{2 q}^{(i)} \\ ⋮ & ⋮ & ⋮ \\ Y_{n_{i} 1}^{(i)} & Y_{n_{i} 2}^{(i)} & \dots & Y_{n_{i} q}^{(i)} \end{matrix}) .

Assume further that the observations within each sample are independent, therefore the row vectors

{\underset{̲}{Y}}_{(1)}^{(i)}, {\underset{̲}{Y}}_{(2)}^{(i)}, \dots, {\underset{̲}{Y}}_{(n_{i})}^{(i)}

represent independent observations from a

M V N ({\underset{̲}{μ}}_{0}, Σ)

distribution. The sample covariance matrix at time i is denoted by

S_{i} : q \times q

, where it is known that

S_{i}

follows a Wishart distribution (see [7]). Since we assume that

Σ

is unknown, the first sample is used to obtain an initial point estimate of

Σ,

i.e., the sample covariance matrix

S_{1}

. At sample number two,

S_{2}

is compared to

S_{1}

to investigate whether the covariance structure is still the same. If so, a pooled sample covariance matrix is calculated (based on the observations in samples 1 and 2) which will be compared to

S_{3}

at time period three. This sequential updating and testing procedure continues until the process is observed (or rather, declared) to be out-of-control.

The scenario under consideration in this paper is described in Figure 1, and, without loss of generality it is assumed that the mean vector is the null vector. Suppose that between samples

κ - 1

and

κ

the covariance structure changes as shown in Figure 1, i.e., from

Σ

to

λ Σ

where

λ > 0

and

λ \neq 1

, where the location of the shift between these samples is unknown. In this paper the two matrix random variables,

U_{0}

and

U_{1},

that correspond to the two successive time periods immediately after the change in the covariance structure occurred (i.e., sample

κ

and sample

κ + 1)

will be the focus. Formally this can be described as

\begin{matrix} U_{0} & = & X^{- \frac{1}{2}} λ W_{0} X^{- \frac{1}{2}} \\ U_{1} & = & {(X + λ W_{0})}^{- \frac{1}{2}} λ W_{1} {(X + λ W_{0})}^{- \frac{1}{2}} \end{matrix}

(2)

where

C^{\frac{1}{2}}

denotes the unique positive definite square root of a matrix

C

. In this case

X

has a Wishart distribution with parameters

v_{1}

and

Σ,

denoted

W_{q} (v_{1}, Σ),

W_{0}

is

W_{q} (v_{2}, Σ)

distributed and

W_{1}

has a

W_{q} (v_{3}, Σ)

distribution with

X

,

W_{0}

and

W_{1}

independent

(v_{i} \geq q, i = 1, 2, 3)

. In terms of the statistical process control (SPC) literature the parameters are interpretable as

v_{1} = \sum_{i = 1}^{κ - 1} n_{i},

v_{2} = n_{κ}

and

v_{3} = n_{κ + 1}

.

Figure 1. Schematic description of the multivariate process.

If

q = 1

in Equation (2), then the random variables simplify to the case for two successive time periods

(Q_{0} and Q_{1})

in Equation (1), and so Equation (2) can be represented as

\begin{matrix} U_{0} & = & X^{- \frac{1}{2}} W_{0} X^{- \frac{1}{2}} \\ U_{1} & = & {(X + W_{0})}^{- \frac{1}{2}} W_{1} {(X + W_{0})}^{- \frac{1}{2}} \end{matrix}

(3)

where

X \sim W_{q} (v_{1}, Σ),

W_{0} \sim W_{q} (v_{2}, λ Σ)

and

W_{1} \sim W_{q} (v_{3}, λ Σ)

. Refs. [8,9] previously described constructions of this nature. The advantage of this approach lies in the mathematical and statistical formulation of observing a change in the covariance matrix in such a sequential process, and to present this matrix-based framework and results for further future study within SPC.

1.2. Outline of Paper

In Section 2, the distribution of the matrix random variables (3) is unknown and will be investigated; and expressions for the marginal distributions and moments

E [| U_{i} |^{h_{j}} |]

are given for

i = 0, 1, j = 1, 2

, which are used to obtain exact expressions for the pdfs of

| U_{0} |

and

| U_{1} |

(relying on mathematical tools reviewed in Section 1.3. The cumulative distribution function (cdf) of

|U_{0}|

and

|U_{1}|

are given and used as part of the numerical example within an SPC environment in Section 3. In particular, a measure is proposed to determine the probability that a control chart will signal immediately after a change in the covariance matrix. The expressions are given in computable terms of Meijer’s G-function, and also theoretical terms involving zonal polynomials and hypergeometric functions with matrix argument which are often encountered in the literature (see [7,10,11,12,13,14,15]). Finally, Section 4 contains discussions and conclusions.

1.3. Mathematical Toolbox

Some essential mathematical tools and definitions for this paper are listed below.

(Ref. [16]) The multivariate gamma function, denoted $Γ_{q} (α),$ is defined as

$\begin{matrix} Γ_{q} (α) & = & \int_{S > 0} etr (- S) {|S|}^{α - \frac{1}{2} (q + 1)} d S \\ = & π^{\frac{1}{4} q (q - 1)} \prod_{i = 1}^{q} Γ [α - \frac{1}{2} (i - 1)] \end{matrix}$

(4)

where $ℜ (α) > \frac{1}{2} (q - 1)$ , and the integral is over the space of $q \times q$ positive definite matrices. For $q = 1$ it simplifies to the gamma function. The generalised gamma function of weight $τ$ is defined as

$\begin{matrix} Γ_{q} (α, τ) & = & π^{\frac{1}{4} q (q - 1)} \prod_{j = 1}^{q} Γ [α + t_{j} - \frac{1}{2} (j - 1)] \\ = & {(α)}_{τ} Γ_{q} (α) \end{matrix}$

(5)

where the integral is over the space of $q \times q$ positive definite matrices, ${(α)}_{τ}$ is the generalised hypergeometric coefficient, $ℜ (α) \geq \frac{1}{2} (q - 1) - t_{q}, τ = (t_{1}, \dots, t_{q})$ , $t_{1} \geq \dots \geq t_{q} \geq 0$ , $t_{1} + \dots + t_{q} = t$ and $Γ_{q} (α, 0) = Γ_{q} (α)$ . Finally then, the following Laplace transform is used subsequently and given by (see also [12]):

$\int_{S > 0} etr (- SX) {|S|}^{α - \frac{1}{2} (q + 1)} d S = Γ_{q} (α) {|S|}^{- α} .$

(6)
(Ref. [16]) The multivariate beta function, denoted by $β_{q} (α, b)$ , is defined as

$β_{q} (α, β) = \int_{0 < S < I_{q}} {|S|}^{α - \frac{1}{2} (q + 1)} {|I_{q} - S|}^{β - \frac{1}{2} (q + 1)} d S = \frac{Γ_{q} (α) Γ_{q} (β)}{Γ_{q} (α + β)}$

(7)

where $ℜ (α) > \frac{1}{2} (q - 1)$ , $ℜ (β) > \frac{1}{2} (q - 1)$ and $Γ_{q} (\cdot)$ is the multivariate gamma function. For $q = 1$ it simplifies to the usual beta function.
(Ref. [15]) Meijer’s G-function with the parameters $α_{1}, \dots, α_{r}$ and $β_{1}, \dots, β_{s}$ is defined as

$G_{r, s}^{m, n} ({x |}_{β_{1}, \dots, β_{s}}^{α_{1}, \dots, α_{r}}) = \frac{1}{2 π i} \int_{L} g (h) x^{- h} d h$

(8)

where $i = \sqrt{- 1},$ L is a suitable contour, $x \neq 0,$ and

$g (h) = \frac{\prod_{j = 1}^{m} Γ (β_{j} + h) \prod_{j = 1}^{n} Γ (1 - α_{j} - h)}{\prod_{j = m + 1}^{s} Γ (1 - β_{j} - h) \prod_{j = n + 1}^{r} Γ (α_{j} + h)}$

where $m,$ n, r and s are integers with $0 \leq n \leq r$ and $0 \leq m \leq s .$
(Refs. [15,17,18]) The hypergeometric function of matrix argument is defined by

$_{r} F_{s} (α_{1}, \dots, α_{r}; β_{1}, \dots, β_{s}; S) = \sum_{t = 0}^{\infty} \sum_{τ} \frac{{(α_{1})}_{τ} \dots {(α_{r})}_{τ}}{{(β_{1})}_{τ} \dots {(β_{s})}_{τ}} \frac{1}{t!} C_{τ} (S),$

(9)

where $α_{i}$ , $i = 1, \dots, r;$ $β_{j},$ $j = 1, \dots, s$ are arbitrary numbers, $S$ $(q \times q)$ is a real symmetric matrix, $\sum_{τ}$ denotes summation over all partitions $τ$ , $C_{τ} (S)$ is the zonal polynomial of $S$ , ${(α)}_{τ}$ is the generalised hypergeometric coefficient.
Two special cases of Equation (9) are of interest:
1.
If $X : (q \times q)$ is a symmetric matrix where $∥X∥ < 1,$ then

$_{1} F_{0} (α; X) = \frac{1}{Γ_{q} (α)} \int_{S > 0} etr [- S (I_{q} - X)] {|S|}^{α - \frac{1}{2} (q + 1)} d S = {|I_{q} - X|}^{- α},$

(10)

where $ℜ (α) > \frac{1}{2} (q - 1) .$
2.
If $X : (q \times q)$ is a symmetric matrix where $∥X∥ < 1,$ then

$\begin{matrix} _{2} F_{1} (α, β; c; X) \\ = \frac{Γ_{q} (c)}{Γ_{q} (α) Γ_{q} (c - α)} \int_{0 < S < I_{q}} {|S|}^{α - \frac{1}{2} (q + 1)} {|I_{q} - S|}^{c - α - \frac{1}{2} (q + 1)} {|I_{q} - XS|}^{- β} d S \end{matrix}$

(11)

where $ℜ (c) > \frac{1}{2} (q - 1)$ and $ℜ (c - α) > \frac{1}{2} (q - 1) .$ This is known as the Gauss hypergeometric function of matrix argument.
(Ref. [4]) Two particular results are of interest here.
1.
If $S : (q \times q)$ $> 0,$ $B : (q \times q) > 0, B$ free of elements of $S$ , then

$\begin{matrix} \int_{S > 0} {|S|}^{α - \frac{1}{2} (q + 1)} {|I_{q} + S|}^{- β} {|I_{q} + BS|}^{- c} d S \\ = & β_{q} (α, β + c - α) {|B|}^{- c}_{2} F_{1} (β + c - α, c; β + c; I_{q} - B^{- 1}) \end{matrix}$

(12)

where $∥I_{q} - B^{- 1}∥ < 1,$ $ℜ (β + c - α) > \frac{1}{2} (q - 1)$ , and $ℜ (α) > \frac{1}{2} (q - 1)$ .
2.
The confluent hypergeometric function $Ψ (\cdot)$ of symmetric matrix $R : (q \times q)$ is defined by

$Ψ (α, c, R) = \frac{1}{Γ_{q} (α)} \int_{S > 0} etr (- RS) {|S|}^{α - \frac{1}{2} (q + 1)} {|I_{q} + S|}^{c - α - \frac{1}{2} (q + 1)} d S$

(13)

where $R > 0$ and $ℜ (α) > \frac{1}{2} (q - 1) .$ Then

$\begin{matrix} \int_{Y > 0} {|Y|}^{β - \frac{1}{2} (q + 1)} etr (- XY) Ψ (α, c, Y) d Y \\ = & \frac{Γ_{q} (β) Γ_{q} (β - c + \frac{1}{2} (q + 1))}{Γ_{q} (α + β - c + \frac{1}{2} (q + 1))} \\ \times_{2} F_{1} (β - c + \frac{1}{2} (q + 1), β; α + β - c + \frac{1}{2} (q + 1); I_{q} - X) \end{matrix}$

(14)

where $∥I_{q} - X∥ < 1$ and $ℜ (α) > \frac{1}{2} (q - 1),$ $ℜ (β - c) > - 1 .$ Furthermore, let $B > 0$ . It can then be shown that

$\begin{matrix} \int_{Y > 0} {|Y|}^{β - \frac{1}{2} (q + 1)} etr (- XY) Ψ (α, c, B^{\frac{1}{2}} {YB}^{\frac{1}{2}}) d Y \\ = & {|B|}^{- β} \frac{Γ_{q} (β) Γ_{q} (β - c + \frac{1}{2} (q + 1))}{Γ_{q} (α + β - c + \frac{1}{2} (q + 1))} \\ \times_{2} F_{1} (β - c + \frac{1}{2} (q + 1), β; (α + β - c + \frac{1}{2} (q + 1)); I_{q} - B^{- \frac{1}{2}} {XB}^{- \frac{1}{2}}) \end{matrix}$

(15)

where $∥I_{q} - B^{- \frac{1}{2}} {XB}^{- \frac{1}{2}}∥ < 1$ and $ℜ (α) > \frac{1}{2} (q - 1),$ $ℜ (β - c) > - 1$ .

2. Methodology

In this section the focus is to derive the joint distribution of

U_{0}

and

U_{1}

that capture the change in the covariance structure as depicted in Figure 1. From this joint distribution, the distributions of

|U_{0}|

and

|U_{1}|

are investigated to pave the way for the calculation of run-length probabilities in this matrix setting. The joint distribution of

U_{0}

and

U_{1}

is referred to as a generalised bimatrix variate beta type II distribution (functional symmetry is assumed as the symmetrization technique of [19] is inappropriate for this scenario). Without loss of generality, we assume that

Σ = I_{q}

.

Theorem 1.

Suppose that

X \sim W_{q} (v_{1}, Σ)

is independent of

W_{0} \sim W_{q} (v_{2}, λ Σ)

and

W_{1} \sim W_{q} (v_{3}, λ Σ)

. Let

C^{- 1} = 2^{\frac{1}{2} q (v_{1} + v_{2} + v_{3})} \sum_{i = 1}^{3} Γ_{q} (\frac{1}{2} v_{i}) λ^{\frac{1}{2} q (v_{2} + v_{3})}

. Then, the pdf of:

1.: Equation (3) is given by

$\begin{matrix} f (U_{0}, U_{1}) \\ = & C {|U_{0}|}^{\frac{1}{2} v_{2} - \frac{1}{2} (q + 1)} {|I_{q} + U_{0}|}^{\frac{1}{2} v_{3}} \\ \times {|U_{1}|}^{\frac{1}{2} v_{3} - \frac{1}{2} (q + 1)} \int_{U > 0} {|U|}^{\frac{1}{2} (v_{1} + v_{2} + v_{3}) - \frac{1}{2} (q + 1)} \\ \times etr (- \frac{1}{2} U (I_{q} + \frac{1}{λ} U_{0})) etr (- \frac{1}{2 λ} U^{\frac{1}{2}} (I_{q} + U_{0}) U^{\frac{1}{2}} U_{1}) dU, \end{matrix}$

(16)
2.: $U_{0}$ is given by

$f (U_{0}) = \frac{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}))}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{2})} λ^{\frac{1}{2} q v_{1}} {|U_{0}|}^{\frac{1}{2} v_{2} - \frac{1}{2} (q + 1)} {|λ I_{q} + U_{0}|}^{- \frac{1}{2} (v_{1} + v_{2})},$

(17)
3.: $U_{1}$ is given by

$\begin{matrix} f (U_{1}) \\ = & \frac{Γ_{q} (\frac{1}{2} (v_{1} + v_{2} + v_{3}))}{Γ_{q} (\frac{1}{2} (v_{1} + v_{2})) Γ_{q} (\frac{1}{2} v_{3})} λ^{\frac{1}{2} q v_{1}} {|U_{1}|}^{\frac{1}{2} v_{3} - \frac{1}{2} (q + 1)} {|λ I_{q} + U_{1}|}^{- \frac{1}{2} (v_{1} + v_{2} + v_{3})} \\ \times_{2} F_{1} (\frac{1}{2} v_{2}, \frac{1}{2} (v_{1} + v_{2} + v_{3}); \frac{1}{2} (v_{1} + v_{2}); I_{q} - {(I_{q} + U_{1})}^{\frac{1}{2}} {(λ I_{q} + U_{1})}^{- 1} {(I_{q} + U_{1})}^{\frac{1}{2}}) \end{matrix}$

(18)

where $U_{i} > 0, i = 0, 1$ with $ℜ (v_{i}) > q - 1,$ $i = 1, 2, 3$ , and

$∥I_{q} - {(I_{q} + U_{1})}^{\frac{1}{2}} {(λ I_{q} + U_{1})}^{- 1} {(I_{q} + U_{1})}^{\frac{1}{2}}∥ < 1 .$

Proof.

1.: The joint pdf of $X, W_{0}, W_{1}$ is given by

$\begin{matrix} f (X, W_{0}, W_{1}) & = & C {|X|}^{\frac{1}{2} (v_{1} - q - 1)} {|W_{0}|}^{\frac{1}{2} (v_{2} - q - 1)} {|W_{1}|}^{\frac{1}{2} (v_{3} - q - 1)} \\ \times etr (- \frac{1}{2} X) etr (- \frac{1}{2} λ^{- 1} W_{0}) etr (- \frac{1}{2} λ^{- 1} W_{1}) . \end{matrix}$

(19)

Making the transformation

$U = X, U_{0} = X^{- \frac{1}{2}} W_{0} X^{- \frac{1}{2}}, U_{1} = {(X + W_{0})}^{- \frac{1}{2}} W_{1} {(X + W_{0})}^{- \frac{1}{2}},$

leaves

$X = U, W_{0} = U^{\frac{1}{2}} U_{0} U^{\frac{1}{2}}, W_{1} = {(U + U^{\frac{1}{2}} U_{0} U^{\frac{1}{2}})}^{\frac{1}{2}} U_{1} {(U + U^{\frac{1}{2}} U_{0} U^{\frac{1}{2}})}^{\frac{1}{2}} .$

From [16] p. 12, the Jacobian of the transformation is given by

$\begin{matrix} J (X, W_{0}, W_{1} \to U, U_{0}, U_{1}) & = & J (X \to U) J (W_{0} \to U_{0}) J (W_{1} \to U_{1}) \\ = & {|U|}^{q + 1} {|I_{q} + U_{0}|}^{\frac{1}{2} (q + 1)} . \end{matrix}$

(20)

Therefore, substituting in Equation (19) gives the joint pdf of $(U, U_{0}, U_{1})$ as

$\begin{matrix} f (U, U_{0}, U_{1}) \\ = & C^{- 1} {|U|}^{\frac{1}{2} (v_{1} + v_{2} + v_{3}) - \frac{1}{2} (q + 1)} {|U_{0}|}^{\frac{1}{2} v_{2} - \frac{1}{2} (q + 1)} {|I_{q} + U_{0}|}^{\frac{1}{2} v_{3}} {|U_{1}|}^{\frac{1}{2} v_{3} - \frac{1}{2} (q + 1)} \\ \times etr (- \frac{1}{2} U) etr (- \frac{1}{2} λ^{- 1} {UU}_{0}) etr (- \frac{1}{2} λ^{- 1} U^{\frac{1}{2}} (I_{q} + U_{0}) U^{\frac{1}{2}} U_{1}) \end{matrix}$

(21)

which leaves the final result.
2.: The marginal pdf of $U_{0}$ is obtained by integrating $f (U_{0}, U_{1})$ (see Equation (16)) with respect to $U_{1}$ using Equation (6):

$\begin{matrix} f (U_{0}) \\ = & C {|U_{0}|}^{\frac{1}{2} v_{2} - \frac{1}{2} (q + 1)} {|I_{q} + U_{0}|}^{\frac{1}{2} v_{3}} \\ \times \int_{U > 0} {|U|}^{\frac{1}{2} (v_{1} + v_{2} + v_{3}) - \frac{1}{2} (q + 1)} etr (\frac{1}{2} U (I_{q} + λ^{- 1} U_{0})) \\ \times \int_{U_{1} > 0} {|U_{1}|}^{\frac{1}{2} v_{3} - \frac{1}{2} (q + 1)} etr (- \frac{1}{2} λ^{- 1} U^{\frac{1}{2}} (I_{q} + U_{0}) U^{\frac{1}{2}} U_{1}) d U_{1} d U \\ = & C Γ_{q} (\frac{1}{2} v_{3}) {(2 λ)}^{\frac{1}{2} v_{3} q} {|U_{0}|}^{\frac{1}{2} v_{2} - \frac{1}{2} (q + 1)} \\ \times \int_{U > 0} {|U|}^{\frac{1}{2} (v_{1} + v_{2}) - \frac{1}{2} (q + 1)} etr (- \frac{1}{2} U (I_{q} + λ^{- 1} U_{0})) d U \end{matrix}$

from where the result follows immediately.
3.: From Equations (16) and (13) and (14) it follows that

$\begin{matrix} f (U_{1}) \\ = & C {|U_{1}|}^{\frac{1}{2} v_{3} - \frac{1}{2} (q + 1)} \int_{U > 0} {|U|}^{\frac{1}{2} (v_{1} + v_{2} + v_{3}) - \frac{1}{2} (q + 1)} etr (- \frac{1}{2} (I_{q} + λ^{- 1} U_{1}) U) \\ \int_{U_{0} > 0} {|U_{0}|}^{\frac{1}{2} v_{2} - \frac{1}{2} (q + 1)} {|I_{q} + U_{0}|}^{\frac{1}{2} v_{3}} e t r (- \frac{1}{2} λ^{- 1} (U + U^{\frac{1}{2}} U_{1} U^{\frac{1}{2}}) U_{0}) d U_{0} d U \\ = & C Γ_{q} (\frac{1}{2} v_{2}) {|U_{1}|}^{\frac{1}{2} v_{3} - \frac{1}{2} (q + 1)} \int_{U > 0} {|U|}^{\frac{1}{2} (v_{1} + v_{2} + v_{3}) - \frac{1}{2} (q + 1)} etr (- \frac{1}{2} (I_{q} + λ^{- 1} U_{1}) U) \\ \times Ψ (\frac{1}{2} v_{2}, \frac{1}{2} v_{2} + \frac{1}{2} v_{3} + \frac{1}{2} (q + 1), \frac{1}{2} λ^{- 1} {(I_{q} + U_{1})}^{\frac{1}{2}} U {(I + U_{1})}^{\frac{1}{2}}) d U \end{matrix}$

(22)

from where the result follows. □

Remark 1.

Substituting

λ = 1

(i.e., there is no change in the covariance structure and therefore the process remains in-control) in Equation (17) gives the well-known matrix variate beta type II distribution with parameters (

\frac{1}{2} v_{1}, \frac{1}{2} v_{2}

) with pdf

\frac{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}))}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{2})} {|U_{0}|}^{\frac{1}{2} v_{2} - \frac{1}{2} (q + 1)} {|I_{q} + U_{0}|}^{- \frac{1}{2} (v_{1} + v_{2})}

where

U_{0} > 0

.

The hth moments of

|U_{0}|

and

|U_{1}|

are given in the following corollary. The moments are used to determine the distribution of

|U_{0}|

and

|U_{1}|

, and exact expressions for the pdfs and cdfs of

|U_{0}|

and

|U_{1}|

are subsequently derived.

Corollary 1.

Suppose that

X \sim W_{q} (v_{1}, Σ)

is independent of

W_{0} \sim W_{q} (v_{2}, λ Σ)

and

W_{1} \sim W_{q} (v_{3}, λ Σ) .

If the joint pdf of Equation (3) is given by Equation (16), then

1.: The product moment $E ({|U_{0}|}^{h_{1}})$ is given by

$E ({|U_{0}|}^{h_{1}}) = \frac{Γ_{q} (\frac{1}{2} v_{1} - h_{1}) Γ_{q} (\frac{1}{2} v_{2} + h_{1})}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{2})} λ^{q h_{1}}$

(23)

where $ℜ (\frac{1}{2} v_{1} - h_{1}) > \frac{1}{2} (q - 1),$ $ℜ (\frac{1}{2} v_{2} + h_{1}) > \frac{1}{2} (q - 1)$ .
2.: The product moment $E ({|U_{1}|}^{h_{2}})$ is given by

$\begin{matrix} E ({|U_{1}|}^{h_{2}}) & = & \frac{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}) - h_{2}) Γ_{q} (\frac{1}{2} v_{3} + h_{2}) λ^{\frac{1}{2} v_{1} q}}{Γ_{q} (\frac{1}{2} v_{3}) Γ_{q} (\frac{1}{2} (v_{1} + v_{2}))} \\ \times_{2} F_{1} (\frac{1}{2} v_{1}, \frac{1}{2} (v_{1} + v_{2}) - h_{2}; \frac{1}{2} (v_{1} + v_{2}); (1 - λ) I_{q}) \end{matrix}$

(24)

where $∥(1 - λ) I_{q}∥ < 1,$ $ℜ (\frac{1}{2} (v_{1} + v_{2}) - h_{2}) > \frac{1}{2} (q - 1),$ $ℜ (\frac{1}{2} v_{3} + h_{2}) > \frac{1}{2} (q - 1) .$

Theorem 2.

Suppose that

X \sim W_{q} (v_{1}, Σ)

is independent of

W_{0} \sim W_{q} (v_{2}, λ Σ)

and

W_{1} \sim W_{q} (v_{3}, λ Σ) .

If the joint pdf of Equation (3) is given by Equation (16) with marginal pdfs given in Equations (17) and (18) respectively, then

1.: the pdf of $|U_{0}|$ is given by

$f (|U_{0}|) = \frac{π^{\frac{q (q - 1)}{2}}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{2})} λ^{- q} G_{q, q}^{q, q} (λ^{- q} |U_{0}| |_{b_{1}, \dots, b_{q}}^{a_{1}, \dots, a_{q}}), |U_{0}| > 0$

(25)
2.: with cumulative distribution function (CDF)

$\begin{matrix} F_{|U_{0}|} (c) & = & Pr (|U_{0}| \leq c) \\ = & \frac{π^{\frac{q (q - 1)}{2}}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{2})} G_{q + 1, q + 1}^{q, q + 1} (λ^{- q} {c |}_{b_{1} + 1, \dots, b_{q} + 1, 0}^{1, a_{1} + 1, \dots, a_{q} + 1}), c > 0, \end{matrix}$

(26)

where $G (\cdot)$ denotes Meijer’s G-function Equation (8) and $a_{j} = - \frac{1}{2} v_{1} + \frac{1}{2} (j - 1)$ and $b_{j} = \frac{1}{2} v_{2} - \frac{1}{2} (j + 1)$ for $j = 1, 2, \dots, q .$
3.: The pdf of $|U_{1}|$ is given by

$\begin{matrix} f (|U_{1}|) \\ = & \frac{λ^{\frac{1}{2} v_{1} q} π^{\frac{q (q - 1)}{2}}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{3})} \\ \times \sum_{t = 0}^{\infty} \sum_{τ} \frac{Γ_{q} (\frac{1}{2} v_{1}, τ)}{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}), τ) t!} {(1 - λ)}^{t} C_{τ} (I_{q}) G_{q, q}^{q, q} (|U_{1}| |_{b_{1}, \dots, b_{q}}^{a_{1}, \dots, a_{q}}), \end{matrix}$

(27)

such that $|U_{1}| > 0$ and where $C_{τ} (\cdot)$ is the corresponding zonal polynomial, with the values of the parameters such that $f (|U_{1}|)$ is a valid pdf,
4.: with CDF

$\begin{matrix} F_{|U_{1}|} (c) \\ = & Pr (|U_{1}| \leq c) \\ = & \frac{λ^{\frac{1}{2} v_{1} q} π^{\frac{q (q - 1)}{2}}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{3})} \\ \times \sum_{t = 0}^{\infty} \sum_{τ} \frac{Γ_{q} (\frac{1}{2} v_{1}, τ)}{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}), τ) t!} {(1 - λ)}^{t} C_{τ} (I_{q}) G_{q + 1, q + 1}^{q, q + 1} ({c |}_{b_{1} + 1, \dots, b_{q} + 1, 0}^{1, a_{1} + 1, \dots, a_{q} + 1}) \end{matrix}$

(28)

such that $c > 0$ , where $a_{j} = - \frac{1}{2} (v_{1} + v_{2}) - t_{j} + \frac{1}{2} (j - 1)$ and $b_{j} = \frac{1}{2} v_{3} - \frac{1}{2} (j + 1)$ for $j = 1, \dots, q,$ with the values of the parameters such that $F_{|U_{1}|} (c)$ is a valid CDF and $Γ_{q} (\cdot, \cdot)$ denotes the generalised gamma function (see Equation (5)). The proof can be found in the Appendix A.

As a theoretical validation of the results, consider the case when

q = 1

in (25) and (27). Using [15] p. 130 yields the marginal pdf of

U_{0}

:

\begin{matrix} f (u_{0}) & = & \frac{1}{Γ (\frac{1}{2} v_{1}) Γ (\frac{1}{2} v_{2})} λ^{- 1} G_{1, 1}^{1, 1} (λ^{- 1} u_{0} |_{\frac{1}{2} v_{2} - 1}^{- \frac{1}{2} v_{1}}) \\ = & \frac{Γ (\frac{1}{2} (v_{1} + v_{2}))}{Γ (\frac{1}{2} v_{1}) Γ (\frac{1}{2} v_{2})} λ^{\frac{1}{2} v_{1}} u_{0}^{\frac{1}{2} v_{2} - 1} {(λ + u_{0})}^{- \frac{1}{2} (v_{1} + v_{2})} \end{matrix}

where

u_{0} > 0

and the marginal pdf of

U_{1}

:

\begin{matrix} f (u_{1}) \\ = & \frac{λ^{\frac{1}{2} v_{1}}}{Γ (\frac{1}{2} v_{1}) Γ (\frac{1}{2} v_{3})} \sum_{t = 0}^{\infty} \frac{Γ_{1} (\frac{1}{2} v_{1}, t)}{Γ_{1} (\frac{1}{2} (v_{1} + v_{2}), t) t!} {(1 - λ)}^{t} C_{t} (I_{1}) G_{1, 1}^{1, 1} (u_{1} |_{\frac{1}{2} v_{3} - 1}^{- \frac{1}{2} (v_{1} + v_{2}) - t}) \\ = & \frac{λ^{\frac{1}{2} v_{1}}}{Γ (\frac{1}{2} v_{1}) Γ (\frac{1}{2} v_{3})} \sum_{t = 0}^{\infty} \frac{{(\frac{1}{2} v_{1})}_{t} Γ (\frac{1}{2} v_{1})}{{(\frac{1}{2} (v_{1} + v_{2}))}_{t} Γ (\frac{1}{2} (v_{1} + v_{2})) t!} {(1 - λ)}^{t} \\ \times \frac{Γ (\frac{1}{2} (v_{1} + v_{2} + v_{3}) + t) u_{1}^{\frac{1}{2} v_{3} - 1}}{{(1 + u_{1})}^{\frac{1}{2} (v_{1} + v_{2} + v_{3}) + t}} \\ = & \frac{Γ (\frac{1}{2} (v_{1} + v_{2} + v_{3})) λ^{\frac{1}{2} v_{1}}}{Γ (\frac{1}{2} v_{3}) Γ (\frac{1}{2} (v_{1} + v_{2}))} u_{1}^{\frac{1}{2} v_{3} - 1} {(1 + u_{1})}^{- \frac{1}{2} (v_{1} + v_{2} + v_{3})} \\ \times_{2} F_{1} (\frac{1}{2} v_{1}, \frac{1}{2} (v_{1} + v_{2} + v_{3}); \frac{1}{2} (v_{1} + v_{2}); \frac{1 - λ}{1 + u_{1}}) \end{matrix}

where

u_{1} > 0, |\frac{1 - λ}{1 + u_{1}}| < 1

. It is valuable to note the special case of both of these preceeding results (when

q = 1

) in the case when

λ = 1

, i.e., no shift occurs. In both cases for

U_{0}

and

U_{1}

, these marginal pdfs reflect beta type II distributions.

Remark 2.

Ref. [20] discussed the two kinds of Wilks’ statistic. If

U_{0} = X^{- \frac{1}{2}} W_{0} X^{- \frac{1}{2}}

with

X

and

W_{0}

Wishart matrices

(W_{q} (v_{i}, Σ)

,

i = 1, 2),

then

U_{0}

has the matrix variate beta type II distribution. They derived the exact expression for the pdf of Wilks’ statistic type II:

|U_{0}|

, the latter expressed as the product of q univariate betas of the second kind, which in turn, can be expressed as Meijer G-functions. Thus, Equations (25) and (27) can be considered as Wilks’ type II statistics.

3. Numerical Example

This section focusses on the calculation of run-length probabilities for this multivariate sequential process. In particular, some percentage points are calculated as an illustration for the probability to detect the shift instantly (i.e., a run-length of one). In this way, the calculation of run-length probabilities may be feasible and meaningful within the matrix environment.

The discrete random variable that defines the run-length is called the run-length random variable and often denoted by N with its distribution called the run-length distribution. Let

A_{j}

be the event that a univariate random variable

U_{j},

j = 0, 1, \dots, p,

plots inside its respective control limits, i.e.,

A_{j} = L C L_{κ + j} < U_{j} < U C L_{κ + j}

(29)

where

L C L

and

U C L

denotes the lower and upper control limits respectively. The probability of detecting a shift immediately, in other words, the probability of a run-length of one is then

Pr (N = 1) = Pr (A_{0}^{C}) = 1 - Pr (A_{0}) = 1 - Pr (L C L_{κ} < U_{0} < U C L_{κ}) .

(30)

This probability is the probability that the charting statistic will plot on or outside the control limits upon collecting the first sample after the change in the variance (see Equation (30)). In the matrix environment

|U_{0}|

is of interest as a test statistic for testing the null hypothesis at time

κ

that the covariance matrix structure did not change (practically, the process is in-control). Therefore, if the statistic

|U_{0}|

exceeds a critical value (say

c_{0})

it presents evidence that the covariance matrix structure changed and that the process is declared out-of-control. This proposed method deviates from the univariate case where a two sided hypothesis is considered (see Equation (30)). Thus, once the covariance matrix structure changes, the probability to detect this change immediately, in other words, the probability of a run-length of one is

Pr (N = 1) \equiv P [|U_{0}| \geq c_{0}] .

(31)

Take note that

c_{0}

indicates an upper critical value and not a control limit as before (see Equation (30)). If

q = 1

then

c_{0}

is comparable to the UCL of a one-sided hypothesis in the univariate case.

In this example, percentage points are calculated for a run-length of one for the scenario as illustrated in Figure 1 where the covariance matrix changes with a scale factor from

Σ

to

λ Σ .

Two cases with

q = 1

and 2 (i.e., a univariate and bimatrix process) is considered. From Equation (31)

Pr (N = 1) = 1 - F_{|U_{0}|} (c_{0}),

where

F_{|U_{0}|} (\cdot)

is the CDF of

|U_{0}|

given in Equation (26)).

In particular, for

q = 1

see that

F_{|U_{0}|} (c_{0}) = \frac{1}{Γ (\frac{1}{2} v_{1}) Γ (\frac{1}{2} v_{2})} G_{2, 2}^{1, 2} (λ^{- 1} c_{0} |_{\frac{1}{2} v_{2}, 0}^{1, - \frac{1}{2} v_{1} + 1}),

(32)

and for

q = 2

\begin{matrix} F_{|U_{0}|} (c_{0}) \\ = & \frac{1}{Γ (\frac{1}{2} v_{1}) Γ (\frac{1}{2} v_{1} - \frac{1}{2}) Γ (\frac{1}{2} v_{2}) Γ (\frac{1}{2} v_{2} - \frac{1}{2})} G_{3, 3}^{2, 3} (λ^{- 2} c_{0} |_{\frac{1}{2} v_{2}, \frac{1}{2} v_{2} - \frac{1}{2}, 0}^{1, - \frac{1}{2} v_{1} + 1, - \frac{1}{2} v_{1} + \frac{3}{2}}) . \end{matrix}

The percentage points

c_{0}

of

|U_{0}|

are obtained numerically by solving the equation

F_{|U_{0}|} (c_{0}) = \int_{0}^{c_{0}} f (|U_{0}|) d |U_{0}| = 1 - γ

(33)

where

γ

is a pre-specified probability of detecting a change in the covariance structure when the process is out-of-control. Solving for this value numerically involves the computation of Meijer’s G-function which is available in the R software as meijerG; in our case, we used MeijerG in the software Mathematica.

Table 1 provides the numerical values of

c_{0}

for different values of

λ

and

γ

for the case if

q = 1

(univariate) and

q = 2

(bimatrix). In this example, samples of four equal sizes are collected at each point in time, i.e.,

v_{2} = n = 4

. It is assumed that the covariance matrix changes with a scale factor

λ,

between samples

κ - 1

and

κ

where

κ = 3,

therefore

v_{1} = (κ - 1) \times n = 8 .

Table 1. Percentage points

c_{0}

of

|U_{0}|

.

Remark 3.

The upper percentage points

c_{0}

in the simple case when

q = 1

can be related to the control limits (see (30)). In the above example a one-sided test was considered, i.e., the chart would signal that the process is out-of-control if

|U_{0}| \geq c_{0} .

It is well-known that the type I error in hypothesis testing is

P (r e j e c t

H_{0}

|

H_{0}

t r u e

). In the SPC context this is similar as the false alarm rate (FAR). The FAR is defined as the probability for a single charting statistic to plot on or outside the control limits when the process is in-control. The probability that

U_{0}

plots on or outside the control limits given that the process variance did not encounter a shift, i.e.,

λ = 1

. Therefore

\begin{matrix} F A R & = & 1 - Pr (L C L_{κ} < U_{0} < U C L_{κ}| λ = 1) \\ = & 1 - \int_{L C L_{κ}}^{U C L_{κ}} f (u_{0}) d u_{0} (see Equation (17)) \\ = & 0.0027 . \end{matrix}

In the SPC environment it is desirable to have a FAR of

0.0027

. Substituting

γ = \frac{0.0027}{2} = 0.00135

in Equation (33) with

q = 1

and

λ = 1

, gives

c_{0} = 6.58684 .

This value corresponds to the UCL in the case of

q = 1

when, i.e.,

U C L_{κ = 3} = \frac{F_{n_{κ}, \sum_{i = 1}^{κ - 1} n_{i}}^{- 1} [Φ (3)]}{\frac{\sum_{i = 1}^{κ - 1} n_{i}}{n_{κ}}} = \frac{F_{4, 8}^{- 1} [Φ (3)]}{\frac{8}{4}} = 6.58684 .

where

F_{v_{1}, v_{2}}^{- 1} (.)

and

Φ (.)

denotes the inverse CDF of the

F_{v_{1}, v_{2}} (.)

distribution and the CDF of the standard normal distribution respectively. See also [5,6] in this regard.

4. Discussion and Conclusions

In this paper, we introduced a generalised bimatrix variate beta type II distribution which originated from “ratios” of Wishart random variates, emanating from monitoring the process covariance structure of q attributes where samples are independent, having been collected from a multivariate normal distribution with known mean and unknown covariance matrix. In particular, the (i) pdfs of the marginal distributions and (ii) the pdfs of the determinants of the components of the generalised bimatrix variate beta type II distribution were derived. This paves the way for the proposed measure to capture the change in a multivariate process momentarily. An illustrative example was included where some percentage points were calculated to address the run-length concept in the matrix environment. In a similar way as described in this paper, the two matrix random variables

U_{0}

and

U_{1}

can be used to test if the covariance structure has changed significantly between time periods

κ - 1

and

κ

as well as

κ

and

κ + 1

.

In a similar way the run-length of two implies that even though the covariance matrix changed, this change is not detected using the control chart at time

κ,

but that the chart only signals that the process is out-of-control at time

κ + 1 .

Therefore

\begin{matrix} Pr (N = 2) & \equiv Pr [|U_{0}| < c_{0}, |U_{1}| \geq c_{1}] \\ = Pr [|U_{0}| < c_{0}] - Pr [|U_{0}| < c_{0}, |U_{1}| < c_{1}] . \end{matrix}

(34)

From Equation (34) it is evident that the joint pdf of

(|U_{0}|, |U_{1}|)

is needed to calculate the probability of a run-length of two, but a closed form expression is not mathematically tractable. Another possibility is to assume independence of the statistics

|U_{0}|

and

|U_{1}|

, then the approximate run-length probability is

Pr (N = 2) \approx Pr [|U_{0}| < c_{0}] - Pr [|U_{0}| < c_{0}] Pr [|U_{1}| < c_{1}] .

Even in the case of the above approximation one still encounters computational challenges, see the pdf of

|U_{1}|

given in Equation (27).

Furthermore, as a two-sample statistic for testing the hypothesis at time

κ

that the two independent samples (i.e., all observations from time

i = 1

to

κ - 1

vs. the observations in sample

κ

) are from the same

q

-variate multivariate normal distributions with the same unknown covariance matrix

Σ

, the statistic

| U_{0} |

may be of interest as a test statistic. Subsequently

| U_{1} |

can be used at time

κ + 1

. Thus,

| U_{0} |

and

| U_{1} |

may be used as charting statistics for the multivariate process. In this scenario,

| U_{0} |

is in fact a test statistic to check whether

λ = 1

(i.e., the covariance matrices are the same) versus

λ \neq 1

(i.e., the covariance matrix change with the scale factor

λ

). For

λ = 1,

| U_{0} |

is the Wilks’ statistic type II ([20]).

Recent trends indicate a continued interest in modelling and theoretical capturing of shifts within covariance matrices within multivariate settings similar to the one under consideration in this paper. Ref. [21] develops a distribution-free control chart for this purpose, and [22,23] also refreshes the literature of methods for statistical surveillance of covariance structures with particularly developed control charts. The contribution of [24] is also a valuable contribution in literature based on machine learning approaches for the monitoring of the covariance matrix in multivariate SPC, and forms a basis for the departure of potential future studies. The case where there is a change in the mean vector in this sequential process may be considered as future work. As a future development, the practitioner may be interested in more than two successive time periods immediately after the change in the covariance structure occurred, which will lead to new matrix variate Dirichlet type II distributions.

Author Contributions

Conceptualization, A.B. and S.W.H.; methodology, A.B., J.T.F., S.W.H. and K.A.; software, K.A.; validation, A.B., S.W.H. and K.A.; formal analysis, A.B., J.T.F., S.W.H. and K.A.; investigation, A.B., J.T.F., S.W.H. and K.A.; resources, J.T.F.; writing—original draft preparation, K.A. and A.B.; writing—review and editing, A.B. and J.T.F.; supervision, A.B. and S.W.H.; funding acquisition, A.B., J.T.F. and S.W.H. All authors have read and agreed to the published version of the manuscript.

Funding

This work enjoys support from the following grants: University of Pretoria: RDP 296/2019; National Research Foundation: SRUG190308422768 nr. 120839; National Research Foundation: SARChI Research Chair UID 71199, as well as the Centre of Excellence in Mathematical and Statistical Sciences at the University of the Witwatersrand.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors acknowledge the support of the Department of Statistics at the University of Pretoria, Pretoria, South Africa, and discussions with J.J.J. Roux and M. Arashi. In addition, the authors also acknowledge the valuable contribution of the two anonymous referees, whose comments led to an improved presentation of this work.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof of Theorem 2.

1.: From Equation (23),

$E ({|U_{0}|}^{h - 1}) = \frac{Γ_{q} (\frac{1}{2} v_{1} - h + 1) Γ_{q} (\frac{1}{2} v_{2} + h - 1)}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{2})} λ^{q (h - 1)},$

therefore

$E ({|λ^{- 1} U_{0}|}^{h - 1}) = \frac{Γ_{q} (\frac{1}{2} v_{1} - h + 1) Γ_{q} (\frac{1}{2} v_{2} + h - 1)}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{2})} .$

(A1)

Using the well-known Mellin transform of $f (|λ^{- 1} U_{0}|)$ :

$M_{f} (h) \equiv E ({|λ^{- 1} U_{0}|}^{h - 1}) .$

(A2)

Expressing the multivariate gamma functions in Equation (A1) as a product of gamma functions and substituting it in the Mellin transform Equation (A2), gives

$\begin{matrix} M_{f} (h) = \frac{π^{\frac{q (q - 1)}{2}} \sum_{j = 1}^{q} Γ [1 - a_{j} - h] \sum_{j = 1}^{q} Γ [b_{j} + h]}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{2})}, \\ w h e r e a_{j} = - \frac{1}{2} v_{1} + \frac{1}{2} (j - 1) and b_{j} = \frac{1}{2} v_{2} - \frac{1}{2} (j + 1), j = 1, 2, \dots, q . \end{matrix}$

(A3)

The pdf of $|λ^{- 1} U_{0}|$ is uniquely obtained from the inverse Mellin transform ([15]) of Equation (A3) and using Equation (8) and is given by

$\begin{matrix} f (|λ^{- 1} U_{0}|) \\ = & \frac{1}{2 π i} \int_{ω - i \infty}^{ω + i \infty} M_{f} (h) {|λ^{- 1} U_{0}|}^{- h} d h \\ = & \frac{π^{\frac{q (q - 1)}{2}}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{2})} \frac{1}{2 π i} \int_{ω - i \infty}^{ω + i \infty} \sum_{j = 1}^{q} Γ [1 - a_{j} - h] \sum_{j = 1}^{q} Γ [b_{j} + h] {|λ^{- 1} U_{0}|}^{- h} d h \\ = & \frac{π^{\frac{q (q - 1)}{2}}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{2})} G_{q, q}^{q, q} (λ^{- q} |U_{0}| |_{b_{1}, \dots, b_{q}}^{a_{1}, \dots, a_{q}}) \end{matrix}$

(A4)

and the result follows.
2.: Let $u = |U_{0}|,$ $u > 0$ then from Equation (25) the CDF is defined as

$\begin{matrix} F_{|U_{0}|} (c) & = & Pr (|U_{0}| \leq c) \\ = & \frac{π^{\frac{q (q - 1)}{2}}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{2})} λ^{- q} \int_{0}^{c} G_{q, q}^{q, q} (λ^{- q} {u |}_{b_{1}, \dots, b_{q}}^{a_{1}, \dots, a_{q}}) d u . \end{matrix}$

Applying [15] results from pages 142, 59, and 69, yields

$\begin{matrix} F_{|U_{0}|} (c) \\ = & \frac{π^{\frac{q (q - 1)}{2}}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{2})} λ^{- q} \int_{0}^{c} H_{q, q}^{q, q} (λ^{- q} {u |}_{(b_{1}, 1), \dots, (b_{q}, 1)}^{(a_{1}, 1), \dots, (a_{q}, 1)}) d u \\ = & \frac{π^{\frac{q (q - 1)}{2}}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{2})} λ^{- q} c H_{q + 1, q + 1}^{q, q + 1} (λ^{- q} {c |}_{(b_{1}, 1), \dots, (b_{q}, 1), (- 1, 1)}^{(0, 1), (a_{1}, 1), \dots, (a_{q}, 1)}) \\ = & \frac{π^{\frac{q (q - 1)}{2}}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{2})} λ^{- q} c G_{q + 1, q + 1}^{q, q + 1} (λ^{- q} {c |}_{b_{1}, \dots, b_{q}, - 1}^{0, a_{1}, \dots, a_{q}}) \end{matrix}$

and the result follows.
3.: From Equation (24) the Mellin transform ([15]) of $f (|U_{1}|)$ is

$\begin{matrix} M_{f} (h) = & \frac{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}) - h + 1) Γ_{q} (\frac{1}{2} v_{3} + h - 1) λ^{\frac{1}{2} v_{1} q}}{Γ_{q} (\frac{1}{2} v_{3}) Γ_{q} (\frac{1}{2} (v_{1} + v_{2}))} \\ \times_{2} F_{1} (\frac{1}{2} v_{1}, \frac{1}{2} (v_{1} + v_{2}) - h + 1; \frac{1}{2} (v_{1} + v_{2}); (1 - λ) I_{q}) . \end{matrix}$

(A5)

Using Equations (5) and (9) the Gauss hypergeometric function of matrix argument in Equation (A5) can be written as

$\begin{matrix} _{2} F_{1} (\frac{1}{2} v_{1}, \frac{1}{2} (v_{1} + v_{2}) - h + 1; \frac{1}{2} (v_{1} + v_{2}); (1 - λ) I_{q}) \\ = & \sum_{t = 0}^{\infty} \sum_{τ} \frac{Γ_{q} (\frac{1}{2} v_{1}, τ)}{Γ_{q} (\frac{1}{2} v_{1})} \frac{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}) - h + 1, τ)}{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}) - h + 1)} \frac{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}))}{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}), τ)} \frac{C_{τ} ((1 - λ) I_{q})}{t!} . \end{matrix}$

This gives

$\begin{matrix} M_{f} (h) & \equiv & \frac{Γ_{q} (\frac{1}{2} v_{3} + h - 1) λ^{\frac{1}{2} v_{1} q}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{3})} \\ \times \sum_{t = 0}^{\infty} \sum_{τ} \frac{Γ_{q} (\frac{1}{2} v_{1}, τ) Γ_{q} (\frac{1}{2} (v_{1} + v_{2}) - h + 1, τ)}{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}), τ) t!} C_{τ} ((1 - λ) I_{q}) . \end{matrix}$

(A6)

The multivariate gamma function in Equation (A6) can be written as

$\begin{matrix} \begin{matrix} Γ_{q} (\frac{1}{2} v_{3} + h - 1) = π^{\frac{q (q - 1)}{4}} \sum_{j = 1}^{q} Γ [b_{j} + h], \end{matrix} \\ \begin{matrix} where b_{j} = \frac{1}{2} v_{3} - \frac{1}{2} (j + 1) for j = 1, \dots, q, \end{matrix} \end{matrix}$

(A7)

and using Equation (5), the generalised gamma function of weight $τ$ can be written as

$\begin{matrix} \begin{matrix} Γ_{q} (\frac{1}{2} (v_{1} + v_{2}) - h + 1, τ) = π^{\frac{q (q - 1)}{4}} \sum_{j = 1}^{q} Γ [1 - a_{j} - h], \end{matrix} \\ \begin{matrix} w h e r e a_{j} = - \frac{1}{2} (v_{1} + v_{2}) - t_{j} + \frac{1}{2} (j - 1) f o r j = 1, 2, \dots, q . \end{matrix} \end{matrix}$

(A8)

Substituting Equations (A7) and (A8) in Equation (A6) gives

$\begin{matrix} M_{f} (h) & \equiv & \frac{λ^{\frac{1}{2} v_{1} q}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{3})} \sum_{t = 0}^{\infty} \sum_{τ} \frac{Γ_{q} (\frac{1}{2} v_{1}, τ)}{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}), τ) t!} π^{\frac{q (q - 1)}{2}} \\ \times \sum_{j = 1}^{q} Γ [1 - a_{j} - h] \sum_{j = 1}^{q} Γ [b_{j} + h] C_{τ} ((1 - λ) I_{q}) . \end{matrix}$

(A9)

The pdf of $|U_{1}|$ is obtained from the inverse Mellin transform ([15]) of Equation (A9) and from the definition of the Meijer’s G-function Equation (8) as

$\begin{matrix} f (|U_{1}|) \\ = & \frac{λ^{\frac{1}{2} v_{1} q}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{3})} \sum_{t = 0}^{\infty} \sum_{τ} \frac{Γ_{q} (\frac{1}{2} v_{1}, τ)}{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}), τ) t!} C_{τ} ((1 - λ) I_{q}) π^{\frac{q (q - 1)}{2}} \\ \times \frac{1}{2 π i} \int_{ω - i \infty}^{ω + i \infty} \sum_{j = 1}^{q} Γ [1 - a_{j} - h] \sum_{j = 1}^{q} Γ [b_{j} + h] {|U_{1}|}^{- h} d h \\ = & \frac{λ^{\frac{1}{2} v_{1} q} π^{\frac{q (q - 1)}{2}}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{3})} \sum_{t = 0}^{\infty} \sum_{τ} \frac{Γ_{q} (\frac{1}{2} v_{1}, τ)}{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}), τ) t!} {(1 - λ)}^{t} C_{τ} (I_{q}) G_{q, q}^{q, q} (|U_{1}| |_{b_{1}, \dots, b_{q}}^{a_{1}, \dots, a_{q}}) . \end{matrix}$
4.: Let $u = |U_{1}|,$ $u > 0$ then from Equation (27) the CDF is defined as

$\begin{matrix} F_{|U_{1}|} (c) & = & Pr (|U_{1}| \leq c) \\ = & \frac{λ^{\frac{1}{2} v_{1} q} π^{\frac{q (q - 1)}{2}}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{3})} \\ \times \sum_{t = 0}^{\infty} \sum_{τ} \frac{Γ_{q} (\frac{1}{2} v_{1}, τ)}{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}), τ) t!} {(1 - λ)}^{t} C_{τ} (I_{q}) \int_{0}^{c} G_{q, q}^{q, q} ({u |}_{b_{1}, \dots, b_{q}}^{a_{1}, \dots, a_{q}}) d u . \end{matrix}$

Applying [15] results from page 142, 59, and 69, yields

$\begin{matrix} F_{|U_{1}|} (c) \\ = & \frac{λ^{\frac{1}{2} v_{1} q} π^{\frac{q (q - 1)}{2}}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{3})} \\ \times \sum_{t = 0}^{\infty} \sum_{τ} \frac{Γ_{q} (\frac{1}{2} v_{1}, τ)}{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}), τ) t!} {(1 - λ)}^{t} C_{τ} (I_{q}) \int_{0}^{c} H_{q, q}^{q, q} ({v |}_{(b_{1}, 1), \dots, (b_{q}, 1)}^{(a_{1}, 1), \dots, (a_{q}, 1)}) d u \\ = & \frac{λ^{\frac{1}{2} v_{1} q} π^{\frac{q (q - 1)}{2}}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{3})} \\ \times \sum_{t = 0}^{\infty} \sum_{τ} \frac{Γ_{q} (\frac{1}{2} v_{1}, τ)}{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}), τ) t!} {(1 - λ)}^{t} C_{τ} (I_{q}) c H_{q + 1, q + 1}^{q, q + 1} ({c |}_{(b_{1}, 1), \dots, (b_{q}, 1), (- 1, 1)}^{(0, 1), (a_{1}, 1), \dots, (a_{q}, 1)}) \\ = & \frac{λ^{\frac{1}{2} v_{1} q} π^{\frac{q (q - 1)}{2}}}{Γ_{q} (\frac{1}{2} v_{1}) Γ_{q} (\frac{1}{2} v_{3})} \\ \times \sum_{t = 0}^{\infty} \sum_{τ} \frac{Γ_{q} (\frac{1}{2} v_{1}, τ)}{Γ_{q} (\frac{1}{2} (v_{1} + v_{2}), τ) t!} {(1 - λ)}^{t} C_{τ} (I_{q}) c G_{q + 1, q + 1}^{q, q + 1} ({c |}_{b_{1}, \dots, b_{q}, - 1}^{0, a_{1}, \dots, a_{q}}) \end{matrix}$

and the result follows. □

References

Quesenberry, C.P. SPC Q charts for start-up processes and short or long runs. J. Qual. Technol. 1991, 23, 213–224. [Google Scholar] [CrossRef]
Zantek, P.F. Run-length distributions of Q-chart schemes. IIE Trans. 2005, 37, 1037–1045. [Google Scholar] [CrossRef]
Zantek, P.F. A Markov-chain method for computing the run-length distribution of the self-starting cumulative sum scheme. J. Stat. Comput. Simul. 2008, 78, 463–473. [Google Scholar] [CrossRef]
Adamski, K. Generalised Beta Type II Distributions-Emanating from a Sequential Process. Ph.D. Thesis, University of Pretoria, Pretoria, South Africa, 2013. [Google Scholar]
Adamski, K.; Human, S.W.; Bekker, A. A generalized multivariate beta distribution: Control charting when the measurements are from an exponential distribution. Stat. Pap. 2012, 53, 1045–1064. [Google Scholar] [CrossRef]
Adamski, K.; Human, S.; Bekker, A.; Roux, J. Noncentral generalized multivariate beta type II distribution. REVSTAT- J. 2013, 11, 17–43. [Google Scholar]
Muirhead, R.J. Aspects of Multivariate Statistical Theory; John Wiley & Sons: Hoboken, NJ, USA, 2009; Volume 197. [Google Scholar]
Díaz-García, J.A.; Jáimez, R.G. Bimatrix variate generalised beta distributions: Theory and methods. S. Afr. Stat. J. 2010, 44, 193–208. [Google Scholar]
Bekker, A.; Roux, J.J.; Arashi, M. Wishart ratios with dependent structure: New members of the bimatrix beta type IV. Linear Algebra Its Appl. 2011, 435, 3243–3260. [Google Scholar] [CrossRef][Green Version]
Constantine, A.G. Some non-central distribution problems in multivariate analysis. Ann. Math. Stat. 1963, 34, 1270–1285. [Google Scholar] [CrossRef]
Constantine, A.G. The Distribution of Hotelling’s Generalised T squared. Ann. Math. Stat. 1966, 37, 215–225. [Google Scholar] [CrossRef]
Herz, C.S. Bessel functions of matrix argument. Ann. Math. 1955, 61, 474–523. [Google Scholar] [CrossRef]
James, A.T. Zonal polynomials of the real positive definite symmetric matrices. Ann. Math. 1961, 74, 456–469. [Google Scholar] [CrossRef]
James, A.T. Distributions of matrix variates and latent roots derived from normal samples. Ann. Math. Stat. 1964, 35, 475–501. [Google Scholar] [CrossRef]
Mathai, A.M. A Handbook of Generalized Special Functions for Statistical and Physical Sciences; Oxford University Press: Oxford, UK, 1993. [Google Scholar]
Gupta, A.K.; Nagar, D.K. Matrix Variate Distributions; CRC Press: Boca Raton, FL, USA, 2018; Volume 104. [Google Scholar]
Mathai, A.M.; Saxena, R.K.; Haubold, H.J. The H-Function: Theory and Applications; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2009. [Google Scholar]
Khatri, C. On certain distribution problems based on positive definite quadratic functions in normal vectors. Ann. Math. Stat. 1966, 37, 468–479. [Google Scholar] [CrossRef]
Greenacre, M. Symmetrised multivariate distributions. S. Afr. Stat. J. 1973, 7, 95–101. [Google Scholar]
Pham-Gia, T.; Turkkan, N. Distributions of ratios: From random variables to random matrices. Open J. Stat. 2011, 1, 93–104. [Google Scholar] [CrossRef]
Liang, W.; Xiang, D.; Pu, X.; Li, Y.; Jin, L. A robust multivariate sign control chart for detecting shifts in covariance matrix under the elliptical directions distributions. Qual. Technol. Quant. Manag. 2019, 16, 113–127. [Google Scholar] [CrossRef]
Ning, X.; Li, P. A simulation comparison of some distance-based EWMA control charts for monitoring the covariance matrix with individual observations. Qual. Reliab. Eng. Int. 2020, 36, 50–67. [Google Scholar] [CrossRef]
Machado, M.A.; Lee Ho, L.; Quinino, R.C.; Celano, G. Monitoring the covariance matrix of bivariate processes with the DVMAX control charts. Appl. Stoch. Model. Bus. Ind. 2021. [Google Scholar] [CrossRef]
Maboudou-Tchao, E.M. Kernel methods for changes detection in covariance matrices. Commun. Stat.-Simul. Comput. 2018, 47, 1704–1721. [Google Scholar] [CrossRef]

Figure 1. Schematic description of the multivariate process.

Table 1. Percentage points

c_{0}

of

|U_{0}|

.

Table 1. Percentage points

c_{0}

of

|U_{0}|

.

$λ$	q	$γ = 0.01$	$γ = 0.025$	$γ = 0.05$	$γ = 0.1$
2	1	7.00608	5.05263	3.83785	2.80643
1	1	3.50304	2.52632	1.91893	1.40321
0.5	1	1.75152	1.26316	0.95946	0.70161
2	2	7.29343	4.50351	2.97902	1.80558
1	2	3.64671	2.25176	1.48951	0.91746
0.5	2	1.82336	1.12588	0.74475	0.46006

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Capturing a Change in the Covariance Structure of a Multivariate Process

Abstract

1. Introduction

1.1. Problem Description and Approach

1.2. Outline of Paper

1.3. Mathematical Toolbox

2. Methodology

3. Numerical Example

4. Discussion and Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Article Metrics

Citations

Article Access Statistics