Combination Test for Mean Shift and Variance Change

Gao, Min; Shi, Xiaoping; Wang, Xuejun; Yang, Wenzhi

doi:10.3390/sym15111975

Open AccessArticle

Combination Test for Mean Shift and Variance Change

¹

School of Big Data and Statistics, Anhui University, Hefei 230601, China

²

Irving K. Barber Faculty of Science, University of British Columbia, Kelowna, BC V1V 1V7, Canada

^*

Author to whom correspondence should be addressed.

Symmetry 2023, 15(11), 1975; https://doi.org/10.3390/sym15111975

Submission received: 24 September 2023 / Revised: 15 October 2023 / Accepted: 18 October 2023 / Published: 25 October 2023

(This article belongs to the Special Issue Applications Based on Symmetry/Asymmetry in Functional Data Analysis)

Download

Browse Figures

Versions Notes

Abstract

:

This paper considers a new mean-variance model with strong mixing errors and describes a combination test for the mean shift and variance change. Under some stationarity and symmetry conditions, the important limiting distribution for a combination test is obtained, which can derive the limiting distributions for the mean change test and variance change test. As an application, an algorithm for a three-step method to detect the change-points is given. For example, the first step is to test whether there is at least a change-point. The second and third steps are to detect the mean change-point and the variance change-point, respectively. To illustrate our results, some simulations and real-world data analysis are discussed. The analysis shows that our tests not only have high powers, but can also determine the mean change-point or variance change-point. Compared to the existing methods of cpt.meanvar and mosum from the R package, the new method has the advantages of recognition capability and accuracy.

Keywords:

change-point of mean; change-point of variance; CUSUM estimator; limit distribution; mixing sequences

MSC:

62M20; 62F05

1. Introduction

In the paper [1], statistical methods of control for manufacturing processes were first proposed. However, the issue of change-points initially appeared in the context of quality control, where staff typically observe the output of a production line, aiming to detect signals deviating from acceptable levels while observing the data. Since the seminal paper [2], the cumulative sum (CUSUM) test has been one of the most popular methods for detecting a parameter change in statistical models. CUSUM tests of the mean change-point and the variance change-point have played a central role in detecting abnormal signals during quality control and changes in financial time series and other fields. For example, the authors of the papers [3,4] considered CUSUM tests for the mean change-point model with independent errors and linear processes, respectively; the papers [5,6] investigated CUSUM tests for the variance change-point model with independent normal errors and independent errors, respectively. For further studies, reference can be made to [7,8,9,10] and the sources detailed therein. Combining the change-point of mean with change-point of variance, this paper considers a change-point model with mean shift and variance change. For

T \geq 2

, we consider a time series

{X_{t}}

to be a mean-variance model as

X_{t} = μ_{t} + σ_{t} e_{t}, 1 \leq t \leq T,

(1)

where

- \infty < μ_{t} < \infty

and

σ_{t} > 0

are the mean and variance parameters, respectively. Since the condition of strong mixing (

α

-mixing) is more general in time series [11], we consider the error sequence

{e_{t}}

to be a

α

-mixing sequence with a mean of zero and variance of one. Let us recall the definition of

α

-mixing. Let

N = {1, 2, \dots}

and denote

F_{k}^{T} = σ (e_{t}, k \leq t \leq T, t \in N)

to be the

σ

-fields generated by random variables

e_{k}, e_{k + 1}, \dots, e_{T}

,

1 \leq k \leq T

. For

t \geq 1

, we define

α (t) = sup_{m \in N} sup_{A \in F_{1}^{m}, B \in F_{m + t}^{\infty}} | P (A \cap B) - P (A) P (B) | .

Definition 1.

If

α (t) \to 0

as

t \to \infty

, then

{e_{t}, t \geq 1}

is called a strong mixing or α-mixing sequence.

In the mean-variance model (1), we monitor the mean change-point with the CUSUM statistic

U_{k, 1} = (\frac{k (T - k)}{T}) [\frac{1}{k} \sum_{t = 1}^{k} X_{t} - \frac{1}{T - k} \sum_{t = k + 1}^{T} X_{t}], 1 \leq k \leq T - 1,

(2)

and monitor the variance change-point with the CUSUM statistic

U_{k, 2} = (\frac{k (T - k)}{T}) [\frac{1}{k} \sum_{t = 1}^{k} {(X_{t} - \bar{X})}^{2} - \frac{1}{T - k} \sum_{t = k + 1}^{T} {(X_{t} - \bar{X})}^{2}], 1 \leq k \leq T - 1,

(3)

where

\bar{X} = \frac{1}{T} \sum_{t = 1}^{T} X_{t}

.

To improve the power of the statistics of the mean change-point and the variance change-point, we use the combination statistic

\begin{matrix} g (T, k) : = {(g_{1} (T, k), g_{2} (T, k))}^{⊤} : = {(\frac{1}{\sqrt{T}} U_{k, 1}, \frac{1}{\sqrt{T}} U_{k, 2})}^{⊤}, \end{matrix}

(4)

where

U_{k, 1}

and

U_{k, 2}

are defined by (2) and (3), respectively. Here, ⊤ represents the transpose of the vector. Under the assumption of no change in the mean or variance, for all

T \geq 2

, the model (1) can be summarized in the null hypothesis

H_{0}

as

μ_{1} = μ_{2} = \dots = μ_{T} = μ_{0} and σ_{1} = σ_{2} \dots = σ_{T} = σ_{0},

(5)

where

- \infty < μ_{0} < \infty

and

0 < σ_{0} < \infty

. The change-point alternative hypothesis

H_{A}

is that there is an integer

k_{1}^{*}

such that

μ_{1} = \dots = μ_{k_{1}^{*}} \neq μ_{k_{1}^{*} + 1} = \dots = μ_{T}

or there is an integer

k_{2}^{*}

such that

σ_{1} = \dots = σ_{k_{2}^{*}} \neq σ_{k_{2}^{*} + 1} = \dots = σ_{T}

.

In this paper, we consider the mean-variance model (1) with

α

-mixing errors and investigate the limiting distributions for the statistics related to

g (T, k)

under the null hypothesis

H_{0}

by (5). For example, if

max_{1 \leq k \leq T - 1} g {(T, k)}^{T} g (T, k)

is smaller than a critical value (see details in Remark 2), then there is no evidence of a mean change or variance change in the mean-variance model (1). Otherwise, if

max_{1 \leq k \leq T - 1} | g_{1} (T, k) |

is larger than another critical value (see details in Remark 2), the mean change-point location is suggested by

{\hat{k}}_{T, 1} = \underset{1 \leq k \leq T - 1}{argmax} | g_{1} (T, k) | = \underset{1 \leq k \leq T - 1}{argmax} | U_{k, 1} | .

(6)

If

max_{1 \leq k \leq T - 1} | g_{2} (T, k) |

is larger than this critical value, the variance change-point location is suggested by

{\hat{k}}_{T, 2} = \underset{1 \leq k \leq T - 1}{argmax} | g_{2} (T, k) | = \underset{1 \leq k \leq T - 1}{argmax} | U_{k, 2} | .

(7)

Compared with existing methods for determining change-points, such as cpt.meanvar from the R package changepoint in [12] and mosum from the R package mosum in [13], we will show that our tests not only have high powers, but also determine the change-points as the mean change-point or variance change-point. Further details are provided in Section 3, Section 4 and Section 5.

In addition to the change-point studies referred to above, many scholars have sought to extend the change-points of the mean and variance for both independent and dependent data. For the mean change-point example, in [14,15,16], CUSUM estimators were investigated with dependent errors; in [17], a weighted CUSUM estimator was studied using an infinite variance

A R (p)

process; in [18,19], a self-normalization method was used to test the mean change-point in a time series; in [20,21], data-driven methods were used to investigate the mean shift and variance change; in [22,23,24], CUSUM estimators were discussed regarding the mean change-point with panel data. For the variance change-point example, the Schwarz information criterion (SIC) estimator of variance change was studied with independent normal errors in [25]; ref. [26] extended the CUSUM estimator in [5] with normal data to infinite moving average processes; in [27], a weighted variances test was considered based on independent errors; covariance structure change was studied with linear processes in [28]. In addition, the authors of [29] considered a CUSUM test of parameter changes in a time series model; refs. [30,31] reported changes in a variance inflation factor (VIF) regression model and a linear regression model; the authors of ref. [32] considered changes in parameters using the Shiryaev–Roberts statistics; in ref. [33], the change of covariance structure in multivariate time series were considered; refs. [34,35,36] considered multiple change-points; in ref. [37], the authors investigated a Bayesian method for the change-point; the authors of ref. [38] investigated a new class of weighted CUSUM estimators of the mean change-point; in ref. [39], the least sum of the squared error (LSSE) and maximum log-likelihood (MLL) methods in the estimation of the change-point were examined; in ref. [40], multivariate change-points in a mean vector and/or covariance structure were considered; and ref. [41] discussed a CUSUM estimator in an ARMA–GARCH model. Furthermore, a test for the detection of outliers for continuous distribution data was investigated in [42]; refs. [43,44], respectively, investigated change-point problems with a nonstationary time series and the volatility of conditional heteroscedasticity, respectively.

It is pointed out that the

α

-mixing sequence is very general in time series. For example, consider an infinite order moving average (MA(∞)) process

X_{t} = \sum_{i = 0}^{\infty} a_{i} e_{t - i}, t \geq 1

, where

a_{i} \to 0

exponentially fast, and

{e_{t}}

is an

i . i . d .

sequence. If the probability density function of

e_{t}

exists (such as normal, Cauchy, exponential, and uniform distributions), then

{X_{t}}

is an

α

-mixing with exponentially decaying coefficients. The strictly stationary time series, including the autoregressive moving average (ARMA) processes and geometrically ergodic Markov chains, are the

α

-mixing processes. For further studies of

α

-mixing, reference can be made to [45,46] for limit theorems, refs. [47,48] for central limit theorem, refs. [49,50,51,52] for regression models, etc.

The rest of this paper is organized as follows. Some assumptions are provided in Section 2. By some stationarity and symmetry conditions, the limit distribution for the combination statistic

g (T, k)

will be shown under the null hypothesis (5) in Section 2, which can derive the limiting distributions of the CUSUM statistics

g_{1} (T, k)

for the mean change and

g_{2} (T, k)

for the variance change, respectively. As an application, we give a three-step algorithm to detect the change-points in Section 3. For example, in the first step, we do the combination test to check whether there is at least a change-point or not; in the second step, we do the mean test to detect the mean change-point; in the third step, we use the variance test to detect the variance change-point. In Section 4 of simulation, it will show that our method has a better performance than the methods of as cpt.meanvar [12] and mosum [13]. We also use three examples of real-world data to detect the mean change-point and variance change-point in Section 5. In addition, some conclusions and future work will be discussed in Section 6. Last, the proofs of the main results are presented in Section 7.

Throughout the paper, as

T \to \infty

, let

\overset{P}{\to}

and

\overset{d}{\to}

denote the convergence in probability and distribution, respectively. Let

C, C_{1}, C_{2}, C_{3}, \dots

denote some positive constants not depending on T, which may be different in various places. If X and Y have the same distribution, we denote it as

X \sim Y

. In addition, second-order stationarity means that

(e_{1}, e_{1 + k}) \sim (e_{t}, e_{t + k})

for all

t \geq 1

and

k \geq 1

.

2. Main Results

First, we list some assumptions as follows:

Assumption 1.

Consider the model (1), where

{e_{t}, t \geq 1}

is a stationarity sequence of α-mixing random variables with

E e_{t} = 0

,

E e_{t}^{2} = 1

for all

t \geq 1

. In addition, for some

δ > 0

, let

sup_{t \geq 1} E {| e_{t} |}^{4 + 2 δ} < \infty

and

\sum_{t = 1}^{\infty} α^{\frac{δ}{2 + δ}} (t) < \infty

.

Assumption 2.

Let

μ_{0}

be defined in (5) and

\begin{matrix} lim_{T \to \infty} \frac{1}{T} Var (\sum_{t = 1}^{T} X_{t}) = s_{1}^{2} > 0, lim_{T \to \infty} \frac{1}{T} Var (\sum_{t = 1}^{T} {(X_{t} - μ_{0})}^{2}) = s_{2}^{2} > 0, \end{matrix}

(8)

\begin{matrix} lim_{T \to \infty} \frac{1}{T} \sum_{i = 1}^{T} \sum_{j = 1}^{T} Cov [(X_{i} - μ_{0}), {(X_{j} - μ_{0})}^{2}] = 0 . \end{matrix}

(9)

Assumption 3.

Let

{e_{t}, t \geq 1}

be a second-order stationarity sequence of α-mixing random variables with

E e_{1} = 0

and

E e_{1}^{2} = 1

. For some

δ > 0

, assume that

E | e_{1} |^{8 + 4 δ} < \infty

and

\sum_{t = 1}^{\infty} α^{\frac{δ}{2 + δ}} (t) < \infty

. In addition, let

E e_{1} e_{1 + j}^{2} = 0

for all

j \geq 0

.

Assumption 4.

Let

s_{1}^{2} : = γ_{1} (0) + 2 \sum_{h = 1}^{\infty} γ_{1} (h) > 0, s_{2}^{2} : = γ_{2} (0) + 2 \sum_{h = 1}^{\infty} γ_{2} (h) > 0,

(10)

where

γ_{1} (h) = Cov (X_{1}, X_{1 + h}) = Cov ((X_{1} - μ_{0}), (X_{1 + h} - μ_{0}))

and

γ_{2} (h) = Cov ({(X_{1} - μ_{0})}^{2}, {(X_{1 + h} - μ_{0})}^{2})

for

h = 0, 1, 2, \dots

.

Assumption 5.

Let

{h_{T}, T \geq 1}

be a sequence of positive integers satisfying

h_{T} \to \infty a s T \to \infty a n d h_{T} = O (T^{β}) f o r s o m e β \in (0, 1 / 2) .

(11)

Remark 1.

The moment conditions and mixing coefficients of α-mixing sequence

{e_{t}}

in Assumption 1 are used by many researchers, see [46,51], etc. The conditions (8) in Assumption 2 are the limiting variances for partial sums of

{\frac{X_{t} - μ_{0}}{\sqrt{T}}}

and

{\frac{{(X_{t} - μ_{0})}^{2}}{\sqrt{T}}}

. The condition (9) in Assumption 2 is a symmetry condition, which requires the limiting for partial sums of covariance of

{\frac{X_{t} - μ_{0}}{\sqrt{T}}}

and

{\frac{{(X_{t} - μ_{0})}^{2}}{\sqrt{T}}}

to be zero. For example, let

f (x, y; k)

denote the joint probability density function of random variables

X_{t} - μ_{0}

and

X_{t + k} - μ_{0}

for all

t \geq 1

and

k \geq 1

. Let

f (x, y; k)

be symmetrical, i.e.,

f (x, y; k) = f (- x, - y; k)

for all

x, y \in R

. It is easy to check that

E [(X_{t} - μ_{0}) {(X_{t + k} - μ_{0})}^{2}] = - E [(X_{t} - μ_{0}) {(X_{t + k} - μ_{0})}^{2}]

, which implies

E [(X_{t} - μ_{0}) {(X_{t + k} - μ_{0})}^{2}] = 0

for all

t \geq 1

and

k \geq 1

. In addition, it has

E (X_{t} - μ_{0}) = 0

and

E {(X_{t} - μ_{0})}^{3} = 0

for all

t \geq 1

. Obviously, the binary normal distribution can satisfy the conditions for this example. Thus, the condition of (9) is satisfied. The second-order stationarity condition in Assumption 3 is provided to obtain

s_{1}^{2}

and

s_{2}^{2}

in Assumption 2. They are the long-run variances

s_{1}^{2}

and

s_{2}^{2}

in (10). To estimate these long-run variances

s_{1}^{2}

and

s_{2}^{2}

, we use Assumption 4 and the sample autocovariance functions to give their estimators

{\hat{s}}_{T, 1}^{2}

and

{\hat{s}}_{T, 2}^{2}

in (16). A similar condition (11) can be seen in [26].

Second, we study the limiting distribution of combination statistic

g (T, k)

in (4) under the null hypothesis

H_{0}

by (5). We denote

⌊ x ⌋

as the greatest integer not exceeding x. Throughout the paper, let

{W_{1}^{0} (x), x \in [0, 1]}

and

{W_{2}^{0} (x), x \in [0, 1]}

be two independent standard Brownian bridges, ⇒ denote the convergence in distribution in the Skorokhod space

D [0, 1]

.

Theorem 1.

In model (1), let the Assumptions 1 and 2 be satisfied. Then, under the null hypothesis

H_{0}

defined by (5), for

0 < k = ⌊ x T ⌋ < T

and

x \in [0, 1]

, it has

\begin{matrix} g {(T, k)}^{⊤} g (T, k) \Rightarrow \sum_{i = 1}^{2} s_{i}^{2} {(W_{i}^{0} (x))}^{2}, a s T \to \infty, \end{matrix}

(12)

where

g (T, k)

,

s_{1}^{2}

and

s_{2}^{2}

are defined by (4) and (8), respectively. Combining with the continuous mapping theorem, we obtain

\begin{matrix} max_{1 \leq k \leq T - 1} g {(T, k)}^{⊤} g (T, k) \overset{d}{⟶} sup_{0 \leq x \leq 1} \sum_{i = 1}^{2} s_{i}^{2} {(W_{i}^{0} (x))}^{2}, a s T \to \infty . \end{matrix}

(13)

Usually, the

s_{1}^{2}

and

s_{2}^{2}

in (8) are unknown. We should estimate them. By the second stationarity in Assumption 3, it is easy to obtain the long-run variances

s_{1}^{2}

and

s_{2}^{2}

defined by (10). In the next, we discuss the estimators of

s_{1}^{2}

and

s_{2}^{2}

. Let

Z_{t} = X_{t} - \bar{X}

,

1 \leq t \leq T

. Then,

γ_{1} (h)

and

γ_{2} (h)

defined in (10) can be, respectively, estimated by

{\hat{γ}}_{1} (h)

and

{\hat{γ}}_{2} (h)

as

{\hat{γ}}_{1} (h) = \frac{1}{T} \sum_{t = 1}^{T - h} Z_{t} Z_{t + h}, 0 \leq h < T,

(14)

and

{\hat{γ}}_{2} (h) = \frac{1}{T} \sum_{t = 1}^{T - h} (Z_{t}^{2} - \bar{Z^{2}}) (Z_{t + h}^{2} - \bar{Z^{2}}), 0 \leq h < T,

(15)

where

\bar{Z^{2}} = \frac{1}{T} \sum_{t = 1}^{T} Z_{t}^{2}

and

\bar{X} = \frac{1}{T} \sum_{t = 1}^{T} X_{t}

.

Thus, the estimators of

s_{1}^{2}

and

s_{2}^{2}

are, respectively, suggested by

{\hat{s}}_{T, 1}^{2} : = {\hat{γ}}_{1} (0) + 2 \sum_{h = 1}^{h_{T}} {\hat{γ}}_{1} (h), {\hat{s}}_{T, 2}^{2} : = {\hat{γ}}_{2} (0) + 2 \sum_{h = 1}^{h_{T}} {\hat{γ}}_{2} (h),

(16)

where

{\hat{γ}}_{1} (h)

and

{\hat{γ}}_{2} (h)

are defined by (14) and (15).

Lemma 1.

In model (1), let the Assumptions 3–5 be satisfied. Under the null hypothesis

H_{0}

defined by (5), we obtain

{\hat{s}}_{T, 1}^{2} \overset{P}{⟶} s_{1}^{2}, {\hat{s}}_{T, 2}^{2} \overset{P}{⟶} s_{2}^{2},

(17)

where

s_{1}^{2}

,

s_{2}^{2}

,

{\hat{s}}_{T, 1}^{2}

and

{\hat{s}}_{T, 2}^{2}

are defined by (10) and (16), respectively.

For

1 \leq k \leq T - 1

, denote the combination statistic

\begin{matrix} f (T, k) : = {(\frac{1}{\sqrt{{\hat{s}}_{T, 1}^{2}}} g_{1} (T, k), \frac{1}{\sqrt{{\hat{s}}_{T, 2}^{2}}} g_{2} (T, k))}^{⊤} = {(\frac{1}{\sqrt{T {\hat{s}}_{T, 1}^{2}}} U_{k, 1}, \frac{1}{\sqrt{T {\hat{s}}_{T, 2}^{2}}} U_{k, 2})}^{⊤}, \end{matrix}

(18)

where

U_{k, 1}

,

U_{k, 2}

,

g_{1} (T, k)

,

g_{2} (T, k)

,

{\hat{s}}_{T, 1}^{2}

and

{\hat{s}}_{T, 2}^{2}

are defined by (2)–(4) and (16), respectively.

Combining Theorem 1 with Lemma 1, we obtain two corollaries as follows:

Corollary 1.

In model (1), let the Assumptions 3–5 be fulfilled. Under the null hypothesis

H_{0}

defined by (5), for

0 < k = ⌊ x T ⌋ < T

and

x \in [0, 1]

, we have

\begin{matrix} f {(T, k)}^{⊤} f (T, k) \Rightarrow \sum_{i = 1}^{2} {(W_{i}^{0} (x))}^{2}, a s T \to \infty, \end{matrix}

(19)

where

W_{1}^{0} (x)

and

W_{2}^{0} (x)

are defined in (12). Thus,

\begin{matrix} max_{1 \leq k \leq T - 1} f {(T, k)}^{⊤} f (T, k) \overset{d}{⟶} sup_{0 \leq x \leq 1} \sum_{i = 1}^{2} {(W_{i}^{0} (x))}^{2}, a s T \to \infty . \end{matrix}

(20)

Corollary 2.

In model (1), let the Assumptions 3–5 be fulfilled. Under the null hypothesis

H_{0}

defined by (5), it has

max_{1 \leq k \leq T - 1} | \frac{g_{1} (T, k)}{\sqrt{{\hat{s}}_{T, 1}^{2}}} | \overset{d}{⟶} sup_{0 \leq x \leq 1} | W_{1}^{0} (x) |, max_{1 \leq k \leq T - 1} | \frac{g_{2} (T, k)}{\sqrt{{\hat{s}}_{T, 2}^{2}}} | \overset{d}{⟶} sup_{0 \leq x \leq 1} | W_{2}^{0} (x) |,

(21)

where

\frac{g_{1} (T, k)}{\sqrt{{\hat{s}}_{T, 1}^{2}}}

and

\frac{g_{2} (T, k)}{\sqrt{{\hat{s}}_{T, 2}^{2}}}

are defined by (18),

W_{1}^{0} (x)

and

W_{2}^{0} (x)

are defined in (12).

Remark 2.

For

i = 1, 2

, by (11.38) in [53], it is presented that

P (sup_{x \in [0, 1]} | W_{i}^{0} (x) | \leq y) = 1 + 2 \sum_{k = 1}^{\infty} {(- 1)}^{k} exp (- 2 k^{2} y^{2}), y > 0,

(22)

where

W_{1}^{0} (x)

and

W_{2}^{0} (x)

be defined in (12). Let α (

0 < α < 1

) be the level of significance. For

l \geq 1

, let

W_{1}^{0} (x), \dots, W_{l}^{0} (x)

be independent standard Brownian bridges for

x \in [0, 1]

. Then the distribution of

sup_{0 \leq x \leq 1} \sum_{i = 1}^{l} {(W_{i}^{0} (x))}^{2}

(23)

was derived by Kiefer [54], which has a series Fourier-Bessel expansions. It is not easy to calculate the critical values for this distribution. Lee et al. [29] considered the problem of testing for parameter changes in time series models based on the CUSUM statistics and obtained the limiting distribution (23). They used the Monte Carlo method to obtain the critical values

c_{α}

with different α and l. For example, when

l = 2

, the critical values

c_{α}

are calculated as

c_{0.05} = 2.408

and

c_{0.1} = 2.054

(see [29]).

If

max_{1 \leq k \leq T - 1} f {(T, k)}^{⊤} f (T, k) \leq c_{α}

, there is no evidence of a mean change or variance change. Otherwise, we conclude that there is at least a mean change-point or a variance change-point.

Similar to the multiple testing problems, by (22), we take the critical value

d_{α / 2}

for the distribution of

sup_{0 \leq x \leq 1} | W_{1}^{0} (x) |

to do the tests of the mean change-point and variance change-point, in order to control the type I error. For example,

d_{0.025} = 1.48

,

d_{0.05} = 1.358

. If

max_{1 \leq k \leq T - 1} | \frac{g_{1} (T, k)}{\sqrt{{\hat{s}}_{T, 1}^{2}}} | \leq d_{α / 2}

, there is no evidence of a mean change. Otherwise, we conclude that there is a mean change-point, and its time location

k_{1}^{*}

is defined in (6).

Meanwhile, by (22), the p-value of

max_{1 \leq k \leq T - 1} | \frac{g_{1} (T, k)}{\sqrt{{\hat{s}}_{T, 1}^{2}}} |

can be defined by

p_{v 1}

as

p_{v 1} = P (sup_{x \in [0, 1]} | W_{1}^{0} (x) | > y_{0}) = - 2 \sum_{k = 1}^{\infty} {(- 1)}^{k} exp (- 2 k^{2} y_{0}^{2}),

(24)

where

y_{0} = max_{1 \leq k \leq T - 1} | \frac{g_{1} (T, k)}{\sqrt{{\hat{s}}_{T, 1}^{2}}} |

. Similarly, let

d_{α / 2}

be the critical value for the distribution of

sup_{0 \leq x \leq 1} | W_{2}^{0} (x) |

. If

max_{1 \leq k \leq T - 1} | \frac{g_{2} (T, k)}{\sqrt{{\hat{s}}_{T, 2}^{2}}} | \leq d_{α / 2}

, there is no evidence of a variance change. Otherwise, we conclude that there is a variance change-point, and its time location

k_{2}^{*}

is suggested in (7).

In addition, the p-value

p_{v 2}

of

max_{1 \leq k \leq T - 1} | \frac{g_{2} (T, k)}{\sqrt{{\hat{s}}_{T, 2}^{2}}} |

can be defined by (24), where

| W_{1}^{0} (x) |

is replaced by

| W_{2}^{0} (x) |

and

y_{0}

is

max_{1 \leq k \leq T - 1} | \frac{g_{2} (T, k)}{\sqrt{{\hat{s}}_{T, 2}^{2}}} |

.

3. The Three-Step Algorithm

Based on the combination statistic

f (T, k)

in Corollary 1, the mean change statistic

g_{1} (T, k)

and variance change statistic

g_{2} (T, k)

in Corollary 2, we give a three-step algorithm to test the changes in mean and variance in Algorithm 1, i.e., the combination test, mean change test, and variance change test. In the following algorithm, we assume that there is at most one mean and one variance in the time series. If there are more change-points, we will give the discussion in Remark 3 to detect them.

Remark 3.

It can be seen that the CUSUM statistic

U_{k, 2}

of variance change in (3) contains the sample mean statistic, while the CUSUM statistic

U_{k, 1}

of mean change in (2) does not contain the sample variance statistic. One can use

U_{k, 2}^{*}

U_{k, 2}^{*} = (\frac{k (T - k)}{T}) [\frac{1}{k} \sum_{t = 1}^{k} {(X_{t} - {\bar{X}}_{k})}^{2} - \frac{1}{T - k} \sum_{t = k + 1}^{T} {(X_{t} - {\bar{X}}_{T - k})}^{2}], 1 \leq k \leq T - 1,

to replace

U_{k, 2}

, where

{\bar{X}}_{k} = \frac{1}{k} \sum_{t = 1}^{k} X_{t}

and

{\bar{X}}_{T - k} = \frac{1}{T - k} \sum_{t = k + 1}^{T} X_{t}

. However, the proofs of limiting distribution and consistency estimator based on

U_{k, 2}^{*}

will be complicated. Thus, we consider

U_{k, 2}

to construct the variance change-point estimator. That is why we do the mean change test in Step 2 before the test of variance change in this paper. To reduce the impact of mean change on the variance change test, we can construct the modified data

X_{1}^{*}, X_{2}^{*}, \dots, X_{T}^{*}

if we find a mean change-point. Then, we go to do Step 3 of the variance change test. Since it is assumed that there is at most one mean and one variance in the time series, Algorithm 1 is terminated after Step 3. If there are more change-points, we can modify the process

{X_{t}^{* *}}

by the variance change. For example, base on the data

{X_{1}, X_{2}, \dots, X_{T}}

(or the modified data

{X_{1}^{*}, X_{2}^{*}, \dots, X_{T}^{*}

}), let the modified process

X_{t}^{* *}

as

X_{t}^{* *} = {\begin{matrix} X_{t}, & t \leq {\hat{k}}_{T, 2}, \\ λ_{T}^{- 1 / 2} X_{t}, & t > {\hat{k}}_{T, 2}, \end{matrix}

where

λ_{T} = {\hat{σ}}_{T, 2}^{2} / {\hat{σ}}_{T, 1}^{2}

,

{\hat{σ}}_{T, 1}^{2} = \frac{1}{{\hat{k}}_{T, 2}} \sum_{t = 1}^{{\hat{k}}_{T, 2}} {(X_{t} - \bar{X})}^{2}

,

{\hat{σ}}_{T, 2}^{2} = \frac{1}{T - {\hat{k}}_{T, 2}} \sum_{t = 1 + {\hat{k}}_{T, 2}}^{T} {(X_{t} - \bar{X})}^{2}

and

\bar{X} = \frac{1}{T} \sum_{t = 1}^{T} X_{t}

. Then, based on the modified data

{X_{1}^{* *}, \dots, X_{T}^{* *}}

, we can combine the three-step algorithm with iterative methods to detect more change-points. For further details, one can refer to [20,21] and the sources detailed therein.

For further studies of multiple change-point detection, reference can be made to [35] and the sources detailed therein. Next, we should discuss the measure of accuracy for the multiple change-point detection. Based on a time series observation of

{X_{1}, X_{2}, \dots, X_{T}}

, assume that there are

L_{0}

change-points denoted by

k_{1}^{*}, k_{2}^{*}, \dots, k_{L_{0}}^{*}

. By the change-point detection methods, it is assumed to detect

{\hat{L}}_{T}

change-points denoted by

{\hat{k}}_{T, 1}, {\hat{k}}_{T, 2}, \dots, {\hat{k}}_{T, {\hat{L}}_{T}}

. Following [55,56], a set of correctly detected change-points is defined as True Positive (TP):

T P = {k_{i}^{*} | \exists {\hat{k}}_{T, j} : | {\hat{k}}_{T, j} - k_{i}^{*} | \leq m}, i = 1, 2, \dots, L_{0} a n d 0 \leq j \leq {\hat{L}}_{T},

(25)

where m is a margin size with

m > 0

. Then, the Precision, Recall, and F1-score are defined as follows

Precision = \frac{| T P |}{L_{0}}, Recall = \frac{| T P |}{{\hat{L}}_{T}}, F 1 - score = 2 \frac{Precision \times Recall}{Precision + Recall},

(26)

where

| T P |

denotes the number of set

T P

.

Algorithm 1 Three-step algorithm

Input: Data: ${X_{1}, X_{2}, \dots, X_{T}}$ , set the level of significance $α$ , the critical values $c_{α}$ and $d_{α / 2}$ defined in Remark 2. Denote $K$ the estimator of change-point location.
Initialize: $K \leftarrow \emptyset$
/* Step 1: Do the combination test */
if $max_{1 \leq k \leq T - 1} f {(T, k)}^{⊤} f (T, k)$ in (20) is less than $c_{α}$ then
There is no evidence of a mean change or variance change, and the algorithm is terminated.
else
There is at least a mean change-point or variance change-point, and Step 2 is started.
/* Step 2: Do the mean change test */
if $max_{1 \leq k \leq T - 1} | \frac{g_{1} (T, k)}{\sqrt{{\hat{s}}_{T, 1}^{2}}} |$ in (21) is less than $d_{α / 2}$ then
There is no evidence of a mean change, and Step 3 is started.
else
There is a mean change-point suggested by ${\hat{k}}_{T, 1}$ in (6). Denote $({\hat{k}}_{T, 1}, 1)$ , where 1 stands the change in mean. Do $K \leftarrow ({\hat{k}}_{T, 1}, 1)$ . In addition, take the modified the process $X_{t}^{*}$ by the mean change as

$X_{t}^{*} = {\begin{cases} X_{t}, & t \leq {\hat{k}}_{T, 1}, \\ X_{t} - ({\bar{θ}}_{T, 2} - {\bar{θ}}_{T, 1}), & t > {\hat{k}}_{T, 1}, \end{cases}$

where ${\bar{θ}}_{T, 1} = \frac{1}{{\hat{k}}_{T, 1}} \sum_{t = 1}^{{\hat{k}}_{T, 1}} X_{t}$ and ${\bar{θ}}_{T, 2} = \frac{1}{T - {\hat{k}}_{T, 1}} \sum_{t = {\hat{k}}_{T, 1} + 1}^{T} X_{t}$ . Update ${X_{1}, X_{2}, \dots, X_{T}} \leftarrow {X_{1}^{*}, X_{2}^{*}, \dots, X_{T}^{*}}$ and start the Step 3.
end if
/* Step 3: Do the variance change test */
if $max_{1 \leq k \leq T - 1} | \frac{g_{2} (T, k)}{\sqrt{{\hat{s}}_{T, 2}^{2}}} |$ in (21) is less than $d_{α / 2}$ then
There is no evidence of a variance change, and the algorithm is terminated.
else
There is a variance change-point suggested by ${\hat{k}}_{T, 2}$ in (7). Denote $({\hat{k}}_{T, 2}, 2)$ , where 2 stands the change in variance. Do $K \leftarrow ({\hat{k}}_{T, 2}, 2)$ .
end if
end if
Output: $K$

4. Simulations

In this section, some simulations illustrate the empirical detection probabilities for the change-point estimators

max_{1 \leq k \leq T - 1} f {(T, k)}^{⊤} f (T, k)

,

max_{1 \leq k \leq T - 1} | \frac{g_{1} (T, k)}{\sqrt{{\hat{s}}_{T, 1}^{2}}} |

and

max_{1 \leq k \leq T - 1} | \frac{g_{2} (T, k)}{\sqrt{{\hat{s}}_{T, 2}^{2}}} |

defined by (18). For

T \geq 2

, we consider a mean-variance model as

\begin{matrix} X_{t} & = & μ_{1} I (1 \leq t \leq k_{1}^{*}) + μ_{2} I (k_{1}^{*} + 1 \leq t \leq T) \\ + σ_{1} e_{t} I (1 \leq t \leq k_{2}^{*}) + σ_{2} e_{t} I (k_{2}^{*} + 1 \leq t \leq T), 1 \leq t \leq T, \end{matrix}

(27)

where

μ_{1}

and

μ_{2}

are the mean parameters,

σ_{1} > 0

and

σ_{2} > 0

are the variance parameters,

k_{1}^{*}

and

k_{2}^{*}

are the mean change-point location and variance change-point location, respectively. Let

e = {(e_{1}, e_{2}, \dots, e_{T})}^{⊤}

be a random vector with

E e = 0

and

Cov (e) = Σ_{T}

satisfying

Σ_{T} = {(ξ^{| i - j |})}_{1 \leq i, j \leq T}

for some

| ξ | < 1

. It is easy to see that

e_{1}, e_{2}, e_{3}, \dots

are the

α

-mixing random variables with mixing coefficient

α (t) = O (| ξ |^{t})

.

Consider the null hypothesis

H_{0}

:

μ_{1} = μ_{2} = μ_{0}

and

σ_{1}^{2} = σ_{2}^{2} = σ_{0}^{2}

and the alternative hypothesis

H_{A}

:

μ_{1} \neq μ_{2}

or

σ_{1}^{2} \neq σ_{2}^{2}

. For simplicity, we consider 4 different cases as follows:

Case 1:

μ_{1} = μ_{2} = 1

and

σ_{1}^{2} = σ_{2}^{2} = 1

; Case 2:

μ_{1} = 1, μ_{2} = 1.5

and

σ_{1}^{2} = σ_{2}^{2} = 1

;

Case 3:

μ_{1} = μ_{2} = 1

and

σ_{1}^{2} = 1, σ_{2}^{2} = 2

; Case 4:

μ_{1} = 1, μ_{2} = 1.5

and

σ_{1}^{2} = 1, σ_{2}^{2} = 2

.

The mean change-point location

k_{1}^{*}

and variance change-point location

k_{2}^{*}

will be given later. Denote

A_{T, 0} = max_{1 \leq k \leq T - 1} f {(T, k)}^{⊤} f (T, k), A_{T, 1} = max_{1 \leq k \leq T - 1} | \frac{g_{1} (T, k)}{\sqrt{{\hat{s}}_{T, 1}^{2}}} |, A_{T, 2} = max_{1 \leq k \leq T - 1} | \frac{g_{2} (T, k)}{\sqrt{{\hat{s}}_{T, 2}^{2}}} | .

The details of our algorithm to detect change-points in the mean-variance model can be found in Section 3.

First, we consider the Case 1–Case 4 for the mean-variance model (27) based on the multivariate normal distribution. Let

e \sim N_{T} (0, Σ_{T})

with the dependence parameter

ξ

in

Σ_{T}

. The level of significance is taken

α = 0.05

. By

ξ = [- 0.3, 0, 0.3]

,

h_{T} = ⌊ T^{1 / 5} ⌋

,

k_{1}^{*} = ⌊ T / 4 ⌋

,

k_{2}^{*} = ⌊ T / 2 ⌋

and

T = [300, 600, 900]

, we obtain the empirical sizes and powers for the estimators

A_{T, 0}

,

A_{T, 1}

and

A_{T, 2}

denoted by

p_{0}

,

p_{1}

and

p_{2}

, respectively. Thus, the results of

p_{0}

,

p_{1}

and

p_{2}

are shown in Table 1. The simulation results are taken by 1000 replications.

By Table 1, we give some comments here:

•: For Case 1: The mean and variance are not changed. It can be seen that the empirical sizes $p_{0}$ of $A_{T, 0}$ are around the level of significance $α = 0.05$ , while the empirical sizes $p_{1}$ and $p_{2}$ of $A_{T, 1}$ and $A_{T, 2}$ are smaller than $α / 2 = 0.025$ , respectively.
•: For Case 2: The mean is changed, while the variance is not changed. It can be seen that the powers $p_{0}$ of $A_{T, 0}$ , $p_{1}$ of $A_{T, 1}$ , go to 1 as sample size T increases, while the powers $p_{2}$ of $A_{T, 2}$ are smaller than 0.025.
•: For Case 3: The variance is changed, while the mean is not changed. It can be seen that the powers $p_{0}$ of $A_{T, 0}$ , $p_{2}$ of $A_{T, 2}$ , increase to 1 as sample size T increases, while the powers $p_{1}$ of $A_{T, 1}$ are around $0.025$ .
•: For Case 4: The mean and variance are both changed. We can find that the powers $p_{0}$ of $A_{T, 0}$ , $p_{1}$ of $A_{T, 1}$ , $p_{2}$ of $A_{T, 2}$ , go to 1 as sample size T increases.

Second, we consider the multivariate t distribution. Let

X_{1} \sim N_{T} (0, Σ_{T})

and

X_{2} \sim χ^{2} (n)

and

X_{1}

and

X_{2}

be independent. Thus,

t = X_{1} / \sqrt{Y_{2} / n}

has a multivariate t distribution denoted by

t_{T} (0, Σ_{T}, n)

. Similar to Table 1, we replace

e \sim N_{T} (0, Σ_{T})

by

e \sim t (0, Σ_{T}, 5)

and obtain the results of

p_{0}

,

p_{1}

and

p_{2}

in Table 2.

Compared to Table 1, we find that the powers of

p_{1}

for Cases 2 and 3 in Table 2 are not identical to those in Table 1, but the sizes and powers of

p_{0}

and

p_{2}

for Cases 1–4 in Table 2 are as good as those in Table 1. It may be that the multivariate t distribution with 5 degrees of freedom has heavier tails, which affects the mean change test.

Thirdly, we will discuss the accuracy of Precision, Recall, and F1-score defined by (26) for the above change-point Cases 2–4. Killick and Eckley [12] studied the methods of change-point detection and gave the ‘cpt.mean’, ‘cpt.meanvar’ and ‘cpt.var’ in R the Package changepoint for the mean change, mean-variance change and variance change, respectively. Recently, Meier et al. [13] provided the R the Package mosum to detect the change-point using the moving sum statistics. We use cpt.meanvar and mosum to write ‘cpt.meanvar’ algorithm and ‘mosum’ algorithm, respectively. Thus, we compare these two methods with our method presented in Section 3. By the same setting in Table 1 and Table 2, we take

m = 0.1 T

in (25). When sample size T is 300,600 and 900, respectively, the bandwidth G in the mosum method is taken by 100, 120 and 150, respectively. Here, the G should be less than one half of the sample size (see R Package mosum). Then, we obtain the results of Precision, Recall, and F1-score in Table 3 and Table 4 under the multivariate normal distribution and multivariate t distribution, respectively.

Since the mosum method in [13] is mainly used to detect the mean change-point, by Table 3 and Table 4, the Precision, Recall, and F1-score of the mosum algorithm are worse than those of cpt.meanvar algorithm and our algorithm under Cases 3 and 4. By Table 3, under the multivariate normal case, the results of our algorithm for Cases 2 and 3 are as well as those of cpt.meanvar algorithm, but the results of our algorithm for Case 4 are better than those of cpt.meanvar algorithm. Furthermore, by Table 4, the results of our algorithm are better than those of cpt.meanvar algorithm under the multivariate t distribution.

5. The Real Data Analysis

In this section, we give three examples of real data to illustrate our three-step test for the change-point detection of mean and variance. The statistics

max_{1 \leq k \leq T - 1} f {(T, k)}^{⊤} f (T, k)

,

max_{1 \leq k \leq T - 1} | \frac{g_{1} (T, k)}{\sqrt{{\hat{s}}_{T, 1}^{2}}} |

and

max_{1 \leq k \leq T - 1} | \frac{g_{2} (T, k)}{\sqrt{{\hat{s}}_{T, 2}^{2}}} |

can be found in Section 3.

Example 1.

The dataset is the annual flow of the river Nile at Aswan from 1871 to 1970 (see [57]), and it contains 100 observations denoted by

x_{t}

,

1 \leq t \leq 100

(see Figure 1). It measures the annual discharge at Aswan in 108

m^{3}

and is depicted in Figure 1. The sample autocorrelation function (ACF) is also presented in Figure 1. By the right side of Figure 1, the autocorrelation coefficient is relatively large when the lag is small, but it approaches zero as the lag increases. Therefore, the data satisfies the properties of α-mixing. By Figure 1, it seams that there is a mean change-point in the time series of the annual flow of the river Nile.

To judge the existence of change-points, we set the null hypothesis that the annual flow of the river Nile has no change in the mean or variance. We use our three-step algorithm given in Section 3 to find the change-points. Base on

x_{1}, x_{2}, \dots, x_{100}

, by Step 1, we take

α = 0.1

,

T = 100

,

h_{T} = ⌊ T^{1 / 5} ⌋

and obtain

max_{1 \leq k \leq T - 1} f {(T, k)}^{⊤} f (T, k) = 4.6048 > 2.054

. Therefore, we reject the null hypothesis and conclude that there is at least a change-point of mean or variance. By Step 2, it has

max_{1 \leq k \leq T - 1} | \frac{g_{1} (T, k)}{\sqrt{{\hat{s}}_{T, 1}^{2}}} | = 1.7838 > 1.358

and the p-value

p_{v 1} = 0.0034 < 0.05

. It means that there is a mean change-point located at

{\hat{k}}_{T, 1} = \underset{1 \leq k \leq T - 1}{argmax} | U_{k, 1} | = 28

. Meanwhile, it calculates by Step 3 with the modified data of the mean change that

max_{1 \leq k \leq T - 1} | \frac{g_{2} (T, k)}{\sqrt{{\hat{s}}_{T, 2}^{2}}} | = 1.0415 < 1.358

and the p-value

p_{v 2} = 0.2282 > 0.05

. It means that there is no evidence of a variance change-point. Consequently, we conclude that there is only one mean change-point in the time series of the annual flow of the river Nile. It is pointed out that change-point 28 is the year 1898, when the Aswan dam was built. Since the Aswan dam was built, it has significantly changed the mean annual flow of the river Nile. In addition, Zeileis et al. [58] used the F test to detect the same change-point 28. We also use the cpt.meanvar method (see [12]) and the mosum method with bandwidth

G = 20

(see [13]) from the R and obtain the same change-point 28.

Example 2.

The dataset is the prices of AMD stock downloaded by Python, which contains 212 observations from 3 March 2008 to 31 December 2008. Let

P_{t}

be the closing price of AMD stock, and the return can be defined as

r_{t} = log P_{t} - log P_{t - 1}

and

P_{0} = 1

for

1 \leq t \leq 212

. Figure 2 shows the plots of times series of returns of AMD stock and its sample ACF.

By the right side of Figure 2, the times series of returns satisfy the properties of α-mixing. In addition, by the left side of Figure 2, the returns are around zero, but the variance of returns seams to change. Therefore, we use the three-step algorithm to find the change-points and set the null hypothesis that the return of AMD stock has no change in mean or variance. Based on the sample

r_{1}, r_{2}, \dots, r_{212}

, by Step 1, we take

α = 0.1

,

T = 212

,

h_{T} = ⌊ T^{1 / 5} ⌋

and obtain

max_{1 \leq k \leq T - 1} f {(T, k)}^{⊤} f (T, k) = 4.4340 > 2.054

. By Step 2, it has

max_{1 \leq k \leq T - 1} | \frac{g_{1} (T, k)}{\sqrt{{\hat{s}}_{T, 1}^{2}}} | = 0.8974 < 1.358

and

p_{v 1} = 0.3963 > 0.05

. It means that there is no evidence of a mean change-point. By Step 3, it has

max_{1 \leq k \leq T - 1} | \frac{g_{2} (T, k)}{\sqrt{{\hat{s}}_{T, 2}^{2}}} | = 1.9398 > 1.358

and

p_{v 2} = 0.0011 < 0.05

. Therefore, we detect a variance change-point located at

{\hat{k}}_{T, 2} = \underset{1 \leq k \leq T - 1}{argmax} | U_{k, 2} | = 136

(on 12 September 2008). On the other hand, under the independent normal random variables, the authors investigated the change-point detection of variance [5,25]. Thus, we use the methods in [5,25] to detect the variance change-points 136 and 137, respectively. We also use the cpt.meanvar method from the R and obtain a change-point 136. However, we do not detect any change-point using the mosum method. Obviously, the difference between 136 and 137 is only one. Furthermore, it is known that Lehman Brothers declared bankruptcy on 15 September 2008, i.e., point 137. Therefore, it added financial risk to the stock market. Consequently, the variance of returns of AMD stock began to increase after the time of the bankruptcy of Lehman Brothers.

Example 3.

The dataset is the quarterly US ex-post real interest rate from 1961:Q1 to 1986:Q3 provided by Citibase data bank (see [59]). The data are also available from the R package strucchange (see [60]) and denoted by

x_{t}

,

1 \leq t \leq 103

(see Figure 3). The sample AFC based on the quarterly US ex-post real interest rate is also shown in Figure 3.

Similarly, by the right left of Figure 3, the times series of US ex-post real interest rate satisfy the properties of α-mixing; and by the left side of Figure 3, it seems that there are some change-points of mean or variance. We also use the three-step algorithm to find the change-points and set the null hypothesis that the quarterly US ex-post real interest rate has no change in mean or variance. Base on

x_{1}, x_{2}, \dots, x_{103}

, by

α = 0.1

,

T = 103

,

h_{T} = ⌊ T^{1 / 5} ⌋

and Step 1, we have

max_{1 \leq k \leq T - 1} f {(T, k)}^{⊤} f (T, k) = 3.9193 > 2.054

. It means that there exist some change-points in this times series

{x_{1}, x_{2}, \dots, x_{103}}

. By Step 2, it has

max_{1 \leq k \leq T - 1} | \frac{g_{1} (T, k)}{\sqrt{{\hat{s}}_{T, 1}^{2}}} | = 1.5973 > 1.358

,

p_{v 1} = 0.0122 < 0.05

. So there exists a mean change-point at

{\hat{k}}_{T, 1} = \underset{1 \leq k \leq T - 1}{argmax} | U_{k, 1} | = 76

(on 1979:Q4). Then, it can be checked by Step 3 with the modified data of the mean change that

max_{1 \leq k \leq T - 1} | \frac{g_{2} (T, k)}{\sqrt{{\hat{s}}_{T, 2}^{2}}} | = 1.3779 > 1.358

and

p_{v 2} = 0.0449 < 0.05

. In other words, it has a variance change-point located at

{\hat{k}}_{T, 2} = \underset{1 \leq k \leq T - 1}{argmax} | U_{k, 2} | = 51

(on 1973:Q3). Consequently, we detect a mean change-point 76 and a variance change-point 51. In addition, we use mosum method from the R with bandwidth

G = 25

to detect two change-points 47 and 76. Meanwhile, we apply cpt.meanvar from the R and find two change-points 47 and 79. On the other hand, the differences between the change-points

{51, 76}

,

{47, 76}

,

{47, 79}

are small. However, we detect the change-point 76 to be a mean change-point and detect point 51 to be a variance change-point, while the mosum and cpt.meanvar methods do not specify the types of these change-points. Thus, our method has an advantage over their methods. Furthermore, it is pointed out that the sudden jump in oil prices in 1973 added to the volatility of the US ex-post real interest rate. We also point out that the Federal Reserve’s operating procedures in October 1979 increased the means of US ex-post real interest rate (see [59]).

6. Conclusions

Many researchers have studied the mean change-point models and variance change-point models, and obtained the limiting distributions for the CUSUM statistics of the mean change-point and variance change-point (see [3,6]). As far as we know, there are few papers to study the change-point model with the mean change and variance change. In this paper, we consider the mean-variance change-point model (1) with the

α

-mixing errors. Based on the CUSUM statistics of mean and variance, we give the combination statistic

g {(T, k)}^{⊤} g (T, k)

in (4). To determine whether there is a change-point of mean or variance, the limiting distributions for the CUSUM statistics

g {(T, k)}^{⊤} g (T, k)

and

max_{1 \leq k \leq T - 1} g {(T, k)}^{⊤} g (T, k)

are obtained under the null hypothesis there is no change in mean or variance. Some consistent estimators

{\hat{s}}_{T, 1}^{2}

and

{\hat{s}}_{T, 2}^{2}

for long-run variances

s_{1}^{2}

and

s_{2}^{2}

are presented in (17) of Lemma 1, respectively. Then, we obtain the limiting distributions for a combination statistic

max_{1 \leq k \leq T - 1} f {(T, k)}^{⊤} f (T, k)

, mean CUSUM statistic

max_{1 \leq k \leq T - 1} | \frac{g_{1} (T, k)}{\sqrt{{\hat{s}}_{T, 1}^{2}}} |

and variance CUSUM statistic

max_{1 \leq k \leq T - 1} | \frac{g_{2} (T, k)}{\sqrt{{\hat{s}}_{T, 2}^{2}}} |

in Corollaries 1 and 2. As an application, we give a three-step algorithm for change-point detection. The first step is to test whether there is at least a change-point or not. The second step and third step are to detect the mean change-point and variance change-point, respectively. To illustrate our three-step test of the change-point detection, some simulations and three real data examples are presented in Section 4 and Section 5, respectively. It can be seen that our algorithm has an advantage over the existing methods cpt.meanvar by [12] and mosum by [13]. For example, our method not only has a high power but can also determine the change-points as the mean change-point or variance change-point. On the other hand, the multiple change-point problems of the mean, variance, mean vector, and covariance matrix have gained much attention. In this article, we consider the limit distribution under the null hypothesis of no change-point. It is important for research to investigate the limit distribution under the alternative hypotheses. It is also interesting for researchers to study these problems based on the dependent panel data, high-dimensional data, and other dependent data in future work.

7. Proofs of Main Results

Lemma 2

(Lemma 1 in [49]). Let

Y_{t} = g_{t} (e_{t}, \dots, e_{t - τ})

, where

g_{t}

is a measurable function onto

R^{υ}

and τ and υ are finite positive integers. If

{e_{t}, t \geq 1}

is an α-mixing with

α (t) = O (t^{- λ})

for some

λ > 0

, then

{Y_{t}, t \geq 1}

is also an α-mixing with

α (t) = O (t^{- λ})

.

Lemma 3

(Proposition 2.5 in [51]). Let

X \in F_{- \infty}^{k}

,

Y \in F_{k + t}^{\infty}

. If

{E | X |}^{p} < \infty

and

{E | Y |}^{q} < \infty

for some

p, q \geq 1

and

1 / p + 1 / q < 1

, then

| Cov (X, Y) | \leq {8 (E | X |}^{p})^{1 / p} {(E | Y |}^{q})^{1 / q} α^{1 - 1 / p - 1 / q} (t) .

Lemma 4

(Lemma 1.4 in [52]). For some

δ > 0

, let

{e_{t}, t \geq 1}

be a mean zero α-mixing sequence with

E | e_{t} |^{2 + δ} < \infty

for all

t \geq 1

and

\sum_{t = 1}^{\infty} α^{\frac{δ}{2 + δ}} (t) < \infty

. Then

E {(\sum_{t = 1}^{T} e_{t})}^{2} \leq [1 + 16 \sum_{t = 1}^{\infty} α^{\frac{δ}{2 + δ}} (t)] \sum_{t = 1}^{T} (E | e_{t} {|^{2 + δ})}^{\frac{2}{2 + δ}}, T \geq 1 .

Lemma 5

(Corollary 1 in [47] and Theorem 0 in [48]). For some

δ > 0

, let

{e_{t}, t \geq 1}

be an α-mixing sequence with

E e_{t} = 0

for all

t \geq 1

,

sup_{t \geq 1} E {| e_{t} |}^{2 + δ} < \infty

and

\sum_{t = 1}^{\infty} α^{\frac{δ}{2 + δ}} (t) < \infty

. For

T \geq 1

, denote

S_{T} = \sum_{t = 1}^{T} e_{t}

and suppose

E S_{T}^{2} / T \to σ^{2} > 0, as T \to \infty .

Then

\frac{S_{T}}{\sqrt{T σ^{2}}} \overset{d}{⟶} N (0, 1), as T \to \infty .

(28)

Furthermore,

W_{T} (x) \Rightarrow W (x), as T \to \infty,

(29)

where

W_{T} (x) = S_{⌊ T x ⌋} / \sqrt{T σ^{2}}

for

x \in [0, 1]

, and

{W (x), x \in [0, 1]}

is a Wiener process(standard Brownian motion). Then, for

\forall x \in [0, 1]

,

W_{T} (x) - x W_{T} (1) \Rightarrow W (x) - x W (1) \sim W^{0} (x), a s T \to \infty,

(30)

where

{W^{0} (x); x \in [0, 1]}

is a standard Brownian bridge.

Lemma 6.

Let the Assumptions 1 and 2 be satisfied. Denote

ξ_{T} = \frac{σ_{0}}{\sqrt{T s_{1}^{2}}} \sum_{t = 1}^{T} e_{t} a n d η_{T} = \frac{σ_{0}^{2}}{\sqrt{T s_{2}^{2}}} \sum_{t = 1}^{T} (e_{t}^{2} - E e_{t}^{2}),

where

σ_{0}

is defined by (5),

s_{1}^{2}

and

s_{2}^{2}

are defined by (10), respectively. Under the null hypothesis

H_{0}

defined by (5), it has

(ξ_{T}, η_{T}) \overset{d}{⟶} (ξ, η), a s T \to \infty,

(31)

where ξ and η are two independent

N (0, 1)

random variables. For some

x \in [0, 1]

, let

k = ⌊ x T ⌋

. Then

(\frac{σ_{0}}{\sqrt{T s_{1}^{2}}} \sum_{t = 1}^{k} e_{t}, \frac{σ_{0}^{2}}{\sqrt{T s_{2}^{2}}} \sum_{t = 1}^{k} (e_{t}^{2} - E e_{t}^{2})) \Rightarrow (W_{1} (x), W_{2} (x)),

(32)

and

(\frac{σ_{0}}{\sqrt{T s_{1}^{2}}} (\sum_{t = 1}^{k} e_{t} - \frac{k}{T} \sum_{t = 1}^{T} e_{t}), \frac{σ_{0}^{2}}{\sqrt{T s_{2}^{2}}} (\sum_{t = 1}^{k} e_{t}^{2} - \frac{k}{T} \sum_{t = 1}^{T} e_{t}^{2})) \Rightarrow (W_{1}^{0} (x), W_{2}^{0} (x)),

(33)

where

W_{1}^{0} (x)

and

W_{2}^{0} (x)

are independent standard Brownian bridge.

Proof of Theorem 1.

By (1), (2), (4), (5) and

E e_{t} = 0

for all

t \geq 1

, it is easy to check that

\begin{matrix} \frac{g_{1} (T, k)}{\sqrt{s_{1}^{2}}} & = & \frac{1}{\sqrt{T s_{1}^{2}}} U_{k, 1} = \frac{1}{\sqrt{T s_{1}^{2}}} (\frac{k (T - k)}{T}) (\frac{1}{k} \sum_{t = 1}^{k} X_{t} - \frac{1}{T - k} \sum_{t = k + 1}^{T} X_{t}) \\ = & \frac{1}{\sqrt{T s_{1}^{2}}} (\sum_{t = 1}^{k} X_{t} - \frac{k}{T} \sum_{t = 1}^{T} X_{t}) = \frac{σ_{0}}{\sqrt{T s_{1}^{2}}} (\sum_{t = 1}^{k} e_{t} - \frac{k}{T} \sum_{t = 1}^{T} e_{t}) . \end{matrix}

(34)

Meanwhile, it has

\begin{matrix} \frac{1}{k} \sum_{t = 1}^{k} {(X_{t} - \bar{X})}^{2} - \frac{1}{T - k} \sum_{t = k + 1}^{T} {(X_{t} - \bar{X})}^{2} \\ = & \frac{1}{k} \sum_{t = 1}^{k} (X_{t}^{2} - 2 X_{t} \bar{X} + {\bar{X}}^{2}) - \frac{1}{T - k} \sum_{t = k + 1}^{T} (X_{t}^{2} - 2 X_{t} \bar{X} + {\bar{X}}^{2}) \\ = & (\frac{1}{k} \sum_{t = 1}^{k} X_{t}^{2} - \frac{1}{T - k} \sum_{t = k + 1}^{T} X_{t}^{2}) - 2 \bar{X} (\frac{1}{k} \sum_{t = 1}^{k} X_{t} - \frac{1}{T - k} \sum_{t = k + 1}^{T} X_{t}) \\ = & (\frac{1}{k} \sum_{t = 1}^{k} {(μ_{0} + σ_{0} e_{t})}^{2} - \frac{1}{T - k} \sum_{t = k + 1}^{T} {(μ_{0} + σ_{0} e_{t})}^{2}) \\ - 2 \bar{X} (\frac{1}{k} \sum_{t = 1}^{k} (μ_{0} + σ_{0} e_{t}) - \frac{1}{T - k} \sum_{t = k + 1}^{T} (μ_{0} + σ_{0} e_{t})) \\ = & 2 σ_{0} (μ_{0} - \bar{X}) (\frac{1}{k} \sum_{t = 1}^{k} e_{t} - \frac{1}{T - k} \sum_{t = k + 1}^{T} e_{t}) + σ_{0}^{2} (\frac{1}{k} \sum_{t = 1}^{k} e_{t}^{2} - \frac{1}{T - k} \sum_{t = k + 1}^{T} e_{t}^{2}), \end{matrix}

where

\bar{X} = \frac{1}{T} \sum_{t = 1}^{T} X_{t}

. Combining this with (4), we have

\begin{matrix} \frac{g_{2} (T, k)}{\sqrt{s_{2}^{2}}} & = & \frac{1}{\sqrt{T s_{2}^{2}}} U_{k, 2} \\ = & \frac{1}{\sqrt{T s_{2}^{2}}} (\frac{k (T - k)}{T}) [2 σ_{0} (μ_{0} - \bar{X}) (\frac{1}{k} \sum_{t = 1}^{k} e_{t} - \frac{1}{T - k} \sum_{t = k + 1}^{T} e_{t}) \\ + σ_{0}^{2} (\frac{1}{k} \sum_{t = 1}^{k} e_{t}^{2} - \frac{1}{T - k} \sum_{t = k + 1}^{T} e_{t}^{2})] \\ = & \frac{2 σ_{0} (μ_{0} - \bar{X})}{\sqrt{T s_{2}^{2}}} (\sum_{t = 1}^{k} e_{t} - \frac{k}{T} \sum_{t = 1}^{T} e_{t}) + \frac{σ_{0}^{2}}{\sqrt{T s_{2}^{2}}} (\sum_{t = 1}^{k} e_{t}^{2} - \frac{k}{T} \sum_{t = 1}^{T} e_{t}^{2}) \\ : = & D_{T 1} + D_{T 2} . \end{matrix}

(35)

By

δ > 0

and

sup_{t \geq 1} E {| e_{t} |}^{4 + 2 δ} < \infty

, it has

sup_{t \geq 1} E {| e_{t} |}^{2 + δ} < \infty

. Then, by Lemma 4 with

sup_{t \geq 1} E {| e_{t} |}^{2 + δ} < \infty

and

\sum_{t = 1}^{\infty} α^{\frac{δ}{2 + δ}} (t) < \infty

, it has

\begin{matrix} Var (\bar{X}) & = & E {(\bar{X} - μ_{0})}^{2} \\ = & \frac{σ_{0}^{2}}{T^{2}} E {(\sum_{t = 1}^{T} e_{t})}^{2} \leq \frac{C_{1}}{T^{2}} (1 + 16 \sum_{t = 1}^{\infty} α^{\frac{δ}{2 + δ}} (t)) \sum_{t = 1}^{T} {(E | e_{t} |^{2 + δ})}^{\frac{2}{2 + δ}} = O (\frac{1}{T}), \end{matrix}

(36)

which implies

\bar{X} - μ_{0} = O_{P} (T^{- 1 / 2}) .

(37)

In addition, we apply (33) in Lemma 6 and obtain that

\frac{2 σ_{0}}{\sqrt{T s_{2}^{2}}} (\sum_{t = 1}^{k} e_{t} - \frac{k}{T} \sum_{t = 1}^{T} e_{t}) = O_{P} (1) .

Thus, it has

D_{T 1} = O_{P} (T^{- 1 / 2}) = o_{P} (1)

, which implies that

D_{T 2}

in (35) is a main term. Last, by (34) and (35) and

D_{T 1} = o_{P} (1)

, we apply Lemma 6 and obtain (12), i.e.,

(\frac{g_{1} (T, k)}{\sqrt{s_{1}^{2}}}, \frac{g_{2} (T, k)}{\sqrt{s_{2}^{2}}}) \Rightarrow (W_{1}^{0} (x), W_{2}^{0} (x)),

where

W_{1}^{0} (x)

and

W_{2}^{0} (x)

are independent Brownian motions. In addition, by (12) and the continuous mapping theorem, (13) is also proved. □

Proof of Lemma 1.

First, we prove that

s_{1}^{2}

in (10) absolutely converges. Obviously,

E | e_{1} |^{8 + 4 δ} < \infty

implies

E | e_{1} |^{2 + δ} < \infty

for some

δ > 0

. Then, by the second-order stationarity of

α

-mixing sequence

{e_{t}, t \geq 1}

, we apply Lemma 3 with

E e_{1} = 0

,

E e_{1}^{2} = 1

,

E | e_{1} |^{2 + δ} < \infty

and

\sum_{t = 1}^{\infty} α^{\frac{δ}{2 + δ}} (t) < \infty

, and obtain that

\begin{matrix} 0 & < & s_{1}^{2} = γ_{1} (0) + 2 \sum_{h = 1}^{\infty} γ_{1} (h) \leq Var (X_{1}) + 2 \sum_{h = 1}^{\infty} | Cov (X_{1}, X_{1 + h}) | \\ \leq & σ_{0}^{2} [Var (e_{1}) + 2 \sum_{h = 1}^{\infty} | Cov (e_{1}, e_{1 + h}) |] < \infty . \end{matrix}

(38)

By (10) and (16), we use the decomposition

| {\hat{s}}_{T, 1}^{2} - s_{1}^{2} | \leq | {\hat{γ}}_{1} (0) - γ_{1} (0) | + 2 \sum_{h = 1}^{h_{T}} | {\hat{γ}}_{1} (h) - γ_{1} (h) | + 2 \sum_{h = h_{T} + 1}^{\infty} | γ_{1} (h) | : = \sum_{i = 1}^{3} K_{T, i},

(39)

where

h_{T} \to \infty

as

T \to \infty

.

Obviously, by (38), (39) and

h_{T} \to \infty

as

T \to \infty

, it can be checked that

K_{T, 3} \to 0, as T \to \infty .

(40)

Now, we consider the term

K_{T, 1}

in (39). Obviously, by the second-order stationarity of

{e_{t}}

, (14),

E e_{1} = 0

and

γ_{1} (0) = σ_{0}^{2} E e_{1}^{2}

, we obtain that

{\hat{γ}}_{1} (0) - γ_{1} (0) = \frac{σ_{0}^{2}}{T} \sum_{t = 1}^{T} (e_{t}^{2} - E e_{t}^{2}) - σ_{0}^{2} {(\bar{e})}^{2},

(41)

where

\bar{e} = \frac{1}{T} \sum_{t = 1}^{T} e_{t} = \frac{\bar{X} - μ_{0}}{σ_{0}}

. Combining with (37), we have

\begin{matrix} \bar{e} = O_{P} (T^{- 1 / 2}) . \end{matrix}

(42)

By

E | e_{1} |^{8 + 4 δ} < \infty

, it has

E | e_{1} |^{2 (2 + δ)} < \infty

for some

δ > 0

. In addition by the second-order stationarity of

α

-mixing sequence

{(e_{t}^{2} - E e_{t}^{2}), t \geq 1}

with

E (e_{1}^{2} - E e_{1}^{2}) = 0

,

E | e_{1}^{2} - E e_{1}^{2} |^{2 + δ} < \infty

and

\sum_{t = 1}^{\infty} α^{\frac{δ}{2 + δ}} (t) < \infty

, we apply Lemma 4 and obtain that

E | \sum_{t = 1}^{T} (e_{t}^{2} - E e_{t}^{2}) |^{2} \leq (1 + 16 \sum_{t = 1}^{\infty} α^{\frac{δ}{2 + δ}} (t)) \sum_{t = 1}^{T} {(E {(e_{t}^{2} - E e_{t}^{2})}^{2 + δ})}^{\frac{2}{2 + δ}} = O (T),

(43)

which implies

| \sum_{t = 1}^{T} (e_{t}^{2} - E e_{t}^{2}) | = O_{P} (T^{1 / 2}),

(44)

Consequently, it follows from (41), (42), (44) and

0 < σ_{0}^{2} < \infty

that

| {\hat{γ}}_{1} (0) - γ_{1} (0) | \leq σ_{0}^{2} | \frac{1}{T} \sum_{t = 1}^{T} (e_{t}^{2} - E e_{t}^{2}) | + σ_{0}^{2} {(\bar{e})}^{2} = O_{P} (T^{- \frac{1}{2}}) + O_{P} (T^{- 1}) = O_{P} (T^{- \frac{1}{2}}) .

(45)

Thus, it has

K_{T, 1} = o_{P} (1) .

(46)

Next, we consider

K_{T, 2}

. By (14),

E e_{1} = 0

,

γ_{1} (h) = σ_{0}^{2} E e_{1} e_{1 + h}

and

{\hat{γ}}_{1} (h) = σ_{0}^{2} [\frac{1}{T} \sum_{t = 1}^{T - h} e_{t} e_{t + h} - 2 {(\bar{e})}^{2} + \frac{T - h}{T} {(\bar{e})}^{2} + \frac{\bar{e}}{T} \sum_{t = T - h + 1}^{T} e_{t} + \frac{\bar{e}}{T} \sum_{t = 1}^{h} e_{t}],

it can be seen that

\begin{matrix} {\hat{γ}}_{1} (h) - γ_{1} (h) & = & σ_{0}^{2} [\frac{1}{T} \sum_{t = 1}^{T - h} (e_{t} e_{t + h} - E e_{t} e_{t + h}) - \frac{1}{T} \sum_{t = T - h + 1}^{T} E e_{t} e_{t + h} \\ - (\frac{h}{T} + 1) {(\bar{e})}^{2} + \frac{\bar{e}}{T} \sum_{t = T - h + 1}^{T} e_{t} + \frac{\bar{e}}{T} \sum_{t = 1}^{h} e_{t}] \\ : = & σ_{0}^{2} \sum_{i = 1}^{5} N_{h, i} . \end{matrix}

(47)

By Lemma 3, it can be seen that

{e_{t} e_{t + h}, 1 \leq t \leq T - h}

are

α

-mixing random variables with the same mixing coefficients. Thus, by Lemma 4 with

E | e_{1} |^{2 (2 + δ)} < \infty

and

\sum_{t = 1}^{\infty} α^{\frac{δ}{2 + δ}} (t) < \infty

, we establish that

E {(\sum_{t = 1}^{T - h} [e_{t} e_{t + h} - E (e_{t} e_{t + h})])}^{2} \leq C_{1} \sum_{t = 1}^{T - h} {(E {[(e_{t} e_{t + h}) - E (e_{t} e_{t + h})]}^{2 + δ})}^{\frac{2}{2 + δ}} = O (T - h),

which implies

| \sum_{t = 1}^{T - h} [e_{t} e_{t + h} - E (e_{t} e_{t + h})] | = O_{P} (\sqrt{T - h}) = O_{P} (T^{1 / 2}) .

(48)

By (47) and (48) and the fact

h_{T} = O (T^{β})

in (11) and

β \in (0, 1 / 2)

, we obtain that

\sum_{h = 1}^{h_{T}} | N_{h, 1} | = O_{P} (T^{β - 1 / 2}) = o_{P} (1) .

(49)

Meanwhile, by

E e_{1}^{2} < \infty

, Hölder inequality and

0 < β < 1 / 2

, it has

\sum_{h = 1}^{h_{T}} | N_{h, 2} | \leq \frac{1}{T} \sum_{h = 1}^{h_{T}} \sum_{t = T - h + 1}^{T} {(E e_{t}^{2})}^{1 / 2} {(E e_{t + h}^{2})}^{1 / 2} \leq \frac{C}{T} \sum_{h = 1}^{h_{T}} h = O (T^{2 β - 1}) = o (1) .

(50)

By (42),

\sum_{h = 1}^{h_{T}} | N_{h, 3} | \leq \sum_{h = 1}^{h_{T}} (\frac{h}{T} + 1) {(\bar{e})}^{2} = O_{P} (T^{β - 1}) = o_{P} (1) .

(51)

Similar to the proof of (44), we have

| \sum_{t = T - h + 1}^{T} e_{t} | = O_{P} (\sqrt{h}) .

(52)

Thus, it follows from (42), (47), (52),

h_{T} = O (T^{β})

and

β \in (0, 1 / 2)

that

\begin{matrix} \sum_{h = 1}^{h_{T}} | N_{h, 4} | \leq \frac{| \bar{e} |}{T} \sum_{h = 1}^{h_{T}} | \sum_{t = T - h + 1}^{T} e_{t} | = O_{P} (T^{- 3 / 2}) O_{P} (h_{T}^{3 / 2}) = O_{P} (T^{3 (β - 1) / 2}) = o_{P} (1) . \end{matrix}

(53)

Similarly,

\sum_{h = 1}^{h_{T}} | N_{h, 5} | = O_{P} (T^{3 (β - 1) / 2}) = o_{P} (1) .

(54)

Therefore, by (39), (47), (49)–(54) and

0 < σ_{0}^{2} < \infty

, we establish

K_{T, 2} \leq 2 σ_{0}^{2} \sum_{h = 1}^{h_{T}} \sum_{i = 1}^{5} | N_{h, i} | = o_{P} (1) .

(55)

Finally, it follows from (39), (40), (46) and (55) that

| {\hat{s}}_{T, 1}^{2} - s_{1}^{2} | = o_{P} (1),

(56)

i.e., the first term in (17) is proved.

Next, we prove the right of (17). Similar to (39), by (10) and (16), it follows

| {\hat{s}}_{T, 2}^{2} - s_{2}^{2} | \leq | {\hat{γ}}_{2} (0) - γ_{2} (0) | + 2 \sum_{h = 1}^{h_{T}} | {\hat{γ}}_{2} (h) - γ_{2} (h) | + 2 \sum_{h = h_{T} + 1}^{\infty} | γ_{2} (h) | : = \sum_{i = 1}^{3} R_{T, i},

(57)

where

h_{T} \to \infty

as

T \to \infty

. Similar to the proof of (38), by Lemma 3 with

E | e_{1} |^{2 (2 + δ)} < \infty

and

\sum_{t = 1}^{\infty} α^{\frac{δ}{2 + δ}} (t) < \infty

, it follows

\begin{matrix} 0 & < & s_{2}^{2} = γ_{2} (0) + 2 \sum_{h = 1}^{\infty} γ_{2} (h) \leq Var ({(X_{1} - μ_{0})}^{2}) + 2 \sum_{h = 1}^{\infty} Cov ({(X_{1} - μ_{0})}^{2}, {(X_{1 + h} - μ_{0})}^{2}) \\ \leq & σ_{0}^{4} (Var (e_{1}^{2}) + 2 \sum_{h = 1}^{\infty} | Cov (e_{1}^{2}, e_{1 + h}^{2}) |) < \infty . \end{matrix}

Thus, we have

R_{T, 3} \to 0, as T \to \infty

(58)

proving

h_{T} \to \infty

as

T \to \infty

.

Now, we consider the term

R_{T, 1}

in (57). By (15) and

γ_{2} (0) = E {(X_{1} - μ_{0})}^{4} - {(E {(X_{1} - μ_{0})}^{2})}^{2},

we obtain

\begin{matrix} {\hat{γ}}_{2} (0) - γ_{2} (0) \\ = & \frac{1}{T} \sum_{t = 1}^{T} {(X_{t} - μ_{0})}^{4} + \frac{4 (μ_{0} - \bar{X})}{T} \sum_{t = 1}^{T} {(X_{t} - μ_{0})}^{3} + \frac{8 {(μ_{0} - \bar{X})}^{2}}{T} \sum_{t = 1}^{T} {(X_{t} - μ_{0})}^{2} \\ + \frac{4 {(μ_{0} - \bar{X})}^{3}}{T} \sum_{t = 1}^{T} (X_{t} - μ_{0}) - {(\frac{1}{T} \sum_{t = 1}^{T} {(X_{t} - μ_{0})}^{2})}^{2} - E {(X_{1} - μ_{0})}^{4} + {(E {(X_{1} - μ_{0})}^{2})}^{2} \\ = & \frac{1}{T} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{4} - E {(X_{t} - μ_{0})}^{4}] + \frac{4 (μ_{0} - \bar{X})}{T} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{3} - E {(X_{t} - μ_{0})}^{3}] \\ + \frac{8 {(μ_{0} - \bar{X})}^{2}}{T} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] + \frac{4 {(μ_{0} - \bar{X})}^{3}}{T} \sum_{t = 1}^{T} [(X_{t} - μ_{0}) - E (X_{t} - μ_{0})] \\ - {\frac{1}{T} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2} + E {(X_{t} - μ_{0})}^{2}]}^{2} \\ + 4 (μ_{0} - \bar{X}) E {(X_{1} - μ_{0})}^{3} + 8 {(μ_{0} - \bar{X})}^{2} E {(X_{1} - μ_{0})}^{2} \\ + 4 {(μ_{0} - \bar{X})}^{3} E (X_{1} - μ_{0}) + {(E {(X_{1} - μ_{0})}^{2})}^{2} \\ = & \frac{1}{T} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{4} - E {(X_{t} - μ_{0})}^{4}] + \frac{4 (μ_{0} - \bar{X})}{T} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{3} - E {(X_{t} - μ_{0})}^{3}] \\ + \frac{8 {(μ_{0} - \bar{X})}^{2}}{T} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] + \frac{4 {(μ_{0} - \bar{X})}^{3}}{T} \sum_{t = 1}^{T} [(X_{t} - μ_{0}) - E (X_{t} - μ_{0})] \\ - {(\frac{1}{T} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}])}^{2} - \frac{2 E {(X_{1} - μ_{0})}^{2}}{T} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] \\ + 4 (μ_{0} - \bar{X}) E {(X_{1} - μ_{0})}^{3} + 8 {(μ_{0} - \bar{X})}^{2} E {(X_{1} - μ_{0})}^{2} + 4 {(μ_{0} - \bar{X})}^{3} E (X_{1} - μ_{0}) \\ : = & \sum_{i = 1}^{9} L_{T, i} . \end{matrix}

(59)

By the null hypothesis

H_{0}

defined by (5), it has

X_{t} = μ_{0} + σ_{0} e_{t}

,

t \geq 1

. Then,

{{(X_{t} - μ_{0})}^{j} - E {(X_{t} - μ_{0})}^{j}, 1 \leq i \leq T, j = 1, 2, 3, 4}

are also

α

-mixing random variables with the same mixing coefficients. Similar to the proof of (43), by Lemma 4 with

E | e_{1} |^{4 (2 + δ)} < \infty

and

\sum_{t = 1}^{\infty} α^{\frac{δ}{2 + δ}} (t) < \infty

, it can be obtained

\begin{matrix} E {(\sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{j} - E {(X_{t} - μ_{0})}^{j}])}^{2} = O (T), j = 1, 2, 3, 4, \end{matrix}

(60)

which implies

\begin{matrix} | \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{j} - E {(X_{t} - μ_{0})}^{j}] | = O_{P} (T^{\frac{1}{2}}), j = 1, 2, 3, 4 . \end{matrix}

(61)

Thus, by (37) and (61), we can obtain that

\begin{matrix} | L_{T, 1} | & = & O_{P} (T^{- \frac{1}{2}}), | L_{T, 2} | = O_{P} (T^{- 1}), | L_{T, 3} | = O_{P} (T^{- 3 / 2}), \end{matrix}

(62)

\begin{matrix} | L_{T, 4} | & = & O_{P} (T^{- 2}), | L_{T, 5} | = O_{P} (T^{- 1}), | L_{T, 6} | = O_{P} (T^{- 1 / 2}), \end{matrix}

(63)

\begin{matrix} | L_{T, 7} | & = & O_{P} (T^{- 1 / 2}), | L_{T, 8} | = O_{P} (T^{- 1}), | L_{T, 9} | = O_{P} (T^{- 3 / 2}) . \end{matrix}

(64)

By (57), (59), (62)–(64), we have that

R_{T, 1} = | {\hat{γ}}_{2} (0) - γ_{2} (0) | \leq \sum_{i = 1}^{9} | L_{T, i} | = O_{P} (T^{- \frac{1}{2}}) = o_{P} (1) .

(65)

It is time to consider the term

R_{T, 2}

. We can check that

\begin{matrix} {\hat{γ}}_{2} (h) & = & \frac{1}{T} \sum_{t = 1}^{T - h} Z_{t}^{2} Z_{t + h}^{2} - (\frac{T + h}{T}) {(\bar{Z^{2}})}^{2} + \frac{\bar{Z^{2}}}{T} \sum_{t = T - h + 1}^{T} Z_{t}^{2} + \frac{\bar{Z^{2}}}{T} \sum_{t = 1}^{h} Z_{t}^{2} \\ : = & \sum_{i = 1}^{4} H_{h, i}, \end{matrix}

(66)

where

Z_{t} = X_{t} - \bar{X}

. Combining with

γ_{2} (h) = E {(X_{1} - μ_{0})}^{2} {(X_{1 + h} - μ_{0})}^{2} - E {(X_{1} - μ_{0})}^{2} E {(X_{1 + h} - μ_{0})}^{2}

, it can be checked that

\begin{matrix} {\hat{γ}}_{2} (h) - γ_{2} (h) \\ = & \sum_{i = 1}^{4} H_{n, i} - E {(X_{1} - μ_{0})}^{2} {(X_{1 + h} - μ_{0})}^{2} + E {(X_{1} - μ_{0})}^{2} E {(X_{1 + h} - μ_{0})}^{2} \\ = & \frac{1}{T} \sum_{t = 1}^{T - h} [{(X_{t} - μ_{0})}^{2} {(X_{t + h} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2} {(X_{t + h} - μ_{0})}^{2}] \\ + \frac{2 (μ_{0} - \bar{X})}{T} \sum_{t = 1}^{T - h} [{(X_{t} - μ_{0})}^{2} (X_{t + h} - μ_{0}) - E {(X_{t} - μ_{0})}^{2} (X_{t + h} - μ_{0})] \\ + \frac{{(μ_{0} - \bar{X})}^{2}}{T} \sum_{t = 1}^{T - h} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] \\ + \frac{2 (μ_{0} - \bar{X})}{T} \sum_{t = 1}^{T - h} [(X_{t} - μ_{0}) {(X_{t + h} - μ_{0})}^{2} - E (X_{t} - μ_{0}) {(X_{t + h} - μ_{0})}^{2}] \\ + \frac{4 {(μ_{0} - \bar{X})}^{2}}{T} \sum_{t = 1}^{T - h} [(X_{t} - μ_{0}) (X_{t + h} - μ_{0}) - E (X_{t} - μ_{0}) (X_{t + h} - μ_{0})] \\ + \frac{2 {(μ_{0} - \bar{X})}^{3}}{T} \sum_{t = 1}^{T - h} (X_{t} - μ_{0}) + \frac{{(μ_{0} - \bar{X})}^{2}}{T} \sum_{t = 1}^{T - h} [{(X_{t + h} - μ_{0})}^{2} - E {(X_{t + h} - μ_{0})}^{2}] \\ + \frac{2 {(μ_{0} - \bar{X})}^{3}}{T} \sum_{t = 1}^{T - h} (X_{t + h} - μ_{0}) - (\frac{T + h}{T}) {(\frac{1}{T} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}])}^{2} \\ + \frac{2 [{(μ_{0} - \bar{X})}^{2} - E {(X_{1} - μ_{0})}^{2}] (T + h)}{T^{2}} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] \\ + \frac{1}{T^{2}} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] \sum_{t = T - h + 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] \\ + \frac{2 h E {(X_{1} - μ_{0})}^{2}}{T^{2}} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] \\ + \frac{E {(X_{1} - μ_{0})}^{2}}{T} \sum_{t = T - h + 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] \\ + \frac{2 (μ_{0} - \bar{X})}{T^{2}} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] \sum_{t = T - h + 1}^{T} (X_{t} - μ_{0}) \\ + \frac{2 (μ_{0} - \bar{X}) E {(X_{1} - μ_{0})}^{2}}{T} \sum_{t = T - h + 1}^{T} (X_{t} - μ_{0}) + \frac{2 {(μ_{0} - \bar{X})}^{2} h}{T^{2}} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] \\ - \frac{{(μ_{0} - \bar{X})}^{2}}{T} \sum_{t = T - h + 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] - \frac{2 {(μ_{0} - \bar{X})}^{3}}{T} \sum_{t = T - h + 1}^{T} (X_{t} - μ_{0}) \\ + \frac{1}{T^{2}} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] \sum_{t = 1}^{h} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] \\ + \frac{E {(X_{1} - μ_{0})}^{2}}{T} \sum_{t = 1}^{h} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] \\ + \frac{2 (μ_{0} - \bar{X})}{T^{2}} \sum_{t = 1}^{T} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] \sum_{t = 1}^{h} (X_{t} - μ_{0}) \\ + \frac{2 (μ_{0} - \bar{X}) E {(X_{1} - μ_{0})}^{2}}{T} \sum_{t = 1}^{h} (X_{t} - μ_{0}) \\ - \frac{{(μ_{0} - \bar{X})}^{2}}{T} \sum_{t = 1}^{h} [{(X_{t} - μ_{0})}^{2} - E {(X_{t} - μ_{0})}^{2}] - \frac{2 {(μ_{0} - \bar{X})}^{3}}{T} \sum_{t = 1}^{h} (X_{t} - μ_{0}) \\ - \frac{h}{T} E {(X_{1} - μ_{0})}^{2} {(X_{1 + h} - μ_{0})}^{2} + \frac{2 (μ_{0} - \bar{X}) (T - h)}{T} E {(X_{1} - μ_{0})}^{2} (X_{1 + h} - μ_{0}) \\ + \frac{2 (μ_{0} - \bar{X}) (T - h)}{T} E (X_{1} - μ_{0}) {(X_{1 + h} - μ_{0})}^{2} + \frac{4 {(μ_{0} - \bar{X})}^{2} (T - h)}{T} E (X_{1} - μ_{0}) (X_{1 + h} - μ_{0}) \\ + 4 {(μ_{0} - \bar{X})}^{2} E {(X_{1} - μ_{0})}^{2} + \frac{h}{T} {(E {(X_{1} - μ_{0})}^{2})}^{2} - \frac{4 h}{T} {(μ_{0} - \bar{X})}^{4} \\ : = & \sum_{i = 1}^{31} G_{h, i} . \end{matrix}

(67)

According to Lemma 2, it is easy to obtain that

{[{(X_{t} - μ_{0})}^{i} {(X_{t + h} - μ_{0})}^{j} - E {(X_{t} - μ_{0})}^{i} {(X_{t + h} - μ_{0})}^{j}], i, j = 0, 1, 2, 1 \leq t \leq T - h}

are

α

-mixing random variables with the same mixing coefficients. Then, similar to the proof of (43), by Lemma 4 with

E | e_{1} |^{4 (2 + δ)} < \infty

and

\sum_{t = 1}^{\infty} α^{\frac{δ}{2 + δ}} (t) < \infty

, we obtain

E | \sum_{t = 1}^{T - h} [{(X_{t} - μ_{0})}^{i} {(X_{t + h} - μ_{0})}^{j} - E {(X_{t} - μ_{0})}^{i} {(X_{t + h} - μ_{0})}^{j}] |^{2} = O (T), i, j = 0, 1, 2,

which implies

| \sum_{t = 1}^{T - h} [{(X_{t} - μ_{0})}^{i} {(X_{t + h} - μ_{0})}^{j} - E {(X_{t} - μ_{0})}^{i} {(X_{t + h} - μ_{0})}^{j}] | = O_{P} (T^{1 / 2}), i, j = 0, 1, 2 .

(68)

Thus, by (37), (67) and (68), one can obtain that

\begin{matrix} | G_{h, 1} | = O_{P} (T^{- \frac{1}{2}}), | G_{h, 2} | = O_{P} (T^{- 1}), | G_{h, 3} | = O_{P} (T^{- \frac{3}{2}}), \end{matrix}

(69)

\begin{matrix} | G_{h, 4} | = O_{P} (T^{- 1}), | G_{h, 5} | = O_{P} (T^{- \frac{3}{2}}), | G_{h, 6} | = O_{P} (T^{- 2}), \end{matrix}

(70)

\begin{matrix} | G_{h, 7} | = O_{P} (T^{- \frac{3}{2}}), | G_{h, 8} | = O_{P} (T^{- 2}), | G_{h, 9} | = O_{P} (T^{- 1}), \end{matrix}

(71)

\begin{matrix} | G_{h, 10} | = O_{P} (T^{- \frac{1}{2}}), | G_{h, 11} | = O_{P} (T^{- \frac{3}{2}} h^{1 / 2}), | G_{h, 12} | = O_{P} (T^{- \frac{3}{2}} h), \end{matrix}

(72)

\begin{matrix} | G_{h, 13} | = O_{P} (T^{- 1} h^{\frac{1}{2}}), | G_{h, 14} | = O_{P} (T^{- 2} h^{\frac{1}{2}}), | G_{h, 15} | = O_{P} (T^{- \frac{3}{2}} h^{\frac{1}{2}}), \end{matrix}

(73)

\begin{matrix} | G_{h, 16} | = O_{P} (T^{- \frac{5}{2}} h), | G_{h, 17} | = O_{P} (T^{- 2} h^{\frac{1}{2}}), | G_{h, 18} | = O_{P} (T^{- \frac{5}{2}} h^{\frac{1}{2}}), \end{matrix}

(74)

\begin{matrix} | G_{h, 19} | = O_{P} (T^{- \frac{3}{2}} h^{\frac{1}{2}}), | G_{h, 20} | = O_{P} (T^{- 1} h^{\frac{1}{2}}), | G_{h, 21} | = O_{P} (T^{- 2} h^{\frac{1}{2}}), \end{matrix}

(75)

\begin{matrix} | G_{h, 22} | = O_{P} (T^{- \frac{3}{2}} h^{\frac{1}{2}}), | G_{h, 23} | = O_{P} (T^{- 2} h^{\frac{1}{2}}), | G_{h, 24} | = O_{P} (T^{- \frac{5}{2}} h^{\frac{1}{2}}), \end{matrix}

(76)

\begin{matrix} | G_{h, 25} | = O (T^{- 1} h), | G_{h, 26} | = O_{P} (T^{- \frac{1}{2}}), | G_{h, 27} | = O_{P} (T^{- \frac{1}{2}}), \end{matrix}

(77)

\begin{matrix} | G_{h, 28} | = | G_{h, 29} | = O_{P} (T^{- 1}), | G_{h, 30} | = O (T^{- 1} h), | G_{h, 31} | = O (T^{- 3} h) . \end{matrix}

(78)

Therefore, by (57), (67), (69)–(78),

h_{T} = O (T^{β})

and

β \in (0, 1 / 2)

, we obtain that

R_{T, 2} \leq \sum_{h = 1}^{h_{T}} \sum_{i = 1}^{31} | G_{h, i} | = O_{P} (T^{β - 1 / 2}) = o_{P} (1) .

(79)

Consequently, it follows from (57), (58), (65) and (79) that

| {\hat{s}}_{T, 2}^{2} - s_{2}^{2} | = o_{P} (1),

(80)

i.e., the second term in (17) is complete to prove. □

Proof of Lemma 6.

By the Cramér–Wold device, it is sufficient to show that

a ξ_{T} + b η_{T} \overset{d}{⟶} N (0, a^{2} + b^{2}) for ‘ all a, b \in R .

(81)

We rewrite

a ξ_{T} + b η_{T} = \frac{1}{\sqrt{T}} \sum_{t = 1}^{T} ζ_{t}

, where

ζ_{t} = \frac{a σ_{0}}{\sqrt{s_{1}^{2}}} e_{t} + \frac{b σ_{0}^{2}}{\sqrt{s_{2}^{2}}} (e_{t}^{2} - E e_{t}^{2}), 1 \leq t \leq T .

(82)

Obviously,

{ζ_{t}, t \geq 1}

is also a mean zero sequence of

α

-mixing random variables with the same mixing coefficients. By the null hypothesis

H_{0}

defined by (5) and the Assumptions 1 and 2, it is easy to check that

\begin{matrix} lim_{T \to \infty} \frac{1}{T} Var (\sum_{t = 1}^{T} ζ_{t}) = lim_{T \to \infty} \frac{1}{T} Var (\sum_{t = 1}^{T} (\frac{a σ_{0}}{\sqrt{s_{1}^{2}}} e_{t} + \frac{b σ_{0}^{2}}{\sqrt{s_{2}^{2}}} e_{t}^{2})) \\ = & lim_{T \to \infty} \frac{1}{T} Var (\sum_{t = 1}^{T} \frac{a σ_{0}}{\sqrt{s_{1}^{2}}} e_{t}) + lim_{T \to \infty} \frac{1}{T} Var (\sum_{t = 1}^{T} \frac{b σ_{0}^{2}}{\sqrt{s_{2}^{2}}} e_{t}^{2}) \\ + lim_{T \to \infty} \frac{2}{T} \frac{a σ_{0}}{\sqrt{s_{1}^{2}}} \frac{b σ_{0}^{2}}{\sqrt{s_{2}^{2}}} \sum_{i = 1}^{T} \sum_{j = 1}^{T} Cov (e_{i}, e_{j}^{2}) \\ = & lim_{T \to \infty} \frac{a^{2}}{T s_{1}^{2}} Var (\sum_{t = 1}^{T} (X_{t} - μ_{0})) + lim_{T \to \infty} \frac{b^{2}}{T s_{2}^{2}} Var (\sum_{t = 1}^{T} {(X_{t} - μ_{0})}^{2}) \\ + lim_{T \to \infty} \frac{2 a b}{T \sqrt{s_{1}^{2} s_{2}^{2}}} \sum_{i = 1}^{T} \sum_{j = 1}^{T} Cov ((X_{i} - μ_{0}), {(X_{j} - μ_{0})}^{2}) \\ = & a^{2} + b^{2} . \end{matrix}

(83)

Thus, by (28) and (83) in Lemma 5 with

sup_{t \geq 1} E {| e_{t} |}^{4 + 2 δ} < \infty

and

\sum_{t = 1}^{\infty} α^{\frac{δ}{2 + δ}} (t) < \infty

for some

δ > 0

, we immediately obtain the result of (31).

Next, we prove (32). Denote

S_{T} (1) = σ_{0} \sum_{t = 1}^{T} e_{t}

,

S_{T} (2) = σ_{0}^{2} \sum_{t = 1}^{T} (e_{t}^{2} - E e_{t}^{2})

,

W_{T_{1}} (x) = \frac{S_{⌊ T x ⌋} (1)}{\sqrt{T s_{1}^{2}}} and W_{T_{2}} (x) = \frac{S_{⌊ T x ⌋} (2)}{\sqrt{T s_{2}^{2}}} for x \in [0, 1],

where

s_{1}^{2}

,

s_{2}^{2}

are defined by (10). Obviously,

{e_{t}, t \geq 1}

and

{(e_{t}^{2} - E e_{t}^{2}), t \geq 1}

are mean zero

α

-mixing sequence with the same mixing coefficients. Then, by (29) and (31) in Lemma 5, we obtain 32. Combining with (30), the proof of (33) is completed. □

Author Contributions

Supervision W.Y.; software M.G.; writing–original draft preparation, X.S., X.W., and W.Y. All authors have read and agreed to the published version of the manuscript.

Funding

Yang’s work was funded by NSF of Anhui Province (2008085MA14, 2108085MA06), Quality Engineering Project of Anhui University (2023xjzlgc232); Shi’s work was supported by the NSERC Discovery Grant RGPIN 2022-03264, the Interior Universities Research Coalition and the BC Ministry of Health, and the University of British Columbia Okanagan (UBC-O) Vice Principal Research in collaboration with the UBC-O Irving K. Barber Faculty of Science.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Shewhart, W.A. The application of statistics as an aid in maintaining quality of a manufactured product. J. Amer. Statist. Assoc. 1925, 20, 546–548. [Google Scholar] [CrossRef]
Page, E.S. Continuous inspection schemes. Biometrika 1954, 41, 100–115. [Google Scholar] [CrossRef]
Antoch, J.; Hušková, M.; Veraverbeke, N. Change-point problem and bootstrap. J. Nonparametr. Stat. 1995, 5, 123–144. [Google Scholar] [CrossRef]
Bai, J. Least squares estimation of a shift in linear processes. J. Time Series Anal. 1994, 15, 453–472. [Google Scholar] [CrossRef]
Inclán, C.; Tiao, G. Use of cumulative sums of squares for retrospective detection of changes of variance. J. Amer. Statist. Assoc. 1994, 89, 913–923. [Google Scholar]
Gombay, E.; Horváth, L.; Hušková, M. Estimators and tests for change in variances. Statist. Decis. 1996, 14, 145–159. [Google Scholar] [CrossRef]
Csörgő, M.; Horváth, L. Limit Theorems in Change-Point Analysis; Wiley: Chichester, UK, 1997; pp. 170–180. [Google Scholar]
Chen, J.; Gupta, A. Parametric Statistical Change Point Analysis, with Applications to Genetics, Medicine and Finance, 2nd ed.; Birkhäuser: Boston, MA, USA, 2012; pp. 1–30. [Google Scholar]
Shiryaev, A. On stochastic models and optimal methods in the quickest detection problems. Theory Probab. Appl. 2009, 53, 385–401. [Google Scholar] [CrossRef]
Shiryaev, A. Stochastic Disorder Problems; Springer: Berlin/Heidelberg, Germany, 2019; pp. 367–388. [Google Scholar]
Rosenblatt, M. A central limit theorem and a strong mixing condition. Proc. Natl. Acad. Sci. USA 1956, 42, 43–47. [Google Scholar] [CrossRef] [PubMed]
Killick, R.; Eckley, I.A. changepoint, An R Package for Changepoint Analysis. J. Stat. Softw. 2014, 58, 1–19. [Google Scholar] [CrossRef]
Meier, A.; Kirch, C.; Cho, H. mosum: A Package for Moving Sums in Change-Point Analysis. J. Stat. Softw. 2021, 97, 1–42. [Google Scholar] [CrossRef]
Kokoszka, P.; Leipus, R. Change-point in the mean of dependent observations. Statist. Probab. Lett. 1998, 40, 385–393. [Google Scholar] [CrossRef]
Shi, X.P.; Wu, Y.H.; Miao, B.Q. Strong convergence rate of estimators of change-point and its application. Comput. Statist. Data Anal. 2009, 53, 990–998. [Google Scholar] [CrossRef]
Ding, S.S.; Fang, H.Y.; Dong, X.; Yang, W.Z. The CUSUM statistics of change-point models based on dependent sequences. J. Appl. Stat. 2022, 49, 2593–2611. [Google Scholar] [CrossRef] [PubMed]
Zhou, J.; Liu, S.Y. Inference for mean change-point in infinite variance AR(p) process. Stat. Probab. Lett. 2009, 79, 6–15. [Google Scholar] [CrossRef]
Shao, X.; Zhang, X. Testing for change points in time series. J. Amer. Statist. Assoc. 2010, 105, 1228–1240. [Google Scholar] [CrossRef]
Shao, X. Self-normalization for time series, a review of recent developments. J. Amer. Statist. Assoc. 2015, 110, 1797–1817. [Google Scholar] [CrossRef]
Tsay, R. Outliers, level shifts and variance changes in time series. J. Forecast. 1988, 7, 1–20. [Google Scholar] [CrossRef]
Yang, W.Z.; Liu, H.S.; Wang, Y.W.; Wang, X.J. Data-driven estimation of change-points with mean shift. J. Korean Statist. Soc. 2023, 52, 130–153. [Google Scholar] [CrossRef]
Bai, J. Common breaks in means and variance for panel data. J. Econom. 2010, 157, 78–92. [Google Scholar] [CrossRef]
Horváth, L.; Hušková, M. Change-point detection in panel data. J. Time Ser. Anal. 2012, 33, 631–648. [Google Scholar] [CrossRef]
Cho, H. Change-point detection in panel data via double CUSUM statistic. Electron. J. Stat. 2016, 10, 2000–2038. [Google Scholar] [CrossRef]
Chen, J.; Gupta, A. Testing and locating variance change points with application to stock prices. J. Amer. Statist. Assoc. 1997, 92, 739–747. [Google Scholar] [CrossRef]
Lee, S.; Park, S. The cusum of squares test for scale changes in infinite order moving average processes. Scand. J. Stat. 2001, 28, 625–644. [Google Scholar] [CrossRef]
Xu, M.; Wu, Y.; Jin, B. Detection of a change-point in variance by a weighted sum of powers of variances test. J. Appl. Stat. 2019, 46, 664–679. [Google Scholar] [CrossRef]
Berkes, I.; Gombay, E.; Horvath, L. Testing for changes in the covariance structure of linear processes. J. Stat. Plan. Inf. 2009, 139, 2044–2063. [Google Scholar] [CrossRef]
Lee, S.; Ha, J.; Na, O. The cusum test for parameter change time series models. Scand. J. Stat. 2003, 30, 781–796. [Google Scholar] [CrossRef]
Vexler, A. Guaranteed testing for epidemic changes of a linear regression model. J. Stat. Plann. Inference 2006, 136, 3101–3120. [Google Scholar] [CrossRef]
Jin, B.S.; Wu, Y.H.; Shi, X.P. Consistent two-stage multiple change-point detection in linear models. Canad. J. Statist. 2016, 44, 161–179. [Google Scholar] [CrossRef]
Gurevich, G. Optimal properties of parametric Shiryaev-Roberts statistical control procedures. Comput. Model. New Technol. 2013, 17, 37–50. [Google Scholar]
Aue, A.; Hörmann, S.; Horváth, L.; Reimherr, M. Break detection in the covariance structure of multivariate time series models. Ann. Statist. 2009, 37, 4046–4087. [Google Scholar] [CrossRef]
Cho, H.; Kirch, C. Two-stage data segmentation permitting multiscale change points, heavy tails and dependence. Ann. Inst. Statist. Math. 2022, 74, 653–684. [Google Scholar] [CrossRef]
Niu, Y.; Hao, N.; Zhang, H. Multiple change-point detection, a selective overview. Statist. Sci. 2016, 31, 611–623. [Google Scholar] [CrossRef]
Korkas, K.; Fryzlewicz, P. Multiple change-point detection for non-stationary time series using wild binary segmentation. Statist. Sinica 2017, 27, 287–311. [Google Scholar] [CrossRef]
Shi, X.P.; Wu, Y.H.; Rao, C.R. Consistent and powerful graph-based change-point test for high-dimensional data. Proc. Natl. Acad. Sci. USA 2017, 114, 3873–3878. [Google Scholar] [CrossRef] [PubMed]
Shi, X.P.; Wang, X.-S.; Reid, N. A New Class of Weighted CUSUM Statistics. Entropy 2022, 24, 1652. [Google Scholar] [CrossRef]
Chen, F.; Mamon, R.; Nkurunziza, S. Inference for a change-point problem under a generalised Ornstein-Uhlenbeck setting. Ann. Inst. Statist. Math. 2018, 70, 807–853. [Google Scholar] [CrossRef]
Zamba, K.D.; Hawkins, D.M. A multivariate change-point model for change in mean vector and/or covariance dtructure. J. Qual. Technol. 2009, 41, 285–303. [Google Scholar] [CrossRef]
Oh, H.; Lee, S. On score vector-and residual-based CUSUM tests in ARMA-GARCH models. Stat. Methods Appl. 2018, 27, 385–406. [Google Scholar] [CrossRef]
Jäntschi, L. A test detecting the outliers for continuous distributions based on the cumulative distribution function of the data being tested. Symmetry 2019, 11, 835. [Google Scholar] [CrossRef]
William, K.; Isidore, N. Inference for nonstationary time series of counts with application to change-point problems. Ann. Inst. Statist. Math. 2022, 74, 801–835. [Google Scholar]
Arrouch, M.S.E.; Elharfaoui, E.; Ngatchou-Wandji, J. Change-Point Detection in the Volatility of Conditional Heteroscedastic Autoregressive Nonlinear Models. Mathematics 2023, 11, 4018. [Google Scholar] [CrossRef]
Hall, P.; Heyde, C.C. Martingale Limit Theory and Its Application; Academic Press Inc.: New York, NY, USA, 1980. [Google Scholar]
Lin, Z.Y.; Lu, C.R. Limit Theory for Mixing Dependent Random Variable; Science Press: Beijing, China, 1997. [Google Scholar]
Withers, C.S. Central limit theorems for dependent variables. Z. Wahrsch. Verw. Gebiete. 1981, 57, 509–534. [Google Scholar] [CrossRef]
Herrndorf, N. A Functional Central Limit Theorem for Strongly Mixing Sequences of Random Variables. Z. Wahrsch. Verw. Gebiete 1985, 69, 541–550. [Google Scholar] [CrossRef]
White, H.; Domowitz, I. Nonlinear regression with dependent observations. Econometrica 1984, 52, 143–162. [Google Scholar] [CrossRef]
Györfi, L.; Härdle, W.; Sarda, P.; Vieu, P. Nonparametric Curve Estimation from Time Series; Springer: Berlin/Heidelberg, Germany, 1989. [Google Scholar]
Fan, J.Q.; Yao, Q.W. Nonlinear Time Series. Nonparametric and Parametric Methods; Springer: New York, NY, USA, 2003. [Google Scholar]
Yang, W.Z.; Wang, Y.W.; Hu, S.H. Some probability inequalities of least-squares estimator in non linear regression model with strong mixing errors. Comm. Statist. Theory Methods 2017, 46, 165–175. [Google Scholar] [CrossRef]
Billingsley, P. Convergence of Probability Measures; John Wiley & Sons, Inc.: New York, NY, USA, 1968. [Google Scholar]
Kiefer, J. K-sample analogues of the Kolmogorov-Smirnov and Cramér-v. Mises tests. Ann. Math. Statist. 1959, 30, 420–447. [Google Scholar] [CrossRef]
Bolboacă, S.D.; Jäntschi, L. Predictivity approach for quantitative structure-property models. application for blood-brain barrier permeation of diverse drug-like compounds. Int. J. Mol. Sci. 2011, 12, 4348–4364. [Google Scholar] [CrossRef]
Truong, C.; Oudre, L.; Vayatis, N. Selective review of offline change-point detection methods. Signal Process. 2020, 167, 107299. [Google Scholar] [CrossRef]
Balke, N. Detecting level shifts in time series. J. Bus. Econom. Statist. 1993, 11, 81–92. [Google Scholar]
Zeileis, A.; Kleiber, C.; Krämer, W.; Hornik, H. Testing and dating of structural changes in practice. Comput. Statist. Data Anal. 2003, 44, 109–123. [Google Scholar] [CrossRef]
Garcia, R.; Perron, P. An analysis of the real interest rate under regime shifts. Rev. Econom. Statist. 1996, 78, 111–125. [Google Scholar] [CrossRef]
Zeileis, A.; Leisch, F.; Hornik, K.; Kleiber, C. strucchange: An R Package for Testing for Structural Change in Linear Regression Models. J. Stat. Softw. 2002, 7, 1–38. [Google Scholar] [CrossRef]

Figure 1. The left side is the times series of the annual flow of the river Nile at Aswan from 1871 to 1970; the right side is the sample ACF for the river Nile.

Figure 2. The left side is the times series of returns of AMD.com stock from March 2008 to December 2008; the right side is the sample ACF for these returns.

Figure 3. The left side is the quarterly US ex-post real interest rate from 1961:Q1 to 1986:Q3; the right side is the sample ACF for these interest rates.

Table 1. Empirical sizes and powers of

A_{T, 0}

,

A_{T, 1}

,

A_{T, 2}

based on

N_{T} (0, Σ_{T})

and the level of significance

α = 0.05

.

Table 1. Empirical sizes and powers of

A_{T, 0}

,

A_{T, 1}

,

A_{T, 2}

based on

N_{T} (0, Σ_{T})

and the level of significance

α = 0.05

.

$h_{T} = ⌊ T^{1 / 5} ⌋$	$ξ$	T	$A_{T, 0}$	$A_{T, 1}$	$A_{T, 2}$
$h_{T} = ⌊ T^{1 / 5} ⌋$	$ξ$	T	$p_{0}$	$p_{1}$	$p_{2}$
Case 1 $μ_{1} = μ_{2} = 1$ $σ_{1}^{2} = σ_{2}^{2} = 1$ $k_{1}^{} = T$ $k_{2}^{} = T$	−0.3	300	0.0590	0.0230	0.0190
		600	0.0430	0.0170	0.0120
		900	0.0520	0.0220	0.0130
	0	300	0.0500	0.0230	0.0120
		600	0.0490	0.0170	0.0130
		900	0.0470	0.0140	0.0150
	0.3	300	0.0490	0.0120	0.0180
		600	0.0440	0.0190	0.0130
		900	0.0450	0.0170	0.0120
Case 2 $μ_{1} = 1, μ_{2} = 1.5$ $σ_{1}^{2} = σ_{2}^{2} = 1$ $k_{1}^{} = ⌊ T / 4 ⌋$ $k_{2}^{} = T$	−0.3	300	0.9320	0.9180	0.0130
		600	1.0000	1.0000	0.0230
		900	1.0000	1.0000	0.0270
	0	300	0.6630	0.6090	0.0170
		600	0.9750	0.9730	0.0230
		900	1.0000	1.0000	0.0190
	0.3	300	0.3820	0.2970	0.0200
		600	0.8010	0.7680	0.0200
		900	0.9550	0.9490	0.0150
Case 3 $μ_{1} = μ_{2} = 1$ $σ_{1}^{2} = 1, σ_{2}^{2} = 2$ $k_{1}^{} = T$ $k_{2}^{} = ⌊ T / 2 ⌋$	−0.3	300	0.7920	0.0320	0.7460
		600	0.9910	0.0300	0.9880
		900	1.0000	0.0250	1.0000
	0	300	0.8890	0.0230	0.8640
		600	0.9990	0.0200	0.9980
		900	1.0000	0.0170	1.0000
	0.3	300	0.8170	0.0260	0.7630
		600	0.9910	0.0230	0.9910
		900	1.0000	0.0220	1.0000
Case 4 $μ_{1} = 1, μ_{2} = 1.5$ $σ_{1}^{2} = 1, σ_{2}^{2} = 2$ $k_{1}^{} = ⌊ T / 4 ⌋$ $k_{2}^{} = ⌊ T / 2 ⌋$	−0.3	300	0.9740	0.7740	0.8000
		600	1.0000	0.9960	0.9960
		900	1.0000	1.0000	1.0000
	0	300	0.9360	0.3930	0.8380
		600	1.0000	0.8660	1.0000
		900	1.0000	0.9820	1.0000
	0.3	300	0.9790	0.7580	0.7910
		600	1.0000	0.9950	0.9950
		900	1.0000	1.0000	1.0000

Table 2. Empirical sizes and powers of

A_{T, 0}

,

A_{T, 1}

,

A_{T, 2}

based on

e \sim t (0, Σ_{T}, 5)

and the level of significance

α = 0.05

.

Table 2. Empirical sizes and powers of

A_{T, 0}

,

A_{T, 1}

,

A_{T, 2}

based on

e \sim t (0, Σ_{T}, 5)

and the level of significance

α = 0.05

.

$h_{T} = ⌊ T^{1 / 5} ⌋$	$ξ$	T	$A_{T, 0}$	$A_{T, 1}$	$A_{T, 2}$
$h_{T} = ⌊ T^{1 / 5} ⌋$	$ξ$	T	$p_{0}$	$p_{1}$	$p_{2}$
Case 1 $μ_{1} = μ_{2} = 1$ $σ_{1}^{2} = σ_{2}^{2} = 1$ $k_{1}^{} = T$ $k_{2}^{} = T$	−0.3	300	0.0530	0.0240	0.0140
		600	0.0470	0.0200	0.0110
		900	0.0540	0.0220	0.0190
	0	300	0.0490	0.0160	0.0140
		600	0.0570	0.0190	0.0160
		900	0.0420	0.0120	0.0190
	0.3	300	0.0380	0.0150	0.0100
		600	0.0400	0.0140	0.0130
		900	0.0500	0.0140	0.0200
Case 2 $μ_{1} = 1, μ_{2} = 1.5$ $σ_{1}^{2} = σ_{2}^{2} = 1$ $k_{1}^{} = ⌊ T / 4 ⌋$ $k_{2}^{} = T$	−0.3	300	0.8130	0.7870	0.0150
		600	0.9550	0.9470	0.0150
		900	0.9800	0.9770	0.0250
	0	300	0.5890	0.5370	0.0130
		600	0.8430	0.8200	0.0220
		900	0.9310	0.9210	0.0210
	0.3	300	0.3590	0.2930	0.0200
		600	0.6530	0.6200	0.0180
		900	0.7920	0.7750	0.0180
Case 3 $μ_{1} = μ_{2} = 1$ $σ_{1}^{2} = 1, σ_{2}^{2} = 2$ $k_{1}^{} = T$ $k_{2}^{} = ⌊ T / 2 ⌋$	−0.3	300	0.8160	0.0280	0.7660
		600	0.9960	0.0420	0.9950
		900	1.0000	0.0350	1.0000
	0	300	0.8890	0.0270	0.8680
		600	0.9980	0.0290	0.9980
		900	1.0000	0.0260	1.0000
	0.3	300	0.8370	0.0170	0.8050
		600	0.9970	0.0160	0.9950
		900	1.0000	0.0280	1.0000
Case 4 $μ_{1} = 1, μ_{2} = 1.5$ $σ_{1}^{2} = 1, σ_{2}^{2} = 2$ $k_{1}^{} = ⌊ T / 4 ⌋$ $k_{2}^{} = ⌊ T / 2 ⌋$	−0.3	300	0.9370	0.6460	0.7750
		600	1.0000	0.8900	0.9900
		900	1.0000	0.9480	1.0000
	0	300	0.9290	0.3740	0.8390
		600	1.0000	0.6860	0.9980
		900	1.0000	0.8600	1.0000
	0.3	300	0.8170	0.1630	0.6990
		600	0.9970	0.4700	0.9900
		900	1.0000	0.6290	1.0000

Table 3. Precision, Recall and F1-score of two algorithms based on

N_{T} (0, Σ_{T})

.

Table 3. Precision, Recall and F1-score of two algorithms based on

N_{T} (0, Σ_{T})

.

$h_{T} = ⌊ T^{1 / 5} ⌋$	$ξ$	T	Our Algorithm			cpt.meanvar’s Algorithm			Mosum’s Algorithm
$h_{T} = ⌊ T^{1 / 5} ⌋$	$ξ$	T	Precision	Recall	F1-Score	Precision	Recall	F1-Score	Precision	Recall	F1-Score
		300	0.6563	0.6518	0.6533	0.1738	0.1738	0.1738	0.3117	0.3117	0.3117
	−0.3	600	0.8342	0.8242	0.8275	0.6823	0.6823	0.6823	0.6304	0.6274	0.6284
Case 2		900	0.9141	0.9011	0.9054	0.9600	0.9595	0.9597	0.8272	0.8242	0.8252
$μ_{1} = 1, μ_{2} = 1.5$		300	0.3656	0.3626	0.3636	0.2088	0.2083	0.2085	0.3467	0.3382	0.3410
$σ_{1}^{2} = σ_{2}^{2} = 1$	0	600	0.7223	0.7133	0.7163	0.6294	0.6284	0.6287	0.6364	0.6170	0.6234
$k_{1}^{*} = ⌊ T / 4 ⌋$		900	0.8102	0.8027	0.8052	0.8691	0.8681	0.8685	0.7662	0.7509	0.7559
$k_{2}^{*} = T$		300	0.1469	0.1454	0.1459	0.2238	0.2188	0.2204	0.3526	0.3199	0.3306
	0.3	600	0.5015	0.4950	0.4972	0.5325	0.5253	0.5276	0.5984	0.5098	0.5380
		900	0.6783	0.6738	0.6753	0.7522	0.7451	0.7474	0.7253	0.6248	0.6562
		300	0.5085	0.4980	0.5015	0.2987	0.2977	0.2980	0.0000	0.0000	0.0000
	−0.3	600	0.8122	0.8012	0.8049	0.7453	0.7443	0.7446	0.0000	0.0000	0.0000
Case 3		900	0.9141	0.8966	0.9024	0.8981	0.8981	0.8981	0.0000	0.0000	0.0000
$μ_{1} = μ_{2} = 1$		300	0.6374	0.6284	0.6314	0.3177	0.3152	0.3160	0.0000	0.0000	0.0000
$σ_{1}^{2} = 1, σ_{2}^{2} = 2$	0	600	0.8531	0.8442	0.8472	0.7642	0.7637	0.7639	0.0030	0.0030	0.0030
$k_{1}^{*} = T$		900	0.9231	0.9156	0.9181	0.9141	0.9126	0.9131	0.0000	0.0000	0.0000
$k_{2}^{*} = ⌊ T / 2 ⌋$		300	0.5185	0.5125	0.5145	0.3327	0.3232	0.3263	0.0140	0.0123	0.0128
	0.3	600	0.8042	0.7957	0.7985	0.6993	0.6893	0.6926	0.0290	0.0230	0.0248
		900	0.8941	0.8836	0.8871	0.8901	0.8820	0.8846	0.0220	0.0154	0.0174
		300	0.5285	0.6369	0.5646	0.1828	0.3546	0.2401	0.0794	0.1588	0.1059
	−0.3	600	0.8217	0.8247	0.8227	0.4421	0.7794	0.5544	0.3152	0.6274	0.4192
Case 4		900	0.8971	0.8971	0.8971	0.6528	0.8838	0.7297	0.4136	0.8242	0.5504
$μ_{1} = 1, μ_{2} = 1.5$		300	0.3981	0.5879	0.4614	0.2043	0.3953	0.2679	0.1169	0.2293	0.1543
$σ_{1}^{2} = 1, σ_{2}^{2} = 2$	0	600	0.7343	0.7927	0.7537	0.4411	0.7686	0.5500	0.3192	0.6180	0.4187
$k_{1}^{*} = ⌊ T / 4 ⌋$		900	0.8506	0.8591	0.8535	0.6389	0.8590	0.7121	0.3821	0.7504	0.5048
$k_{1}^{*} = ⌊ T / 2 ⌋$		300	0.5280	0.6449	0.5669	0.2223	0.3996	0.2813	0.1484	0.2650	0.1870
	0.3	600	0.8327	0.8367	0.8340	0.4291	0.6900	0.5157	0.3157	0.5330	0.3865
		900	0.9041	0.9041	0.9041	0.5934	0.7702	0.6517	0.0270	0.0380	0.0304

Table 4. Precision, Recall and F1-score of two algorithms based on

t (0, Σ_{T}, 5)

.

Table 4. Precision, Recall and F1-score of two algorithms based on

t (0, Σ_{T}, 5)

.

$h_{T} = ⌊ T^{1 / 5} ⌋$	$ξ$	T	Our Algorithm			cpt.meanvar’s Algorithm			Mosum’s Algorithm
$h_{T} = ⌊ T^{1 / 5} ⌋$	$ξ$	T	Precision	Recall	F1-Score	Precision	Recall	F1-Score	Precision	Recall	F1-Score
		300	0.5514	0.5470	0.5485	0.0649	0.0593	0.0611	0.1089	0.1079	0.1082
	−0.3	600	0.7622	0.7552	0.7576	0.2777	0.2626	0.2674	0.2577	0.2547	0.2557
Case 2		900	0.8392	0.8292	0.8325	0.5544	0.5159	0.5275	0.3876	0.3856	0.3863
$μ_{1} = 1, μ_{2} = 1.5$		300	0.3417	0.3402	0.3407	0.0839	0.0755	0.0782	0.1479	0.1449	0.1459
$σ_{1}^{2} = σ_{2}^{2} = 1$	0	600	0.5914	0.5844	0.5867	0.2977	0.2770	0.2834	0.3087	0.3022	0.3044
$k_{1}^{*} = ⌊ T / 4 ⌋$		900	0.7163	0.7098	0.7120	0.5355	0.5034	0.5128	0.4815	0.4720	0.4752
$k_{2}^{*} = T$		300	0.1528	0.1503	0.1512	0.1269	0.1149	0.1185	0.1798	0.1635	0.1688
	0.3	600	0.4036	0.3981	0.3999	0.3137	0.2841	0.2929	0.3926	0.3464	0.3614
		900	0.5455	0.5380	0.5405	0.4865	0.4487	0.4596	0.5115	0.4668	0.4809
		300	0.5185	0.5090	0.5122	0.2657	0.2488	0.2539	0.0000	0.0000	0.0000
	−0.3	600	0.8122	0.7937	0.7999	0.5345	0.5011	0.5111	0.0000	0.0000	0.0000
Case 3		900	0.9041	0.8876	0.8931	0.6683	0.6324	0.6427	0.0000	0.0000	0.0000
$μ_{1} = μ_{2} = 1$		300	0.6104	0.6004	0.6037	0.2827	0.2645	0.2700	0.0010	0.0010	0.0010
$σ_{1}^{2} = 1, σ_{2}^{2} = 2$	0	600	0.8501	0.8377	0.8418	0.5485	0.5225	0.5305	0.0020	0.0020	0.0020
$k_{1}^{*} = T$		900	0.9281	0.9166	0.9204	0.7103	0.6640	0.6768	0.0000	0.0000	0.0000
$k_{2}^{*} = ⌊ T / 2 ⌋$		300	0.5375	0.5315	0.5335	0.2837	0.2566	0.2649	0.0080	0.0070	0.0073
	0.3	600	0.8482	0.8407	0.8432	0.5105	0.4664	0.4797	0.0230	0.0208	0.0215
		900	0.9041	0.8921	0.8961	0.6923	0.6437	0.6580	0.0180	0.0135	0.0148
		300	0.4770	0.6139	0.5226	0.1538	0.2837	0.1969	0.0260	0.0519	0.0346
	−0.3	600	0.7602	0.8072	0.7759	0.3192	0.5548	0.3961	0.1284	0.2532	0.1700
Case 4		900	0.8546	0.8776	0.8623	0.4191	0.6603	0.4960	0.1938	0.3856	0.2577
$μ_{1} = 1, μ_{2} = 1.5$		300	0.3921	0.5799	0.4547	0.1728	0.3042	0.2157	0.0400	0.0779	0.0526
$σ_{1}^{2} = 1, σ_{2}^{2} = 2$	0	600	0.6753	0.8072	0.7193	0.3392	0.5856	0.4197	0.1543	0.3012	0.2033
$k_{1}^{*} = ⌊ T / 4 ⌋$		900	0.7912	0.8581	0.8135	0.4575	0.6846	0.5277	0.2408	0.4720	0.3178
$k_{2}^{*} = ⌊ T / 2 ⌋$		300	0.2842	0.4865	0.3516	0.1888	0.3281	0.2343	0.0699	0.1255	0.0884
	0.3	600	0.5375	0.7522	0.6091	0.3362	0.5177	0.3932	0.2078	0.3604	0.2582
		900	0.6573	0.8212	0.7120	0.4635	0.6581	0.5228	0.2647	0.4744	0.3335

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gao, M.; Shi, X.; Wang, X.; Yang, W. Combination Test for Mean Shift and Variance Change. Symmetry 2023, 15, 1975. https://doi.org/10.3390/sym15111975

AMA Style

Gao M, Shi X, Wang X, Yang W. Combination Test for Mean Shift and Variance Change. Symmetry. 2023; 15(11):1975. https://doi.org/10.3390/sym15111975

Chicago/Turabian Style

Gao, Min, Xiaoping Shi, Xuejun Wang, and Wenzhi Yang. 2023. "Combination Test for Mean Shift and Variance Change" Symmetry 15, no. 11: 1975. https://doi.org/10.3390/sym15111975

APA Style

Gao, M., Shi, X., Wang, X., & Yang, W. (2023). Combination Test for Mean Shift and Variance Change. Symmetry, 15(11), 1975. https://doi.org/10.3390/sym15111975

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Combination Test for Mean Shift and Variance Change

Abstract

1. Introduction

2. Main Results

3. The Three-Step Algorithm

4. Simulations

5. The Real Data Analysis

6. Conclusions

7. Proofs of Main Results

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI