A Strong Limit Theorem of the Largest Entries of a Sample Correlation Matrices under a Strong Mixing Assumption

Zhao, Haozhu; Zhang, Yong

doi:10.3390/axioms12070657

Open AccessArticle

A Strong Limit Theorem of the Largest Entries of a Sample Correlation Matrices under a Strong Mixing Assumption

by

Haozhu Zhao

and

Yong Zhang

^*

School of Mathematics, Jilin University, Changchun 130012, China

^*

Author to whom correspondence should be addressed.

Axioms 2023, 12(7), 657; https://doi.org/10.3390/axioms12070657

Submission received: 17 May 2023 / Revised: 19 June 2023 / Accepted: 30 June 2023 / Published: 2 July 2023

(This article belongs to the Special Issue Probability, Statistics and Estimation)

Download Versions Notes

Abstract

:

We are interested in an n by p matrix

X_{n}

where the n rows are strictly stationary

α

-mixing random vectors and each of the p columns is an independent and identically distributed random vector;

p = p_{n}

goes to infinity as

n \to \infty

, satisfiying

0 < c_{1} \leq p_{n} / n^{τ} \leq c_{2} < \infty

, where

τ > 0

,

c_{2} \geq c_{1} > 0

. We obtain a logarithmic law of

L_{n} = {max}_{1 \leq i < j \leq p_{n}} | ρ_{i j} |

using the Chen–Stein Poisson approximation method, where

ρ_{i j}

denotes the sample correlation coefficient between the ith column and the jth column of

X_{n}

.

Keywords:

Chen-SteinPoisson approximation method; mixing sequence; sample correlation matrices

MSC:

60F15

1. Introduction

Random matrix theory is being used in variety of fields, from physics to various areas of mathematics. Ref. [1] test the structure of the regression coefficient matrix under the multivariate linear regression model using the LRT statistic. The correlation coefficient matrix holds significance as a crucial statistic in the multivariate analysis, and its maximum likelihood estimator is the sample correlation matrix. Consider an n by p matrix

X_{n} = (X_{k, i})

,

1 \leq k \leq n, 1 \leq i \leq p

, representing observations from a specific multivariate distribution. It has an unknown mean

μ = (μ_{1}, \dots, μ_{p})

, unknown correlation coefficient matrix

R

, and unknown covariance matrix

Σ

.

This paper shows the logarithmic law for the largest entries of a sample correlation matrices while an

α

-mixing assumption holds. This study is the promotion of the statistical hypothesis testing problem that [2] analyzed. Ref. [2] considered the statistical test as the sample size n and the sample dimension p are both large; the null hypothesis is

H_{0} : R = I_{p}

, where

I_{p}

is the identity matrix. The null hypothesis of [2] postulates that the components of

X_{n} = (X^{(1)}, \dots, X^{(p)})

are uncorrelated and, in the case of

X_{n}

, have a p-variate normal distribution. Ref. [2]’s test statistic is

\begin{matrix} L_{n} = max_{1 \leq i < j \leq p_{n}} | ρ_{i j} |, \end{matrix}

where

\begin{matrix} ρ_{i j} = \frac{\sum_{k = 1}^{n} (X_{k, i} - {\bar{X}}^{(i)}) (X_{k, j} - {\bar{X}}^{(j)})}{\sqrt{\sum_{k = 1}^{n} {(X_{k, i} - {\bar{X}}^{(i)})}^{2}} \sqrt{\sum_{k = 1}^{n} {(X_{k, j} - {\bar{X}}^{(j)})}^{2}}} \end{matrix}

is the Pearson correlation coefficient between

X^{(i)}

and

X^{(j)}

. Then,

Γ_{n} : = (ρ_{i j})

is the sample correlation matrix generated by

X_{n}

.

Let

n \geq 2

; let

{\bar{X}}^{(i)} = \frac{\sum_{k = 1}^{n} X_{k, i}}{n}

,

1 \leq i \leq p

. We have

\begin{matrix} ρ_{i j} = \frac{{(X^{(i)} - {\bar{X}}^{(i)} e)}^{'} (X^{(j)} - {\bar{X}}^{(j)} e)}{∥ X^{(i)} - {\bar{X}}^{(i)} e ∥ \cdot ∥ X^{(j)} - {\bar{X}}^{(j)} e ∥} \end{matrix}

(1)

where

e = {(1, \dots, 1)}^{'} \in R^{n}

, and

∥ \cdot ∥

represent the Euclidean norm in

R^{n}

.

Ref. [2] proved the following theorem, focusing on the test statistic

L_{n}

as

p = p_{n}

and

X_{n} = {X_{k, i}; 1 \leq k \leq n, 1 \leq i \leq p}

is a set of i.i.d. random variables where the n rows are observations from a certain multivariate distribution and each of the p columns is an n observation from a variable of the population distribution.

Theorem 1.

Suppose that

{ξ, X_{k, i}; k \geq 1, i \geq 1}

be i.i.d random variables. Let

X_{n} = (X_{k, i})

be an

n \times p

matrix. For any

ε > 0

,

{E | ξ |}^{30 - ε} < \infty

. If

n / p \to γ \in (0, \infty)

, then

\begin{matrix} lim_{n \to \infty} \sqrt{\frac{n}{log n}} L_{n} = 2 a . s . \end{matrix}

Theorem 2.

Suppose that

{ξ, X_{k, i}; k \geq 1, i \geq 1}

be i.i.d. random variables. Let

X_{n} = (X_{k, i})

be an

n \times p

matrix. For some

ε > 0

,

{E | ξ |}^{30 + ε} < \infty

. If

n / p \to γ

, then, for any

y \in R

,

\begin{matrix} P (n L_{n}^{2} - 4 log n + log (log n) \leq y) \to e^{- K e^{- y / 2}} \end{matrix}

as

n \to \infty

, where

K = {(γ^{2} \sqrt{8 π})}^{- 1}

.

Subsequently, many scholars consider the limiting property of the largest entries of the sample correlation matrices under the weaker moment conditions. Ref. [3] showed that the moment condition in Theorem 2 could be

x^{6} P (| X_{1, 1} X_{1, 2} | \geq x) \to 0

as

x \to \infty

under

{lim sup}_{n \to \infty} p / n < \infty

. Ref. [4] showed that Theorem 2 holds under the moment condition

(x^{6} / {log}^{3} x) P (| X_{1, 1} X_{1, 2} | \geq x) \to 0

as

x \to \infty

. Ref. [5] improved the moment condition and obtained the limit theorems of

L_{n}

. Refs. [6,7] showed the results as

p / n

bounded from zero to infinity under a more relaxed moment assumption. Meanwhile, p reaches infinity as

n \to \infty

; some scholars also pursue the limiting property of the largest entries of the sample correlation matrices, they are interested in the relationship of the sample dimension p and the sample size n. Ref. [8] obtained the limit theorem as

log p = o (n^{α})

for

0 < α \leq 1

without the Gaussian assumption. Most of the work is based on the assumption of sample independence; it is reasonable to assume the independence of samples, but it is difficult to verify the independence of the samples. Therefore, it is necessary to study the largest entries of the sample correlation matrices under mixing assumptions. Ref. [9] showed the asymptotic distribution of

L_{n}

under a

φ

-mixing assumptions. Ref. [10] showed the asymptotic distribution of

L_{n}

under an

α

-mixing assumption. Under the dependent case, the logarithmic law for

L_{n}

remains largely unknown.

We will establish a logarithmic law of

L_{n}

under an

α

-mixing assumption. Let

{X_{n}, n \geq 1}

be a random variables sequence on probability space

(Ω, F, P)

. Let

F_{a}^{b}

represent the

σ

-field generated by the random variables

X_{a}, X_{a + 1}, \dots, X_{b}

. For any two

σ

-fields

A, B \subset F

, set

α (A, B) : = sup {| P (A B) - P (A) P (B) |; A \in A, B \in B}

. The strong mixing coefficients of

{X_{n}, n \geq 1}

are defined as:

α (n) : = {sup}_{k \geq 1} α (F_{1}^{k}, F_{k + n}^{\infty})

, if

α (n) \to 0

,

{X_{n}, n \geq 1}

is called

α

-mixing (see [11]).

The first section introduces the background of, and the motivation for, this study. In Section 2, we show the main result of this paper. In Section 3, we introduce notations and present some classical or elementary facts, which we include for easier referencing. The proof of the main theorem is discussed in Section 4 and is the main novel ingredient of the paper. In Section 5, we present the significance of the main result and its applications.

2. Main Result

Assumption 1.

Let

X_{k} = (X_{k, 1}, X_{k, 2}, \dots, X_{k, p})

be a random vector sequence,

V a r (X_{1, 1}) = 1

, and assume

{X_{k}; 1 \leq k \leq n}

be a strictly stationary α-mixing random vector sequence, satisfied with

α (n) = O (e^{- n})

.

Assumption 2.

Let

X^{(i)} = {(X_{1, i}, X_{2, i}, \dots, X_{n, i})}^{'}

be a sequence of random vector, assume

{X^{(i)};

1 \leq i \leq p}

be an i.i.d. random vector sequence.

Remark 1.

For

H_{0} : R = I_{p}

,

X^{(1)}, X^{(2)}, \dots, X^{(p)}

is independent. Let

B_{i} : = {X_{k, i}; 1 \leq k \leq n}

be a random sampling of

X^{(i)}

; it is reasonable to suppose

{B_{i}; 1 \leq i \leq p}

is independent. Therefore, Assumption 2 can reasonably obtain the logarithmic law of

L_{n}

.

Theorem 3.

Under the Assumptions 1 and 2, let

S_{n} = Σ_{k = 1}^{n} X_{k, 1} X_{k, 2}

, and define

σ^{2} : = {lim}_{n \to \infty} E S_{n}^{2} / n

. Suppose that

E X_{1, 1} = 0

, and

0 < c_{1} \leq p_{n} / n^{τ} \leq c_{2} < \infty

, where

τ > 0

,

c_{2} \geq c_{1} > 0

.

E | X_{1, 1} |^{12 + 16 τ + ε} < \infty

, for some

ε > 0

. Then,

\begin{matrix} lim_{n \to \infty} \sqrt{\frac{n}{log p_{n}}} L_{n} = 2 σ a . s . \end{matrix}

(2)

Corollary 1.

Suppose that

{ξ, X_{k, i}; k \geq 1, i \geq 1}

are i.i.d. random variables,

E X_{1, 1} = 0

. Let

X_{n} = (X_{k, i})

be an

n \times p

matrix, let

S_{n} = Σ_{k = 1}^{n} X_{k, 1} X_{k, 2}

, and define

σ^{2} : = {lim}_{n \to \infty} E S_{n}^{2} / n

. Suppose that

0 < c_{1} \leq p_{n} / n^{τ} \leq c_{2} < \infty

, where

τ > 0

,

c_{2} \geq c_{1} > 0

.

E | X_{1, 1} |^{12 + 16 τ + ε} < \infty

, for some

ε > 0

. Then,

\begin{matrix} lim_{n \to \infty} \sqrt{\frac{n}{log p_{n}}} L_{n} = 2 σ a . s . \end{matrix}

(3)

Remark 2.

Theorem 3 considered

p_{n} = O (n^{τ})

, where

τ > 0

. Theorems 1 and 2 only considered the case

τ = 1

. In Theorem 3, the n rows of

X_{n}

are strictly stationary α-mixing random vectors. This case is more complex than the i.i.d. case considered in Theorems 1 and 2 because of the dependence. Under the i.i.d. assumption, we obtained Corollary 1, and generalized the Theorem 1 from the same order n and

p_{n}

to

p_{n} = O (n^{τ})

; the moment condition of Corollary 1 is weaker than Theorem 1.

3. Preliminaries

The proofs of the main result are intricate and complex. We will gather and establish several technical tools that contribute to the proof of the main result. The subsequent lemmas play a crucial role in validating our findings. Lemma 1 could be seen from [12].

Lemma 1.

Suppose X and Y are random variables taking their values in Borel spaces

S_{1}

,

S_{2}

, respectively; suppose U is a uniform-

[0, 1]

random variable independent of

(X, Y)

. Assume N is a positive integer,

H = {H_{1}, H_{2}, \dots, H_{N}}

is a measurable partition of

S_{2}

. There exists an

S_{2}

-valued random variable

Y^{*} = f (X, Y, U)

; f is a measurable function from

S_{1} \times S_{2} \times [0, 1]

into

S_{2}

, such that

(i)

Y^{*}

is independent of X,

(

i i

) the probability distribution of

Y^{*}

and Y on

S_{2}

are identical,

(

i i i

)

P (Y^{*}

and Y are not from the same

H_{i} {\in H) \leq (8 N)}^{1 / 2} α (B (X), B (Y))

.

Lemma 2.

Let

{X_{i}; i \geq 1}

be a strictly stationary α-mixing sequence of random variables:

E X_{i} = 0

. If

\sum_{i = 1}^{\infty} {[α (i)]}^{\frac{g - 2}{g}} < \infty

, for some

g > 2

,

\sum_{i = 1}^{n} i^{1 + h} {[α (i)]}^{1 - h} < \infty

for some

0 < h < 1

, there exists

K \in R

such that

\begin{matrix} E {|\sum_{i = 1}^{n} X_{i}|}^{g} < K n^{g / 2} sup_{k \geq 1} E {| X_{k} |}^{g}, n = 1, 2, \dots . \end{matrix}

Proof of Lemma 2.

Let

Y_{i} = \frac{X_{i}}{{sup}_{k \geq 1} {∥ X_{k} ∥}_{g}}

, where

∥ X_{k} ∥_{g} = {(E | X_{k} |^{g})}^{1 / g}

. Then, we have

E | Y_{i} |^{g} = \frac{E | X_{i} |^{g}}{{sup}_{k \geq 1} E {| X_{k} |}^{g}} < \infty

. Using Theorem 1 of [13], we have

\begin{matrix} E {|\sum_{i = 1}^{n} Y_{i}|}^{g} < K n^{g / 2}, n = 1, 2, \dots . \end{matrix}

Then, we can obtain

\begin{matrix} E {|\sum_{i = 1}^{n} X_{i}|}^{g} < K n^{g / 2} sup_{k \geq 1} E {| X_{k} |}^{g}, n = 1, 2, \dots . \end{matrix}

□

Lemma 3.

Assume

{X_{n}; n \geq 1}

is a sequence of α-mixing random variables,

X \in F_{- \infty}^{k}

,

Y \in F_{k + n}^{\infty}

and

{E | X |}^{p} < \infty

,

{E | Y |}^{q} < \infty

,

\frac{1}{p} + \frac{1}{q} < 1

. Then

\begin{matrix} | E X Y - E X E Y | \leq {10 ∥ X ∥}_{p} {∥ Y ∥}_{q} {(α (n))}^{1 - \frac{1}{p} - \frac{1}{q}} . \end{matrix}

Proof of Lemma 3.

See Corollary 6.1 in [11]. □

Lemma 4.

For any independent random variables sequence

{ξ_{n}; n \geq 1}

with

E ξ_{n} = 0

and finite variance, there exists an independent normal variables sequence

{η_{n}; n \geq 1}

with

E η_{n} = 0

,

E η_{n}^{2} = E ξ_{n}^{2}

, such that

\begin{matrix} P (max_{k \leq n} |\sum_{i = 1}^{k} ξ_{i} - \sum_{i = 1}^{k} η_{i}| \geq y) \leq {(A Q)}^{Q} y^{- Q} \sum_{i = 1}^{n} E {| ξ_{i} |}^{Q} \end{matrix}

for all

Q > 2

and

y > 0

, whenever

E | ξ_{i} |^{Q} < \infty

,

i = 1, 2, \dots, n

. Here, A is a universal constant.

Proof of Lemma 4.

See [14]. □

Lemma 5.

Let

{η_{k}; 1 \leq k \leq n}

be an independent symmetric random variables sequence and

S_{n} = \sum_{k = 1}^{n} η_{k}

. Then, there exist positive numbers

C_{j}

and

D_{j}

depending only on j for each integer

j \geq 1

, for all

t > 0

,

\begin{matrix} P (| S_{n} | \geq 2 j t) \leq C_{j} P (max_{1 \leq j \leq n} | η_{j} | \geq t) + D_{j} {(P (| S_{n} | \geq t))}^{j} . \end{matrix}

Proof of Lemma 5.

See [15]. □

Lemma 6.

Let

{X_{n}; n \geq 1}

be an independent random variables sequence.

S_{k} = \sum_{i = 1}^{k} X_{i}

. For

\forall x > 0

,

P (max_{1 \leq k \leq n} | S_{k} | > 2 x) \leq \frac{P (| S_{n} | > x)}{{min}_{1 \leq k \leq n} P (| S_{n} - S_{k} | \leq x)} .

Proof of Lemma 6.

See [16]. □

Lemma 7.

Let

{X_{n}; n \geq 1}

be an independent random variables sequence and

E | X_{n} |^{g} < \infty

for any

g \geq 2

,

n \geq 1

; then, there exists a positive constant

C (g)

depending only on g, such that

\begin{matrix} E {(max_{1 \leq i \leq n} |\sum_{k = 1}^{i} (X_{k} - E X_{k})|)}^{g} \\ \leq & C (g) (\sum_{k = 1}^{n} E {| X_{k} - E X_{k} |}^{g} + {(\sum_{k = 1}^{n} E {| X_{k} - E X_{k} |}^{2})}^{g / 2}) \end{matrix}

Proof of Lemma 7.

See [17]. □

Lemma 8.

If

{η_{n}; n \geq 1}

is an i.i.d. random variables sequence with

E η_{1} = 0

,

E | η_{1} |^{p} < \infty

,

p \geq 1

, and

S_{n} = \sum_{i = 1}^{n} η_{i}

. Then,

\begin{matrix} E | S_{n} |^{p} = \{\begin{matrix} O (n^{p / 2}), & i f p \geq 2, \\ O (n), & i f 1 \leq p < 2 . \end{matrix} \end{matrix}

Proof of Lemma 8.

See [16]. □

The following is the Chen–Stein method, as shown in [18].

Lemma 9.

Let

{η_{α}; α \in I}

be a random variables sequence on an index set

I

and

{B_{α}; α \in I}

is a set of subsets of

I

, then for each

α \in I

,

B_{α} \subset I

. For any

t \in R

, let

λ = \sum_{α \in I} P (η_{α} > t)

. Then,

\begin{matrix} |P (max_{α \in I} η_{α} \leq t) - e^{- λ}| \leq (1 \land λ^{- 1}) (b_{1} + b_{2} + b_{3}), \end{matrix}

where

\begin{matrix} b_{1} & = & \sum_{α \in I} \sum_{β \in B_{α}} P (η_{α} > t) P (η_{β} > t), \\ b_{2} & = & \sum_{α \in I} \sum_{α \neq β \in B_{α}} P (η_{α} > t, η_{β} > t), \\ b_{3} & = & \sum_{α \in I} E |P (η_{α} > t | σ (η_{β}, β \notin B_{α})) - P (η_{α} > t)|, \end{matrix}

and

σ (η_{β}; β \notin B_{α})

is the

σ -

algebra generated by

{η_{β}; β \notin B_{α}}

. If

η_{α}

is independent of

{η_{β}; β \notin B_{α}}

for each α, then

b_{3}

vanishes.

Now we define

\begin{matrix} W_{n} = max_{1 \leq i < j \leq p_{n}} |\sum_{k = 1}^{n} X_{k, i} X_{k, j}|, n \geq 1 . \end{matrix}

(4)

Define

⫴ A ⫴ = {max}_{1 \leq i \neq j \leq n} | a_{i, j} |

for any square matrix

A = (a_{i, j})

.

Lemma 10.

We can see

X^{(i)}

in (1). For each i, let

h_{i} = ∥ X^{(i)} - {\bar{X}}^{(i)} e ∥ / \sqrt{n}

. Then,

⫴ n Γ_{n} - X_{n}^{'} X_{n} ⫴ \leq (b_{n, 1}^{2} + 2 b_{n, 1}) W_{n} b_{n, 3}^{- 2} + n b_{n, 3}^{- 2} b_{n, 4}^{2},

where

\begin{matrix} b_{n, 1} = max_{1 \leq i \leq p_{n}} | h_{i} - 1 |, & W_{n} = max_{1 \leq i < j \leq p_{n}} | {(X^{(i)})}^{'} X^{(j)} | \\ b_{n, 3} = min_{1 \leq i \leq p_{n}} h_{i}, & b_{n, 4} = max_{1 \leq i \leq p_{n}} | {\bar{X}}^{(i)} | . \end{matrix}

Proof of Lemma 10.

See [19]. □

4. Proofs

The next lemma refers to Lemma 2 in [20]; we generalize its result to the

α

-mixing condition.

Lemma 11.

Under the Assumptions 1 and 2, let

a > 1 / 2

,

b \geq 0

and

M > 0

be constants.

\begin{matrix} (1) & E | X_{1, 1} |^{(1 + b) / a} < \infty; \\ (2) & c = \{\begin{matrix} E X_{1, 1}, & i f a \leq 1, \\ a n y n u m b e r, & i f a > 1 \end{matrix} \end{matrix}

is the sufficient condition for

max_{j \leq M n^{b}} |n^{- a} \sum_{i = 1}^{n} (X_{i, j} - c)| \to 0 a . s . as n \to \infty

Proof of Lemma 11.

Without loss of generality, suppose

c = 0

. Since, for

η > 0

and

N \geq 1

,

\begin{matrix} P \{max_{j \leq M n^{b}} |\frac{1}{n^{a}} \sum_{i = 1}^{n} X_{i, j}| \geq η, i . o .\} \\ \leq & \sum_{k \geq N} P \{max_{2^{k - 1} < n \leq 2^{k}} max_{j \leq M 2^{k b}} |\sum_{i = 1}^{n} X_{i, j}| \geq η^{'} 2^{k a}\} \\ \leq & \sum_{k \geq N} M 2^{k b} P \{max_{2^{k - 1} < n \leq 2^{k}} |\sum_{i = 1}^{n} X_{i, 1}| \geq 2^{k a} ε^{'}\}, \end{matrix}

where

η^{'} = 2^{- a} η

. To conclude that the probability on the left-hand side of this inequality is equal to zero, it is sufficient to show that

\begin{matrix} \sum_{k = 1}^{\infty} 2^{k b} P \{max_{n \leq 2^{k}} |\sum_{i = 1}^{n} X_{i, 1}| \geq 2^{k a} η\} < \infty . \end{matrix}

(5)

Let

Y_{i, k} = X_{i, 1} I {| X_{i, 1} | < 2^{k a}}

and

Z_{i, k} = Y_{i, k} - E Y_{i, k}

. Then

| Z_{i, k} | \leq 2^{k a + 1}

and

E Z_{i, k} = 0

. Let g be an integer such that

g (a - 1 / 2) > b + 2 a

. It is easy to see

\begin{matrix} X_{i, 1} & = & X_{i, 1} I {| X_{i, 1} | < 2^{k a}} + X_{i, 1} I {| X_{i, 1} | \geq 2^{k a}} \\ = & Z_{i, k} + E Y_{i, k} + X_{i, 1} I {| X_{i, 1} | \geq 2^{k a}} . \end{matrix}

We have that

\begin{matrix} \sum_{k = 1}^{\infty} 2^{k b} P \{max_{n \leq 2^{k}} |\sum_{i = 1}^{n} X_{i, 1}| \geq 2^{k a} η\} \\ \leq & \sum_{k = 1}^{\infty} 2^{k b} P \{max_{n \leq 2^{k}} |\sum_{i = 1}^{n} Z_{i, k}| \geq \frac{2^{k a} η}{4}\} + \sum_{k = 1}^{\infty} 2^{k b} P \{max_{n \leq 2^{k}} |\sum_{i = 1}^{n} E Y_{i, k}| \geq \frac{2^{k a} η}{2}\} \\ + & \sum_{k = 1}^{\infty} 2^{k b} P \{max_{n \leq 2^{k}} |\sum_{i = 1}^{n} X_{i, 1} I {| X_{i, 1} | \geq 2^{k a}}| \geq \frac{2^{k a} η}{4}\} . \end{matrix}

Then, note that

{Z_{i, k}}

is a sequence of α-mixing random variables by Assumption 1. Using Lemma 2, we obtain

\begin{matrix} \begin{matrix} P \{max_{n \leq 2^{k}} |\sum_{i = 1}^{n} Z_{i, k}| \geq \frac{η 2^{k a}}{4}\} & \leq C \frac{E {|\sum_{i = 1}^{2^{k}} Z_{i, k}|}^{g}}{2^{k g a}} \\ \leq C \frac{2^{k g / 2} E | Z_{1, k}^{g} |}{2^{k g a}}, \end{matrix} \end{matrix}

(6)

It is easy to see that

g a - b - 1 > g (a - 1 / 2) - (b + 2 a) > 0

,

E Z_{1, k}^{2} \leq E X_{1, 1}^{2} < \infty

,

\begin{matrix} \begin{matrix} \sum_{k = 1}^{\infty} 2^{k b - k g a + k g / 2} E |Z_{1, k}^{g}| \leq C \sum_{k = 1}^{\infty} 2^{k (b - g a + g / 2)} E |X_{1, 1}^{g}| I \{|X_{1, 1}| < 2^{k a}\} \\ \leq C \sum_{k = 1}^{\infty} 2^{k (b - g a + g / 2)} (\sum_{l = 1}^{k} E {|X_{1, 1}|}^{g} I \{2^{a (l - 1)} \leq |X_{1, 1}| < 2^{a l}\} + 1) \\ \leq C \sum_{l = 1}^{\infty} 2^{l (b - g a + g / 2)} (E {|X_{1, 1}|}^{g} I \{2^{a (l - 1)} \leq |X_{1, 1}| < 2^{a l}\} + 1) \\ \leq C \sum_{l = 1}^{\infty} 2^{l (b + 2 a - g (a - 1 / 2))} E {|X_{1, 1}|}^{(b + 1) / a} I \{2^{a (l - 1)} \leq |X_{1, 1}| < 2^{a l}\} + C_{1} < \infty . \end{matrix} \end{matrix}

where

C_{1}

and C are constants; these can be different. We can obtain that

\begin{matrix} \sum_{k = N}^{\infty} 2^{k b} P \{max_{n \leq 2^{k}} |\sum_{i = 1}^{n} Z_{i, k}| \geq η 2^{k a}\} < \infty . \end{matrix}

(7)

We could estimate

E Y_{i, k}

for a large k. We have

\begin{matrix} \begin{matrix} max_{n \leq 2^{k}} |\sum_{i = 1}^{n} E Y_{i k}| \leq 2^{k} |E Y_{1 k}| \\ \leq \{\begin{matrix} 2^{k} E | X_{11} | I {| X_{11} | \geq 2^{k a}} \leq 2^{k (a - b)} E | X_{11} |^{\frac{(1 + b)}{a}} I {| X_{11} | \geq 2^{k a}}, & if a \leq 1 + b \\ 2^{k} log k + 2^{k (a - b)} E | X_{11} |^{\frac{(1 + b)}{a}} I {| X_{11} | > log k}, & if a > 1 + b \end{matrix} \\ \leq 2^{- 1} η 2^{k a} \end{matrix} \end{matrix}

(8)

for all

ε > 0

. Hence,

\begin{matrix} \sum_{k = N}^{\infty} 2^{k b} P \{max_{n \leq 2^{k}} |\sum_{i = 1}^{n} E Y_{i k}| \geq \frac{η 2^{k a}}{2}\} < \infty . \end{matrix}

(9)

Finally, since

E | X_{11} |^{(b + 1) / a} < \infty

, we have

\begin{matrix} \begin{matrix} \sum_{k = 1}^{\infty} 2^{k b} P (⋃_{i = 1}^{2^{k}} \{| X_{i 1} | \geq 2^{k a}\}) & \leq \sum_{k = 1}^{\infty} 2^{k (b + 1)} P \{| X_{11} | \geq 2^{k a}\} \\ < C E | X_{1, 1} |^{\frac{b + 1}{a}} < \infty . \end{matrix} \end{matrix}

(10)

Hence,

\begin{matrix} \sum_{k = N}^{\infty} 2^{k b} P \{max_{n \leq 2^{k}} |\sum_{i = 1}^{n} X_{i 1} I {| X_{i 1} | \geq 2_{k a}}| \geq \frac{η 2^{k a}}{4}\} < \infty . \end{matrix}

(11)

Then, (5) follows from (7), (9) and (11). □

Lemma 12.

Under the Assumptions 1 and 2,

E X_{1, 1} = 0

,

V a r (X_{1, 1}) = 1

. Suppose that

0 < c_{1} \leq p_{n} / n^{τ} \leq c_{2} < \infty

, where

τ > 0

,

c_{2} \geq c_{1} > 0

. If, for some

a \in (0, 1 / 2)

,

E | X_{1, 1} |^{(2 + 2 τ) / (1 - a)} < \infty

, then

\begin{matrix} n^{a} b_{n, 1} \to 0 a . s ., b_{n, 3} \to 1 a . s . and n^{a} b_{n, 4} \to 0 a . s . \end{matrix}

as

n \to \infty

.

Proof of Lemma 12.

It is easy to see that

∥ X^{(i)} - {\bar{X}}^{(i)} ∥^{2} = {(X^{(i)})}^{'} X^{(i)} - n {| {\bar{X}}^{(i)} |}^{2}

. For any

x > 0

, we know that

| x - 1 | \leq | x^{2} - 1 |

. We obtain

\begin{matrix} \begin{matrix} n^{a} b_{n, 1} & = n^{a} max_{1 \leq i \leq p_{n}} |\sqrt{\frac{{(X^{(i)})}^{'} X^{(i)}}{n} - {| {\bar{X}}^{(i)} |}^{2}} - 1| \\ \leq max_{1 \leq i \leq p_{n}} |\frac{{(X^{(i)})}^{'} X^{(i)} - n}{n^{1 - a}}| + {(n^{\frac{a}{2}} max_{1 \leq i \leq p_{n}} | {\bar{X}}^{(i)} |)}^{2} . \end{matrix} \end{matrix}

(12)

Note that

{(X^{(i)})}^{'} X^{(i)} = \sum_{k = 1}^{n} X_{k, i}^{2}

. By Lemma 11, when

E | X_{1, 1} |^{(2 + 2 τ) / (1 - a)} < \infty

, the first and second maximum presented above reach zero. This is true under the assumptions of Theorem 3. Therefore, the first limit is proved. We can obtain the second limit through the first one. Since

E | X_{1, 1} |^{(1 + τ) / (1 - a)} < \infty

, we have

\begin{matrix} n^{a} b_{n, 4} = n^{a} max_{1 \leq i \leq p_{n}} | {\bar{X}}^{(i)} | = max_{1 \leq i \leq p_{n}} |\frac{\sum_{k = 1}^{n} X_{k, i}}{n^{1 - a}}| \end{matrix}

the limit that

n^{a} b_{n, 4} \to 0

a.s. is proved by the relationship between

n^{a} b_{n, 4}

and the rightmost term in (12). □

Let

\frac{1}{4} - δ < μ < \frac{1}{4}

, where

0 < δ < \frac{1 + 8 ε}{4 (9 + 16 τ + ε)}

is sufficiently small for

ε > 0

.

\begin{matrix} S_{n, i, j} = \sum_{k = 1}^{n} X_{k, i} X_{k, j}, Y_{k, i, j} = X_{k, i} X_{k, j} I {| X_{k, i} X_{k, j} | \leq n^{μ}}, \\ S_{n, i, j}^{'} = \sum_{k = 1}^{n} (Y_{k, i, j} - E Y_{k, i, j}) . \end{matrix}

Let

0 < ρ < 1

,

ρ

move close to

1 / 2

,

0 < α < 1

.

α

moves close to 0, such that

ρ - α ρ - 2 μ > 0

. Let

z = z_{n} = [n^{ρ}]

,

m_{n} = [n / (z_{n} + q_{n})] \sim n^{1 - ρ}

,

q = q_{n} = [n^{α ρ}]

,

N_{n} = m_{n} (z_{n} + q_{n})

. Let

\begin{matrix} I_{i, n} & = & {j : (i + 1) z + i q + 1 \leq j \leq (i + 1) (z + q)}, \\ H_{i, n} & = & {j : i (z + q) + 1 \leq j \leq (i + 1) z + i q}, \end{matrix}

let

v_{m, i, j} = \sum_{k \in I_{m, n}} (Y_{k, i, j} - E Y_{k, i, j})

,

u_{m, i, j} = \sum_{k \in H_{m, n}} (Y_{k, i, j} - E Y_{k, i, j})

,

1 \leq m \leq m_{n}

,.

Lemma 13.

Under the condition of Theorem 3, let

{Y_{m, i, j}^{*}; m = 1, 2, \dots, m_{n}}

be i.i.d. normal random variables,

E | Y_{m, i, j}^{*} | = 0

and variance

E u_{m, i, j}^{2}

,

σ^{2} = {lim}_{n \to \infty} E S_{n}^{2} / n

. Then

\begin{matrix} lim_{n \to \infty} \frac{E S_{n}^{2}}{\sum_{m = 1}^{m_{n}} E Y_{m, i, j}^{* 2}} = 1 . \end{matrix}

Proof of Lemma 13.

We have

E S_{n}^{2} = E {(S_{n, 1, 2}^{'} + \sum_{k = 1}^{n} X_{k 1} X_{k 2} I {| X_{k 1} X_{k 2} | \geq n^{μ}})}^{2}

, where we could write

S_{n, 1, 2}^{'} = \sum_{m = 1}^{m_{n}} u_{m, 1, 2} + \sum_{m = 1}^{m_{n}} v_{m, 1, 2} + \sum_{k = N_{m_{n}} + 1}^{n} (Y_{k, 1, 2} - E Y_{k, 1, 2})

. Since

E | X_{1, 1} |^{12 + 16 τ + ε} < \infty

, for some

ε > 0

, it follows that

\begin{matrix} \sum_{k = 1}^{n} E |X_{k, 1} X_{k, 2}| I \{|X_{k, 1} X_{k, 2}| \geq n^{μ}\} \\ \leq & \sum_{k = 1}^{n} [E |X_{k, 1}| I \{|X_{k, 1}| \geq n^{\frac{μ}{2}}\}] E | X_{k, 2} | + \sum_{k = 1}^{n} E [|X_{k, 2}| I \{|X_{k, 2}| \geq n^{\frac{μ}{2}}\}] E | X_{k, 1} | \\ \leq & \sum_{k = 1}^{n} E {|X_{k, 1}|}^{12 + 16 τ + ε} E {|X_{k, 1}|}^{1 - (12 + 16 τ + ε)} I \{|X_{k, 1}| \geq n^{\frac{μ}{2}}\} E | X_{k, 2} | \\ + \sum_{k = 1}^{n} E {|X_{k, 2}|}^{12 + 16 τ + ε} E {|X_{k, 2}|}^{1 - (12 + 16 τ + ε)} I \{|X_{k, 2}| \geq n^{\frac{μ}{2}}\} E | X_{k, 1} | \\ \leq & C n^{1 - \frac{μ}{2} (11 + 16 τ + ε)} = o (\sqrt{\frac{n}{log n}}), \end{matrix}

as

n \to \infty

. Therefore,

\begin{matrix} \sum_{k = 1}^{n} E | X_{k, 1} X_{k, 2} | I {| X_{k, 1} X_{k, 2} | \geq n^{μ}} = o (\sqrt{\frac{n}{log n}}), \end{matrix}

as

n \to \infty

. By Lemma 3, it follows that

\begin{matrix} E {(\sum_{m = 1}^{m_{n}} v_{m, 1, 2})}^{2} = \sum_{m = 1}^{m_{n}} E {(v_{m, 1, 2})}^{2} + 2 \sum_{j = 2}^{m_{n}} \sum_{m = 1}^{j - 1} (E v_{m, 1, 2} v_{j, 1, 2} - E v_{m, 1, 2} E v_{j, 1, 2}) \\ \leq & \sum_{m = 1}^{m_{n}} \sum_{k \in I_{m, n}} E {(Y_{k, 1, 2} - E Y_{k, 1, 2})}^{2} + 2 \sum_{m = 1}^{m_{n}} \sum_{\begin{matrix} k < j \\ k, j \in I_{m, n} \end{matrix}} (E Y_{k, 1, 2} Y_{j, 1, 2} - E Y_{k, 1, 2} E Y_{j, 1, 2}) \\ + 2 \sum_{j = 2}^{m_{n}} \sum_{m = 1}^{j - 1} (E v_{m, 1, 2} v_{j, 1, 2} - E v_{m, 1, 2} E v_{j, 1, 2}) \\ \leq & C m_{n} q_{n} + 2 \sum_{m = 1}^{m_{n}} \sum_{\begin{matrix} k < j \\ k, j \in I_{m, n} \end{matrix}} 10 (E | Y_{k, 1, 2} |^{p})^{1 / p} (E | Y_{j, 1, 2} {|^{q^{'}})}^{1 / q^{'}} {(α (I_{m, n}))}^{1 - \frac{1}{p} - \frac{1}{q^{'}}} \\ + & 2 \sum_{j = 2}^{m_{n}} \sum_{m = 1}^{j - 1} 10 {(E {|v_{m, 1, 2}|}^{p})}^{1 / p} {(E {|v_{j, 1, 2}|}^{q^{'}})}^{1 / q^{'}} {(α (|I_{m, n}|))}^{1 - \frac{1}{p} - \frac{1}{q^{'}}} \\ \leq & C (m_{n} q_{n} + m_{n}^{2} q_{n}^{2} {(e^{- n^{α ρ}})}^{1 - \frac{1}{p} - \frac{1}{q^{'}}} + m_{n}^{2} {(n^{\frac{(α ρ + 2 μ) p}{2}})}^{1 / p} {(n^{\frac{(α ρ + 2 μ) q^{'}}{2}})}^{1 / q^{'}} {(e^{- n^{α ρ}})}^{1 - \frac{1}{p} - \frac{1}{q^{'}}}) \\ \leq & C n^{1 - ρ + α ρ} = o (\frac{n}{log n}), \end{matrix}

as

n \to \infty

, where

2 < p < 12 + 16 τ + ε

,

2 < q^{'} < 12 + 16 τ + ε

,

1 / p + 1 / q^{'} < 1

. We have

\begin{matrix} E {(\sum_{k = N_{m_{n}} + 1}^{n} (Y_{k, 1, 2} - E Y_{k, 1, 2}))}^{2} \\ \leq & \sum_{k = N_{m_{n}} + 1}^{n} E {(Y_{k, 1, 2} - E Y_{k, 1, 2})}^{2} + 2 \sum_{j = N_{m_{n}} + 2}^{n} \sum_{k = N_{m_{n}} + 1}^{j - 1} (E Y_{k, 1, 2} Y_{j, 1, 2} - E Y_{k, 1, 2} E Y_{j, 1, 2}) \\ \leq & C (n^{ρ} + n^{α ρ}) + 2 \sum_{j = N_{m_{n}} + 2}^{n} \sum_{k = N_{m_{n}} + 1}^{j - 1} 10 (E | Y_{k, 1, 2} |^{p})^{1 / p} (E | Y_{j, 1, 2} {|^{q^{'}})}^{1 / q^{'}} {(α (I_{m, n}))}^{1 - \frac{1}{p} - \frac{1}{q^{'}}} \\ \leq & C (n^{ρ} + n^{α ρ}) + C {(n^{ρ} + n^{α ρ})}^{2} {(e^{- n^{α ρ}})}^{1 - \frac{1}{p} - \frac{1}{q^{'}}} = o (\frac{n}{log n}) \end{matrix}

as

n \to \infty

, where

2 < p < 12 + 16 τ + ε

,

2 < q^{'} < 12 + 16 τ + ε

,

1 / p + 1 / q^{'} < 1

. We have that

E S_{n}^{2} = E {(\sum_{m = 1}^{m_{n}} (u_{m, 1, 2} - E u_{m, 1, 2}))}^{2} + o (n / log n)

, by Lemma 3, we can obtain

\begin{matrix} |E {(\sum_{m = 1}^{m_{n}} (u_{m, 1, 2} - E u_{m, 1, 2}))}^{2} - \sum_{m = 1}^{m_{n}} E Y_{m, 1, 2}^{* 2}| \\ = & |E \sum_{m = 1}^{m_{n}} {(u_{m, 1, 2} - E u_{m, 1, 2})}^{2} + 2 E \sum_{j = 2}^{m_{n}} \sum_{m = 1}^{j - 1} (u_{m, 1, 2} - E u_{m, 1, 2}) (u_{j, 1, 2} - E u_{j, 1, 2}) - \sum_{m = 1}^{m_{n}} E Y_{m, 1, 2}^{* 2}| \\ \leq & 2 |\sum_{j = 2}^{m_{n}} \sum_{m = 1}^{j - 1} (E u_{m, 1, 2} u_{j, 1, 2} - E u_{m, 1, 2} E u_{j, 1, 2})| \\ \leq & 2 \sum_{j = 2}^{m_{n}} \sum_{m = 1}^{j - 1} 10 {(E {|u_{m, 1, 2}|}^{p})}^{1 / p} {(E {|u_{j, 1, 2}|}^{q^{'}})}^{1 / q^{'}} {(α (|I_{m, n}|))}^{1 - \frac{1}{p} - \frac{1}{q^{'}}} \\ \leq & C m_{n}^{2} {(n^{\frac{(ρ + 2 μ) p}{2}})}^{1 / p} {(n^{\frac{(ρ + 2 μ) q^{'}}{2}})}^{1 / q^{'}} {(e^{- n^{α ρ}})}^{1 - \frac{1}{p} - \frac{1}{q^{'}}} \\ \leq & C n^{2 - ρ + 2 μ} {(e^{- n^{α ρ}})}^{1 - \frac{1}{p} - \frac{1}{q^{'}}} = o (\frac{n}{log n}) \end{matrix}

as

n \to \infty

, where

1 / p + 1 / q^{'} < 1

. Hence,

\begin{matrix} |E S_{n}^{2} - \sum_{m = 1}^{m_{n}} E Y_{m, i, j}^{* 2}| = o (\frac{n}{log n}), \end{matrix}

as

n \to \infty

. Recall that

{lim}_{n \to \infty} \frac{E S_{n}^{2}}{n} = σ^{2}

, Conclusively,

\begin{matrix} lim_{n \to \infty} \frac{E S_{n}^{2}}{\sum_{m = 1}^{m_{n}} E Y_{m, i, j}^{* 2}} = 1, \end{matrix}

where

E Y_{m, i, j}^{* 2} = E u_{m, i, j}^{2}

. □

Lemma 14.

Let

{Y_{m, i, j}^{*}; m = 1, 2, \dots, m_{n}}

be i.i.d. normal random variables,

E Y_{1, 1, 1}^{*} = 0

and variance

E u_{m, i, j}^{2}

. Then,

\begin{matrix} \underset{n \to \infty}{lim sup} \frac{{max}_{1 \leq i < j \leq p_{n}} |\sum_{m = 1}^{m_{n}} Y_{m, i, j}^{*}|}{\sqrt{n log p_{n}}} \leq 2 σ a . s . \end{matrix}

(13)

Proof of Lemma 14.

Choose

t \in (0, 1)

, set

ω_{n} = (2 + t) σ \sqrt{n log p_{n}}

; we can suppose that

σ = 1

. Using Lemma 13, we have

{lim}_{n \to \infty} E S_{n}^{2} / \sum_{m = 1}^{m_{n}} E Y_{m, i, j}^{* 2} = 1

, as

n \to \infty

, where

E Y_{m, i, j}^{* 2} = E u_{m, i, j}^{2}

. Then, we can obtain

\begin{matrix} \begin{matrix} max_{1 \leq i \neq j < \infty} P (|\sum_{m = 1}^{m_{n}} Y_{m, i, j}^{*}| > ω_{n}) = 2 [1 - Φ (\frac{(2 + t) \sqrt{n log p_{n}}}{\sqrt{\sum_{m = 1}^{m_{n}} E u_{m, i, j}^{* 2}}})] \\ \leq & C \frac{\sqrt{\sum_{m = 1}^{m_{n}} E u_{m, i, j}^{* 2}}}{\sqrt{2 π} (2 + t) \sqrt{n log p_{n}}} exp (- \frac{{(2 + t)}^{2} n log p_{n}}{2 \sum_{m = 1}^{m_{n}} E u_{m, i, j}^{* 2}}) \\ \leq & C \frac{1}{(2 + t) \sqrt{2 π log p_{n}} p_{n}^{{(2 + t)}^{2} / 2}} = O (\frac{1}{n^{τ {(2 + t)}^{2} / 2}}), \end{matrix} \end{matrix}

(14)

as n is large, where we use

1 - Φ (x) = \frac{1}{\sqrt{2 π}} \int_{x}^{\infty} e^{- t^{2} / 2} d t \sim \frac{1}{\sqrt{2 π} x} e^{- x^{2} / 2}

(15)

as

x \to + \infty

(shown in [16]). We define

W_{n}^{'} = {max}_{1 \leq i < j \leq p_{n}} |\sum_{m = 1}^{m_{n}} Y_{m, i, j}^{*}|

,

n_{k} = k^{g}

for any integer

g > (2 + τ {(2 + t)}^{2}) / (t^{2} + 4 t) τ

,

\begin{matrix} \begin{matrix} max_{n_{k} \leq n \leq n_{k + 1}} W_{n}^{'} \leq max_{1 \leq i \neq j \leq p_{n_{k + 1}}} |\sum_{m = 1}^{m_{n_{k}}} Y_{m, i, j}^{*}| + r_{n}, \end{matrix} \end{matrix}

(16)

where

r_{n} = max_{1 \leq i \neq j \leq p_{n_{k + 1}}} max_{n_{k} \leq n \leq n_{k + 1}} |\sum_{m = 1}^{m_{n}} Y_{m, i, j}^{*} - \sum_{m = 1}^{m_{n_{k}}} Y_{m, i, j}^{*}| .

(17)

By (14),

\begin{matrix} P (max_{1 \leq i \neq j \leq p_{n_{k + 1}}} |\sum_{m = 1}^{m_{n_{k}}} Y_{m, i, j}^{*}| > ω_{n_{k}}) \\ \leq & p_{n_{k + 1}}^{2} P (\frac{|\sum_{m = 1}^{m_{n_{k}}} Y_{m, 1, 2}^{*}|}{\sqrt{\sum_{m = 1}^{m_{n_{k}}} E u_{m, i, j}^{* 2}}} > \frac{ω_{n_{k}}}{\sqrt{\sum_{m = 1}^{m_{n_{k}}} E u_{m, i, j}^{* 2}}}) \\ \leq & 2 p_{n_{k + 1}}^{2} (1 - Φ (\frac{(2 + t) \sqrt{n_{k} log p_{n_{k}}}}{\sqrt{\sum_{m = 1}^{m_{n_{k}}} E u_{m, i, j}^{* 2}}})) \\ \leq & C {(k + 1)}^{2 g τ} \frac{\sqrt{\sum_{m = 1}^{m_{n_{k}}} E u_{m, i, j}^{* 2}}}{\sqrt{2 π} (2 + t) \sqrt{n_{k} log p_{n_{k}}}} exp (- \frac{{(2 + t)}^{2} n_{k} log p_{n_{k}}}{2 \sum_{m = 1}^{m_{n_{k}}} E u_{m, i, j}^{* 2}}) \\ = & O (k^{- \frac{(t^{2} + 4 t) g τ}{2}}) . \end{matrix}

Since

\sum_{k} k^{- (t^{2} + 4 t) g τ / 2} < \infty

, using the Borel–Cantelli lemma,

\underset{k \to \infty}{lim sup} \frac{{max}_{1 \leq i \neq j \leq p_{n_{k}}} |\sum_{m = 1}^{m_{n_{k}}} Y_{m, i, j}^{*}|}{\sqrt{n_{k} log p_{n_{k}}}} \leq 2 + t a . s .

(18)

Now, let us estimate

r_{n}

as in (17).

Let partial sums

S_{0} = 0

and

S_{l} = \sum_{m = 1}^{l} Y_{m, i, j}^{*}

. Observe that the distribution of

\sum_{m = 1}^{m_{n}} Y_{m, i, j}^{*} - \sum_{m = 1}^{m_{n_{k}}} Y_{m, i, j}^{*}

is equal to that of

S_{m_{n} - m_{n_{k}}}

for all

m_{n} \geq m_{n_{k}}

. Thus, by Lemma 6, we have

\begin{matrix} \begin{matrix} P (r_{n} \geq t \sqrt{n_{k} log p_{n_{k}}}) \\ \leq & p_{n_{k + 1}}^{2} P (max_{1 \leq l \leq n_{k + 1} - n_{k}} |S_{l}| \geq t \sqrt{n_{k} log p_{n_{k}}}) \\ \leq & 2 p_{n_{k + 1}}^{2} P (|S_{n_{k + 1} - n_{k}}| \geq (t / 2) \sqrt{n_{k} log p_{n_{k}}}) \end{matrix} \end{matrix}

(19)

as n is sufficiently large, since

{min}_{1 \leq l \leq n_{k + 1} - n_{k}} P (|S_{n_{k + 1} - n_{k}} - S_{l}| \leq (t / 2) \sqrt{n_{k} log p_{n_{k}}}) \geq 1 / 2

, where Ottaviani’s inequality in Lemma 6 is used in the last inequality. Note that, for fixed g and t,

(t / 2) \sqrt{n_{k} log p_{n_{k}}} \geq (2 + t) \sqrt{(n_{k + 1} - n_{k}) log (p_{n_{k + 1}} - p_{n_{k}})}

as n is sufficiently large. From (15), we have

\begin{matrix} 2 p_{n_{k + 1}}^{2} P (|S_{n_{k + 1} - n_{k}}| \geq (t / 2) \sqrt{n_{k} log p_{n_{k}}}) = O (k^{- (τ g - 1) {(2 + t)}^{2} / 2 + 2 g τ}) \end{matrix}

Therefore,

P (r_{n} \geq t \sqrt{n_{k} log p_{n_{k}}}) = O (k^{- μ}),

where

u = (τ g - 1) {(2 + t)}^{2} / 2 - 2 g τ

, since g is chosen such that

u > 1

. Using the Borel–Cantelli lemma again, we obtain

\underset{k \to \infty}{lim sup} \frac{r_{n}}{\sqrt{n_{k} log p_{n_{k}}}} \leq t a . s .

(20)

Using (16), (18) and (20), we obtain that

\underset{k \to \infty}{lim sup} \frac{{max}_{n_{k} \leq n \leq n_{k + 1}} W_{n}^{'}}{\sqrt{n_{k} log p_{n_{k}}}} \leq 2 + 2 t a . s .

for any sufficiently small

t > 0

. Then, we have the inequality (13) in Lemma 14. □

Lemma 15.

Let

{Y_{m, i, j}^{*}; m = 1, 2, \dots, m_{n}}

be i.i.d. normal random variables,

E Y_{m, i, j}^{*} = 0

and variance

E u_{m, i, j}^{2}

. Then,

\begin{matrix} \underset{n \to \infty}{lim inf} \frac{{max}_{1 \leq i < j \leq p_{n}} |\sum_{m = 1}^{m_{n}} Y_{m, i, j}^{*}|}{\sqrt{n log p_{n}}} \geq 2 σ a . s . \end{matrix}

(21)

Proof of Lemma 15.

First prove that

P (W_{n}^{'} \leq v_{n}) = O (\frac{1}{n^{t^{'}}})

(22)

as

n \to \infty

, for

t^{'} > 0

, depending only on t and the distribution of

X_{1, 1} X_{1, 2}

only. For any

t \in (0, 1)

, set

v_{n} = (2 - t) σ \sqrt{n log p_{n}}

, and

σ^{2} = {lim}_{n \to \infty} E S_{n}^{2} / n

, we can suppose

σ = 1

. Take an integer g satisfying

g > 1 / t^{'}

. Then, we have

P (W_{n_{k}}^{'} \leq v_{n_{k}}) = O (1 / k^{t^{'} g})

. Since

\sum_{k} k^{- t^{'} g} < \infty

, using the Borel–Cantelli lemma, we obtain

\underset{k \to \infty}{lim inf} \frac{W_{n_{k}}^{'}}{\sqrt{n_{k} log p_{n_{k}}}} \geq 2 - t a . s .

(23)

for any

t \in (0, 1)

. We can see the definition of

r_{n}

in (17), and obtain

inf_{n_{k} \leq n \leq n_{k + 1}} W_{n}^{'} \geq W_{n_{k}}^{'} - r_{n} .

From (20) and (23), we have that

\underset{k \to \infty}{lim inf} \frac{{inf}_{n_{k} \leq n \leq n_{k + 1}} W_{n}^{'}}{\sqrt{n_{k} log p_{n_{k}}}} \geq 2 - 2 t a . s .

for any

t > 0

that is small enough. This suggests (21) of Lemma 15.

Now, we prove (22) using Lemma 9.

Let

I = {(i, j); 1 \leq i < j \leq p}

. Set

α = (i, j) \in I

,

B_{α} = {(k, l) \in I; o n e o f k a n d

l = i o r j b u t (k, l) \neq α}

,

η_{α} = | \sum_{m = 1}^{m_{n}} Y_{m, i, j}^{*} |

,

t = v_{n}

and

A_{α} = A_{i j} = {| \sum_{m = 1}^{m_{n}} Y_{m, i, j}^{*} | > ν_{n}}

. Usnig Lemma 9,

P (W_{n}^{'} \leq ν_{n}) \leq e^{- λ_{n}} + b_{1, n} + b_{2, n} .

(24)

Evidently,

\begin{matrix} \begin{matrix} λ_{n} = \frac{p (p - 1)}{2} P (A_{12}), \\ b_{1, n} \leq 2 p^{3} P {(A_{12})}^{2}, \\ b_{2, n} \leq 2 p^{3} P (A_{12} A_{13}) . \end{matrix} \end{matrix}

(25)

Recall that

\sum_{m = 1}^{m_{n}} Y_{m, i, j}^{*}

is a sum of i.i.d. normal random variables. Recall (15). We have

\begin{matrix} \begin{matrix} P (A_{12}) & = P (|\sum_{m = 1}^{m_{n}} Y_{m, 1, 2}^{*}| > ν_{n}) \\ = P (\frac{|\sum_{m = 1}^{m_{n}} Y_{m, 1, 2}^{*}|}{\sqrt{\sum_{m = 1}^{m_{n}} E u_{m, 1, 2}^{* 2}}} > \frac{(2 - t) \sqrt{n log p_{n}}}{\sqrt{\sum_{m = 1}^{m_{n}} E u_{m, 1, 2}^{* 2}}}) \\ = 2 [1 - Φ (\frac{(2 - t) \sqrt{n log p_{n}}}{\sqrt{\sum_{m = 1}^{m_{n}} E u_{m, 1, 2}^{* 2}}})] \\ \leq C \frac{\sqrt{\sum_{m = 1}^{m_{n}} E u_{m, 1, 2}^{* 2}}}{(2 - t) \sqrt{2 π n log p_{n}}} exp (- \frac{{(2 - t)}^{2} n log p_{n}}{2 \sum_{m = 1}^{m_{n}} E u_{m, 1, 2}^{* 2}}) = O (\frac{1}{n^{τ {(2 - t)}^{2} / 2}}) \end{matrix} \end{matrix}

(26)

as

n \to \infty

. Provided that

E | X_{1, 1} |^{12 + 16 τ + ε} < \infty

and

v_{n} / \sqrt{n log p_{n}} \to 2 - t

,

\begin{matrix} P (A_{12} A_{13}) = P (|\sum_{m = 1}^{m_{n}} Y_{m, 1, 2}^{*}| \geq ν_{n}, |\sum_{m = 1}^{m_{n}} Y_{m, 1, 3}^{*}| \geq ν_{n}) . \end{matrix}

(27)

The two events in (27) are conditionally independent given

Y_{m, i, j}^{*}

’s.

P^{1}

and

E^{1}

represent the conditional probability and expectation of

{Y_{m, i, j}^{*}; 1 \leq m \leq m_{n}, 1 \leq i, j \leq p_{n}}

, respectively. Then, the probability in (27) is

\begin{matrix} E [P^{1} {(|\sum_{m = 1}^{m_{n}} Y_{m, 1, 2}^{*}| \geq (2 - t) \sqrt{n log p_{n}})}^{2}] . \end{matrix}

(28)

Set

\begin{matrix} A_{n} (s) : = \{\frac{1}{n} |\sum_{m = 1}^{m_{n}} ({|Y_{m, 1, 2}^{*}|}^{s} - E {|Y_{m, 1, 2}^{*}|}^{s})| \leq \tilde{δ}\} \end{matrix}

for

s \geq 2

and

\tilde{δ} \in (1, \frac{1}{2})

. Choose

β \in (a^{2} + 2, q / (a^{2} + 1))

and

r = a^{2} + 1

. Let

ζ_{m} = {|Y_{m, 1, 2}^{*}|}^{β} - E {|Y_{m, 1, 2}^{*}|}^{β}

for

m = 1, 2, \dots, m_{n}

. Then,

E {|ζ_{1}|}^{r} < \infty

. Using the Chebyshev inequality and Lemma 8,

\begin{matrix} P (A_{n} {(β)}^{c}) = O (n^{- f (r)}) \end{matrix}

(29)

as

n \to \infty

, where

f (r) = \{\begin{matrix} r - 1, 1 < r \leq 2 \\ r / 2, r \geq 2 \end{matrix}

(30)

Let

{ζ_{m}^{'}; 1 \leq m \leq m_{n}}

be an independent copy of

{ζ_{m}; 1 \leq m \leq m_{n}}

. Using (29),

P (| \sum_{m = 1}^{m_{n}} ζ_{m} | \leq n \tilde{δ} / 2) \geq 1 / 2

for any n that is large enough, we have

\begin{matrix} P (|\sum_{m = 1}^{m_{n}} ζ_{m}| > n \tilde{δ}) = O (n^{- f (r)}) \end{matrix}

(31)

by repeating (29). Choose an integer

j \geq 1

, set

ν = n \tilde{δ} / 4 j

. Lemma 5 implies that there are positive constants

C_{j}

and

D_{j}

, satisfying

\begin{matrix} P (|\sum_{m = 1}^{m_{n}} (ζ_{m} - ζ_{m}^{'})| > n \tilde{δ} / 2) = P (|\sum_{m = 1}^{m_{n}} (ζ_{m} - ζ_{m}^{'})| > 2 j ν) \\ \leq & C_{j} P (max_{1 \leq m \leq m_{n}} |ζ_{m} - ζ_{m}^{'}| > ν) + D_{j} P {(|\sum_{m = 1}^{m_{n}} (ζ_{m} - ζ_{m}^{'})| > ν)}^{j} . \end{matrix}

Since

E | ζ_{1} |^{r} < \infty

,

P ({max}_{1 \leq m \leq m_{n}} | ζ_{m} - ζ_{m}^{'} | > ν) = O (n^{1 - r})

. From the equality in (31), we have that

\begin{matrix} {(P (|\sum_{m = 1}^{m_{n}} (ζ_{m} - ζ_{m}^{'})| > ν))}^{j} = O (n^{- j f (r)}) \end{matrix}

Take

j = [\frac{(r - 1)}{f (r)}] + 1

. We obtain

\begin{matrix} P (|\sum_{m = 1}^{m_{n}} (ζ_{m} - ζ_{m}^{'})| > n \tilde{δ} / 2) = O (n^{1 - r}) \end{matrix}

(32)

as

n \to \infty

. Then, we have

\begin{matrix} P (A_{n} {(β)}^{c}) = O (n^{1 - r}) \end{matrix}

as

n \to \infty

, since (29), (31) and (32). Hence,

\begin{matrix} \begin{matrix} P (A_{12} A_{13}) \\ \leq & E [P^{1} {(|\sum_{m = 1}^{m_{n}} Y_{m, 1, 2}^{*}| \geq (2 - t) \sqrt{n log p_{n}})}^{2} I_{A_{n} (s) \cap A_{n} (2)}] + P (A_{n} {(s)}^{c}) . \end{matrix} \end{matrix}

(33)

Since

\begin{matrix} \begin{matrix} P^{1} (|\sum_{m = 1}^{m_{n}} Y_{m, 1, 2}^{*}| \geq (2 - t) \sqrt{n log p_{n}}) \\ \leq & C exp (- \frac{{(2 - t)}^{2} n log p_{n}}{2 \sum_{m = 1}^{m_{n}} E u_{m, 1, 2}^{* 2}}), \end{matrix} \end{matrix}

(34)

we can obtain

\begin{matrix} \begin{matrix} P^{1} {(|\sum_{m = 1}^{m_{n}} Y_{m, 1, 2}^{*}| \geq (2 - t) \sqrt{n log p_{n}})}^{2} I_{A_{n} (s) \cap A_{n} (2)} \\ \leq & C exp (- \frac{{(2 - t)}^{2} n log p_{n}}{\sum_{m = 1}^{m_{n}} E u_{m, 1, 2}^{* 2}}) = O (n^{b - {(2 - t)}^{2}}) \end{matrix} \end{matrix}

(35)

for any

b > 0

. If both b and t are small enough, we have that

e^{- λ_{n}} \leq e^{- n^{t}}, b_{1, n} \leq \frac{1}{\sqrt{n}} a n d b_{2, n} \leq \frac{1}{\sqrt{n}}

(36)

for any n that is large enough. Since (24) and (36), (22) holds. □

Lemma 16.

Under the condition of Theorem 3, take

T_{n} = {max}_{1 \leq i < j \leq p_{n}} |S_{n, i, j}^{'}|

; then,

\begin{matrix} \underset{n \to \infty}{lim sup} \frac{T_{n}}{\sqrt{n log p_{n}}} \leq 2 σ a . s . \end{matrix}

(37)

\begin{matrix} \underset{n \to \infty}{lim inf} \frac{T_{n}}{\sqrt{n log p_{n}}} \geq 2 σ a . s . \end{matrix}

(38)

Proof of Lemma 16.

Using Markov inequality and Lemma 2, for

\forall δ^{'} > 0

, we can obtain

\begin{matrix} P (max_{1 \leq i < j \leq p_{n}} |\sum_{m = 1}^{m_{n}} v_{m, i, j}| \geq δ^{'} \sqrt{n log p_{n}}) \leq C \frac{p_{n}^{2} E {|\sum_{m = 1}^{m_{n}} v_{m, 1, 2}|}^{q}}{{(n log p_{n})}^{q / 2}} \\ \leq & C \frac{p_{n}^{2} m_{n}^{q / 2} q_{n}^{q / 2} E {|X_{1, 1} X_{1, 2}|}^{q} I \{|X_{1, 1} X_{1, 2}| \leq n^{μ}\}}{{(n log n)}^{q / 2}} \\ \leq & C \frac{1}{{(log n)}^{\frac{q}{2}} n^{{(ρ - α ρ - 2 μ))}_{2}^{q} - 2 τ}} = O (\frac{1}{n^{1 + ε^{'}}}) \end{matrix}

for

ε^{'} > 0

and sufficiently large q, using the Borel–Cantelli lemma, we have

\begin{matrix} lim_{n \to \infty} \frac{{max}_{1 \leq i < j \leq p_{n}} |\sum_{m = 1}^{m_{n}} v_{m, i, j}|}{\sqrt{n log p_{n}}} = 0, a . s . \end{matrix}

and using the Markov inequality and Lemma 2, for

\forall δ^{'} > 0

,

\begin{matrix} P (max_{1 \leq i < j \leq p_{n}} |\sum_{k = N_{m_{n}} + 1}^{n} (Y_{k, i, j} - E Y_{k, i, j})| \geq δ^{'} \sqrt{n log p_{n}}) \\ \leq & C \frac{p_{n}^{2} E {|\sum_{k = N_{m_{n}} + 1}^{n} (Y_{k, 1, 2} - E Y_{k, 1, 2})|}^{q}}{{(n log p_{n})}^{q / 2}} \\ \leq & C \frac{p_{n}^{2} {(z_{n} + q_{n})}^{q / 2} E {|X_{1, 1} X_{1, 2}|}^{q} I \{|X_{1, 1} X_{1, 2}| \leq n^{μ}\}}{{(n log n)}^{q / 2}} \\ \leq & C \frac{n^{2 τ} {(n^{ρ} + n^{α ρ})}^{q / 2}}{n^{(1 - 2 μ) q} {(log n)}^{q / 2}} = O (\frac{1}{n^{1 + ε^{'}}}), \end{matrix}

for sufficiently large q, using the Borel–Cantelli lemma, we can obtain

\begin{matrix} lim_{n \to \infty} \frac{{max}_{1 \leq i < j \leq p_{n}} |\sum_{k = N_{m_{n}} + 1}^{n} (Y_{k, i, j} - E Y_{k, i, j})|}{\sqrt{n log p_{n}}} = 0, a . s . \end{matrix}

Hence, we only need to prove

\begin{matrix} lim_{n \to \infty} \frac{{max}_{1 \leq i < j \leq p_{n}} | \sum_{m = 1}^{m_{n}} u_{m, i, j} |}{\sqrt{n log p_{n}}} = 2 σ, a . s . \end{matrix}

Since Lemma 1, we have an independent random variables sequence

{u_{m, i, j}^{*}; 1 \leq m \leq m_{n}}

,

{u_{m, i, j}^{*}; 1 \leq m \leq m_{n}}

and

{u_{m, i, j}; 1 \leq m \leq m_{n}}

have same distribution, and we could obtain that

P (u_{i, 1, 2}^{*} and u_{i, 1, 2} are not from the same H_{i} \in H) \leq ({8 N)}^{1 / 2} α (B (X), B (Y))

. We can prove that

\begin{matrix} P (max_{1 \leq i < j \leq p_{n}} |\sum_{m = 1}^{m_{n}} (u_{m, i, j} - u_{m, i, j}^{*})| \geq δ^{'} \sqrt{n log p_{n}}) \\ \leq & p_{n}^{2} m_{n} {(8 m_{n})}^{1 / 2} α (|I_{1, n}|) \\ \leq & C \frac{1}{n^{\frac{3}{2} ρ - \frac{3}{2} - 2 τ} e^{n^{α ρ}}} = o (\frac{1}{n^{1 + ε^{'}}}), \end{matrix}

Using the Borel–Cantelli lemma, we only need to prove

\begin{matrix} lim_{n \to \infty} \frac{{max}_{1 \leq i < j \leq p_{n}} | \sum_{m = 1}^{m_{n}} u_{m, i, j}^{*} |}{\sqrt{n log p_{n}}} = 2 σ, a . s . \end{matrix}

Let

u_{m, i, j}^{*} = \sum_{k \in H_{m, n}} (Y_{k, i, j}^{i} - E Y_{k, i, j}^{i})

,

1 \leq m \leq m_{n}

, where

Y_{k, i, j}^{i} = X_{k, i}^{i} X_{k, j}^{i} I {| X_{k, i}^{i} X_{k, j}^{i} | \leq n^{μ}}

,

{X_{k, j}^{i}; k \in H_{i, n}}

is an independent replication of

{X_{k, j}; k \in H_{i, n}}

. Thus, we only need to prove

\begin{matrix} (1) & \underset{n \to \infty}{lim sup} \frac{{max}_{1 \leq i < j \leq p_{n}} |\sum_{m = 1}^{m_{n}} u_{m, i, j}^{*}|}{\sqrt{n log p_{n}}} \leq 2 σ a . s . \\ (2) & \underset{n \to \infty}{lim inf} \frac{{max}_{1 \leq i < j \leq p_{n}} |\sum_{m = 1}^{m_{n}} u_{m, i, j}^{*}|}{\sqrt{n log p_{n}}} \geq 2 σ a . s . \end{matrix}

Let

Y_{m, i, j}^{*}, 1 \leq m \leq m_{n}

be an independent normal random variables sequence with variance

V a r (u_{m, i, j}^{*})

. Since Lemma 4, we have that

\begin{matrix} P (max_{1 \leq i < j \leq p_{n}} |\sum_{m = 1}^{m_{n}} (u_{m, i, j}^{*} - Y_{m, i, j}^{*})| \geq δ^{'} \sqrt{n log p_{n}}) \leq C \frac{p_{n}^{2} \sum_{i = 1}^{m_{n}} E {|u_{i, 1, 2}^{*}|}^{q}}{{(n log n)}^{\frac{q}{2}}} \\ \leq & C \frac{p_{n}^{2} m_{n} z_{n}^{\frac{q}{2}} (E | X_{k, 1}^{i} X_{k, 2}^{i} |^{2} I {| X_{k, 1}^{i} X_{k, 2}^{i} | \leq n^{μ} {})}^{\frac{q}{2}}}{{(n log p_{n})}^{\frac{q}{2}}} \\ + & C \frac{p_{n}^{2} m_{n} z_{n} E | X_{k, 1}^{i} X_{k, 2}^{i} |^{q} I {| X_{k, 1}^{i} X_{k, 2}^{i} | \leq n^{μ}}}{{(n log p_{n})}^{\frac{q}{2}}} \\ \leq & C \frac{1}{{(log p_{n})}^{\frac{q}{2}} n^{(1 - ρ) \frac{q}{2} + ρ - 1 - 2 τ}} + C \frac{1}{{(log p_{n})}^{\frac{q}{2}} n^{(1 - 2 μ) \frac{q}{2} - 1 - 2 τ}} = O (\frac{1}{n^{1 + ε^{'}}}) . \end{matrix}

for sufficiently large q. Using the Borel–Cantelli lemma,

\begin{matrix} lim_{n \to \infty} \frac{{max}_{1 \leq i < j \leq p_{n}} |\sum_{m = 1}^{m_{n}} (u_{m, i, j}^{*} - Y_{m, i, j}^{*})|}{\sqrt{n log p_{n}}} = 0, a . s . \end{matrix}

Thus, with Lemmas 14 and 15 and Borel–Cantelli lemma, we could obtain the result. □

Proof of Lemma 3.

We could find

W_{n}

in (4). Let

a = 1 / 3

, since

E | X_{1, 1} |^{12 + 16 τ + ε} < \infty

. Using the triangle inequality and Lemmas 10 and 12, we have that

| n L_{n} - W_{n} | \leq ⫴ n Γ_{n} - X_{n}^{'} X_{n} ⫴ \leq 4 n^{- 1 / 3} W_{n} + 2 n^{1 / 3} a . s .

(39)

as n is large enough. Hence,

\begin{matrix} |\frac{\sqrt{n} L_{n}}{\sqrt{log p_{n}}} - \frac{W_{n}}{\sqrt{n log p_{n}}}| & = & \frac{1}{\sqrt{n log p_{n}}} | n L_{n} - W_{n} | \leq \frac{4 n^{- 1 / 3} W_{n}}{\sqrt{n log p_{n}}} + \frac{2 n^{1 / 3}}{\sqrt{n log p_{n}}} \\ \leq & \frac{4 W_{n}}{n^{5 / 6} \sqrt{log p_{n}}} + \frac{1}{n^{1 / 6} \sqrt{log p_{n}}} \to 0, a . s . \end{matrix}

If

{lim}_{n \to \infty} \frac{W_{n}}{\sqrt{n log p_{n}}} = 2 σ

, a.s. then

{lim}_{n \to \infty} \frac{W_{n}}{n^{5 / 6} \sqrt{log p_{n}}} = 0

, a.s. Hence, in order to prove Theorem 3, we need to show that

lim_{n \to \infty} \frac{W_{n}}{\sqrt{n log p_{n}}} = 2 σ a . s .

(40)

Take

T_{n} = {max}_{1 \leq i < j \leq p_{n}} |S_{n, i, j}^{'}|

. We have that

\begin{matrix} |W_{n} - T_{n}| \leq max_{1 \leq i < j \leq p_{n}} |\sum_{k = 1}^{n} X_{k, i} X_{k, j} I \{| X_{k, i} X_{k, j} | \geq n^{μ}\}| = : U_{n}, \end{matrix}

Using Lemma 2, let

q = 3

for any

δ^{'} > 0

, since

1 / 4 - δ < μ < 1 / 4

, where

0 < δ < \frac{1 + 8 ε}{4 (9 + 16 τ + ε)}

; for some

ε > 0

, we have

\begin{matrix} \begin{matrix} P (U_{n} \geq δ^{'} \sqrt{n log p_{n}}) \\ \leq & P (max_{1 \leq i < j \leq p_{n}} |\sum_{k = 1}^{n} X_{k, i} X_{k, j} I \{|X_{k, i} X_{k, j}| \geq n^{μ}\}| \geq δ^{'} \sqrt{n log p_{n}}) \\ \leq & p_{n}^{2} \frac{E {|\sum_{k = 1}^{n} X_{k, 1} X_{k, 2} I \{|X_{k, 1} X_{k, 2}| \geq n^{μ}\}|}^{q}}{{(δ^{'} \sqrt{n log p_{n}})}^{q}} \\ \leq & \frac{C n^{2 τ} n^{\frac{3}{2}} E {|X_{1, 1} X_{1, 2}|}^{3} I \{|X_{1, 1} X_{1, 2}| \geq n^{μ}\}}{n^{3 / 2} {(log p_{n})}^{3 / 2}} \\ \leq & \frac{C n^{2 τ} E {|X_{1, 1}|}^{3} I \{|X_{1, 1}| \geq n^{\frac{μ}{2}}\} E {|X_{1, 2}|}^{3}}{{(log p_{n})}^{3 / 2}} + \frac{C n^{2 τ} E {|X_{1, 2}|}^{3} I \{|X_{1, 2}| \geq n^{\frac{μ}{2}}\} E {|X_{1, 1}|}^{3}}{{(log p_{n})}^{3 / 2}} \\ \leq & \frac{C n^{2 τ} E {|X_{1, 1}|}^{12 + 16 τ + ε} E {|X_{1, 1}|}^{3 - (12 + 16 τ + ε)} I \{|X_{1, 1}| \geq n^{\frac{μ}{2}}\} E {|X_{1, 2}|}^{3}}{{(log p_{n})}^{3 / 2}} \\ + \frac{C n^{2 τ} E {|X_{1, 2}|}^{12 + 16 τ + ε} E {|X_{1, 2}|}^{3 - (12 + 16 τ + ε)} I \{|X_{1, 2}| \geq n^{\frac{μ}{2}}\} E {|X_{1, 1}|}^{3}}{{(log p_{n})}^{3 / 2}} \\ \leq & C \frac{1}{{(log p_{n})}^{3 / 2} n^{\frac{μ}{2} (9 + 16 τ + ε) - 2 τ}} = o (\frac{1}{n^{1 + ε^{'}}}), \end{matrix} \end{matrix}

(41)

Using the Borel–Cantelli lemma,

\begin{matrix} lim_{n \to \infty} \frac{U_{n}}{\sqrt{n log p_{n}}} = 0 a . s . \end{matrix}

To prove

(40)

, we need to show that

\begin{matrix} lim_{n \to \infty} \frac{T_{n}}{\sqrt{n log p_{n}}} = 2 σ a . s . \end{matrix}

We could obtain

{lim}_{n \to \infty} T_{n} / \sqrt{n log p_{n}} = 2 σ

by Lemma 16. (40) holds from Lemma 16. Since (40), we can obtain

4 n^{- 1 / 3} W_{n} = O (n^{1 / 6} log p_{n})

a.s.. Therefore,

n L n - W_{n} = O (n^{1 / 3})

. We could prove Theorem 3 by (40). □

5. Examples

In certain applications, such as the construction of compressed sensing matrices, the means

μ_{i} = E X^{(i)}

and

μ_{j} = E X^{(j)}

are given and one is interested in

{\tilde{ρ}}_{i j} = \frac{{(X^{(i)} - μ_{i})}^{T} (X^{(j)} - μ_{j})}{∥ X^{(i)} - μ_{i} ∥ \cdot ∥ X^{(j)} - μ_{j} ∥}, 1 \leq i, j \leq p

The corresponding coherence is defined by

{\tilde{L}}_{n} = max_{1 \leq i < j \leq p} | {\tilde{ρ}}_{i j} | .

Compressed sensing is a rapidly evolving field, aiming to construct measurement matrices

X_{n \times p}

that enable the exact recovery of any k-sparse signal

β \in R^{p}

from linear measurements

y = X β

using computationally efficient recovery algorithms.

Two commonly employed conditions in compressed sensing are the Restricted Isometry Property (RIP) and the Mutual Incoherence Property (MIP). In this paper, the derived limiting laws can be utilized to assess the likelihood of a random matrix satisfying the MIP condition, as demonstrated by Cai and Jiang [19].

Example 1.

The MIP condition, which is frequently utilized, requires the pairwise correlations among the column vectors of

X_{n}

, denoted as

X_{n} = (X^{(1)}, X^{(2)}, \dots, X^{(p)}) = {(X_{k, i})}_{n \times p}

, to be small. It has been established that the condition

(2 k - 1) {\tilde{L}}_{n} < 1

guarantees the exact recovery of a k-sparse signal β in the absence of noise, where

y = X β

, and enables the stable recovery of a sparse signal in the presence of noise, where

y = X β + z

. Here, z represents an error vector that may not necessarily be random.

Author Contributions

Writing–original draft, H.Z.; Writing–review & editing, Y.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by National Natural Science Foundation of China under Grant [No. 11771178, 12171198]; the Science and Technology Development Program of Jilin Province under Grant [No. 20210101467JC]; and Fundamental Research Funds for the Central Universities.

Data Availability Statement

Data sharing is not applicable to this article as no datasets were generated or analyzed during the current study.

Conflicts of Interest

The authors declare no conflict of interest.

References

Bai, Y.; Zhang, Y.; Liu, C. Moderate deviation principle for likelihood ratio test in multivariate linear regression model. J. Multivar. Anal. 2023, 194, 105139. [Google Scholar] [CrossRef]
Jiang, T. The asymptotic distributions of the largest entries of sample correlation matrices. Ann. Appl. Probab. 2004, 14, 865–880. [Google Scholar] [CrossRef] [Green Version]
Zhou, W. Asymptotic distribution of the largest off-diagonal entry of correlation matrices. Trans. Am. Math. Soc. 2007, 359, 5345–5363. [Google Scholar] [CrossRef]
Liu, W.; Lin, Z.; Shao, Q. The asymptotic distribution and Berry-Esseen bound of a new test for independence in high dimension with an application to stochastic optimization. Ann. Appl. Probab. 2008, 18, 2337–2366. [Google Scholar] [CrossRef]
Li, D.; Rosalsky, A. Some strong limit theorems for the largest entries of sample correlation matrices. Ann. Appl. Probab. 2006, 16, 423–447. [Google Scholar] [CrossRef] [Green Version]
Li, D.; Liu, W.; Rosalsky, A. Necessary and sufficient conditions for the asymptotic distribution of the largest entry of a sample correlation matrix. Probab. Theory Relat. Fields 2010, 148, 5–35. [Google Scholar] [CrossRef]
Li, D.; Qi, Y.; Rosalsky, A. On Jiang’s asymptotic distribution of the largest entry of a sample correlation matrix. J. Multivar. Anal. 2012, 111, 256–270. [Google Scholar] [CrossRef] [Green Version]
Shao, Q.; Zhou, W. Necessary and sufficient conditions for the asymptotic distributions of coherence of ultra-high dimensional random matrices. Ann. Probab. 2014, 42, 623–648. [Google Scholar] [CrossRef] [Green Version]
Liu, W.; Lin, Z. Asymptotic distributions of the largest entries of sample correlation matrices under dependence assumptions. Chin. Ann. Math. Ser. 2008, 29, 543–556. [Google Scholar]
Zhao, H.; Zhang, Y. The asymptotic distributions of the largest entries of sample correlation matrices under an α-mixing assumption. Acta. Math. Sin.-Engl. Ser. 2022, 38, 2039–2056. [Google Scholar] [CrossRef]
Lin, Z.; Lu, C. Limit Theory on Mixing Dependent Random Variables; Kluwer Academic Publishers: Dordrecht, The Netherland, 1997. [Google Scholar]
Bradley, R. Approximation theorems for strongly mixing random variables. Mich. Math. J. 1983, 30, 69–81. [Google Scholar] [CrossRef]
Kim, T. A note on moment bounds for strong mixing sequences. Stat. Probab. Lett. 1993, 16, 163–168. [Google Scholar] [CrossRef]
Sakhanenko, A. On the accuracy of normal approximation in the invariance principle. Sib. Adv. Math. 1991, 1, 58–91. [Google Scholar]
Li, D.; Rao, M.; Jiang, T.; Wang, X. Complete convergence and almost sure convergence of weighted sums of random variables. J. Theoret. Probab. 1995, 8, 49–76. [Google Scholar] [CrossRef]
Chow, Y.; Teicher, H. Probability Theory, Independence, Interchangeability, Martingales, 2nd ed.; Springer: New York, NY, USA, 1988. [Google Scholar]
Allan, G. Probability: A Graduate Course; Springer: New York, NY, USA, 2005. [Google Scholar]
Arratia, R.; Goldstein, L.; Gordon, L. Two moments suffice for Poisson approximations: The Chen-Stein method. Ann. Probab. 1989, 17, 9–25. [Google Scholar] [CrossRef]
Cai, T.; Jiang, T. Limiting laws of coherence of random matrices with applications to testing covariance structure and construction of compressed sensing matrices. Ann. Stat. 2011, 39, 1496–1525. [Google Scholar] [CrossRef]
Bai, Z.; Yin, Y. Limit of the smallest eigenvalue of a large-dimensional sample covariance matrix. Ann. Probab. 1993, 21, 1275–1294. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhao, H.; Zhang, Y. A Strong Limit Theorem of the Largest Entries of a Sample Correlation Matrices under a Strong Mixing Assumption. Axioms 2023, 12, 657. https://doi.org/10.3390/axioms12070657

AMA Style

Zhao H, Zhang Y. A Strong Limit Theorem of the Largest Entries of a Sample Correlation Matrices under a Strong Mixing Assumption. Axioms. 2023; 12(7):657. https://doi.org/10.3390/axioms12070657

Chicago/Turabian Style

Zhao, Haozhu, and Yong Zhang. 2023. "A Strong Limit Theorem of the Largest Entries of a Sample Correlation Matrices under a Strong Mixing Assumption" Axioms 12, no. 7: 657. https://doi.org/10.3390/axioms12070657

APA Style

Zhao, H., & Zhang, Y. (2023). A Strong Limit Theorem of the Largest Entries of a Sample Correlation Matrices under a Strong Mixing Assumption. Axioms, 12(7), 657. https://doi.org/10.3390/axioms12070657

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Strong Limit Theorem of the Largest Entries of a Sample Correlation Matrices under a Strong Mixing Assumption

Abstract

1. Introduction

2. Main Result

3. Preliminaries

4. Proofs

5. Examples

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI