Applications of the Periodogram Method for Perturbed Block Toeplitz Matrices in Statistical Signal Processing

Jesús Gutiérrez-Gutiérrez; Xabier Insausti; Marta Zárraga-Rodríguez

doi:10.3390/math8040582

,

and

Department of Biomedical Engineering and Sciences, Tecnun, University of Navarra, Paseo Manuel Lardizábal 13, 20018 San Sebastián, Spain

^*

Author to whom correspondence should be addressed.

Mathematics2020, 8(4), 582;https://doi.org/10.3390/math8040582

This article belongs to the Special Issue Matrix Structures: Numerical Methods and Applications

Version Notes

Order Reprints

Abstract

In this paper, we combine the periodogram method for perturbed block Toeplitz matrices with the Cholesky decomposition to give a parameter estimation method for any perturbed vector autoregressive (VAR) or vector moving average (VMA) process, when we only know a perturbed version of the sequence of correlation matrices of the process. In order to combine the periodogram method for perturbed block Toeplitz matrices with the Cholesky decomposition, we first need to generalize a known result on the Cholesky decomposition of Toeplitz matrices to perturbed block Toeplitz matrices.

Keywords:

parameter estimation; periodogram method for perturbed block Toeplitz matrices; the Cholesky decomposition; vector autoregressive (VAR) processes; vector moving average (VMA) processes

1. Introduction

The Cholesky decomposition has been widely used in statistical signal processing. For instance, it has been used for parameter estimation of vector autoregressive (VAR) processes and for parameter estimation of vector moving average (VMA) processes. Specifically, the parameters of a VAR process can be directly obtained from the Cholesky decomposition of the inverses of its correlation matrices, and the parameters of a VMA process can be directly obtained from the Cholesky decomposition of its correlation matrices. However, when real-world problems are considered, what we usually know is a perturbed version of the sequence of correlation matrices of the process involved.

In this paper, we use the Cholesky decomposition to give a parameter estimation method for any perturbed VAR or VMA process, whenever the sequence of correlation matrices of the perturbed process is asymptotically equivalent to the sequence of correlation matrices of the original process in the Gray sense [1]. Specifically, our parameter estimation method combines the Cholesky decomposition with the periodogram method for perturbed block Toeplitz matrices presented in [2]. In order to combine them, we first need to generalize a result given in [3] on the Cholesky decomposition of Toeplitz matrices to perturbed block Toeplitz matrices.

The paper is organized as follows. In Section 2, we set up notation and we review the periodogram method for perturbed block Toeplitz matrices presented in [2]. In Section 3, we generalize a result given in [3] on the Cholesky decomposition of Toeplitz matrices to perturbed block Toeplitz matrices. In Section 4, we give a parameter estimation method for perturbed VAR and VMA processes. Our parameter estimation method for perturbed VMA processes is there also applied in another statistical signal processing problem, namely, in multiple-input multiple-output (MIMO) channel identification.

2. Preliminaries

In this section, we set up notation and we review the periodogram method for perturbed block Toeplitz matrices presented in [2].

2.1. Notation

In this paper,

N

,

Z

,

R

, and

C

denote the set of natural numbers (that is, the set of positive integers), the set of integer numbers, the set of real numbers, and the set of complex numbers, respectively.

C^{M \times N}

is the set of all

M \times N

complex matrices,

I_{N}

stands for the

N \times N

identity matrix,

0_{M \times N}

denotes the

M \times N

zero matrix, and

V_{n}

is the

n \times n

Fourier unitary matrix, i.e.,

{[V_{n}]}_{j, k} : = \frac{1}{\sqrt{n}} e^{- \frac{2 π (j - 1) (k - 1)}{n} i}, j, k \in {1, \dots, n},

with

i

being the imaginary unit. We denote by

λ_{1} (A), \dots, λ_{n} (A)

the eigenvalues of an

n \times n

Hermitian matrix A arranged in decreasing order, * denotes conjugate transpose, ⊗ is the Kronecker product, E stands for expectation, and

χ_{S}

denotes the characteristic (or indicator) function of

S \subseteq R

, that is,

χ_{S} (ω) : = \{\begin{matrix} 1 & if ω \in S, \\ 0 & otherwise . \end{matrix}

If

x_{k} \in C^{N \times 1}

for all

k \in {1, \dots, n}

, then

x_{n : 1}

is the

n N

-dimensional vector given by

x_{n : 1} = (\begin{matrix} x_{n} \\ x_{n - 1} \\ ⋮ \\ x_{1} \end{matrix}) .

If

x_{n}

is a (complex) random N-dimensional vector for all

n \in N

,

{x_{n}}

denotes the corresponding (complex) random N-dimensional vector process.

Let

A_{n}

and

B_{n}

be

n M \times n N

matrices for all

n \in N

. We write

{A_{n}} \sim {B_{n}}

if the sequences

{A_{n}}

and

{B_{n}}

are asymptotically equivalent (i.e.,

{∥ A_{n} ∥_{2}}

and

{∥ B_{n} ∥_{2}}

are bounded and

{lim}_{n \to \infty} \frac{∥ A_{n} - B_{n} ∥_{F}}{\sqrt{n}} = 0

with

{∥ \cdot ∥}_{2}

and

{∥ \cdot ∥}_{F}

being the spectral norm and the Frobenius norm, respectively). We recall that the concept of asymptotically equivalent sequences of matrices was introduced by Gray in [1] for the case in which

M = N = 1

.

If

F : R \to C^{M \times N}

is a continuous

2 π

-periodic function, we denote by

T_{n} (F)

the block Toeplitz matrix generated by F whose blocks are given by

{[T_{n} (F)]}_{j, k} : = F_{j - k}, n \in N, j, k \in {1, \dots, n},

where

{F_{k}}_{k \in Z}

is the sequence of Fourier coefficients of F, that is,

F_{k} : = \frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} F (ω) d ω \forall k \in Z .

2.2. The Periodogram Method for Perturbed Block Toeplitz Matrices

The following theorem, which was given in ([2], Theorem 4), provides a method to estimate the generating function F when we only know a perturbed version of the sequence of block Toeplitz matrices

{T_{n} (F)}

, namely, we only know a sequence of matrices

{A_{n}}

which is asymptotically equivalent to

{T_{n} (F)}

.

Theorem 1.

Let

A_{n}

be an

n M \times n N

matrix for all

n \in N

. Suppose that there exists a continuous

2 π

-periodic function

F : R \to C^{M \times N}

such that

\lim_{n \to \infty} \frac{∥ A_{n} - T_{n} {(F) ∥}_{F}}{\sqrt{n}} = 0

. Then

\lim_{n \to \infty} \frac{1}{2 π} \int_{0}^{2 π} {∥ {\hat{P}}_{A_{n}} (ω) - F (ω) ∥}_{F}^{2} d ω = 0,

(1)

where

{\hat{P}}_{A_{n}} : R \to C^{M \times N}

is the

2 π

-periodic step function given by

{\hat{P}}_{A_{n}} (ω) : = \sum_{k = 1}^{n} χ_{[\frac{2 π (k - 1)}{n}, \frac{2 π k}{n})} (ω) {[{(V_{n} \otimes I_{M})}^{*} A_{n} (V_{n} \otimes I_{N})]}_{k, k} \forall ω \in [0, 2 π) .

Moreover, if F is a trigonometric polynomial there exists

K \in [0, \infty)

such that

\sqrt{\frac{1}{2 π} \int_{0}^{2 π} {∥ {\hat{P}}_{A_{n}} (ω) - F (ω) ∥}_{F}^{2} d ω} \leq \frac{∥ A_{n} - T_{n} {(F) ∥}_{F}}{\sqrt{n}} + \frac{K}{\sqrt{n}} \forall n \in N .

The estimation method of the generating function F provided in Theorem 1 consists of the sequence of functions

{{\hat{P}}_{A_{n}}}

. Observe that from Equation (1) the squared error made, when F is estimated (approximated) by

{\hat{P}}_{A_{n}}

, tends to zero as n grows.

The correlation matrix of a random vector is a positive semidefinite matrix. Furthermore, if A is a positive semidefinite matrix, then there exists a zero-mean random vector whose correlation matrix is A. Therefore,

{T_{n} (F)}

is a sequence of positive semidefinite matrices if and only if

{T_{n} (F)}

is the sequence of correlation matrices of certain wide sense stationary (WSS) N-dimensional vector process (we recall that a random vector process

{x_{n}}

is said to be WSS if its correlation matrices

E (x_{n : 1} x_{n : 1}^{*})

are block Toeplitz and its random vectors

x_{n}

have the same mean). If

{T_{n} (F)}

is the sequence of correlation matrices of a WSS vector process, the generating function F is called the power spectral density (PSD) of the process. Therefore, Theorem 1 provides a method to estimate the PSD (a spectral estimation method) of any WSS vector process, when we only know a perturbed version of its sequence of correlation matrices. This spectral estimation method is a modified version of the (averaged) periodogram method, because if

N = 1

then

\begin{matrix} {\hat{P}}_{T_{n}} (\frac{2 π (h - 1)}{n}) = {[V_{n}^{*} T_{n} V_{n}]}_{h, h} = \sum_{k = 1}^{n} {[V_{n}^{*} T_{n}]}_{h, k} {[V_{n}]}_{k, h} = \sum_{k = 1}^{n} {[V_{n}]}_{k, h} \sum_{j = 1}^{n} {[V_{n}^{*}]}_{h, j} {[T_{n}]}_{j, k} \\ = \sum_{j, k = 1}^{n} {[V_{n}]}_{k, h} \bar{{[V_{n}]}_{j, h}} {[T_{n}]}_{j, k} = \frac{1}{n} \sum_{j, k = 1}^{n} e^{\frac{2 π (j - k) (h - 1)}{n} i} {[T_{n}]}_{j, k} = P_{T_{n}} (\frac{2 π (h - 1)}{n}), n \in N, h \in {1, \dots, n}, \end{matrix}

where

{P_{T_{n}}}

is the conventional spectral estimator, which is also known as the method of (averaged) periodogram or as the Bartlett method (see, e.g., [4]), defined as

P_{T_{n}} (ω) : = \frac{1}{n} \sum_{j, k = 1}^{n} e^{(j - k) ω i} {[T_{n}]}_{j, k}, n \in N, ω \in R .

3. A Note on the Cholesky Decomposition of Perturbed Block Toeplitz Matrices

We recall that if A is an

n \times n

positive definite matrix, then there exists a unique

n \times n

lower triangular matrix L with

{[L]}_{j, j} > 0

for all

j \in {1, \dots, n}

satisfying that

A = L L^{*}

. This decomposition of A (

A = L L^{*}

) is called the Cholesky decomposition of A. In ([3], Section 6.3) Gray gave a result on the Cholesky decomposition of Toeplitz matrices. The following theorem generalizes this result to perturbed block Toeplitz matrices. Furthermore, unlike in ([3], Section 6.3) we also give the convergence speed of our result.

Theorem 2.

Consider a continuous

2 π

-periodic function

F : R \to C^{N \times N}

whose sequence of Fourier coefficients

{F_{k}}_{k \in Z}

satisfies that

F_{0}

is lower triangular with

{[F_{0}]}_{j, j} > 0

for all

j \in {1, \dots, N}

and

F_{- k} = 0_{N \times N}

for all

k \in N

. Suppose that

A_{n}

is an

n N \times n N

positive definite matrix for all

n \in N

with

{A_{n}} \sim {T_{n} (F) {(T_{n} (F))}^{*}}

(or equivalently,

{A_{n}} \sim {T_{n} (F F^{*})}

, where

F F^{*} (ω) = F (ω) {(F (ω))}^{*}

,

ω \in R

). Let

A_{n} = L_{n} L_{n}^{*}

be the Cholesky decomposition of

A_{n}

for all

n \in N

. If

{L_{n}}

and

{T_{n} (F)}

are stable (that is,

{∥ L_{n}^{- 1} ∥_{2}}

and

{∥ {(T_{n} (F))}^{- 1} ∥_{2}}

are bounded) then

{L_{n}} \sim {T_{n} (F)} .

(2)

Moreover, there exists

K \in [0, \infty)

such that

\frac{∥ L_{n} - T_{n} {(F) ∥}_{F}}{\sqrt{n}} \leq K \frac{∥ A_{n} - T_{n} (F) {(T_{n} (F))}^{*} ∥_{F}}{\sqrt{n}} \forall n \in N .

(3)

Proof.

Applying ([5], Lemma 4.2) and ([5], Theorem 6.2) yields

{T_{n} (F) {(T_{n} (F))}^{*}} = {T_{n} (F) T_{n} (F^{*})} \sim {T_{n} (F F^{*})}

(we recall that ([5], Theorem 6.2) was previously given for Hermitian generating functions (see, e.g., [6,7], or ([8], Theorem 2))). Hence, since the relation ∼ is symmetric and transitive (see ([5], Lemma 3.1)),

{A_{n}} \sim {T_{n} (F) {(T_{n} (F))}^{*}}

if and only if

{A_{n}} \sim {T_{n} (F F^{*})}

.

The sequence

{∥ T_{n} (F) ∥_{2}}

is bounded (see, e.g., ([5], Theorem 4.3) or ([9], Corollary 4.2)). As

{∥ A_{n} ∥_{2}}

is bounded and

{∥ L_{n} ∥_{2}} = {∥ L_{n}^{*} ∥_{2}} = \{\sqrt{λ_{1} (L_{n} L_{n}^{*})}\} = \{\sqrt{∥ L_{n} L_{n}^{*} ∥_{2}}\} = \{\sqrt{∥ A_{n} ∥_{2}}\},

{∥ L_{n} ∥_{2}}

is also bounded. Consequently, to finish the proof we only need to show Equation (3), or equivalently, we only need to show that there exists

K \in [0, \infty)

such that

∥ L_{n} - T_{n} {(F) ∥}_{F} \leq K {∥ A_{n} - T_{n} (F) {(T_{n} (F))}^{*} ∥}_{F} \forall n \in N .

(4)

We have

\begin{matrix} ∥ L_{n} - T_{n} {(F) ∥}_{F} & = ∥ T_{n} (F) {(T_{n} (F))}^{- 1} L_{n} - T_{n} {(F) ∥}_{F} \\ \leq ∥ T_{n} {(F) ∥}_{2} ∥ {(T_{n} (F))}^{- 1} L_{n} - I_{n N} ∥_{F} \leq ∥ T_{n} {(F) ∥}_{2} (∥ {(T_{n} (F))}^{- 1} L_{n} - D_{n} ∥_{F} + ∥ D_{n} - I_{n N} ∥_{F}), \end{matrix}

(5)

where

D_{n}

denotes the

n N \times n N

diagonal matrix satisfying that

{[D_{n}]}_{j, j} = {[{(T_{n} (F))}^{- 1} L_{n}]}_{j, j}

for all

j \in {1, \dots, n N}

and

n \in N

. Since

T_{n} (F)

is lower triangular for all

n \in N

,

{(T_{n} (F))}^{- 1}

is lower triangular for all

n \in N

(see, e.g., ([10], p. 44)), and therefore,

1 = {[I_{n N}]}_{j, j} = {[T_{n} (F) {(T_{n} (F))}^{- 1}]}_{j, j} = \sum_{k = 1}^{n N} {[T_{n} (F)]}_{j, k} {[{(T_{n} (F))}^{- 1}]}_{k, j} = {[T_{n} (F)]}_{j, j} {[{(T_{n} (F))}^{- 1}]}_{j, j}

for all

j \in {1, \dots, n N}

and

n \in N

. Thus,

{[D_{n}]}_{j, j} = {[{(T_{n} (F))}^{- 1} L_{n}]}_{j, j} = \sum_{k = 1}^{n N} {[{(T_{n} (F))}^{- 1}]}_{j, k} {[L_{n}]}_{k, j} = {[{(T_{n} (F))}^{- 1}]}_{j, j} {[L_{n}]}_{j, j} = \frac{{[L_{n}]}_{j, j}}{{[T_{n} (F)]}_{j, j}} > 0

for all

j \in {1, \dots, n N}

and

n \in N

, and hence,

\begin{matrix} ∥ D_{n} - I_{n N} ∥_{F} \\ = \sqrt{\sum_{j = 1}^{n N} {| {[D_{n} - I_{n N}]}_{j, j} |}^{2}} = \sqrt{\sum_{j = 1}^{n N} {| {[D_{n}]}_{j, j} - 1 |}^{2}} = \sqrt{\sum_{j = 1}^{n N} {|\frac{{({[D_{n}]}_{j, j})}^{2} - 1}{{[D_{n}]}_{j, j} + 1}|}^{2}} = \sqrt{\sum_{j = 1}^{n N} \frac{| {[D_{n} D_{n}]}_{j, j} {- 1 |}^{2}}{{({[D_{n}]}_{j, j} + 1)}^{2}}} \\ \leq \sqrt{\sum_{j = 1}^{n N} {| {[D_{n} D_{n}]}_{j, j} - 1 |}^{2}} = ∥ D_{n} D_{n} - I_{n N} ∥_{F} = ∥ D_{n} D_{n}^{*} - I_{n N} ∥_{F} = {∥ D_{n} D_{n}^{*} - {(T_{n} (F))}^{- 1} L_{n} {(L_{n})}^{- 1} T_{n} (F) ∥}_{F} \\ \leq ∥ D_{n} D_{n}^{*} - {(T_{n} (F))}^{- 1} L_{n} D_{n}^{*} ∥_{F} + {∥ {(T_{n} (F))}^{- 1} L_{n} D_{n}^{*} - {(T_{n} (F))}^{- 1} L_{n} {(L_{n})}^{- 1} T_{n} (F) ∥}_{F} \\ \leq ∥ D_{n} - {(T_{n} (F))}^{- 1} L_{n} ∥_{F} ∥ D_{n}^{*} ∥_{2} + ∥ {(T_{n} (F))}^{- 1} L_{n} ∥_{2} {∥ D_{n}^{*} - {(L_{n})}^{- 1} T_{n} (F) ∥}_{F} \\ = ∥ D_{n} - {(T_{n} (F))}^{- 1} L_{n} ∥_{F} ∥ D_{n} ∥_{2} + ∥ {(T_{n} (F))}^{- 1} L_{n} ∥_{2} {∥ D_{n} - {(T_{n} (F))}^{*} {(L_{n}^{*})}^{- 1} ∥}_{F} \\ \leq ∥ D_{n} - {(T_{n} (F))}^{- 1} L_{n} ∥_{F} {∥ D_{n} ∥}_{2} \\ + ∥ {(T_{n} (F))}^{- 1} ∥_{2} ∥ L_{n} ∥_{2} (∥ D_{n} - {(T_{n} (F))}^{- 1} L_{n} ∥_{F} + ∥ {(T_{n} (F))}^{- 1} L_{n} - {(T_{n} (F))}^{*} {(L_{n}^{*})}^{- 1} ∥_{F}) \forall n \in N . \end{matrix}

(6)

As

T_{n} (F)

and

L_{n}

are lower triangular for all

n \in N

,

{(T_{n} (F))}^{- 1} L_{n}

and

L_{n}^{- 1} T_{n} (F)

are lower triangular for all

n \in N

(see, e.g., ([11], p. 240)). Consequently,

{(T_{n} (F))}^{*} {(L_{n}^{*})}^{- 1}

is upper triangular for all

n \in N

, and therefore,

\begin{matrix} ∥ {(T_{n} (F))}^{- 1} L_{n} - D_{n} ∥_{F} = \sqrt{∥ {(T_{n} (F))}^{- 1} L_{n} - D_{n} ∥_{F}^{2}} \leq \sqrt{∥ {(T_{n} (F))}^{- 1} L_{n} - D_{n} ∥_{F}^{2} + {∥ D_{n} - {(T_{n} (F))}^{*} {(L_{n}^{*})}^{- 1} ∥}_{F}^{2}} \\ = \sqrt{∥ {(T_{n} (F))}^{- 1} L_{n} - {(T_{n} (F))}^{*} {(L_{n}^{*})}^{- 1} ∥_{F}^{2}} = {∥ {(T_{n} (F))}^{- 1} L_{n} - {(T_{n} (F))}^{*} {(L_{n}^{*})}^{- 1} ∥}_{F} \forall n \in N . \end{matrix}

(7)

Combining Equations (6), (7), and (8) yields

\begin{matrix} ∥ L_{n} - T_{n} {(F) ∥}_{F} \\ \leq ∥ T_{n} {(F) ∥}_{2} (1 + ∥ D_{n} ∥_{2} + 2 ∥ {(T_{n} (F))}^{- 1} ∥_{2} ∥ L_{n} ∥_{2}) ∥ {(T_{n} (F))}^{- 1} L_{n} - {(T_{n} (F))}^{*} {(L_{n}^{*})}^{- 1} ∥_{F} \\ \leq ∥ T_{n} {(F) ∥}_{2} (1 + ∥ D_{n} ∥_{2} + 2 ∥ {(T_{n} (F))}^{- 1} ∥_{2} ∥ L_{n} ∥_{2}) ∥ {(T_{n} (F))}^{- 1} ∥_{2} {∥ L_{n} - T_{n} (F) {(T_{n} (F))}^{*} {(L_{n}^{*})}^{- 1} ∥}_{F} \\ \leq ∥ T_{n} {(F) ∥}_{2} (1 + ∥ D_{n} ∥_{2} + 2 ∥ {(T_{n} (F))}^{- 1} ∥_{2} ∥ L_{n} ∥_{2}) ∥ {(T_{n} (F))}^{- 1} ∥_{2} ∥ L_{n} L_{n}^{*} - T_{n} (F) {(T_{n} (F))}^{*} ∥_{F} {∥ {(L_{n}^{*})}^{- 1} ∥}_{2} \\ = K_{n} {∥ A_{n} - T_{n} (F) {(T_{n} (F))}^{*} ∥}_{F} \end{matrix}

with

K_{n} = ∥ T_{n} {(F) ∥}_{2} (1 + ∥ D_{n} ∥_{2} + 2 ∥ {(T_{n} (F))}^{- 1} ∥_{2} ∥ L_{n} ∥_{2}) ∥ {(T_{n} (F))}^{- 1} ∥_{2} {∥ L_{n}^{- 1} ∥}_{2}

for all

n \in N

. To prove Equation (4) we only need to show that

{K_{n}}

is bounded, or equivalently, we only need to show that

{∥ D_{n} ∥_{2}}

is bounded. For every

n \in N

there exists

j_{0} \in {1, \dots, n N}

such that

∥ D_{n} ∥_{2} = λ_{1} (D_{n}) = {[D_{n}]}_{j_{0}, j_{0}} = \frac{∥ D_{n} e_{j_{0}} ∥_{F}}{∥ e_{j_{0}} ∥_{F}} \leq \frac{∥ {(T_{n} (F))}^{- 1} L_{n} e_{j_{0}} ∥_{F}}{∥ e_{j_{0}} ∥_{F}} \leq ∥ {(T_{n} (F))}^{- 1} L_{n} ∥_{2} \leq ∥ {(T_{n} (F))}^{- 1} ∥_{2} {∥ L_{n} ∥}_{2},

where

e_{j_{0}}

is the

n N

-dimensional (column) vector whose entries are given by

{[e_{j_{0}}]}_{j, 1} = δ_{j, j_{0}}

,

j \in {1, \dots, n N}

, with

δ

being the Kronecker delta. Thus,

{∥ D_{n} ∥_{2}}

is bounded. □

Observe that Equation (3) shows that the sequence

\{\frac{∥ L_{n} - T_{n} {(F) ∥}_{F}}{\sqrt{n}}\}

converges to zero at least as fast as the sequence

\{\frac{∥ A_{n} - T_{n} (F) {(T_{n} (F))}^{*} ∥_{F}}{\sqrt{n}}\}

does.

Equation (2) generalizes ([3], Section 6.3). Specifically, in ([3], Section 6.3) Gray proved Equation (2), but only for the special case in which

N = 1

, F is in the Wiener class, and

{A_{n}} = {T_{n} (F F^{*})}

(or equivalently,

{A_{n}} = {T_{n} {(| F |}^{2})}

). It should be mentioned that unlike here, the convergence speed of

\{\frac{∥ L_{n} - T_{n} {(F) ∥}_{F}}{\sqrt{n}}\}

was not given in ([3], Section 6.3) for the special case there studied.

4. Applications of the Periodogram Method in Parameter Estimation

Using Theorems 1 and 2 we give in this section a parameter estimation method for perturbed VAR processes and another for perturbed VMA processes. These methods can be applied in any real-world problem where the random process involved is modeled as a VAR process or as a VMA process, e.g., in damage detection for aeronautical structures or in MIMO channel identification.

4.1. Parameter Estimation Method for Perturbed VAR Processes

We begin by reviewing the concept of VAR process.

Definition 1.

A zero-mean random N-dimensional vector process

{x_{n}}

is said to be a VAR process if

x_{n} = w_{n} - \sum_{k = 1}^{n - 1} F_{- k} x_{n - k} \forall n \in N,

(8)

where

F_{- k} \in C^{N \times N}

for all

k \in N

, and

{w_{n}}

is a zero-mean random N-dimensional vector process whose sequence of correlation matrices is given by

{E (w_{n : 1} w_{n : 1}^{*})} = {T_{n} (Λ)}

with Λ being an

N \times N

positive definite matrix. If there exists

p \in N

such that

F_{- k} = 0_{N \times N}

for all

k > p

, then

{x_{n}}

is called a VAR process of (finite) order p or a VAR

(p)

process.

Let

{x_{n}}

be as in Definition 1. Assume that

{F_{k}}_{k \in Z}

, with

F_{0} = I_{N}

and

F_{k} = 0_{N \times N}

for all

k \in N

, is the sequence of Fourier coefficients of a continuous

2 π

-periodic function

F : R \to C^{N \times N}

. From Equation (8) we can obtain (see, e.g., ([12], Equation (20)))

{(E (x_{n : 1} x_{n : 1}^{*}))}^{- 1} = T_{n} (F^{*}) T_{n} (Λ^{- 1}) T_{n} (F) \forall n \in N .

If

Λ^{- 1} = L_{Λ^{- 1}} L_{Λ^{- 1}}^{*}

is the Cholesky decomposition of the positive definite matrix

Λ^{- 1}

, then

{(E (x_{n : 1} x_{n : 1}^{*}))}^{- 1} = T_{n} (F^{*} L_{Λ^{- 1}}) {(T_{n} (F^{*} L_{Λ^{- 1}}))}^{*}

(9)

is the Cholesky decomposition of the positive definite matrix

{(E (x_{n : 1} x_{n : 1}^{*}))}^{- 1}

for all

n \in N

, since

\begin{matrix} {(E (x_{n : 1} x_{n : 1}^{*}))}^{- 1} & = T_{n} (F^{*}) T_{n} (L_{Λ^{- 1}} L_{Λ^{- 1}}^{*}) T_{n} (F) = T_{n} (F^{*}) T_{n} (L_{Λ^{- 1}}) T_{n} (L_{Λ^{- 1}}^{*}) T_{n} (F) \\ = T_{n} (F^{*} L_{Λ^{- 1}}) T_{n} (L_{Λ^{- 1}}^{*} F) = T_{n} (F^{*} L_{Λ^{- 1}}) T_{n} ({(F^{*} L_{Λ^{- 1}})}^{*}) \forall n \in N . \end{matrix}

Observe that if we know the correlation matrix

E (x_{n : 1} x_{n : 1}^{*})

for certain

n \in N

, then the Cholesky decomposition of

{(E (x_{n : 1} x_{n : 1}^{*}))}^{- 1}

provides

Λ

and the parameters

F_{- 1}, \dots, F_{1 - n}

of the VAR process, because

T_{n} (F^{*} L_{Λ^{- 1}}) = (\begin{matrix} L_{Λ^{- 1}} & 0_{N \times N} & 0_{N \times N} & \dots & 0_{N \times N} \\ F_{- 1}^{*} L_{Λ^{- 1}} & L_{Λ^{- 1}} & 0_{N \times N} & \dots & 0_{N \times N} \\ F_{- 2}^{*} L_{Λ^{- 1}} & F_{- 1}^{*} L_{Λ^{- 1}} & L_{Λ^{- 1}} & \dots & 0_{N \times N} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ F_{1 - n}^{*} L_{Λ^{- 1}} & F_{2 - n}^{*} L_{Λ^{- 1}} & F_{3 - n}^{*} L_{Λ^{- 1}} & \dots & L_{Λ^{- 1}} \end{matrix}) .

(10)

However, in practice what we usually know is a perturbed version

{A_{n}}

of the sequence of correlation matrices

{E (x_{n : 1} x_{n : 1}^{*})}

of the process. The following theorem allows us to estimate

Λ

and the parameters

{F_{- k}}_{k \in N}

of the VAR process from the Cholesky decomposition of the matrices of the sequence

{A_{n}^{- 1}}

, when

{A_{n}} \sim {E (x_{n : 1} x_{n : 1}^{*})}

.

Theorem 3.

Let

{x_{n}}

be as in Definition 1. Assume that

{F_{k}}_{k \in Z}

, with

F_{0} = I_{N}

and

F_{k} = 0_{N \times N}

for all

k \in N

, is the sequence of Fourier coefficients of a continuous

2 π

-periodic function

F : R \to C^{N \times N}

. Suppose that

A_{n}

is an

n N \times n N

positive definite matrix for all

n \in N

satisfying that

{A_{n}}

is stable and

{A_{n}} \sim {E (x_{n : 1} x_{n : 1}^{*})}

. Let

A_{n}^{- 1} = L_{n} L_{n}^{*}

be the Cholesky decomposition of

A_{n}^{- 1}

for all

n \in N

. Then

\lim_{n \to \infty} \frac{1}{2 π} \int_{0}^{2 π} {∥ {\hat{P}}_{L_{n}} (ω) - {(F (ω))}^{*} L_{Λ^{- 1}} ∥}_{F}^{2} d ω = 0

(11)

and

{∥\frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} {\hat{P}}_{L_{n}} (ω) d ω - F_{- k}^{*} L_{Λ^{- 1}}∥}_{F}^{2} \leq \frac{1}{2 π} \int_{0}^{2 π} {∥{\hat{P}}_{L_{n}} (ω) - {(F (ω))}^{*} L_{Λ^{- 1}}∥}_{F}^{2} d ω

for all

n \in N

and

k \in {0, 1, \dots, n - 1}

, where

Λ^{- 1} = L_{Λ^{- 1}} L_{Λ^{- 1}}^{*}

is the Cholesky decomposition of

Λ^{- 1}

. Moreover, if

{x_{n}}

is of finite order there exist

K_{1}, K_{2} \in [0, \infty)

such that

\sqrt{\frac{1}{2 π} \int_{0}^{2 π} {∥ {\hat{P}}_{L_{n}} (ω) - {(F (ω))}^{*} L_{Λ^{- 1}} ∥}_{F}^{2} d ω} \leq K_{1} \frac{∥ A_{n} - E (x_{n : 1} x_{n : 1}^{*}) ∥_{F}}{\sqrt{n}} + \frac{K_{2}}{\sqrt{n}} \forall n \in N .

Proof.

Since

A_{n}

is positive definite matrix for all

n \in N

,

A_{n}^{- 1}

is positive definite matrix for all

n \in N

. From ([12], Equation (20)) and ([5], Lemma 4.2) we have

\begin{matrix} {∥{(E (x_{n : 1} x_{n : 1}^{*}))}^{- 1}∥}_{2} & = ∥ T_{n} (F^{*}) T_{n} (Λ^{- 1}) T_{n} (F) ∥_{2} \leq ∥ T_{n} (F^{*}) ∥_{2} ∥ T_{n} (Λ^{- 1}) ∥_{2} {∥ T_{n} (F) ∥}_{2} \\ = ∥ {(T_{n} (F))}^{*} ∥_{2} ∥ Λ^{- 1} ∥_{2} ∥ T_{n} {(F) ∥}_{2} = ∥ Λ^{- 1} ∥_{2} {∥ T_{n} (F) ∥}_{2}^{2} \forall n \in N . \end{matrix}

Hence, as

{∥ T_{n} (F) ∥_{2}}

is bounded (see, e.g., ([5], Theorem 4.3) or ([9], Corollary 4.2)),

\{{∥{(E (x_{n : 1} x_{n : 1}^{*}))}^{- 1}∥}_{2}\}

is also bounded. Consequently, applying ([13], Lemma A1) and Equation (9) yields

\{A_{n}^{- 1}\} \sim \{{(E (x_{n : 1} x_{n : 1}^{*}))}^{- 1}\} = \{T_{n} (F^{*} L_{Λ^{- 1}}) {(T_{n} (F^{*} L_{Λ^{- 1}}))}^{*}\} .

As

{∥ A_{n} ∥_{2}}

and

{∥ E (x_{n : 1} x_{n : 1}^{*}) ∥_{2}}

are bounded, the sequences

{∥ L_{n}^{- 1} ∥_{2}} = \{\sqrt{λ_{1} ({(L_{n}^{- 1})}^{*} L_{n}^{- 1})}\} = \{\sqrt{{∥{(L_{n}^{- 1})}^{*} L_{n}^{- 1}∥}_{2}}\} = \{\sqrt{{∥{(L_{n} L_{n}^{*})}^{- 1}∥}_{2}}\} = \{\sqrt{{∥A_{n}∥}_{2}}\}

and

{∥ {(T_{n} (F^{*} L_{Λ^{- 1}}))}^{- 1} ∥_{2}} = \{\sqrt{{∥{(T_{n} (F^{*} L_{Λ^{- 1}}) {(T_{n} (F^{*} L_{Λ^{- 1}}))}^{*})}^{- 1}∥}_{2}}\} = \{\sqrt{{∥E (x_{n : 1} x_{n : 1}^{*})∥}_{2}}\}

are also bounded. Thus, from Theorem 2 we have that

{L_{n}} \sim {T_{n} (F^{*} L_{Λ^{- 1}})}

and that there exists

K \in [0, \infty)

such that

\frac{∥ L_{n} - T_{n} (F^{*} L_{Λ^{- 1}}) ∥_{F}}{\sqrt{n}} \leq K \frac{∥ A_{n}^{- 1} - T_{n} (F^{*} L_{Λ^{- 1}}) {(T_{n} (F^{*} L_{Λ^{- 1}}))}^{*} ∥_{F}}{\sqrt{n}} = K \frac{∥ A_{n}^{- 1} - {(E (x_{n : 1} x_{n : 1}^{*}))}^{- 1} ∥_{F}}{\sqrt{n}}

for all

n \in N

. Hence, applying Theorem 1 we conclude that Equation (11) holds.

Applying the Schwarz inequality (see, e.g., ([14], p. 139) yields

\begin{matrix} {∥\frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} {\hat{P}}_{L_{n}} (ω) d ω - F_{- k}^{*} L_{Λ^{- 1}}∥}_{F} & = {∥\frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} {\hat{P}}_{L_{n}} (ω) d ω - \frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} {(F (ω))}^{*} d ω L_{Λ^{- 1}}∥}_{F} \\ = \frac{1}{2 π} {∥\int_{0}^{2 π} e^{- k ω i} ({\hat{P}}_{L_{n}} (ω) - {(F (ω))}^{*} L_{Λ^{- 1}}) d ω∥}_{F} \\ = \frac{1}{2 π} \sqrt{\sum_{r, s = 1}^{N} {|{[\int_{0}^{2 π} e^{- k ω i} ({\hat{P}}_{L_{n}} (ω) - {(F (ω))}^{*} L_{Λ^{- 1}}) d ω]}_{r, s}|}^{2}} \\ = \frac{1}{2 π} \sqrt{\sum_{r, s = 1}^{N} {|\int_{0}^{2 π} e^{- k ω i} {[{\hat{P}}_{L_{n}} (ω) - {(F (ω))}^{*} L_{Λ^{- 1}}]}_{r, s} d ω|}^{2}} \\ \leq \frac{1}{2 π} \sqrt{\sum_{r, s = 1}^{N} 2 π \int_{0}^{2 π} {|e^{- k ω i} {[{\hat{P}}_{L_{n}} (ω) - {(F (ω))}^{*} L_{Λ^{- 1}}]}_{r, s}|}^{2} d ω} \\ = \sqrt{\frac{1}{2 π} \int_{0}^{2 π} \sum_{r, s = 1}^{N} {|e^{- k ω i}|}^{2} {|{[{\hat{P}}_{L_{n}} (ω) - {(F (ω))}^{*} L_{Λ^{- 1}}]}_{r, s}|}^{2} d ω} \\ = \sqrt{\frac{1}{2 π} \int_{0}^{2 π} \sum_{r, s = 1}^{N} {|{[{\hat{P}}_{L_{n}} (ω) - {(F (ω))}^{*} L_{Λ^{- 1}}]}_{r, s}|}^{2} d ω} \\ = \sqrt{\frac{1}{2 π} \int_{0}^{2 π} {∥{\hat{P}}_{L_{n}} (ω) - {(F (ω))}^{*} L_{Λ^{- 1}}∥}_{F}^{2} d ω} \end{matrix}

for all

n \in N

and

k \in Z

.

Moreover, if

{x_{n}}

is of finite order from Theorem 1 there exists

K_{2} \in [0, \infty)

such that

\begin{matrix} \sqrt{\frac{1}{2 π} \int_{0}^{2 π} {∥ {\hat{P}}_{L_{n}} (ω) - {(F (ω))}^{*} L_{Λ^{- 1}} ∥}_{F}^{2} d ω} \\ \leq \frac{∥ L_{n} - T_{n} (F^{*} L_{Λ^{- 1}}) ∥_{F}}{\sqrt{n}} + \frac{K_{2}}{\sqrt{n}} \\ \leq K \frac{∥ A_{n}^{- 1} - {(E (x_{n : 1} x_{n : 1}^{*}))}^{- 1} ∥_{F}}{\sqrt{n}} + \frac{K_{2}}{\sqrt{n}} \\ = K \frac{∥ {(E (x_{n : 1} x_{n : 1}^{*}))}^{- 1} - A_{n}^{- 1} ∥_{F}}{\sqrt{n}} + \frac{K_{2}}{\sqrt{n}} \\ = K \frac{∥ {(E (x_{n : 1} x_{n : 1}^{*}))}^{- 1} (A_{n} - E (x_{n : 1} x_{n : 1}^{*})) A_{n}^{- 1} ∥_{F}}{\sqrt{n}} + \frac{K_{2}}{\sqrt{n}} \\ \leq K ∥ {(E (x_{n : 1} x_{n : 1}^{*}))}^{- 1} ∥_{2} \frac{∥ (A_{n} - E (x_{n : 1} x_{n : 1}^{*})) A_{n}^{- 1} ∥_{F}}{\sqrt{n}} + \frac{K_{2}}{\sqrt{n}} \\ \leq K ∥ {(E (x_{n : 1} x_{n : 1}^{*}))}^{- 1} ∥_{2} {∥ A_{n}^{- 1} ∥}_{2} \frac{∥ A_{n} - E (x_{n : 1} x_{n : 1}^{*}) ∥_{F}}{\sqrt{n}} + \frac{K_{2}}{\sqrt{n}} \forall n \in N . \end{matrix}

□

If we know

A_{n}

for certain

n \in N

, Theorem 3 provides an estimation of the block entry

F_{- k}^{*} L_{Λ^{- 1}}

of the matrix

T_{n} (F^{*} L_{Λ^{- 1}})

in Equation (10) given by

\begin{matrix} \frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} {\hat{P}}_{L_{n}} (ω) d ω \\ = \frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} \sum_{h = 1}^{n} χ_{[\frac{2 π (h - 1)}{n}, \frac{2 π h}{n})} (ω) {[{(V_{n} \otimes I_{N})}^{*} L_{n} (V_{n} \otimes I_{N})]}_{h, h} d ω \\ = \frac{1}{2 π} \sum_{h = 1}^{n} \int_{0}^{2 π} χ_{[\frac{2 π (h - 1)}{n}, \frac{2 π h}{n})} (ω) e^{- k ω i} d ω {[{(V_{n} \otimes I_{N})}^{*} L_{n} (V_{n} \otimes I_{N})]}_{h, h} \\ = \frac{1}{2 π} \sum_{h = 1}^{n} \int_{\frac{2 π (h - 1)}{n}}^{\frac{2 π h}{n}} e^{- k ω i} d ω {[{(V_{n} \otimes I_{N})}^{*} L_{n} (V_{n} \otimes I_{N})]}_{h, h} \\ = \{\begin{matrix} \frac{1}{n} \sum_{h = 1}^{n} {[{(V_{n} \otimes I_{N})}^{*} L_{n} (V_{n} \otimes I_{N})]}_{h, h} & if k = 0, \\ \frac{i}{2 π k} \sum_{h = 1}^{n} (e^{- k \frac{2 π h}{n} i} - e^{- k \frac{2 π (h - 1)}{n} i}) {[{(V_{n} \otimes I_{N})}^{*} L_{n} (V_{n} \otimes I_{N})]}_{h, h} & if k \in {1, \dots, n - 1} . \end{matrix} \end{matrix}

Therefore, if we know

A_{n}

for certain

n \in N

, Theorem 3 allows us to estimate

Λ

and the parameters

F_{- 1}, \dots, F_{1 - n}

of the VAR process as follows

\hat{Λ} (n) = {((\frac{1}{2 π} \int_{0}^{2 π} {\hat{P}}_{L_{n}} (ω) d ω) {(\frac{1}{2 π} \int_{0}^{2 π} {\hat{P}}_{L_{n}} (ω) d ω)}^{*})}^{- 1}

and

{\hat{F}}_{- k} (n) = {((\frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} {\hat{P}}_{L_{n}} (ω) d ω) {(\frac{1}{2 π} \int_{0}^{2 π} {\hat{P}}_{L_{n}} (ω) d ω)}^{- 1})}^{*} \forall k \in {1, \dots, n - 1} .

Example 1.

We consider the zero-mean 2-dimensional VAR

(1)

process

{x_{n}}

in ([15], Example 2.3), where

Λ = (\begin{matrix} 4 & 1 \\ 1 & 2 \end{matrix})

and

F_{- 1} = (\begin{matrix} - 0.8 & - 0.7 \\ 0.4 & - 0.6 \end{matrix}) .

Figure 1 shows the squared error made when Λ and

F_{- 1}

are estimated from the perturbed VAR

(1)

process whose sequence of correlation matrices is

{A_{n}} = \{E (x_{n : 1} x_{n : 1}^{*}) + (\begin{matrix} 0_{2 n - 2 \times 2 n - 2} & 0_{2 n - 2 \times 2} \\ 0_{2 \times 2 n - 2} & I_{2} \end{matrix})\} .

Observe that this perturbed process has been generated by corrupting the VAR

(1)

process in ([15], Example 2.3) by an impulse at

n = 1

.

Figure 1. Squared error made when

Λ

and

F_{- 1}

are estimated by

\hat{Λ} (n)

and

{\hat{F}}_{- 1} (n)

, respectively.

4.2. Parameter Estimation Method for Perturbed VMA Processes

We begin by reviewing the concept of VMA process.

Definition 2.

A zero-mean random N-dimensional vector process

{x_{n}}

is said to be a VMA process if

x_{n} = w_{n} + \sum_{k = 1}^{n - 1} G_{k} w_{n - k} \forall n \in N,

(12)

where

G_{k} \in C^{N \times N}

for all

k \in N

, and

{w_{n}}

is a zero-mean random N-dimensional vector process whose sequence of correlation matrices is given by

{E (w_{1 : n} w_{1 : n}^{*})} = {T_{n} (Λ)}

with Λ being an

N \times N

positive definite matrix and

w_{1 : n} = (\begin{matrix} w_{1} \\ ⋮ \\ w_{n} \end{matrix}) \forall n \in N .

If there exists

q \in N

such that

G_{k} = 0_{N \times N}

for all

k > q

, then

{x_{n}}

is called a VMA process of (finite) order q or a VMA

(q)

process.

Let

{x_{n}}

be as in Definition 2. Assume that

{G_{k}}_{k \in Z}

, with

G_{0} = I_{N}

and

G_{- k} = 0_{N \times N}

for all

k \in N

, is the sequence of Fourier coefficients of a continuous

2 π

-periodic function

G : R \to C^{N \times N}

. Since Equation (12) can be rewritten as

x_{n} = (\begin{matrix} G_{n - 1} & \dots & G_{1} & I_{N} \end{matrix}) w_{1 : n} \forall n \in N,

we have

x_{1 : n} = (\begin{matrix} I_{N} & 0_{N \times N} & 0_{N \times N} & \dots & 0_{N \times N} \\ G_{1} & I_{N} & 0_{N \times N} & \dots & 0_{N \times N} \\ G_{2} & G_{1} & I_{N} & \dots & 0_{N \times N} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ G_{n - 1} & G_{n - 2} & G_{n - 3} & \dots & I_{N} \end{matrix}) w_{1 : n} = T_{n} (G) w_{1 : n} \forall n \in N,

and consequently,

\begin{matrix} \{E (x_{1 : n} x_{1 : n}^{*})\} & = \{E (T_{n} (G) w_{1 : n} w_{1 : n}^{*} {(T_{n} (G))}^{*})\} \\ = \{T_{n} (G) E (w_{1 : n} w_{1 : n}^{*}) {(T_{n} (G))}^{*}\} = {T_{n} (G) T_{n} (Λ) {(T_{n} (G))}^{*}} . \end{matrix}

(13)

If

Λ = L_{Λ} L_{Λ}^{*}

is the Cholesky decomposition of

Λ

, then

E (x_{1 : n} x_{1 : n}^{*}) = T_{n} (G L_{Λ}) {(T_{n} (G L_{Λ}))}^{*}

(14)

is the Cholesky decomposition of the positive definite matrix

E (x_{1 : n} x_{1 : n}^{*})

for all

n \in N

, because

\begin{matrix} E (x_{1 : n} x_{1 : n}^{*}) & = T_{n} (G) T_{n} (L_{Λ} L_{Λ}^{*}) {(T_{n} (G))}^{*} = T_{n} (G) T_{n} (L_{Λ}) T_{n} (L_{Λ}^{*}) {(T_{n} (G))}^{*} \\ = T_{n} (G L_{Λ}) {(T_{n} (L_{Λ}))}^{*} {(T_{n} (G))}^{*} = T_{n} (G L_{Λ}) {(T_{n} (G) T_{n} (L_{Λ}))}^{*} \forall n \in N . \end{matrix}

Observe that if we know the correlation matrix

E (x_{1 : n} x_{1 : n}^{*})

for certain

n \in N

, then its Cholesky decomposition provides

Λ

and the parameters

G_{1}, \dots, G_{n - 1}

of the VMA process, since

T_{n} (G L_{Λ}) = (\begin{matrix} L_{Λ} & 0_{N \times N} & 0_{N \times N} & \dots & 0_{N \times N} \\ G_{1} L_{Λ} & L_{Λ} & 0_{N \times N} & \dots & 0_{N \times N} \\ G_{2} L_{Λ} & G_{1} L_{Λ} & L_{Λ} & \dots & 0_{N \times N} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ G_{n - 1} L_{Λ} & G_{n - 2} L_{Λ} & G_{n - 3} L_{Λ} & \dots & L_{Λ} \end{matrix}) .

(15)

However, in practice what we usually know is a perturbed version

{A_{n}}

of the sequence of correlation matrices

{E (x_{1 : n} x_{1 : n}^{*})}

of the process. The following theorem allows us to estimate

Λ

and the parameters

{G_{k}}_{k \in N}

of the VMA process from the Cholesky decomposition of the matrices of the sequence

{A_{n}}

, when

{A_{n}} \sim {E (x_{1 : n} x_{1 : n}^{*})}

.

Theorem 4.

Let

{x_{n}}

be as in Definition 2. Assume that

{G_{k}}_{k \in Z}

, with

G_{0} = I_{N}

and

G_{- k} = 0_{N \times N}

for all

k \in N

, is the sequence of Fourier coefficients of a continuous

2 π

-periodic function

G : R \to C^{N \times N}

. Suppose that

A_{n}

is an

n N \times n N

positive definite matrix for all

n \in N

satisfying that

{A_{n}}

is stable and

{A_{n}} \sim {E (x_{1 : n} x_{1 : n}^{*})}

. Let

A_{n} = L_{n} L_{n}^{*}

be the Cholesky decomposition of

A_{n}

for all

n \in N

. If

{E (x_{1 : n} x_{1 : n}^{*})}

is stable then

\lim_{n \to \infty} \frac{1}{2 π} \int_{0}^{2 π} {∥ {\hat{P}}_{L_{n}} (ω) - G (ω) L_{Λ} ∥}_{F}^{2} d ω = 0,

(16)

and

{∥\frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} {\hat{P}}_{L_{n}} (ω) d ω - G_{k} L_{Λ}∥}_{F}^{2} \leq \frac{1}{2 π} \int_{0}^{2 π} {∥{\hat{P}}_{L_{n}} (ω) - G (ω) L_{Λ}∥}_{F}^{2} d ω

for all

n \in N

and

k \in {0, 1, \dots, n - 1}

, where

Λ = L_{Λ} L_{Λ}^{*}

is the Cholesky decomposition of Λ. Moreover, if

{x_{n}}

is of finite order there exist

K_{1}, K_{2} \in [0, \infty)

such that

\sqrt{\frac{1}{2 π} \int_{0}^{2 π} {∥ {\hat{P}}_{L_{n}} (ω) - G (ω) L_{Λ} ∥}_{F}^{2} d ω} \leq K_{1} \frac{∥ A_{n} - E (x_{1 : n} x_{1 : n}^{*}) ∥_{F}}{\sqrt{n}} + \frac{K_{2}}{\sqrt{n}} \forall n \in N .

Proof.

From Equation (14) we have

{A_{n}} \sim {E (x_{1 : n} x_{1 : n}^{*})} = {T_{n} (G L_{Λ}) {(T_{n} (G L_{Λ}))}^{*}}

. As

{∥ A_{n}^{- 1} ∥_{2}}

and

{∥ {(E (x_{1 : n} x_{1 : n}^{*}))}^{- 1} ∥_{2}}

are bounded, the sequences

{∥ L_{n}^{- 1} ∥_{2}} = \{\sqrt{λ_{1} ({(L_{n}^{- 1})}^{*} L_{n}^{- 1})}\} = \{\sqrt{{∥{(L_{n}^{- 1})}^{*} L_{n}^{- 1}∥}_{2}}\} = \{\sqrt{{∥{(L_{n} L_{n}^{*})}^{- 1}∥}_{2}}\} = \{\sqrt{{∥A_{n}^{- 1}∥}_{2}}\}

and

{∥ {(T_{n} (G L_{Λ}))}^{- 1} ∥_{2}} = \{\sqrt{{∥{(T_{n} (G L_{Λ}) {(T_{n} (G L_{Λ}))}^{*})}^{- 1}∥}_{2}}\} = \{\sqrt{{∥{(E (x_{1 : n} x_{1 : n}^{*}))}^{- 1}∥}_{2}}\}

are also bounded. Consequently, from Theorem 2 we have that

{L_{n}} \sim {T_{n} (G L_{Λ})}

and that there exists

K_{1} \in [0, \infty)

such that

\frac{∥ L_{n} - T_{n} (G L_{Λ}) ∥_{F}}{\sqrt{n}} \leq K_{1} \frac{{∥A_{n} - T_{n} (G L_{Λ}) {(T_{n} (G L_{Λ}))}^{*}∥}_{F}}{\sqrt{n}} = K_{1} \frac{{∥A_{n} - E (x_{1 : n} x_{1 : n}^{*})∥}_{F}}{\sqrt{n}} \forall n \in N .

Therefore, applying Theorem 1 we conclude that Equation (16) holds.

Applying the Schwarz inequality (see, e.g., ([14], p. 139)) yields

\begin{matrix} {∥\frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} {\hat{P}}_{L_{n}} (ω) d ω - G_{k} L_{Λ}∥}_{F} & = {∥\frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} {\hat{P}}_{L_{n}} (ω) d ω - \frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} G (ω) d ω L_{Λ}∥}_{F} \\ = \frac{1}{2 π} {∥\int_{0}^{2 π} e^{- k ω i} ({\hat{P}}_{L_{n}} (ω) - G (ω) L_{Λ}) d ω∥}_{F} \\ = \frac{1}{2 π} \sqrt{\sum_{r, s = 1}^{N} {|{[\int_{0}^{2 π} e^{- k ω i} ({\hat{P}}_{L_{n}} (ω) - G (ω) L_{Λ}) d ω]}_{r, s}|}^{2}} \\ = \frac{1}{2 π} \sqrt{\sum_{r, s = 1}^{N} {|\int_{0}^{2 π} e^{- k ω i} {[{\hat{P}}_{L_{n}} (ω) - G (ω) L_{Λ}]}_{r, s} d ω|}^{2}} \\ \leq \frac{1}{2 π} \sqrt{\sum_{r, s = 1}^{N} 2 π \int_{0}^{2 π} {|e^{- k ω i} {[{\hat{P}}_{L_{n}} (ω) - G (ω) L_{Λ}]}_{r, s}|}^{2} d ω} \\ = \sqrt{\frac{1}{2 π} \int_{0}^{2 π} \sum_{r, s = 1}^{N} {|e^{- k ω i}|}^{2} {|{[{\hat{P}}_{L_{n}} (ω) - G (ω) L_{Λ}]}_{r, s}|}^{2} d ω} \\ = \sqrt{\frac{1}{2 π} \int_{0}^{2 π} \sum_{r, s = 1}^{N} {|{[{\hat{P}}_{L_{n}} (ω) - G (ω) L_{Λ}]}_{r, s}|}^{2} d ω} \\ = \sqrt{\frac{1}{2 π} \int_{0}^{2 π} {∥{\hat{P}}_{L_{n}} (ω) - G (ω) L_{Λ}∥}_{F}^{2} d ω} \end{matrix}

for all

n \in N

and

k \in Z

.

Moreover, if

{x_{n}}

is of finite order from Theorem 1 there exists

K_{2} \in [0, \infty)

such that

\begin{matrix} \sqrt{\frac{1}{2 π} \int_{0}^{2 π} {∥ {\hat{P}}_{L_{n}} (ω) - G (ω) L_{Λ} ∥}_{F}^{2} d ω} \\ \leq \frac{∥ L_{n} - T_{n} (G L_{Λ}) ∥_{F}}{\sqrt{n}} + \frac{K_{2}}{\sqrt{n}} \leq K_{1} \frac{{∥A_{n} - E (x_{1 : n} x_{1 : n}^{*})∥}_{F}}{\sqrt{n}} + \frac{K_{2}}{\sqrt{n}} \forall n \in N . \end{matrix}

□

If we know

A_{n}

for certain

n \in N

, Theorem 4 provides an estimation of the block entry

G_{k} L_{Λ}

of the matrix

T_{n} (G L_{Λ})

in Equation (15) given by

\begin{matrix} \frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} {\hat{P}}_{L_{n}} (ω) d ω \\ = \{\begin{matrix} \frac{1}{n} \sum_{h = 1}^{n} {[{(V_{n} \otimes I_{N})}^{*} L_{n} (V_{n} \otimes I_{N})]}_{h, h} & if k = 0, \\ \frac{i}{2 π k} \sum_{h = 1}^{n} (e^{- k \frac{2 π h}{n} i} - e^{- k \frac{2 π (h - 1)}{n} i}) {[{(V_{n} \otimes I_{N})}^{*} L_{n} (V_{n} \otimes I_{N})]}_{h, h} & if k \in {1, \dots, n - 1} . \end{matrix} \end{matrix}

Therefore, if we know

A_{n}

for certain

n \in N

, Theorem 4 allows us to estimate

Λ

and the parameters

G_{1}, \dots, G_{n - 1}

of the VMA process as follows

\hat{Λ} (n) = (\frac{1}{2 π} \int_{0}^{2 π} {\hat{P}}_{L_{n}} (ω) d ω) {(\frac{1}{2 π} \int_{0}^{2 π} {\hat{P}}_{L_{n}} (ω) d ω)}^{*}

and

{\hat{G}}_{k} (n) = (\frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} {\hat{P}}_{L_{n}} (ω) d ω) {(\frac{1}{2 π} \int_{0}^{2 π} {\hat{P}}_{L_{n}} (ω) d ω)}^{- 1} \forall k \in {1, \dots, n - 1} .

Example 2.

We consider the zero-mean 2-dimensional VMA

(1)

process

{x_{n}}

in ([15], Example 2.1), where

Λ = (\begin{matrix} 4 & 1 \\ 1 & 2 \end{matrix})

and

G_{1} = (\begin{matrix} - 0.8 & - 0.7 \\ 0.4 & - 0.6 \end{matrix}) .

Figure 2 shows the squared error made when Λ and

G_{1}

are estimated from the perturbed VMA

(1)

process whose sequence of correlation matrices is

{A_{n}} = \{E (x_{1 : n} x_{1 : n}^{*}) + (\begin{matrix} I_{2} & 0_{2 \times 2 n - 2} \\ 0_{2 n - 2 \times 2} & 0_{2 n - 2 \times 2 n - 2} \end{matrix})\} .

Observe that this perturbed process has been generated by corrupting the VMA

(1)

process in ([15], Example 2.1) by an impulse at

n = 1

.

Figure 2. Squared error made when

Λ

and

G_{1}

are estimated by

\hat{Λ} (n)

and

{\hat{G}}_{1} (n)

, respectively.

In [2], the periodogram method for perturbed block Toeplitz matrices was applied in spectral estimation. In Theorems 3 and 4, it has been also applied in parameter estimation for perturbed VAR processes and in parameter estimation for perturbed VMA processes, respectively. We finish the paper by showing that the periodogram method for perturbed block Toeplitz matrices can be applied in a fourth statistical signal processing problem, namely, in MIMO channel identification with perturbed additive WSS noise.

In [16], an asymptotic result on block Toeplitz matrices was applied in single-input multiple-output (SIMO) channel identification. We finish the paper by showing that Theorem 4 can be applied in MIMO channel identification when the number of channel inputs and the number of channel outputs are equal.

We consider a MIMO channel with a discrete-time causal infinite impulse response (IIR) filter and additive noise. Thus, the channel output process

{y_{n}}

is of the form

y_{n} = x_{n} + ϵ_{n} = \sum_{k = 0}^{n - 1} G_{k} w_{n - k} + ϵ_{n} \forall n \in N .

We assume that the filter tap

G_{k} \in C^{N \times N}

for all

k \in N

and

G_{0} = I_{N}

. We also assume that

{G_{k}}_{k \in Z}

, with

G_{- k} = 0_{N \times N}

for all

k \in N

, is the sequence of Fourier coefficients of a continuous

2 π

-periodic function

G : R \to C^{N \times N}

. We consider that the input process

{w_{n}}

is a zero-mean WSS N-dimensional vector process with

{E (w_{1 : n} w_{1 : n}^{*})} = {T_{n} (Λ)}

, where

Λ

is an

N \times N

positive definite matrix. We assume that the noise process

{ϵ_{n}}

is a zero-mean random N-dimensional vector process satisfying that there exists a continuous

2 π

-periodic function

Υ : R \to C^{N \times N}

such that

{E (ϵ_{1 : n} ϵ_{1 : n}^{*})} \sim {T_{n} (Υ)}

. We also assume that the noise process is uncorrelated with the input process.

Suppose that

{E (x_{1 : n} x_{1 : n}^{*})}

is stable and

{A_{n}} = {E (y_{1 : n} y_{1 : n}^{*}) - T_{n} (Υ)}

is a stable sequence of positive definite matrices. To show that Theorem 4 can be here applied, we only need to prove that

{A_{n}} \sim {E (x_{1 : n} x_{1 : n}^{*})}

.

From Equation (13) we obtain

{∥E (x_{1 : n} x_{1 : n}^{*})∥}_{2} = ∥ T_{n} (G) T_{n} (Λ) {(T_{n} (G))}^{*} ∥_{2} \leq ∥ T_{n} {(G) ∥}_{2} ∥ T_{n} {(Λ) ∥}_{2} ∥ {(T_{n} (G))}^{*} ∥_{2} = {∥ Λ ∥}_{2} {∥ T_{n} (G) ∥}_{2}^{2}

for all

n \in N

. Hence, as

{∥ T_{n} (G) ∥_{2}}

is bounded (see, e.g., ([5], Theorem 4.3) or ([9], Corollary 4.2)),

{∥ E (x_{1 : n} x_{1 : n}^{*}) ∥_{2}}

is also bounded and

{E (x_{1 : n} x_{1 : n}^{*})} \sim {E (x_{1 : n} x_{1 : n}^{*})}

. Since

{∥ - T_{n} {(Υ) ∥}_{2}} = {∥ T_{n} (Υ) ∥_{2}}

is bounded,

{- T_{n} (Υ)} \sim {- T_{n} (Υ)}

, and consequently, applying ([5], Lemma 3.1) yields

{E (ϵ_{1 : n} ϵ_{1 : n}^{*}) - T_{n} (Υ)} \sim {0_{n N \times n N}}

. Therefore, from ([5], Lemma 3.1) we conclude that

\begin{matrix} {A_{n}} & = {E ((x_{1 : n} + ϵ_{1 : n}) {(x_{1 : n} + ϵ_{1 : n})}^{*}) - T_{n} (Υ)} \\ = {E (x_{1 : n} x_{1 : n}^{*}) + E (x_{1 : n} ϵ_{1 : n}^{*}) + E (ϵ_{1 : n} x_{1 : n}^{*}) + E (ϵ_{1 : n} ϵ_{1 : n}^{*}) - T_{n} (Υ)} \\ = {E (x_{1 : n} x_{1 : n}^{*}) + E (x_{1 : n}) E {(ϵ_{1 : n})}^{*} + E (ϵ_{1 : n}) E {(x_{1 : n})}^{*} + E (ϵ_{1 : n} ϵ_{1 : n}^{*}) - T_{n} (Υ)} \\ = {E (x_{1 : n} x_{1 : n}^{*}) + E (ϵ_{1 : n} ϵ_{1 : n}^{*}) - T_{n} (Υ)} \sim {E (x_{1 : n} x_{1 : n}^{*})} . \end{matrix}

Thus, Theorem 4 can be applied in the considered MIMO channel identification problem, that is, it can be used to identify

Λ

and the filter taps

{G_{k}}_{k \in N}

.

5. Conclusions

In ([2], Theorem 4) the (averaged) periodogram method for positive semidefinite Toeplitz matrices was generalized to perturbed block Toeplitz matrices. Moreover, ([2], Theorem 4) was there applied to perturbed positive semidefinite block Toeplitz matrices to solve a statistical signal processing problem: spectral estimation for perturbed WSS vector processes.

In the present paper, ([2], Theorem 4) (Theorem 1) has been applied to perturbed lower triangular block Toeplitz matrices to solve three statistical signal processing problems: parameter estimation for perturbed VAR processes, parameter estimation for perturbed VMA processes, and MIMO channel identification with perturbed additive WSS noise. To solve those problems we have first generalized a result given in [3] on the Cholesky decomposition of Toeplitz matrices to perturbed block Toeplitz matrices.

Author Contributions

Authors are listed in order of their degree of involvement in the work, with the most active contributors listed first. J.G.-G. conceived the research question. All authors were involved in the research and wrote the paper. They have also read and approved the published version of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Basque Government through the research project “Advanced distributed control for safety and energy efficiency of air transport (CODISAVA)” (KK-2018/00082).

Conflicts of Interest

The authors declare no conflict of interest.

References

Gray, R.M. On the asymptotic eigenvalue distribution of Toeplitz matrices. IEEE Trans. Inf. Theory 1972, IT-18, 725–730. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J. A modified version of the Pisarenko method to estimate the power spectral density of any asymptotically wide sense stationary vector process. Appl. Math. Comput. 2019, 362, 124526. [Google Scholar] [CrossRef]
Gray, R.M. Toeplitz and circulant matrices: A review. Found. Trends Commun. Inf. Theory 2006, 2, 155–239. [Google Scholar] [CrossRef]
Pisarenko, V.F. On the estimation of spectra by means of non-linear functions of the covariance matrix. Geophys. J. R. Astron. Soc. 1972, 28, 511–531. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J.; Crespo, P.M. Block Toeplitz matrices: Asymptotic results and applications. Found. Trends Commun. Inf. Theory 2011, 8, 179–257. [Google Scholar] [CrossRef]
Serra, S. Asymptotic results on the spectra of block Toeplitz preconditioned matrices. SIAM J. Matrix Anal. Appl. 1998, 20, 31–44. [Google Scholar] [CrossRef]
Miranda, M.; Tilli, P. Asymptotic spectra of Hermitian block Toeplitz matrices and preconditioning results. SIAM J. Matrix Anal. Appl. 2000, 21, 867–881. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J.; Crespo, P.M. Asymptotically equivalent sequences of matrices and Hermitian block Toeplitz matrices with continuous symbols: Applications to MIMO systems. IEEE Trans. Inf. Theory 2008, 54, 5671–5680. [Google Scholar] [CrossRef]
Tilli, P. Singular values and eigenvalues of non-Hermitian block Toeplitz matrices. Linear Algebra Appl. 1998, 272, 59–89. [Google Scholar] [CrossRef]
Lancaster, P.; Tismenetsky, M. The Theory of Matrices; Academic Press: Cambridge, MA, USA, 1985. [Google Scholar]
Bernstein, D.S. Matrix Mathematics: Theory, Facts, and Formulas; Princeton University Press: Princeton, NJ, USA, 2009. [Google Scholar]
Gutiérrez-Gutiérrez, J.; Crespo, P.M. Asymptotically equivalent sequences of matrices and multivariate ARMA processes. IEEE Trans. Inf. Theory 2011, 57, 5444–5454. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J.; Zárraga-Rodríguez, M.; Crespo, P.M.; Insausti, X. Rate distortion function of Gaussian asymptotically WSS vector processes. Entropy 2018, 20, 719. [Google Scholar] [CrossRef]
Rudin, W. Principles of Mathematical Analysis; McGraw-Hill: New York, NY, USA, 1976. [Google Scholar]
Reinsel, G.C. Elements of Multivariate Time Series Analysis; Springer: Berlin, Germany, 1993. [Google Scholar]
Gazzah, H.; Regalia, P.A.; Delmas, J.P. Asymptotic eigenvalue distribution of block Toeplitz matrices and application to blind SIMO channel identification. IEEE Trans. Inf. Theory 2001, 47, 1243–1251. [Google Scholar] [CrossRef]

Figure 1. Squared error made when

Λ

and

F_{- 1}

are estimated by

\hat{Λ} (n)

and

{\hat{F}}_{- 1} (n)

, respectively.

Figure 1. Squared error made when

Λ

and

F_{- 1}

are estimated by

\hat{Λ} (n)

and

{\hat{F}}_{- 1} (n)

, respectively.

Figure 2. Squared error made when

Λ

and

G_{1}

are estimated by

\hat{Λ} (n)

and

{\hat{G}}_{1} (n)

, respectively.

Figure 2. Squared error made when

Λ

and

G_{1}

are estimated by

\hat{Λ} (n)

and

{\hat{G}}_{1} (n)

, respectively.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Applications of the Periodogram Method for Perturbed Block Toeplitz Matrices in Statistical Signal Processing

Abstract

1. Introduction

2. Preliminaries

2.1. Notation

2.2. The Periodogram Method for Perturbed Block Toeplitz Matrices

3. A Note on the Cholesky Decomposition of Perturbed Block Toeplitz Matrices

4. Applications of the Periodogram Method in Parameter Estimation

4.1. Parameter Estimation Method for Perturbed VAR Processes

4.2. Parameter Estimation Method for Perturbed VMA Processes

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics