On the Asymptotic Optimality of a Low-Complexity Coding Strategy for WSS, MA, and AR Vector Sources

Jesús Gutiérrez-Gutiérrez; Marta Zárraga-Rodríguez; Xabier Insausti

doi:10.3390/e22121378

,

and

Tecnun, University of Navarra, Paseo Manuel Lardizábal 13, 20018 San Sebastián, Spain

^*

Author to whom correspondence should be addressed.

Entropy2020, 22(12), 1378;https://doi.org/10.3390/e22121378

This article belongs to the Section Information Theory, Probability and Statistics

Version Notes

Order Reprints

Abstract

In this paper, we study the asymptotic optimality of a low-complexity coding strategy for Gaussian vector sources. Specifically, we study the convergence speed of the rate of such a coding strategy when it is used to encode the most relevant vector sources, namely wide sense stationary (WSS), moving average (MA), and autoregressive (AR) vector sources. We also study how the coding strategy considered performs when it is used to encode perturbed versions of those relevant sources. More precisely, we give a sufficient condition for such perturbed versions so that the convergence speed of the rate remains unaltered.

Keywords:

source coding; low-complexity; wide sense stationary (WSS) vector source; moving average (MA) vector source; autoregressive (AR) vector source

1. Introduction

In [1], Kolmogorov gave a formula for the rate distortion function (RDF) of Gaussian vectors and for the RDF of Gaussian wide sense stationary (WSS) sources. In [2], Pearl presented an upper bound for the RDF of finite-length data blocks of any Gaussian WSS source and proved that such a bound tends to the RDF of the source when the length of the data block grows. However, he did not propose a coding strategy to achieve his bound for a given block length. In [3], we presented a tighter upper bound for the RDF of finite-length data blocks of any Gaussian WSS source, and we proposed a low-complexity coding strategy to achieve our bound. Obviously, since such a bound is tighter than the one given by Pearl, it also tends to the RDF of the source when the length of the data block grows. In [4], we generalized our low-complexity coding strategy to encode (compress) finite-length data blocks of any Gaussian vector source. Moreover, in [4], we also gave a sufficient condition for the vector source in order to make such a coding strategy asymptotically optimal. We recall that a coding strategy is asymptotically optimal if its rate tends to the RDF of the source as the length of the data block grows. Such a sufficient condition requires the Gaussian vector source to be asymptotically WSS (AWSS). The definition of the AWSS process was first introduced in [5], Section 6, and extended to vector processes in [6], Definition 7.1. However, the convergence speed of the rate of the coding strategy considered (i.e., how fast the rate of the coding strategy tends to the RDF of the AWSS vector source) was not studied in [4].

In this paper, we present a less restrictive sufficient condition for the vector source to make the coding strategy considered asymptotically optimal. Moreover, we study the convergence speed of the rate of such a coding strategy when it is used to encode the most relevant vector sources, namely, WSS, moving average (MA), and autoregressive (AR) vector sources. In this paper, we also study how the coding strategy considered performs when it is used to encode perturbed versions of those relevant sources. Specifically, we give a sufficient condition for such perturbed versions so that the convergence speed of the rate remains unaltered.

The study of the convergence speed in any information-theoretic problem is not an easy task. To study the aforementioned convergence speed, we first need to derive new mathematical results on block Toeplitz matrices and new mathematical results on the correlation matrices of the WSS, MA, and AR vector processes. These new mathematical results are useful not only to study the convergence speed in the information-theoretic problem considered, but also in other problems. In fact, as an example, in Appendix H, we use such mathematical results to study the convergence speed in a statistical signal processing problem on filtering WSS vector processes.

The paper is organized as follows. In Section 2, we give several new mathematical results on block Toeplitz matrices. In Section 3, using the results obtained in Section 2, we give several new mathematical results on the correlation matrices of WSS, MA, and AR vector processes. In Section 4, we recall the low-complexity coding strategy presented in [4], and using the results obtained in Section 3, we study the asymptotic optimality of such a coding strategy when it is used to encode WSS, MA, and AR vector sources. In Section 4, we also study how the coding strategy considered performs when it is used to encode perturbed versions of those sources. Finally, in Section 5, some conclusions are presented.

2. Several New Results on Block Toeplitz Matrices

In this section, we present new results on the product of block Toeplitz matrices, on the inverse of a block Toeplitz matrix, and on block circulant matrices. These results will be used in Section 3. We begin by introducing some notation.

2.1. Notation

In this paper,

N

,

Z

,

R

, and

C

denote the set of natural numbers (that is, the set of positive integers), the set of integer numbers, the set of real numbers, and the set of complex numbers, respectively.

C^{M \times N}

is the set of all

M \times N

complex matrices.

I_{N}

stands for the

N \times N

identity matrix.

0_{M \times N}

denotes the

M \times N

zero matrix.

V_{n}

is the

n \times n

Fourier unitary matrix, i.e.,

{[V_{n}]}_{j, k} = \frac{1}{\sqrt{n}} e^{- \frac{2 π (j - 1) (k - 1)}{n} i}, j, k \in {1, \dots, n},

with

i

being the imaginary unit. We denote by

λ_{1} (A), \dots, λ_{n} (A)

the eigenvalues of an

n \times n

Hermitian matrix A arranged in decreasing order. ∗ denotes the conjugate transpose. ⊗ is the Kronecker product.

{∥ \cdot ∥}_{2}

and

{∥ \cdot ∥}_{F}

are the spectral norm and the Frobenius norm, respectively.

If

n \in N

and

A_{j} \in C^{M \times N}

for all

j \in {1, \dots, n}

, then

diag (A_{1}, \dots, A_{n})

is the

n \times n

block diagonal matrix whose

M \times N

blocks are given by:

{[diag (A_{1}, \dots, A_{n})]}_{j, k} = δ_{j, k} A_{j}, j, k \in {1, \dots, n},

where

δ

is the Kronecker delta. We also denote by

{diag}_{1 \leq k \leq n} (A_{k})

the matrix

diag (A_{1}, \dots, A_{n})

.

If

n \in N

and

F : R \to C^{M \times N}

is a continuous

2 π

-periodic function,

T_{n} (F)

stands for the

n \times n

block Toeplitz matrix generated by F whose

M \times N

blocks are given by:

{[T_{n} (F)]}_{j, k} = F_{j - k}, j, k \in {1, \dots, n},

where

{F_{k}}_{k \in Z}

is the sequence of Fourier coefficients of F, that is,

F_{k} = \frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} F (ω) d ω \forall k \in Z .

We denote by

C_{n} (F)

the

n \times n

block circulant matrix with

M \times N

blocks defined as:

C_{n} (F) = (V_{n} \otimes I_{M}) {diag}_{1 \leq k \leq n} (F (\frac{2 π (k - 1)}{n})) {(V_{n} \otimes I_{N})}^{*} .

If

A_{n} \in C^{n M \times n N}

, then

C_{A_{n}}

is the

n \times n

block circulant matrix with the

M \times N

blocks given by:

C_{A_{n}} = (V_{n} \otimes I_{M}) {diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{M})}^{*} A_{n} (V_{n} \otimes I_{N})]}_{k, k}) {(V_{n} \otimes I_{N})}^{*} .

We denote by

{\hat{C}}_{n} (F)

the

n \times n

block circulant matrix with the

M \times N

blocks defined as

{\hat{C}}_{n} (F) = C_{T_{n} (F)}

.

If

F (ω)

is Hermitian for all

ω \in R

(or equivalently,

T_{n} (F)

is Hermitian for all

n \in N

(see, e.g., [6], Theorem 4.4), then

inf F

denotes

{inf}_{ω \in [0, 2 π]} λ_{N} (F (ω))

. We recall that (see [7], Proposition 3):

inf_{n \in N} λ_{n N} (T_{n} (F)) = inf F = min_{ω \in [0, 2 π]} λ_{N} (F (ω)) .

(1)

2.2. Product of Block Toeplitz Matrices

We begin this subsection with a result on the entries of the block Toeplitz matrices generated by the product of two functions, which is a direct consequence of the Parseval theorem.

Lemma 1.

Consider two continuous

2 π

-periodic functions

F : R \to C^{M \times N}

and

G : R \to C^{N \times K}

. Let

{F_{k}}_{k \in Z}

and

{G_{k}}_{k \in Z}

be the sequences of Fourier coefficients of F and G, respectively. Then:

{[T_{n} (F G)]}_{j, k} = \sum_{h = - \infty}^{\infty} F_{j - h} G_{h - k}

for all

n \in N

and

j, k \in {1, \dots, n}

.

Proof.

See Appendix A. □

We can now give a result on the product of two block Toeplitz matrices when one of them is generated by a trigonometric polynomial. We recall that an

M \times N

trigonometric polynomial of degree

p \in N \cup {0}

is a function

F : R \to C^{M \times N}

of the form:

F (ω) = \sum_{k = - p}^{p} e^{k ω i} A_{k} \forall ω \in R,

(2)

where

A_{k} \in C^{M \times N}

with

| k | \leq p

. It can be easily proven (see, e.g., [6], Example 4.3) that the sequence of the Fourier coefficients

{F_{k}}_{k \in Z}

of the continuous

2 π

-periodic function F in Equation (2) is given by:

F_{k} = \{\begin{matrix} A_{k} & if | k | \leq p, \\ 0_{M \times N} & if | k | > p . \end{matrix}

Lemma 2.

Let F, G,

{F_{k}}_{k \in Z}

, and

{G_{k}}_{k \in Z}

be as in Lemma 1.

1.: If F is a trigonometric polynomial of degree p, then:

${[T_{n} (F) T_{n} (G) - T_{n} (F G)]}_{j, k} = \{\begin{matrix} - \sum_{h = j - p}^{0} F_{j - h} G_{h - k} & i f j \leq p, \\ 0_{M \times K} & i f p + 1 \leq j \leq n - p, \\ - \sum_{h = n + 1}^{j + p} F_{j - h} G_{h - k} & i f j \geq n - p + 1, \end{matrix}$

(3)

and:

$∥ T_{n} (F) T_{n} (G) - T_{n} {(F G) ∥}_{F} \leq \sqrt{p (p + 1) (\frac{1}{2 π} \int_{0}^{2 π} {∥ F (ω) ∥}_{F}^{2} d ω) (\frac{1}{2 π} \int_{0}^{2 π} {∥ G (ω) ∥}_{F}^{2} d ω)}$

(4)

for all $n \in N$ and $j, k \in {1, \dots, n}$ .
2.: If G is a trigonometric polynomial of degree q, then:

${[T_{n} (F) T_{n} (G) - T_{n} (F G)]}_{j, k} = \{\begin{matrix} - \sum_{h = k - q}^{0} F_{j - h} G_{h - k} & i f k \leq q, \\ 0_{M \times K} & i f q + 1 \leq k \leq n - q, \\ - \sum_{h = n + 1}^{k + q} F_{j - h} G_{h - k} & i f k \geq n - q + 1, \end{matrix}$

(5)

and:

$∥ T_{n} (F) T_{n} (G) - T_{n} {(F G) ∥}_{F} \leq \sqrt{q (q + 1) (\frac{1}{2 π} \int_{0}^{2 π} {∥ F (ω) ∥}_{F}^{2} d ω) (\frac{1}{2 π} \int_{0}^{2 π} {∥ G (ω) ∥}_{F}^{2} d ω)}$

(6)

for all $n \in N$ and $j, k \in {1, \dots, n}$ .
3.: If F is a trigonometric polynomial of degree p and G is a trigonometric polynomial of degree q, then:

$T_{n} (F) T_{n} (G) - T_{n} (F G) = (\begin{matrix} ξ_{1} (F, G) & 0_{p M \times (n - 2 q) K} & 0_{p M \times q K} \\ 0_{(n - 2 p) M \times q K} & 0_{(n - 2 p) M \times (n - 2 q) K} & 0_{(n - 2 p) M \times q K} \\ 0_{p M \times q K} & 0_{p M \times (n - 2 q) K} & ξ_{2} (F, G) \end{matrix})$

and:

$∥ T_{n} (F) T_{n} (G) - T_{n} {(F G) ∥}_{F} = \sqrt{∥ ξ_{1} {(F, G) ∥}_{F}^{2} + {∥ ξ_{2} (F, G) ∥}_{F}^{2}}$

(7)

for all $n \geq max {2 p, 2 q}$ , where $ξ_{1} (F, G), ξ_{2} (F, G) \in C^{p M \times q K}$ are given by:

${[ξ_{1} (F, G)]}_{j, k} = - \sum_{h = max {j - p, k - q}}^{0} F_{j - h} G_{h - k}$

and:

${[ξ_{2} (F, G)]}_{j, k} = - \sum_{h = 1}^{min {j, k}} F_{j - p - h} G_{h + q - k}$

for all $j \in {1, \dots, p}$ and $k \in {1, \dots, q}$ .

Proof.

See Appendix B. □

2.3. Inverse of a Block Toeplitz Matrix

Lemma 3.

Let

F : R \to C^{N \times N}

be a trigonometric polynomial of degree p.

1.: If $F (ω)$ is invertible for all $ω \in R$ and ${T_{n} (F)}$ is stable (i.e., $T_{n} (F)$ is invertible for all $n \in N$ and ${∥ {(T_{n} (F))}^{- 1} ∥_{2}}$ is bounded), then:

$∥ {(T_{n} (F))}^{- 1} - T_{n} (F^{- 1}) ∥_{F} \leq sup_{m \in N} {∥ {(T_{m} (F))}^{- 1} ∥}_{2} \sqrt{p (p + 1) (\frac{1}{2 π} \int_{0}^{2 π} {∥ F (ω) ∥}_{F}^{2} d ω) (\frac{1}{2 π} \int_{0}^{2 π} {∥ {(F (ω))}^{- 1} ∥}_{F}^{2} d ω)}$

for all $n \in N$ .
2.: If $F (ω)$ is positive definite for all $ω \in R$ , then:

$∥ {(T_{n} (F))}^{- 1} - T_{n} (F^{- 1}) ∥_{F} \leq \frac{1}{inf F} \sqrt{p (p + 1) (\frac{1}{2 π} \int_{0}^{2 π} {∥ F (ω) ∥}_{F}^{2} d ω) (\frac{1}{2 π} \int_{0}^{2 π} {∥ {(F (ω))}^{- 1} ∥}_{F}^{2} d ω)}$

(8)

for all $n \in N$ .

Proof.

See Appendix C. □

2.4. Block Circulant Matrices

Lemma 4.

Consider

A_{n}, B_{n} \in C^{n M \times n N}

. Then:

∥ C_{A_{n}} - C_{B_{n}} ∥_{F} \leq {∥ A_{n} - B_{n} ∥}_{F}

and:

∥ A_{n} - C_{A_{n}} ∥_{F} \leq 2 ∥ A_{n} - B_{n} ∥_{F} + {∥ B_{n} - C_{B_{n}} ∥}_{F} .

(9)

Moreover, if

B_{n}

is an

n \times n

block circulant matrix with

M \times N

blocks, then:

C_{B_{n}} = B_{n}

(10)

and:

∥ A_{n} - C_{A_{n}} ∥_{F} \leq 2 {∥ A_{n} - B_{n} ∥}_{F} .

(11)

Proof.

See Appendix D. □

Lemma 5.

Let

F : R \to C^{M \times N}

be a trigonometric polynomial of degree p. Then:

∥ T_{n} (F) - {\hat{C}}_{n} {(F) ∥}_{F} \leq {∥ T_{n} (F) - C_{n} (F) ∥}_{F} = \sqrt{\sum_{k = 1}^{p} k (∥ F_{k} ∥_{F}^{2} + {∥ F_{- k} ∥}_{F}^{2})}

for all

n > 2 p

. Furthermore,

lim_{n \to \infty} ∥ T_{n} (F) - {\hat{C}}_{n} {(F) ∥}_{F} = lim_{n \to \infty} {∥ T_{n} (F) - C_{n} (F) ∥}_{F} .

Proof.

See Appendix E. □

3. Several New Results on the Correlation Matrices of Certain Random Vector Processes

Let

{x_{n}}

be a (complex) random N-dimensional vector process, that is

x_{n}

is a (complex) random N-dimensional (column) vector for all

n \in N

. In this section, we study the boundedness of the sequence

\{{∥E (x_{n : 1} x_{n : 1}^{*}) - C_{E (x_{n : 1} x_{n : 1}^{*})}∥}_{F}\}

when

{x_{n}}

is a WSS, MA, or AR vector process, where:

x_{n : 1} = (\begin{matrix} x_{n} \\ x_{n - 1} \\ ⋮ \\ x_{1} \end{matrix}), n \in N,

and E denotes expectation.

3.1. WSS Vector Processes

In this subsection, we review the concept of the WSS vector process, and we prove that the sequence

\{{∥E (x_{n : 1} x_{n : 1}^{*}) - C_{E (x_{n : 1} x_{n : 1}^{*})}∥}_{F}\}

is bounded when

{x_{n}}

is a WSS vector process whose power spectral density (PSD) is a trigonometric polynomial.

Definition 1.

Let

X : R \to C^{N \times N}

be continuous and

2 π

-periodic. A random N-dimensional vector process

{x_{n}}

is said to be WSS with PSD X if it has constant mean (i.e.,

E (x_{n_{1}}) = E (x_{n_{2}})

for all

n_{1}, n_{2} \in N)

and

{E (x_{n : 1} x_{n : 1}^{*})} = {T_{n} (X)}

.

Lemma 6.

If

{x_{n}}

is a WSS vector process whose PSD is a trigonometric polynomial, then

\{{∥E (x_{n : 1} x_{n : 1}^{*}) - C_{E (x_{n : 1} x_{n : 1}^{*})}∥}_{F}\}

is bounded.

Proof.

This is a direct consequence of Lemma 5. □

3.2. VMA Processes

In this subsection, we review the concept of the MA vector (VMA) process, and we prove that the sequence

\{{∥E (x_{n : 1} x_{n : 1}^{*}) - C_{E (x_{n : 1} x_{n : 1}^{*})}∥}_{F}\}

is bounded when

{x_{n}}

is a VMA process of finite order.

Definition 2.

A zero-mean random N-dimensional vector process

{x_{n}}

is said to be a VMA process if:

x_{n} = w_{n} + \sum_{k = 1}^{n - 1} G_{- k} w_{n - k} \forall n \in N,

(12)

where

G_{- k} \in C^{N \times N}

for all

k \in N

and

{w_{n}}

is a zero-mean WSS N-dimensional vector process whose PSD is an

N \times N

positive semidefinite matrix Λ. If there exists

q \in N

such that

G_{- k} = 0_{N \times N}

for all

k > q

, then

{x_{n}}

is called a VMA process of (finite) order q or a VMA

(q)

process.

Lemma 7.

If

{x_{n}}

is a VMA

(q)

process as in Definition 2, then

\{{∥E (x_{n : 1} x_{n : 1}^{*}) - C_{E (x_{n : 1} x_{n : 1}^{*})}∥}_{F}\}

is bounded.

Proof.

See Appendix F. □

3.3. VAR Processes

In this subsection, we review the concept of the AR vector (VAR) process, and we study the boundedness of the sequence

\{{∥E (x_{n : 1} x_{n : 1}^{*}) - C_{E (x_{n : 1} x_{n : 1}^{*})}∥}_{F}\}

when

{x_{n}}

is a VAR process of finite order.

Definition 3.

A zero-mean random N-dimensional vector process

{x_{n}}

is said to be a VAR process if:

x_{n} = w_{n} - \sum_{k = 1}^{n - 1} F_{- k} x_{n - k} \forall n \in N,

(13)

where

F_{- k} \in C^{N \times N}

for all

k \in N

and

{w_{n}}

is a zero-mean WSS N-dimensional vector process whose PSD is an

N \times N

positive definite matrix Λ. If there exists

p \in N

such that

F_{- k} = 0_{N \times N}

for all

k > p

, then

{x_{n}}

is called a VAR process of (finite) order p or a VAR

(p)

process.

Lemma 8.

Let

{x_{n}}

be a VAR

(p)

process as in Definition 3. Suppose that

F (ω) = I_{N} + \sum_{k = 1}^{p} e^{- k ω i} F_{- k}

is invertible for all

ω \in R

and

{∥ {(T_{n} (F))}^{- 1} ∥_{2}}

is bounded. Then,

\{{∥E (x_{n : 1} x_{n : 1}^{*}) - C_{E (x_{n : 1} x_{n : 1}^{*})}∥}_{F}\}

is bounded.

Proof.

See Appendix G. □

4. On the Asymptotic Optimality of a Low-Complexity Coding Strategy for Gaussian Vector Sources

4.1. Low-Complexity Coding Strategy Considered

In [1], Kolmogorov gave a formula for the RDF of a real zero-mean Gaussian N-dimensional vector

x

with a positive definite correlation matrix

E (x x^{⊤})

, namely,

R_{x} (D) = \frac{1}{N} \sum_{k = 1}^{N} max \{0, \frac{1}{2} ln \frac{λ_{k} (E (x x^{⊤}))}{θ}\} \forall D \in (0, \frac{tr (E (x x^{⊤}))}{N}],

where ⊤ stands for the transpose,

tr

denotes the trace, and

θ

is a real number satisfying:

D = \frac{1}{N} \sum_{k = 1}^{N} min \{θ, λ_{k} (E (x x^{⊤}))\} .

We recall that

R_{x} (D)

can be thought of as the minimum rate (measured in nats) at which

x

can be encoded (compressed) in order to be able to recover it with a mean squared error (MSE) per dimension no larger than a given distortion D, that is:

\frac{E ({∥x - \tilde{x}∥}_{2}^{2})}{N} \leq D,

where

\tilde{x}

denotes the estimation of

x

.

If

D \in (0, λ_{N} (E (x x^{⊤}))]

, an optimal coding strategy to achieve

R_{x} (D)

is to encode

{[z]}_{1, 1}, \dots, {[z]}_{N, 1}

separately with

E (∥ {[z]}_{k, 1} - \tilde{{[z]}_{k, 1}} ∥_{2}^{2}) \leq D

for all

k \in {1, \dots, N}

, where

z = U^{⊤} x

with U being a real orthogonal eigenvector matrix of

E (x x^{⊤})

(see [8], Corollary 1). Observe that in order to obtain U, we need to know the correlation matrix

E (x x^{⊤})

. This coding strategy also requires an optimal coding method for real Gaussian random variables.

In [4], Theorem 3, we gave a low-complexity coding strategy for any Gaussian N-dimensional vector source

{x_{n}}

. According to that strategy, to encode a finite-length data block

x_{n : 1}

of such a source, we first compute the block discrete Fourier transform (DFT) of

x_{n : 1}

:

y_{n : 1} = (V_{n}^{*} \otimes I_{N}) x_{n : 1},

(14)

and then, we encode

y_{⌈ \frac{n}{2} ⌉}, \dots, y_{n}

separately (i.e., if n is even, we encode

y_{\frac{n}{2}}, \hat{y_{\frac{n}{2} + 1}}, \dots, \hat{y_{n - 1}}, y_{n}

separately, and if n is odd, we encode

\hat{y_{\frac{n + 1}{2}}}, \dots, \hat{y_{n - 1}}, y_{n}

separately) with:

\frac{E ({∥\hat{y_{k}} - \tilde{\hat{y_{k}}}∥}_{2}^{2})}{2 N} \leq \frac{D}{2}, k \in \{⌈\frac{n}{2}⌉, \dots, n - 1\} \ \{\frac{n}{2}\},

and:

\frac{E ({∥y_{k} - \tilde{y_{k}}∥}_{2}^{2})}{N} \leq D, k \in \{\frac{n}{2}, n\} \cap N,

where

⌈ x ⌉

denotes the smallest integer higher than or equal to

x \in R

and:

\hat{z} = (\begin{matrix} Re (z) \\ Im (z) \end{matrix}) = (\begin{matrix} Re ({[z]}_{1, 1}) \\ ⋮ \\ Re ({[z]}_{N, 1}) \\ Im ({[z]}_{1, 1}) \\ ⋮ \\ Im ({[z]}_{N, 1}) \end{matrix}) \forall z \in C^{N \times 1}

with

Re

and

Im

being the real part and the imaginary part of a complex number, respectively.

As our coding strategy requires the computation of the block DFT, its computational complexity is

O (n N log n)

whenever the fast Fourier transform (FFT) algorithm is used. We recall that the computational complexity of the optimal coding strategy for

x_{n : 1}

is

O (n^{2} N^{2})

since it requires the computation of

U_{n}^{⊤} x_{n : 1}

, where

U_{n}

is a real orthogonal eigenvector matrix of

E (x_{n : 1} x_{n : 1}^{⊤})

. Observe that such an eigenvector matrix

U_{n}

also needs to be computed, which further increases the complexity. Hence, the main advantage of our coding strategy is that it notably reduces the computational complexity of coding

x_{n : 1}

. Moreover, our coding strategy does not require the knowledge of

E (x_{n : 1} x_{n : 1}^{⊤})

. It only requires the knowledge of

E (\hat{y_{k}} {\hat{y_{k}}}^{⊤})

, with

k \in {⌈ \frac{n}{2} ⌉, \dots, n}

.

We finish this subsection by reviewing a result that provides an upper bound for the distance between

R_{x_{n : 1}} (D)

and the rate of our coding strategy

{\tilde{R}}_{x_{n : 1}} (D)

(see [4], Theorem 3).

Theorem 1.

Consider

n, N \in N

. Let

x_{k}

be a random N-dimensional vector for all

k \in {1, \dots, n}

. Suppose that

x_{n : 1}

is a real zero-mean Gaussian vector with a positive definite correlation matrix (or equivalently,

λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤})) > 0

). Let

y_{n : 1}

be the random vector given by Equation (14). If

D \in (0, λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤}))]

, then:

\begin{matrix} 0 \leq {\tilde{R}}_{x_{n : 1}} (D) - R_{x_{n : 1}} (D) \leq \frac{1}{2} ln (1 + \frac{{∥E (x_{n : 1} x_{n : 1}^{⊤}) - C_{E (x_{n : 1} x_{n : 1}^{⊤})}∥}_{F}}{\sqrt{n N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤}))}), \end{matrix}

(15)

where:

{\tilde{R}}_{x_{n : 1}} (D) = \{\begin{matrix} \frac{R_{y_{\frac{n}{2}}} (D) + 2 \sum_{k = \frac{n}{2} + 1}^{n - 1} R_{\hat{y_{k}}} (\frac{D}{2}) + R_{y_{n}} (D)}{n} & if n is even, \\ \frac{2 \sum_{k = \frac{n + 1}{2}}^{n - 1} R_{\hat{y_{k}}} (\frac{D}{2}) + R_{y_{n}} (D)}{n} & if n is odd . \end{matrix}

4.2. On the Asymptotic Optimality of the Low-Complexity Coding Strategy Considered

In this subsection, we study the asymptotic optimality of our coding strategy for Gaussian vector sources. We begin by presenting a new result that provides a sufficient condition for the source to make such a coding strategy asymptotically optimal.

Theorem 2.

Let

{x_{n}}

be a real zero-mean Gaussian N-dimensional vector process. Suppose that

{inf}_{n \in N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤})) > 0

and

{lim}_{n \to \infty} \frac{{∥E (x_{n : 1} x_{n : 1}^{⊤}) - C_{E (x_{n : 1} x_{n : 1}^{⊤})}∥}_{F}}{\sqrt{n}} = 0

. If

D \in (0, {inf}_{n \in N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤}))]

, then:

lim_{n \to \infty} ({\tilde{R}}_{x_{n : 1}} (D) - R_{x_{n : 1}} (D)) = 0 .

Hence, if

{R_{x_{n : 1}} (D)}

is convergent, then:

lim_{n \to \infty} {\tilde{R}}_{x_{n : 1}} (D) = lim_{n \to \infty} R_{x_{n : 1}} (D) .

(16)

Proof.

From Equation (15), we have:

\begin{matrix} 0 & \leq {\tilde{R}}_{x_{n : 1}} (D) - R_{x_{n : 1}} (D) \leq \frac{1}{2} ln (1 + \frac{{∥E (x_{n : 1} x_{n : 1}^{⊤}) - C_{E (x_{n : 1} x_{n : 1}^{⊤})}∥}_{F}}{\sqrt{n N} {inf}_{m \in N} λ_{m N} (E (x_{m : 1} x_{m : 1}^{⊤}))}) \forall n \in N, \end{matrix}

and therefore, Theorem 2 is proven. □

We recall that

{lim}_{n \to \infty} R_{x_{n : 1}} (D)

is the RDF of the source

{x_{n}}

.

In [4], Theorem 4, we gave a more restrictive sufficient condition for the source to make the coding strategy considered asymptotically optimal. Specifically, in [4], Theorem 4, we proved that Equation (16) holds if

{x_{n}}

is AWSS. However, the convergence speed of the rate of the coding strategy considered (i.e., how fast the rate of the coding strategy tends to the RDF of the AWSS vector source) was not studied in [4]. We now study the convergence speed of the rate of such a coding strategy when it is used to encode the most relevant vector sources, namely WSS vector sources, VMA sources, and VAR sources. It should be mentioned that this convergence speed depends on the sequence

\{{∥E (x_{n : 1} x_{n : 1}^{⊤}) - C_{E (x_{n : 1} x_{n : 1}^{⊤})}∥}_{F}\}

whose boundedness is studied in Section 3 for these three types of vector sources.

Theorem 3.

Let

{x_{n}}

be a real zero-mean Gaussian WSS N-dimensional vector process whose PSD X is a trigonometric polynomial. Suppose that

inf X > 0

(or equivalently,

det (X (ω)) \neq 0

for all

ω \in R

). If

D \in (0, inf X]

, there exists

K \in [0, \infty)

such that:

\begin{matrix} 0 & \leq {\tilde{R}}_{x_{n : 1}} (D) - R_{x_{n : 1}} (D) \leq \frac{1}{2} ln (1 + \frac{K}{\sqrt{n}}) \forall n \in N . \end{matrix}

(17)

Proof.

As

{T_{n} (X)} = {E (x_{n : 1} x_{n : 1}^{*})}

,

T_{n} (X)

is positive semidefinite for all

n \in N

. Consequently, from [7], Proposition 3,

X (ω)

is positive semidefinite for all

ω \in R

. Therefore, applying Equation (1),

inf X > 0

if and only if

det (X (ω)) \neq 0

for all

ω \in R

. Equation (17) is a direct consequence of Equation (1), Theorem 1, and Lemma 6. □

Theorem 4.

Let

{x_{n}}

be a VMA(q) process as in Definition 2. Suppose that

det (Λ) \neq 0

and

{∥ {(T_{n} (G))}^{- 1} ∥_{2}}

is bounded with

G (ω) = I_{N} + \sum_{k = 1}^{q} e^{- k ω i} G_{- k}

for all

ω \in R

. If

{x_{n}}

is real and Gaussian, and

D \in (0, {inf}_{n \in N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤}))]

, there exists

K \in [0, \infty)

such that:

\begin{matrix} 0 & \leq {\tilde{R}}_{x_{n : 1}} (D) - R_{x_{n : 1}} (D) \leq \frac{1}{2} ln (1 + \frac{K}{\sqrt{n}}) \forall n \in N . \end{matrix}

Proof.

Since

det (T_{n} (G)) = 1

for all

n \in N

, from Equation (A3), we have:

\begin{matrix} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤})) = \frac{1}{{∥{(E (x_{n : 1} x_{n : 1}^{⊤}))}^{- 1}∥}_{2}} = \frac{1}{{∥{(T_{n} (G) T_{n} (Λ) {(T_{n} (G))}^{*})}^{- 1}∥}_{2}} \\ = \frac{1}{{∥{({(T_{n} (G))}^{- 1})}^{*} T_{n} (Λ^{- 1}) {(T_{n} (G))}^{- 1}∥}_{2}} \geq \frac{1}{{∥{({(T_{n} (G))}^{- 1})}^{*}∥}_{2} {∥T_{n} (Λ^{- 1})∥}_{2} {∥{(T_{n} (G))}^{- 1}∥}_{2}} \\ = \frac{λ_{N} (Λ)}{{∥{(T_{n} (G))}^{- 1}∥}_{2}^{2}} \geq \frac{λ_{N} (Λ)}{{({sup}_{m \in N} {∥{(T_{m} (G))}^{- 1}∥}_{2})}^{2}} > 0 \forall n \in N . \end{matrix}

Hence,

{inf}_{n \in N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤})) > 0

. Theorem 1 and Lemma 7 prove Theorem 4. □

Theorem 5.

Let

{x_{n}}

be a VAR(p) process as in Definition 3. Suppose that

F (ω) = I_{N} + \sum_{k = 1}^{p} e^{- k ω i} F_{- k}

is invertible for all

ω \in R

and

{∥ {(T_{n} (F))}^{- 1} ∥_{2}}

is bounded. If

{x_{n}}

is real and Gaussian and

D \in (0, {inf}_{n \in N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤}))]

, there exists

K \in [0, \infty)

such that:

\begin{matrix} 0 & \leq {\tilde{R}}_{x_{n : 1}} (D) - R_{x_{n : 1}} (D) \leq \frac{1}{2} ln (1 + \frac{K}{\sqrt{n}}) \forall n \in N . \end{matrix}

Proof.

As

det (T_{n} (F)) = 1

for all

n \in N

, applying Equation (A4) and [6], Theorem 4.3, yields:

\begin{matrix} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤})) & = \frac{1}{{∥{(E (x_{n : 1} x_{n : 1}^{⊤}))}^{- 1}∥}_{2}} = \frac{1}{{∥{({(T_{n} (F))}^{- 1} T_{n} (Λ) {({(T_{n} (F))}^{*})}^{- 1})}^{- 1}∥}_{2}} \\ = \frac{1}{{∥{(T_{n} (F))}^{*} T_{n} (Λ^{- 1}) T_{n} (F)∥}_{2}} \geq \frac{1}{{∥{(T_{n} (F))}^{*}∥}_{2} {∥T_{n} (Λ^{- 1})∥}_{2} {∥T_{n} (F)∥}_{2}} \\ = \frac{λ_{N} (Λ)}{{∥T_{n} (F)∥}_{2}^{2}} \geq \frac{λ_{N} (Λ)}{{({sup}_{m \in N} {∥T_{m} (F)∥}_{2})}^{2}} > 0 \forall n \in N . \end{matrix}

Thus,

{inf}_{n \in N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤})) > 0

. Theorem 1 and Lemma 8 prove Theorem 5. □

4.3. On How the Low-Complexity Coding Strategy Considered Performs under Perturbations

In this subsection, we study how the low-complexity coding strategy considered performs when it is used to encode a perturbed version,

{z_{n}}

, of a WSS, MA, or AR vector source

{x_{n}}

. Observe that if

\{{∥E (z_{n : 1} z_{n : 1}^{⊤}) - E (x_{n : 1} x_{n : 1}^{⊤})∥}_{F}\}

is bounded, from Equation (9), we conclude that our coding strategy can also be used to optimally encode

{z_{n}}

, and the convergence speed of the rate remains unaltered.

We now present three numerical examples that show how the coding strategy considered performs in the presence of a perturbation. In all of them,

N = 2

and:

E (z_{n : 1} z_{n : 1}^{⊤}) = E (x_{n : 1} x_{n : 1}^{⊤}) + (\begin{matrix} 0_{2 n - 2 \times 2 n - 2} & 0_{2 n - 2 \times 2} \\ 0_{2 \times 2 n - 2} & I_{2} \end{matrix}) \forall n \in N .

Obviously,

\{{∥E (z_{n : 1} z_{n : 1}^{⊤}) - E (x_{n : 1} x_{n : 1}^{⊤})∥}_{F}\}

is bounded since

{∥E (z_{n : 1} z_{n : 1}^{⊤}) - E (x_{n : 1} x_{n : 1}^{⊤})∥}_{F} = \sqrt{2}

for all

n \in N

. The three vector sources

{x_{n}}

considered in our numerical examples are the zero-mean WSS vector source in [9], Section 4, the VMA(1) source in [10], Example 2.1, and the VAR(1) source in [10], Example 2.3. In [9], Section 4, the Fourier coefficients of the PSD X are:

X_{0} = (\begin{matrix} 2.0002 & 0.7058 \\ 0.7058 & 2.0000 \end{matrix}), X_{- 1} = X_{1}^{*} = (\begin{matrix} - 0.3542 & 0.1016 \\ 0.1839 & - 0.2524 \end{matrix}), X_{- 2} = X_{2}^{*} = (\begin{matrix} - 0.0923 & 0.0153 \\ 0.1490 & 0.0696 \end{matrix}),

X_{- 3} = X_{3}^{*} = (\begin{matrix} - 0.1443 & - 0.0904 \\ 0.0602 & 0.0704 \end{matrix}), X_{- 4} = X_{4}^{*} = (\begin{matrix} - 0.0516 & - 0.0603 \\ 0 & 0 \end{matrix}),

and

X_{k} = 0_{2 \times 2}

with

| k | > 4

. In [10], Example 2.1,

G_{- 1}

and

Λ

are given by:

(\begin{matrix} - 0.8 & - 0.7 \\ 0.4 & - 0.6 \end{matrix})

(18)

and:

(\begin{matrix} 4 & 1 \\ 1 & 2 \end{matrix}),

(19)

respectively. In [10], Example 2.3,

F_{- 1}

and

Λ

are given by Equations (18) and (19), respectively.

Figure 1a, Figure 2a and Figure 3a show

R_{x_{n : 1}} (D)

and

{\tilde{R}}_{x_{n : 1}} (D)

for the three vector sources

{x_{n}}

considered by assuming that they are Gaussian. Figure 1b, Figure 2b and Figure 3b show

R_{z_{n : 1}} (D)

and

{\tilde{R}}_{z_{n : 1}} (D)

for these three vector sources. In Figure 1, Figure 2 and Figure 3,

n \leq 100

and

D = 0.001

. The figures bear the evidence of the fact that the rate of the low-complexity coding strategy considered tends to the RDF of the source even in the presence of a perturbation.

Figure 1. Rates for the considered wide sense stationary (WSS) vector source: (a) without perturbation and (b) with perturbation.

Figure 2. Rates for the considered VMA(1) source: (a) without perturbation and (b) with perturbation.

Figure 3. Rates for the considered VAR(1) source: (a) without perturbation and (b) with perturbation.

5. Conclusions

In [4], we proposed a low-complexity coding strategy to encode finite-length data blocks of any Gaussian vector source. In this paper, we proved that the convergence speed of the rate of our coding strategy is

O (\frac{1}{\sqrt{n}})

when it is used to encode the most relevant vector sources, namely WSS, MA, and AR vector sources. This means that the rate of our coding strategy will be close enough to the RDF of the source even if the length n of the data blocks is relatively small. Therefore, we conclude that our coding strategy is not only low-complexity and asymptotically optimal, but also low-latency. These three features make our coding strategy very useful in practical coding applications.

Author Contributions

Authors are listed in order of their degree of involvement in the work, with the most active contributors listed first. J.G.-G. conceived the research question. All authors were involved in the research and wrote the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Spanish Ministry of Science and Innovation through the ADELE project (PID2019-104958RB-C44).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Lemma 1

Proof.

Fix

n \in N

and

j, k \in {1, \dots, n}

. As

F : R \to C^{M \times N}

and

G : R \to C^{N \times K}

are continuous and

2 π

-periodic,

F G : R \to C^{M \times K}

is also continuous and

2 π

-periodic. Applying the Parseval theorem (see, e.g., [11], p. 191) yields:

\begin{matrix} {[{[T_{n} (F G)]}_{j, k}]}_{r, s} & = {[\frac{1}{2 π} \int_{0}^{2 π} e^{- (j - k) ω i} F (ω) G (ω) d ω]}_{r, s} \\ = \frac{1}{2 π} \int_{0}^{2 π} e^{- (j - k) ω i} {[F (ω) G (ω)]}_{r, s} d ω \\ = \frac{1}{2 π} \int_{0}^{2 π} e^{(k - j) ω i} \sum_{t = 1}^{N} {[F (ω)]}_{r, t} {[G (ω)]}_{t, s} d ω \\ = \sum_{t = 1}^{N} \frac{1}{2 π} \int_{0}^{2 π} e^{(k - j) ω i} {[F (ω)]}_{r, t} {[G (ω)]}_{t, s} d ω \\ = \sum_{t = 1}^{N} \frac{1}{2 π} \int_{0}^{2 π} {[G (ω)]}_{t, s} e^{k ω i} {[F (ω)]}_{r, t} e^{- j ω i} d ω \\ = \sum_{t = 1}^{N} \sum_{h = - \infty}^{\infty} ((\frac{1}{2 π} \int_{0}^{2 π} e^{- h ω i} {[G (ω)]}_{t, s} e^{k ω i} d ω) \bar{(\frac{1}{2 π} \int_{0}^{2 π} e^{- h ω i} \bar{{[F (ω)]}_{r, t} e^{- j ω i}} d ω)}) \\ = \sum_{t = 1}^{N} \sum_{h = - \infty}^{\infty} ((\frac{1}{2 π} \int_{0}^{2 π} e^{- h ω i} {[G (ω)]}_{t, s} e^{k ω i} d ω) (\frac{1}{2 π} \int_{0}^{2 π} e^{h ω i} {[F (ω)]}_{r, t} e^{- j ω i} d ω)) \\ = \sum_{t = 1}^{N} \sum_{h = - \infty}^{\infty} ((\frac{1}{2 π} \int_{0}^{2 π} e^{- (j - h) ω i} {[F (ω)]}_{r, t} d ω) (\frac{1}{2 π} \int_{0}^{2 π} e^{- (h - k) ω i} {[G (ω)]}_{t, s} d ω)) \\ = \sum_{t = 1}^{N} \sum_{h = - \infty}^{\infty} ({[\frac{1}{2 π} \int_{0}^{2 π} e^{- (j - h) ω i} F (ω) d ω]}_{r, t} {[\frac{1}{2 π} \int_{0}^{2 π} e^{- (h - k) ω i} G (ω) d ω]}_{t, s}) \\ = \sum_{t = 1}^{N} \sum_{h = - \infty}^{\infty} ({[F_{j - h}]}_{r, t} {[G_{h - k}]}_{t, s}) \\ = \sum_{h = - \infty}^{\infty} \sum_{t = 1}^{N} ({[F_{j - h}]}_{r, t} {[G_{h - k}]}_{t, s}) \\ = \sum_{h = - \infty}^{\infty} {[F_{j - h} G_{h - k}]}_{r, s} \\ = {[\sum_{h = - \infty}^{\infty} F_{j - h} G_{h - k}]}_{r, s} \end{matrix}

for all

r \in {1, \dots, M}

and

s \in {1, \dots, K}

. □

Appendix B. Proof of Lemma 2

Proof.

(1) Fix

n \in N

. As

F_{r} = 0_{M \times N}

with

| r | > p

, from Lemma 1, we have:

\begin{matrix} {[T_{n} (F) T_{n} (G) - T_{n} (F G)]}_{j, k} & = {[T_{n} (F) T_{n} (G)]}_{j, k} - {[T_{n} (F G)]}_{j, k} \\ = \sum_{h = 1}^{n} {[T_{n} (F)]}_{j, h} {[T_{n} (G)]}_{h, k} - \sum_{h = - \infty}^{\infty} F_{j - h} G_{h - k} \\ = \sum_{h = 1}^{n} F_{j - h} G_{h - k} - lim_{H \to \infty} (\sum_{h = - H}^{H} F_{j - h} G_{h - k}) \\ = - lim_{H \to \infty} (\sum_{h = - H}^{0} F_{j - h} G_{h - k} + \sum_{h = n + 1}^{H} F_{j - h} G_{h - k}) \\ = - \sum_{h = j - p}^{0} F_{j - h} G_{h - k} - \sum_{h = n + 1}^{j + p} F_{j - h} G_{h - k}, \end{matrix}

and consequently, Equation (3) holds for all

j, k \in {1, \dots, n}

. Applying Equation (3), the Schwarz inequality (see, e.g., [11], p. 15), the Parseval theorem for continuous matrix-valued functions (see, e.g., [6], p. 208), and the well-known formula for the partial sums of the arithmetic series yields:

\begin{matrix} ∥ T_{n} (F) T_{n} (G) - T_{n} {(F G) ∥}_{F}^{2} \\ = \sum_{j = 1}^{n} \sum_{k = 1}^{n} {∥{[T_{n} (F) T_{n} (G) - T_{n} (F G)]}_{j, k}∥}_{F}^{2} \\ = \sum_{j = 1}^{p} \sum_{k = 1}^{n} {∥- \sum_{h = j - p}^{0} F_{j - h} G_{h - k}∥}_{F}^{2} + \sum_{j = n - p + 1}^{n} \sum_{k = 1}^{n} {∥- \sum_{h = n + 1}^{j + p} F_{j - h} G_{h - k}∥}_{F}^{2} \\ = \sum_{j = 1}^{p} \sum_{k = 1}^{n} {∥\sum_{h = j - p}^{0} F_{j - h} G_{h - k}∥}_{F}^{2} + \sum_{j = n - p + 1}^{n} \sum_{k = 1}^{n} {∥\sum_{h = n + 1}^{j + p} F_{j - h} G_{h - k}∥}_{F}^{2} \\ \leq \sum_{j = 1}^{p} \sum_{k = 1}^{n} {(\sum_{h = j - p}^{0} {∥ F_{j - h} G_{h - k} ∥}_{F})}^{2} + \sum_{j = n - p + 1}^{n} \sum_{k = 1}^{n} {(\sum_{h = n + 1}^{j + p} {∥ F_{j - h} G_{h - k} ∥}_{F})}^{2} \\ \leq \sum_{j = 1}^{p} \sum_{k = 1}^{n} {(\sum_{h = j - p}^{0} ∥ F_{j - h} ∥_{F} {∥ G_{h - k} ∥}_{F})}^{2} + \sum_{j = n - p + 1}^{n} \sum_{k = 1}^{n} {(\sum_{h = n + 1}^{j + p} ∥ F_{j - h} ∥_{F} {∥ G_{h - k} ∥}_{F})}^{2} \\ \leq \sum_{j = 1}^{p} \sum_{k = 1}^{n} \sum_{h = j - p}^{0} ∥ F_{j - h} ∥_{F}^{2} \sum_{l = j - p}^{0} ∥ G_{l - k} ∥_{F}^{2} + \sum_{j = n - p + 1}^{n} \sum_{k = 1}^{n} \sum_{h = n + 1}^{j + p} ∥ F_{j - h} ∥_{F}^{2} \sum_{l = n + 1}^{j + p} {∥ G_{l - k} ∥}_{F}^{2} \\ = \sum_{j = 1}^{p} \sum_{h = j - p}^{0} ∥ F_{j - h} ∥_{F}^{2} \sum_{l = j - p}^{0} \sum_{k = 1}^{n} ∥ G_{l - k} ∥_{F}^{2} + \sum_{j = n - p + 1}^{n} \sum_{h = n + 1}^{j + p} ∥ F_{j - h} ∥_{F}^{2} \sum_{l = n + 1}^{j + p} \sum_{k = 1}^{n} {∥ G_{l - k} ∥}_{F}^{2} \\ \leq \sum_{j = 1}^{p} \sum_{h = j - p}^{0} ∥ F_{j - h} ∥_{F}^{2} \sum_{l = j - p}^{0} \frac{1}{2 π} \int_{0}^{2 π} {∥ G (ω) ∥}_{F}^{2} d ω + \sum_{j = n - p + 1}^{n} \sum_{h = n + 1}^{j + p} ∥ F_{j - h} ∥_{F}^{2} \sum_{l = n + 1}^{j + p} \frac{1}{2 π} \int_{0}^{2 π} {∥ G (ω) ∥}_{F}^{2} d ω \\ = \frac{1}{2 π} \int_{0}^{2 π} {∥ G (ω) ∥}_{F}^{2} d ω (\sum_{j = 1}^{p} (p - j + 1) \sum_{h = j - p}^{0} ∥ F_{j - h} ∥_{F}^{2} + \sum_{j = n - p + 1}^{n} (j + p - n) \sum_{h = n + 1}^{j + p} {∥ F_{j - h} ∥}_{F}^{2}) \\ \leq \frac{1}{2 π} \int_{0}^{2 π} {∥ G (ω) ∥}_{F}^{2} d ω (\sum_{j = 1}^{p} (p - j + 1) \frac{1}{2 π} \int_{0}^{2 π} {∥ F (ω) ∥}_{F}^{2} d ω + \sum_{j = n - p + 1}^{n} (j + p - n) \frac{1}{2 π} \int_{0}^{2 π} {∥ F (ω) ∥}_{F}^{2} d ω) \\ = (\frac{1}{2 π} \int_{0}^{2 π} {∥ F (ω) ∥}_{F}^{2} d ω) (\frac{1}{2 π} \int_{0}^{2 π} {∥ G (ω) ∥}_{F}^{2} d ω) (\sum_{j = 1}^{p} (p - j + 1) + \sum_{j = n - p + 1}^{n} (j + p - n)) \\ = (\frac{1}{2 π} \int_{0}^{2 π} {∥ F (ω) ∥}_{F}^{2} d ω) (\frac{1}{2 π} \int_{0}^{2 π} {∥ G (ω) ∥}_{F}^{2} d ω) (\frac{p (p + 1)}{2} + \frac{p (1 + p)}{2}), \end{matrix}

and therefore, Equation (4) is proven.

(2) Fix

n \in N

. As

G_{r} = 0_{N \times K}

with

| r | > q

, from Lemma 1, we obtain:

\begin{matrix} {[T_{n} (F) T_{n} (G) - T_{n} (F G)]}_{j, k} & = - lim_{H \to \infty} (\sum_{h = - H}^{0} F_{j - h} G_{h - k} + \sum_{h = n + 1}^{H} F_{j - h} G_{h - k}) \\ = - \sum_{h = k - q}^{0} F_{j - h} G_{h - k} - \sum_{h = n + 1}^{k + q} F_{j - h} G_{h - k}, \end{matrix}

and hence, Equation (5) holds for all

j, k \in {1, \dots, n}

. Since G is a trigonometric polynomial of degree q,

G^{*}

is also a trigonometric polynomial of degree q, where

G^{*} (ω) : = {(G (ω))}^{*}

for all

ω \in R

. Applying [6], Lemma 4.2, and Equation (4) yields:

\begin{matrix} ∥ T_{n} (F) T_{n} (G) - T_{n} {(F G) ∥}_{F} = & {∥{(T_{n} (F) T_{n} (G) - T_{n} (F G))}^{*}∥}_{F} \\ = & {∥{(T_{n} (G))}^{*} {(T_{n} (F))}^{*} - {(T_{n} (F G))}^{*}∥}_{F} \\ = & {∥T_{n} (G^{*}) T_{n} (F^{*}) - T_{n} (G^{*} F^{*})∥}_{F} \\ \leq & \sqrt{q (q + 1) (\frac{1}{2 π} \int_{0}^{2 π} {∥ {(G (ω))}^{*} ∥}_{F}^{2} d ω) (\frac{1}{2 π} \int_{0}^{2 π} {∥ {(F (ω))}^{*} ∥}_{F}^{2} d ω)}, \end{matrix}

and thus, Equation (6) is proven.

(3) Fix

n \geq max {2 p, 2 q}

. As

F_{r} = 0_{M \times N}

with

| r | > p

and

G_{s} = 0_{N \times K}

with

| s | > q

, from Lemma 1, we obtain:

\begin{matrix} {[T_{n} (F) T_{n} (G) - T_{n} (F G)]}_{j, k} & = - lim_{H \to \infty} (\sum_{h = - H}^{0} F_{j - h} G_{h - k} + \sum_{h = n + 1}^{H} F_{j - h} G_{h - k}) \\ = - \sum_{h = max {j - p, k - q}}^{0} F_{j - h} G_{h - k} - \sum_{h = n + 1}^{min {j + p, k + q}} F_{j - h} G_{h - k} \\ = \{\begin{matrix} - \sum_{h = max {j - p, k - q}}^{0} F_{j - h} G_{h - k} & if j \leq p and k \leq q, \\ 0_{M \times K} & if j \leq p and k \geq q + 1, \\ 0_{M \times K} & if p + 1 \leq j \leq n - p, \\ 0_{M \times K} & if j \geq n - p + 1 and k \leq n - q, \\ - \sum_{h = n + 1}^{min {j + p, k + q}} F_{j - h} G_{h - k} & if j \geq n - p + 1 and k \geq n - q + 1, \end{matrix} \end{matrix}

for all

j, k \in {1, \dots, n}

. Observe that:

{[T_{n} (F) T_{n} (G) - T_{n} (F G)]}_{n - p + j, n - q + k} = - \sum_{h = n + 1}^{min {n + j, n + k}} F_{n - p + j - h} G_{h - n + q - k} = - \sum_{l = 1}^{min {j, k}} F_{- p + j - l} G_{l + q - k}

for all

j \in {1, \dots, p}

and

k \in {1, \dots, q}

. □

Appendix C. Proof of Lemma 3

Proof.

(1) Since

F : R \to C^{N \times N}

is continuous and

2 π

-periodic,

F^{- 1} : R \to C^{N \times N}

is also continuous and

2 π

-periodic, where

F^{- 1} (ω) : = {(F (ω))}^{- 1}

for all

ω \in R

. As:

\begin{matrix} ∥ {(T_{n} (F))}^{- 1} - T_{n} (F^{- 1}) ∥_{F} = & ∥ {(T_{n} (F))}^{- 1} (I_{n N} - T_{n} (F) T_{n} (F^{- 1})) ∥_{F} \\ \leq & ∥ {(T_{n} (F))}^{- 1} ∥_{2} {∥ I_{n N} - T_{n} (F) T_{n} (F^{- 1}) ∥}_{F} \\ = & ∥ {(T_{n} (F))}^{- 1} ∥_{2} {∥ T_{n} (I_{N}) - T_{n} (F) T_{n} (F^{- 1}) ∥}_{F} \\ = & ∥ {(T_{n} (F))}^{- 1} ∥_{2} {∥ T_{n} (F) T_{n} (F^{- 1}) - T_{n} (F F^{- 1}) ∥}_{F} \end{matrix}

for all

n \in N

, Equation (4) proves Assertion 1 of Lemma 3.

(2) Since

F (ω)

is positive definite for all

ω \in R

(or equivalently,

F (ω)

is Hermitian and

λ_{N} (F (ω)) > 0

for all

ω \in R

),

F (ω)

is invertible for all

ω \in R

(or equivalently,

det (F (ω)) = \prod_{k = 1}^{N} λ_{k} (F (ω))

is non-zero for all

ω \in R

),

T_{n} (F)

is Hermitian, and

λ_{n N} (T_{n} (F)) \geq inf F > 0

for all

n \in N

(see Equation (1)). As

T_{n} (F)

is positive definite for all

n \in N

,

{(T_{n} (F))}^{- 1}

is also positive definite for all

n \in N

. Therefore,

∥ {(T_{n} (F))}^{- 1} ∥_{2} = λ_{1} ({(T_{n} (F))}^{- 1}) = \frac{1}{λ_{n N} (T_{n} (F))} \leq \frac{1}{inf F}

for all

n \in N

. Assertion 2 of Lemma 3 can now be obtained from Assertion 1 of Lemma 3. □

Appendix D. Proof of Lemma 4

Proof.

Consider

A_{n}, B_{n} \in C^{n M \times n N}

. As

V_{n} \otimes I_{m}

is unitary,

{(V_{n} \otimes I_{m})}^{*}

is also unitary for all

m \in N

. Consequently, since the Frobenius norm is unitarily invariant, we have

\begin{matrix} ∥ C_{A_{n}} - C_{B_{n}} ∥_{F} & = {∥(V_{n} \otimes I_{M}) {diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{M})}^{*} (A_{n} - B_{n}) (V_{n} \otimes I_{N})]}_{k, k}) {(V_{n} \otimes I_{N})}^{*}∥}_{F} \\ = {∥{diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{M})}^{*} (A_{n} - B_{n}) (V_{n} \otimes I_{N})]}_{k, k})∥}_{F} \\ \leq {∥{(V_{n} \otimes I_{M})}^{*} (A_{n} - B_{n}) (V_{n} \otimes I_{N})∥}_{F} = {∥A_{n} - B_{n}∥}_{F} \end{matrix}

and:

\begin{matrix} ∥ A_{n} - C_{A_{n}} ∥_{F} & \leq ∥ A_{n} - C_{B_{n}} ∥_{F} + {∥ C_{B_{n}} - C_{A_{n}} ∥}_{F} \\ \leq ∥ A_{n} - B_{n} ∥_{F} + ∥ B_{n} - C_{B_{n}} ∥_{F} + ∥ C_{A_{n}} - C_{B_{n}} ∥_{F} \leq 2 ∥ A_{n} - B_{n} ∥_{F} + {∥ B_{n} - C_{B_{n}} ∥}_{F} . \end{matrix}

If

B_{n}

is an

n \times n

block circulant matrix with

M \times N

blocks, then (see, e.g., [6], Lemma 5.1, or [12], Lemma 3) there exist

Λ_{1}, \dots, Λ_{n} \in C^{M \times N}

such that:

B_{n} = (V_{n} \otimes I_{M}) diag (Λ_{1}, \dots, Λ_{n}) {(V_{n} \otimes I_{N})}^{*} .

Therefore,

\begin{matrix} C_{B_{n}} & = (V_{n} \otimes I_{M}) {diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{M})}^{*} B_{n} (V_{n} \otimes I_{N})]}_{k, k}) {(V_{n} \otimes I_{N})}^{*} \\ = (V_{n} \otimes I_{M}) {diag}_{1 \leq k \leq n} ({[diag (Λ_{1}, \dots, Λ_{n})]}_{k, k}) {(V_{n} \otimes I_{N})}^{*} \\ = (V_{n} \otimes I_{M}) {diag}_{1 \leq k \leq n} (Λ_{k}) {(V_{n} \otimes I_{N})}^{*} = B_{n}, \end{matrix}

and combining Equations (9) and (10), we obtain Equation (11). □

Appendix E. Proof of Lemma 5

Proof.

Fix

n > 2 p

. From [13], p. 5, we obtain:

\begin{matrix} {[{\hat{C}}_{n} (F)]}_{j, 1} & = \{\begin{matrix} F_{0} & if j = 1, \\ (1 - \frac{j - 1}{n}) F_{j - 1} + \frac{j - 1}{n} F_{j - 1 - n} & if j \in {2, \dots, n}, \end{matrix} \\ = \{\begin{matrix} F_{0} & if j = 1, \\ (1 - \frac{j - 1}{n}) F_{j - 1} & if j \in {2, \dots, p + 1}, \\ 0_{M \times N} & if j \in {p + 2, \dots, n - p}, \\ \frac{j - 1}{n} F_{j - 1 - n} & if j \in {n - p + 1, \dots, n} . \end{matrix} \end{matrix}

Hence, the Frobenius norm of the

n \times n

block Toeplitz matrix with

M \times N

blocks

T_{n} (F) - {\hat{C}}_{n} (F)

is given by:

\begin{matrix} ∥ T_{n} (F) - {\hat{C}}_{n} {(F) ∥}_{F}^{2} = & \sum_{j, k = 1}^{n} {∥ {[T_{n} (F) - {\hat{C}}_{n} (F)]}_{j, k} ∥}_{F}^{2} \\ = & \sum_{j = 1}^{n} (n - j + 1) ∥ {[T_{n} (F) - {\hat{C}}_{n} (F)]}_{j, 1} ∥_{F}^{2} + \sum_{k = 2}^{n} (n - k + 1) {∥ {[T_{n} (F) - {\hat{C}}_{n} (F)]}_{1, k} ∥}_{F}^{2} \\ = & \sum_{j = 1}^{n} (n - j + 1) ∥ {[T_{n} (F)]}_{j, 1} - {[{\hat{C}}_{n} (F)]}_{j, 1} ∥_{F}^{2} + \sum_{k = 2}^{n} (n - k + 1) {∥ {[T_{n} (F)]}_{1, k} - {[{\hat{C}}_{n} (F)]}_{1, k} ∥}_{F}^{2} \\ = & \sum_{j = 1}^{n} (n - j + 1) ∥ F_{j - 1} - {[{\hat{C}}_{n} (F)]}_{j, 1} ∥_{F}^{2} + \sum_{k = 2}^{n} (n - k + 1) {∥ F_{1 - k} - {[{\hat{C}}_{n} (F)]}_{n + 2 - k, 1} ∥}_{F}^{2} \\ = & \sum_{j = 2}^{n} (n - j + 1) ∥ F_{j - 1} - {[{\hat{C}}_{n} (F)]}_{j, 1} ∥_{F}^{2} + \sum_{h = 2}^{n} (h - 1) {∥ F_{h - 1 - n} - {[{\hat{C}}_{n} (F)]}_{h, 1} ∥}_{F}^{2} \\ = & \sum_{j = 2}^{n} ((n - j + 1) ∥ F_{j - 1} - {[{\hat{C}}_{n} (F)]}_{j, 1} ∥_{F}^{2} + (j - 1) {∥ F_{j - 1 - n} - {[{\hat{C}}_{n} (F)]}_{j, 1} ∥}_{F}^{2}) \\ = & \sum_{j = 2}^{p + 1} ((n - j + 1) ∥ F_{j - 1} - {[{\hat{C}}_{n} (F)]}_{j, 1} ∥_{F}^{2} + (j - 1) {∥ F_{j - 1 - n} - {[{\hat{C}}_{n} (F)]}_{j, 1} ∥}_{F}^{2}) \\ + \sum_{j = n - p + 1}^{n} ((n - j + 1) ∥ F_{j - 1} - {[{\hat{C}}_{n} (F)]}_{j, 1} ∥_{F}^{2} + (j - 1) {∥ F_{j - 1 - n} - {[{\hat{C}}_{n} (F)]}_{j, 1} ∥}_{F}^{2}) \\ = & \sum_{j = 2}^{p + 1} ((n - j + 1) {∥F_{j - 1} - (1 - \frac{j - 1}{n}) F_{j - 1}∥}_{F}^{2} + (j - 1) {∥- (1 - \frac{j - 1}{n}) F_{j - 1}∥}_{F}^{2}) \\ + \sum_{j = n - p + 1}^{n} ((n - j + 1) {∥- \frac{j - 1}{n} F_{j - 1 - n}∥}_{F}^{2} + (j - 1) {∥F_{j - 1 - n} - \frac{j - 1}{n} F_{j - 1 - n}∥}_{F}^{2}) \\ = & \sum_{j = 2}^{p + 1} ((n - j + 1) {(\frac{j - 1}{n})}^{2} {∥F_{j - 1}∥}_{F}^{2} + (j - 1) {(\frac{n - j + 1}{n})}^{2} {∥F_{j - 1}∥}_{F}^{2}) \\ + \sum_{j = n - p + 1}^{n} ((n - j + 1) {(\frac{j - 1}{n})}^{2} {∥F_{j - 1 - n}∥}_{F}^{2} + (j - 1) {(\frac{n - j + 1}{n})}^{2} {∥F_{j - 1 - n}∥}_{F}^{2}) \\ = & \sum_{j = 2}^{p + 1} \frac{(n - j + 1) (j - 1)}{n} ∥ F_{j - 1} ∥_{F}^{2} + \sum_{j = n - p + 1}^{n} \frac{(n - j + 1) (j - 1)}{n} {∥ F_{j - 1 - n} ∥}_{F}^{2} \\ = & \sum_{k = 1}^{p} k \frac{n - k}{n} ∥ F_{k} ∥_{F}^{2} + \sum_{h = 1}^{p} h \frac{n - h}{n} {∥ F_{- h} ∥}_{F}^{2} \\ = & \sum_{k = 1}^{p} k \frac{n - k}{n} (∥ F_{k} ∥_{F}^{2} + {∥ F_{- k} ∥}_{F}^{2}) . \end{matrix}

(A1)

Applying [6], Lemma 5.4, we have:

\begin{matrix} {[C_{n} (F)]}_{j, 1} & = \{\begin{matrix} F_{0} & if j = 1, \\ F_{j - 1} & if j \in {2, \dots, p + 1}, \\ 0_{M \times N} & if j \in {p + 2, \dots, n - p}, \\ F_{j - 1 - n} & if j \in {n - p + 1, \dots, n} . \end{matrix} \end{matrix}

Consequently, the Frobenius norm of the

n \times n

block Toeplitz matrix with

M \times N

blocks

T_{n} (F) - C_{n} (F)

is given by:

\begin{matrix} ∥ T_{n} (F) - C_{n} {(F) ∥}_{F}^{2} = & \sum_{j = 2}^{p + 1} ((n - j + 1) ∥ F_{j - 1} - {[C_{n} (F)]}_{j, 1} ∥_{F}^{2} + (j - 1) {∥ F_{j - 1 - n} - {[C_{n} (F)]}_{j, 1} ∥}_{F}^{2}) \\ + \sum_{j = n - p + 1}^{n} ((n - j + 1) ∥ F_{j - 1} - {[C_{n} (F)]}_{j, 1} ∥_{F}^{2} + (j - 1) {∥ F_{j - 1 - n} - {[C_{n} (F)]}_{j, 1} ∥}_{F}^{2}) \\ = & \sum_{j = 2}^{p + 1} (j - 1) ∥ - F_{j - 1} ∥_{F}^{2} + \sum_{j = n - p + 1}^{n} (n - j + 1) {∥ - F_{j - 1 - n} ∥}_{F}^{2} \\ = & \sum_{k = 1}^{p} k ∥ F_{k} ∥_{F}^{2} + \sum_{h = 1}^{p} h {∥ F_{- h} ∥}_{F}^{2} \\ = & \sum_{k = 1}^{p} k (∥ F_{k} ∥_{F}^{2} + {∥ F_{- k} ∥}_{F}^{2}) . \end{matrix}

(A2)

Equations (A1) and (A2) prove Lemma 5. □

Appendix F. Proof of Lemma 7

Proof.

From Equation (12), we obtain:

x_{n : 1} = T_{n} (G) w_{n : 1} \forall n \in N,

with

G (ω) = I_{N} + \sum_{k = 1}^{q} e^{- k ω i} G_{- k}

for all

ω \in R

. Therefore, applying [6], Lemma 4.2, yields:

{E (x_{n : 1} x_{n : 1}^{*})} = {T_{n} (G) E (w_{n : 1} w_{n : 1}^{*}) {(T_{n} (G))}^{*}} = {T_{n} (G) T_{n} (Λ) T_{n} (G^{*})} = {T_{n} (G Λ) T_{n} (G^{*})} .

(A3)

Hence, using Equation (9), we have:

\begin{matrix} {∥E (x_{n : 1} x_{n : 1}^{*}) - C_{E (x_{n : 1} x_{n : 1}^{*})}∥}_{F} & \leq 2 ∥ E (x_{n : 1} x_{n : 1}^{*}) - T_{n} (G Λ G^{*}) ∥_{F} + {∥T_{n} (G Λ G^{*}) - C_{T_{n} (G Λ G^{*})}∥}_{F} \\ = 2 ∥ T_{n} (G Λ) T_{n} (G^{*}) - T_{n} (G Λ G^{*}) ∥_{F} + {∥T_{n} (G Λ G^{*}) - {\hat{C}}_{n} (G Λ G^{*})∥}_{F} \end{matrix}

for all

n \in N

. Thus, to finish the proof, we only need to show that

{∥ T_{n} (G Λ) T_{n} (G^{*}) - T_{n} (G Λ G^{*}) ∥_{F}}

and

\{{∥T_{n} (G Λ G^{*}) - {\hat{C}}_{n} (G Λ G^{*})∥}_{F}\}

are bounded. As

G Λ

and

G^{*}

are trigonometric polynomials, from Equation (7), we obtain that

{∥ T_{n} (G Λ) T_{n} (G^{*}) - T_{n} (G Λ G^{*}) ∥_{F}}

is bounded. Since

G Λ G^{*}

is a trigonometric polynomial, applying Lemma 5, we conclude that

\{{∥T_{n} (G Λ G^{*}) - {\hat{C}}_{n} (G Λ G^{*})∥}_{F}\}

is bounded. □

Appendix G. Proof of Lemma 8

Proof.

As

Λ

is positive definite, if

ω \in R

and

v \in C^{N \times 1}

, then:

v^{*} {(F (ω))}^{- 1} Λ {({(F (ω))}^{- 1})}^{*} v = {({({(F (ω))}^{- 1})}^{*} v)}^{*} Λ {({(F (ω))}^{- 1})}^{*} v = {({({(F (ω))}^{*})}^{- 1} v)}^{*} Λ {({(F (ω))}^{*})}^{- 1} v > 0

whenever

{({(F (ω))}^{*})}^{- 1} v \neq 0_{N \times 1}

, or equivalently,

v \neq 0_{N \times 1}

. Since

{(F (ω))}^{- 1} Λ {({(F (ω))}^{- 1})}^{*}

is positive definite for all

ω \in R

, we have:

∥ {(T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}))}^{- 1} ∥_{2} \leq \frac{1}{inf (F^{- 1} Λ {(F^{- 1})}^{*})} \forall n \in N .

From Equation (13), we obtain:

w_{n : 1} = T_{n} (F) x_{n : 1} \forall n \in N .

Consequently,

{T_{n} (Λ)} = {E (w_{n : 1} w_{n : 1}^{*})} = {T_{n} (F) E (x_{n : 1} x_{n : 1}^{*}) {(T_{n} (F))}^{*}} .

Therefore, as

det (T_{n} (F)) = 1

for all

n \in N

, we have:

{E (x_{n : 1} x_{n : 1}^{*})} = {{(T_{n} (F))}^{- 1} T_{n} (Λ) {({(T_{n} (F))}^{*})}^{- 1}} .

(A4)

Hence, applying Equation (11) and [6], Lemma 4.2, yields:

\begin{matrix} {∥E (x_{n : 1} x_{n : 1}^{*}) - C_{E (x_{n : 1} x_{n : 1}^{*})}∥}_{F} \\ \leq & 2 ∥ E (x_{n : 1} x_{n : 1}^{*}) - C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{F} \\ \leq & 2 (∥ {(T_{n} (F))}^{- 1} T_{n} (Λ) {({(T_{n} (F))}^{*})}^{- 1} - T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{F} + {∥ T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) - C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥}_{F}) \\ \leq & 2 (∥ {(T_{n} (F))}^{- 1} T_{n} (Λ) {({(T_{n} (F))}^{*})}^{- 1} ∥_{2} {∥ I_{n N} - {(T_{n} (F))}^{*} {(T_{n} (Λ))}^{- 1} T_{n} (F) T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥}_{F} \\ + ∥ T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2} {∥ I_{n N} - {(T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}))}^{- 1} C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥}_{F}) \\ \leq & 2 (∥ {(T_{n} (F))}^{- 1} ∥_{2} ∥ T_{n} {(Λ) ∥}_{2} ∥ {({(T_{n} (F))}^{*})}^{- 1} ∥_{2} {∥ I_{n N} - T_{n} (F^{*}) T_{n} (Λ^{- 1}) T_{n} (F) T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥}_{F} \\ + ∥ T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2} ∥ {(C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}))}^{- 1} - {(T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}))}^{- 1} ∥_{F} {∥ C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥}_{2}) \\ = & 2 (∥ {(T_{n} (F))}^{- 1} ∥_{2} λ_{1} (Λ) ∥ {({(T_{n} (F))}^{- 1})}^{*} ∥_{2} {∥ I_{n N} - T_{n} (F^{*}) T_{n} (Λ^{- 1} F) T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥}_{F} \\ + ∥ T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2} ∥ C_{n} ({(F^{- 1} Λ {(F^{- 1})}^{*})}^{- 1}) - {(T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}))}^{- 1} ∥_{F} {∥ C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥}_{2}) \\ \leq & 2 ∥ T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2} (∥ {(T_{n} (F))}^{- 1} ∥_{2}^{2} λ_{1} (Λ) {∥ {(T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}))}^{- 1} - T_{n} (F^{*}) T_{n} (Λ^{- 1} F) ∥}_{F} \\ + ∥ C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2} {∥ C_{n} ({(F^{- 1} Λ {(F^{*})}^{- 1})}^{- 1}) - {(T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}))}^{- 1} ∥}_{F}) \\ \leq & 2 ∥ T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2} (∥ {(T_{n} (F))}^{- 1} ∥_{2}^{2} λ_{1} (Λ) (∥ {(T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}))}^{- 1} - T_{n} (F^{*} Λ^{- 1} F) ∥_{F} \\ + ∥ T_{n} (F^{*} Λ^{- 1} F) - T_{n} (F^{*}) T_{n} (Λ^{- 1} F) ∥_{F}) + {∥ C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥}_{2} (∥ C_{n} (F^{*} Λ^{- 1} F) - T_{n} (F^{*} Λ^{- 1} F) ∥_{F} \\ + ∥ T_{n} (F^{*} Λ^{- 1} F) - {(T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}))}^{- 1} ∥_{F})) \\ = & 2 ∥ T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2} (∥ {(T_{n} (F))}^{- 1} ∥_{2}^{2} λ_{1} (Λ) {∥ T_{n} (F^{*}) T_{n} (Λ^{- 1} F) - T_{n} (F^{*} Λ^{- 1} F) ∥}_{F} \\ + (∥ {(T_{n} (F))}^{- 1} ∥_{2}^{2} λ_{1} (Λ) + {∥ C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥}_{2}) {∥ {(T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}))}^{- 1} - T_{n} (F^{*} Λ^{- 1} F) ∥}_{F} \\ + ∥ C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2} {∥ T_{n} (F^{*} Λ^{- 1} F) - C_{n} (F^{*} Λ^{- 1} F) ∥}_{F}) \\ \leq & 2 ∥ T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2} (∥ {(T_{n} (F))}^{- 1} ∥_{2}^{2} λ_{1} (Λ) {∥ T_{n} (F^{*}) T_{n} (Λ^{- 1} F) - T_{n} (F^{*} Λ^{- 1} F) ∥}_{F} \\ + (∥ {(T_{n} (F))}^{- 1} ∥_{2}^{2} λ_{1} (Λ) + {∥ C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥}_{2}) ∥ {(T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}))}^{- 1} ∥_{2} {∥ T_{n} (F^{*} Λ^{- 1} F) ∥}_{2} \\ \times ∥ {(T_{n} (F^{*} Λ^{- 1} F))}^{- 1} - T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{F} \\ + ∥ C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2} {∥ T_{n} (F^{*} Λ^{- 1} F) - C_{n} (F^{*} Λ^{- 1} F) ∥}_{F}) \\ = & 2 ∥ T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2} (∥ {(T_{n} (F))}^{- 1} ∥_{2}^{2} λ_{1} (Λ) {∥ T_{n} (F^{*}) T_{n} (Λ^{- 1} F) - T_{n} (F^{*} Λ^{- 1} F) ∥}_{F} \\ + (∥ {(T_{n} (F))}^{- 1} ∥_{2}^{2} λ_{1} (Λ) + {∥ C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥}_{2}) ∥ {(T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}))}^{- 1} ∥_{2} {∥ T_{n} (F^{*} Λ^{- 1} F) ∥}_{2} \\ \times ∥ {(T_{n} (F^{*} Λ^{- 1} F))}^{- 1} - T_{n} ({(F^{*} Λ^{- 1} F)}^{- 1}) ∥_{F} \\ + ∥ C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2} {∥ T_{n} (F^{*} Λ^{- 1} F) - C_{n} (F^{*} Λ^{- 1} F) ∥}_{F}) \end{matrix}

for all

n \in N

. Thus, to finish the proof, we only need to show that

{∥ T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2}}

,

{∥ T_{n} (F^{*}) T_{n} (Λ^{- 1} F) - T_{n} (F^{*} Λ^{- 1} F) ∥_{F}}

,

{∥ C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2}}

,

{∥ T_{n} (F^{*} Λ^{- 1} F) ∥_{2}}

,

{∥ {(T_{n} (F^{*} Λ^{- 1} F))}^{- 1} - T_{n} ({(F^{*} Λ^{- 1} F)}^{- 1}) ∥_{F}}

, and

{∥ T_{n} (F^{*} Λ^{- 1} F) - C_{n} (F^{*} Λ^{- 1} F) ∥_{F}}

are bounded. From [6], Theorem 4.3, we obtain that

{∥ T_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2}}

and

{∥ T_{n} (F^{*} Λ^{- 1} F) ∥_{2}}

are bounded. Applying [6], Lemma 5.2, we have that

{∥ C_{n} (F^{- 1} Λ {(F^{- 1})}^{*}) ∥_{2}}

is bounded. Since

F^{*}

and

Λ^{- 1} F

are trigonometric polynomials, from Equation (7), we obtain that

{∥ T_{n} (F^{*}) T_{n} (Λ^{- 1} F) - T_{n} (F^{*} Λ^{- 1} F) ∥_{F}}

is bounded. As

F^{*} Λ^{- 1} F

is a trigonometric polynomial, applying Lemma 5, we have that

{∥ T_{n} (F^{*} Λ^{- 1} F) - C_{n} (F^{*} Λ^{- 1} F) ∥_{F}}

is bounded. Since

{(F (ω))}^{- 1} Λ {({(F (ω))}^{- 1})}^{*} = {(F (ω))}^{- 1} Λ {({(F (ω))}^{*})}^{- 1}

is positive definite for all

ω \in R

,

{({(F (ω))}^{- 1} Λ {({(F (ω))}^{*})}^{- 1})}^{- 1} = {(F (ω))}^{*} Λ^{- 1} F (ω)

is also positive definite for all

ω \in R

, and consequently, from Equation (8), we conclude that

{∥ {(T_{n} (F^{*} Λ^{- 1} F))}^{- 1} - T_{n} ({(F^{*} Λ^{- 1} F)}^{- 1}) ∥_{F}}

is bounded. □

Appendix H. A Statistical Signal Processing Application on Filtering WSS Vector Processes

Consider a zero-mean WSS M-dimensional vector process

{x_{n}}

. Let Y be the PSD of a zero-mean WSS N-dimensional vector process

{y_{n}}

with

inf Y > 0

. Assume that those two processes are jointly WSS with joint PSD Z, that is

Z : R \to C^{M \times N}

is a continuous

2 π

-periodic function satisfying that

{E (x_{n : 1} y_{n : 1}^{*})} = {T_{n} (Z)}

.

For every

n \in N

, if

{\tilde{x}}_{n : 1}

is an estimation of

x_{n : 1}

from

y_{n : 1}

of the form:

{\tilde{x}}_{n : 1} = W y_{n : 1}

(A5)

with

W \in C^{n M \times n N}

, the MSE per sample is:

\frac{MSE (W)}{n} = \frac{E (∥ x_{n : 1} - {\tilde{x}}_{n : 1} ∥_{2}^{2})}{n},

and the minimum MSE (MMSE) is given by

MMSE = MSE (W_{0})

, where

W_{0}

is the Wiener filter, i.e.,

W_{0} = E (x_{n : 1} y_{n : 1}^{*}) {(E (y_{n : 1} y_{n : 1}^{*}))}^{- 1} = T_{n} (Z) {(T_{n} (Y))}^{- 1} .

In [13], Equation (6), it was shown that there is no difference in the MSE per sample for large enough n if we substitute the optimal filter

W_{0}

by

W_{C}

, where

W_{C} = {\hat{C}}_{n} (Z) {({\hat{C}}_{n} (Y))}^{- 1}

, that is,

lim_{n \to \infty} (\frac{MSE (W_{C})}{n} - \frac{MMSE}{n}) = 0 .

Obviously, the computational complexity of the operation (A5) is notably reduced when applying this substitution and the FFT algorithm is used. Specifically, the computational complexity is reduced from

O (n^{2})

to

O (n log n)

.

We here study the convergence speed of the sequence

\{\frac{MSE (W_{C})}{n} - \frac{MMSE}{n}\}

(i.e., how fast this sequence tends to zero) by assuming that Y and Z are trigonometric polynomials. Applying [13], p. 11, and Lemma 5, we conclude that there exists

K \in [0, \infty)

such that:

\begin{matrix} 0 \leq \frac{MSE (W_{C})}{n} - \frac{MMSE}{n} & \leq \frac{\sqrt{M} σ_{1} (Z)}{inf Y} (1 + \frac{sup Y}{inf Y}) (\frac{∥ {\hat{C}}_{n} (Z) - T_{n} {(Z) ∥}_{F}}{\sqrt{n}} + \frac{σ_{1} (Z)}{inf Y} \frac{∥ {\hat{C}}_{n} (Y) - T_{n} {(Y) ∥}_{F}}{\sqrt{n}}) \\ \leq \frac{K}{\sqrt{n}} \forall n \in N, \end{matrix}

where

σ_{1} (Z) = {sup}_{ω \in [0, 2 π]} {∥ Z (ω) ∥}_{2}

and

sup Y = {max}_{ω \in [0, 2 π]} λ_{1} (Y (ω))

. Therefore,

\frac{MSE (W_{C})}{n} - \frac{MMSE}{n} = O (\frac{1}{\sqrt{n}}) .

(A6)

Equation (A6) was proven in [2] for the case

M = N = 1

.

References

Kolmogorov, A.N. On the Shannon theory of information transmission in the case of continuous signals. IRE Trans. Inf. Theory 1956, IT-2, 102–108. [Google Scholar] [CrossRef]
Pearl, J. On coding and filtering stationary signals by discrete Fourier transforms. IEEE Trans. Inf. Theory 1973, 19, 229–232. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J.; Zárraga-Rodríguez, M.; Insausti, X. Upper bounds for the rate distortion function of finite-length data blocks of Gaussian WSS sources. Entropy 2017, 19, 554. [Google Scholar] [CrossRef]
Zárraga-Rodríguez, M.; Gutiérrez-Gutiérrez, J.; Insausti, X. A low-complexity and asymptotically optimal coding strategy for Gaussian vectors sources. Entropy 2019, 21, 965. [Google Scholar] [CrossRef]
Gray, R.M. Toeplitz and circulant matrices: A review. Found. Trends Commun. Inf. Theory 2006, 2, 155–239. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J.; Crespo, P.M. Block Toeplitz matrices: Asymptotic results and applications. Found. Trends Commun. Inf. Theory 2011, 8, 179–257. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J. A modified version of the Pisarenko method to estimate the power spectral density of any asymptotically wide sense stationary vector process. Appl. Math. Comput. 2019, 362, 124526. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J.; Zárraga-Rodríguez, M.; Villar-Rosety, F.M.; Insausti, X. Rate-distortion function upper bounds for Gaussian vectors and their applications in coding AR sources. Entropy 2018, 20, 399. [Google Scholar] [CrossRef] [PubMed]
Gutiérrez-Gutiérrez, J.; Iglesias, I.; Podhorski, A. Geometric MMSE for one-sided and two-sided vector linear predictors: From the finite-length case to the infinite-length case. Signal Process. 2011, 91, 2237–2245. [Google Scholar] [CrossRef]
Reinsel, G.C. Elements of Multivariate Time Series Analysis; Springer: Berlin/Heidelberg, Germany, 1993. [Google Scholar]
Rudin, W. Principles of Mathematical Analysis; McGraw-Hill: New York, NY, USA, 1976. [Google Scholar]
Gutiérrez-Gutiérrez, J.; Crespo, P.M. Asymptotically equivalent sequences of matrices and multivariate ARMA processes. IEEE Trans. Inf. Theory 2011, 57, 5444–5454. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J.; Zárraga-Rodríguez, M.; Insausti, X.; Hogstad, B.O. On the complexity reduction of coding WSS vector processes by using a sequence of block circulant matrices. Entropy 2017, 19, 95. [Google Scholar] [CrossRef]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

On the Asymptotic Optimality of a Low-Complexity Coding Strategy for WSS, MA, and AR Vector Sources

Abstract

1. Introduction

2. Several New Results on Block Toeplitz Matrices

2.1. Notation

2.2. Product of Block Toeplitz Matrices

2.3. Inverse of a Block Toeplitz Matrix

2.4. Block Circulant Matrices

3. Several New Results on the Correlation Matrices of Certain Random Vector Processes

3.1. WSS Vector Processes

3.2. VMA Processes

3.3. VAR Processes

4. On the Asymptotic Optimality of a Low-Complexity Coding Strategy for Gaussian Vector Sources

4.1. Low-Complexity Coding Strategy Considered

4.2. On the Asymptotic Optimality of the Low-Complexity Coding Strategy Considered

4.3. On How the Low-Complexity Coding Strategy Considered Performs under Perturbations

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Proof of Lemma 1

Appendix B. Proof of Lemma 2

Appendix C. Proof of Lemma 3

Appendix D. Proof of Lemma 4

Appendix E. Proof of Lemma 5

Appendix F. Proof of Lemma 7

Appendix G. Proof of Lemma 8

Appendix H. A Statistical Signal Processing Application on Filtering WSS Vector Processes

References

Article Metrics

Citations

Article Access Statistics