On the Complexity Reduction of Coding WSS Vector Processes by Using a Sequence of Block Circulant Matrices

Jesús Gutiérrez-Gutiérrez; Marta Zárraga-Rodríguez; Xabier Insausti; Bjørn O. Hogstad

doi:10.3390/e19030095

,

and

¹

CEIT, Manuel Lardizábal 15, 20018 San Sebastián, Spain

²

TECNUN, University of Navarra, Manuel Lardizábal 13, 20018 San Sebastián, Spain

³

Department of Mathematical Sciences, Norwegian University of Science and Technology, Teknologiveien 22, 2815 Gjøvik, Norway

^*

Author to whom correspondence should be addressed.

Entropy2017, 19(3), 95;https://doi.org/10.3390/e19030095

This article belongs to the Section Information Theory, Probability and Statistics

Version Notes

Order Reprints

Abstract

In the present paper, we obtain a result on the rate-distortion function (RDF) of wide sense stationary (WSS) vector processes that allows us to reduce the complexity of coding those processes. To achieve this result, we propose a sequence of block circulant matrices. In addition, we use the proposed sequence to reduce the complexity of filtering WSS vector processes.

Keywords:

rate-distortion function; source coding; WSS vector process; block circulant matrix; filtering

1. Introduction

Applications where Toeplitz matrices arise commonly involve computation of inverses, products, and eigenvalues. Although inverses and products of Toeplitz matrices are not in general Toeplitz, inverses and products of circulant matrices are circulant. Moreover, unlike for Toeplitz matrices, an eigenvalue decomposition is known for any circulant matrix. Hence, the substitution of Toeplitz matrices by circulant matrices leads to a notable reduction of the computational complexity in those applications.

A sequence of Toeplitz matrices

{T_{n} (f)}

generated by a function f can be found in any information-theoretic or statistical signal processing application in which a wide sense stationary (WSS) process appears. In those applications,

{T_{n} (f)}

is the sequence of correlation matrices of the WSS process and the function f is its power spectral density (PSD).

The substitution of the sequence

{T_{n} (f)}

by a sequence of circulant matrices can be done when both sequences are asymptotically equivalent. In [1], Gray gave an example of such a sequence of circulant matrices

{C_{n} (f)}

, where each circulant matrix

C_{n} (f)

depends on the entire sequence

{T_{n} (f)}

. However, in practical situations, what we usually know is

{T_{n} (f)}_{n \leq n_{0}}

for certain natural number

n_{0}

. For those practical situations, Pearl presented in [2] a different sequence of circulant matrices

{{\hat{C}}_{n} (f)}

, which is also asymptotically equivalent to the sequence of Toeplitz matrices

{T_{n} (f)}

, but this satisfies the condition that each circulant matrix

{\hat{C}}_{n} (f)

only depends on the corresponding Toeplitz matrix

T_{n} (f)

.

In the present paper, the sequence

{{\hat{C}}_{n} (f)}

is generalized to block circulant matrices. This sequence of block circulant matrices is used to obtain a result on the rate-distortion function (RDF) of WSS vector processes that allows us to reduce the complexity of coding those processes. In addition, we use that sequence to reduce the complexity of filtering WSS vector processes.

The paper is organized as follows. In Section 2, we set up notation and review the mathematical definitions and results used in the rest of the paper. In Section 3, we define the sequence of block circulant matrices considered in this paper and show its main properties. Finally, in Section 4 and Section 5, we use this sequence of block circulant matrices to reduce the complexity of coding and filtering WSS vector processes, respectively.

2. Preliminaries

2.1. Notation

In this paper,

N

,

Z

,

R

, and

C

denote the set of natural numbers (i.e., the set of positive integers), the set of integer numbers, the set of (finite) real numbers, and the set of (finite) complex numbers, respectively. If

m, n \in N

, then

C^{m \times n}

,

0_{m \times n}

, and

I_{n}

are the set of all

m \times n

complex matrices, the

m \times n

zero matrix, and the

n \times n

identity matrix, respectively. The symbol ∗ denotes conjugate transpose, E stands for expectation,

i

is the imaginary unit,

tr

denotes trace, δ stands for the Kronecker delta, ⊗ is the Kronecker product,

λ_{k} (A)

,

k \in {1, \dots, n}

, are the eigenvalues of an

n \times n

Hermitian matrix A arranged in decreasing order, and

σ_{k} (B)

,

k \in {1, \dots, n}

, are the singular values of an

n \times n

matrix B arranged in decreasing order.

Let

{x_{n} : n \in N}

be a random N-dimensional vector process, i.e.,

x_{n}

is a random (column) vector of dimension N for all

n \in N

. We denote by

x_{n : 1}

the random vector of dimension

n N

given by

x_{n : 1} : = (\begin{matrix} x_{n} \\ x_{n - 1} \\ x_{n - 2} \\ ⋮ \\ x_{1} \end{matrix}), n \in N .

2.2. Block Toeplitz Matrices

We first review the concept of block Toeplitz matrix.

Definition 1.

An

m \times n

block Toeplitz matrix with

M \times N

blocks is an

m M \times n N

matrix of the form

(\begin{matrix} F_{0} & F_{- 1} & F_{- 2} & \dots & F_{1 - n} \\ F_{1} & F_{0} & F_{- 1} & \dots & F_{2 - n} \\ F_{2} & F_{1} & F_{0} & \dots & F_{3 - n} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ F_{m - 1} & F_{m - 2} & F_{m - 3} & \dots & F_{0} \end{matrix}),

where

F_{k} \in C^{M \times N}

with

k \in {1 - n, \dots, m - 1}

.

Consider a matrix-valued function of a real variable

F : R \to C^{M \times N}

, which is continuous and

2 π

-periodic. For every

n \in N

, we denote by

T_{n} (F)

the

n \times n

block Toeplitz matrix with

M \times N

blocks given by

T_{n} (F) : = {(F_{j - k})}_{j, k = 1}^{n},

where

{F_{k}}_{k \in Z}

is the sequence of Fourier coefficients of F:

F_{k} = \frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} F (ω) d ω \forall k \in Z .

It should be mentioned that

T_{n} (F)

is Hermitian for all

n \in N

if and only if

F (ω)

is Hermitian for all

ω \in R

(see [3] (Theorem 4.4.1)). Furthermore, in this case, from [3] (Theorem 4.4.2) (it was previously given in [4] (p. 5674) but without a proof) and [5] (Corollary VI.1.6), we obtain

min_{ω \in [0, 2 π]} λ_{N} (F (ω)) = inf (F) \leq λ_{n N} (T_{n} (F)) \leq λ_{1} (T_{n} (F)) \leq sup (F) = max_{ω \in [0, 2 π]} λ_{1} (F (ω)) \forall n \in N .

(1)

2.3. Block Circulant Matrices

We first review the concept of a block circulant matrix.

Definition 2.

An

n \times n

block circulant matrix with

M \times N

blocks is an

n \times n

block Toeplitz matrix with

M \times N

blocks of the form

(\begin{matrix} C_{0} & C_{- 1} & C_{- 2} & \dots & C_{1 - n} \\ C_{1 - n} & C_{0} & C_{- 1} & \dots & C_{2 - n} \\ C_{2 - n} & C_{1 - n} & C_{0} & \dots & C_{3 - n} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ C_{- 1} & C_{- 2} & C_{- 3} & \dots & C_{0} \end{matrix}),

where

C_{k} \in C^{M \times N}

with

k \in {1 - n, \dots, 0}

.

The next result [6] (Lemma 3) characterizes block circulant matrices.

Lemma 1.

Let

C \in C^{n M \times n N}

; then, the following statements are equivalent:

1.: C is an $n \times n$ block circulant matrix with $M \times N$ blocks.
2.: There exist $A_{1}, \dots, A_{n} \in C^{M \times N}$ such that

$C = (V_{n} \otimes I_{M}) diag (A_{1}, \dots, A_{n}) {(V_{n} \otimes I_{N})}^{*},$

where $diag (A_{1}, \dots, A_{n}) = {diag}_{1 \leq k \leq n} (A_{k}) : = {(δ_{j, k} A_{j})}_{j, k = 1}^{n}$ and $V_{n}$ is the $n \times n$ Fourier unitary matrix

${[V_{n}]}_{j, k} : = \frac{1}{\sqrt{n}} e^{- \frac{2 π (j - 1) (k - 1)}{n} i}, j, k \in {1, \dots, n} .$

2.4. Asymptotically Equivalent Sequences of Matrices

We now review the concept of asymptotically equivalent sequences of matrices introduced in [6] (Definition 2), which is an extension of the original concept given by Gray in [7] to sequences of non-square matrices.

Definition 3.

Consider two strictly increasing sequences of natural numbers

{d_{n}^{(1)}}

and

{d_{n}^{(2)}}

. Let

A_{n}

and

B_{n}

be

d_{n}^{(1)} \times d_{n}^{(2)}

matrices for all

n \in N

. We say that the sequences

{A_{n}}

and

{B_{n}}

are asymptotically equivalent, and write

{A_{n}} \sim {B_{n}}

, if

{∥ A_{n} ∥_{2}}

and

{∥ B_{n} ∥_{2}}

are bounded, and

lim_{n \to \infty} \frac{∥ A_{n} - B_{n} ∥_{F}}{\sqrt{n}} = 0,

where

{∥ \cdot ∥}_{2}

and

{∥ \cdot ∥}_{F}

are the spectral norm and the Frobenius norm, respectively (The definition and the main properties of these two matrix norms can be found, e.g., in [3] (Section 2.1)).

We finish this section by reviewing [6] (Lemma 4).

Lemma 2.

Let

F : R \to C^{M \times N}

be continuous and

2 π

-periodic. Then,

{T_{n} (F)} \sim {C_{n} (F)},

where

C_{n} (F)

is the

n \times n

block circulant matrix with

M \times N

blocks given by

C_{n} (F) : = (V_{n} \otimes I_{M}) diag (F (0), F (\frac{2 π}{n}), \dots, F (\frac{2 π (n - 1)}{n})) {(V_{n} \otimes I_{N})}^{*} \forall n \in N .

When

M = N = 1

and F is in the Wiener class (We recall that a function

F : R \to C^{M \times N}

is said to be in the Wiener class if it is continuous and

2 π

-periodic, and it satisfies

\sum_{k = - \infty}^{\infty} {∥ F_{k} ∥}_{F} < \infty

, see, e.g., [3] (Appendix B)),

{C_{n} (F)}

is the sequence of circulant matrices defined by Gray in [1] (Equation (4.32)), see [3] (Lemma 5.2).

3. Sequence of Block Circulant Matrices Considered

We begin this section by presenting a sequence of block circulant matrices

{{\hat{C}}_{n} (F)}

, where each block circulant matrix

{\hat{C}}_{n} (F)

only depends on the corresponding block Toeplitz matrix

T_{n} (F)

.

Definition 4.

Let

F : R \to C^{M \times N}

be continuous and

2 π

-periodic. For every

n \in N

, we define

{{\hat{C}}_{n} (F)}

as the

n \times n

block circulant matrix with

M \times N

blocks given by

{\hat{C}}_{n} (F) : = (V_{n} \otimes I_{M}) {diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{M})}^{*} T_{n} (F) (V_{n} \otimes I_{N})]}_{k, k}) {(V_{n} \otimes I_{N})}^{*} .

When

M = N = 1

and the matrices

T_{n} (F)

are real and symmetric,

{{\hat{C}}_{n} (F)}

is the sequence of circulant matrices defined by Pearl in [2].

The following result gives an expression for the blocks of the block circulant matrix

{\hat{C}}_{n} (F)

.

Lemma 3.

Consider

n \in N

and let

F : R \to C^{M \times N}

be continuous and

2 π

-periodic.

1.: If $j, k \in {1, \dots, n}$ , then

${[{\hat{C}}_{n} (F)]}_{j, k} = (1 - \frac{| j - k |}{n}) F_{j - k} + \frac{| j - k |}{n} F_{j - k - n sgn (j - k)},$

where $sgn$ is the sign function.
2.: If $T_{n} (F)$ is real, then ${\hat{C}}_{n} (F)$ is real.

Proof.

(1) Fix

j, k \in {1, \dots, n}

. For convenience, we denote

{diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{M})}^{*} T_{n} (F) (V_{n} \otimes I_{N})]}_{k, k})

by

D_{n} (F)

. We have

\begin{matrix} {[{\hat{C}}_{n} (F)]}_{j, k} & = & \sum_{h = 1}^{n} {[V_{n} \otimes I_{M}]}_{j, h} {[D_{n} (F) {(V_{n} \otimes I_{N})}^{*}]}_{h, k} = \sum_{h = 1}^{n} {[V_{n}]}_{j, h} I_{M} \sum_{l = 1}^{n} {[D_{n} (F)]}_{h, l} {[{(V_{n} \otimes I_{N})}^{*}]}_{l, k} \\ = & \sum_{h = 1}^{n} {[V_{n}]}_{j, h} {[D_{n} (F)]}_{h, h} {[V_{n}^{*} \otimes I_{N}^{*}]}_{h, k} = \sum_{h = 1}^{n} {[V_{n}]}_{j, h} {[D_{n} (F)]}_{h, h} {[V_{n}^{*}]}_{h, k} I_{N} \\ = & \sum_{h = 1}^{n} {[V_{n}]}_{j, h} \bar{{[V_{n}]}_{k, h}} {[{(V_{n} \otimes I_{M})}^{*} T_{n} (F) (V_{n} \otimes I_{N})]}_{h, h} \\ = & \sum_{h = 1}^{n} {[V_{n}]}_{j, h} \bar{{[V_{n}]}_{k, h}} \sum_{l = 1}^{n} {[{(V_{n} \otimes I_{M})}^{*}]}_{h, l} {[T_{n} (F) (V_{n} \otimes I_{N})]}_{l, h} \\ = & \sum_{h = 1}^{n} {[V_{n}]}_{j, h} \bar{{[V_{n}]}_{k, h}} \sum_{l = 1}^{n} {[V_{n}^{*}]}_{h, l} I_{M} \sum_{s = 1}^{n} {[T_{n} (F)]}_{l, s} {[V_{n} \otimes I_{N}]}_{s, h} \\ = & \sum_{h = 1}^{n} {[V_{n}]}_{j, h} \bar{{[V_{n}]}_{k, h}} \sum_{l = 1}^{n} \bar{{[V_{n}]}_{l, h}} \sum_{s = 1}^{n} F_{l - s} {[V_{n}]}_{s, h} I_{N} = \sum_{h, l, s = 1}^{n} {[V_{n}]}_{j, h} \bar{{[V_{n}]}_{k, h}} \bar{{[V_{n}]}_{l, h}} {[V_{n}]}_{s, h} F_{l - s} \\ = & \frac{1}{n^{2}} \sum_{h, l, s = 1}^{n} e^{- \frac{2 π (j - 1) (h - 1)}{n} i} e^{\frac{2 π (k - 1) (h - 1)}{n} i} e^{\frac{2 π (l - 1) (h - 1)}{n} i} e^{- \frac{2 π (s - 1) (h - 1)}{n} i} F_{l - s} \\ = & \frac{1}{n^{2}} \sum_{h, l, s = 1}^{n} e^{\frac{2 π (h - 1) (k - j + l - s)}{n} i} F_{l - s} = \frac{1}{n^{2}} \sum_{l, s = 1}^{n} \sum_{h = 1}^{n} {(e^{\frac{2 π (k - j + l - s)}{n} i})}^{h - 1} F_{l - s} \\ = & \frac{1}{n^{2}} (\sum_{\begin{matrix} l, s = 1 \\ k - j + l - s \in n Z \end{matrix}}^{n} n F_{l - s} + \sum_{\begin{matrix} l, s = 1 \\ k - j + l - s \notin n Z \end{matrix}}^{n} \frac{1 - {(e^{\frac{2 π (k - j + l - s)}{n} i})}^{n}}{1 - e^{\frac{2 π (k - j + l - s)}{n} i}} F_{l - s}) = \frac{1}{n} \sum_{\begin{matrix} l, s = 1 \\ k - j + l - s \in n Z \end{matrix}}^{n} F_{l - s}, \end{matrix}

where

\bar{z}

denotes the conjugate of

z \in C

.

If

j = k

, then

{[{\hat{C}}_{n} (F)]}_{j, k} = \frac{1}{n} \sum_{\begin{matrix} l, s = 1 \\ l - s = 0 \end{matrix}}^{n} F_{l - s} = \frac{1}{n} \sum_{l = 1}^{n} F_{0} = F_{0} .

If

j > k

, then

\begin{matrix} {[{\hat{C}}_{n} (F)]}_{j, k} & = & \frac{1}{n} \sum_{\begin{matrix} l, s = 1 \\ k - j + l - s \in {- n, 0} \end{matrix}}^{n} F_{l - s} = \frac{1}{n} \sum_{\begin{matrix} l, s = 1 \\ l - s = j - k \end{matrix}}^{n} F_{l - s} + \frac{1}{n} \sum_{\begin{matrix} l, s = 1 \\ l - s = j - k - n \end{matrix}}^{n} F_{l - s} \\ = & \frac{1}{n} (n - | j - k |) F_{j - k} + \frac{1}{n} (n - | j - k - n |) F_{j - k - n} = (1 - \frac{| j - k |}{n}) F_{j - k} + \frac{j - k}{n} F_{j - k - n} . \end{matrix}

If

j < k

, then

\begin{matrix} {[{\hat{C}}_{n} (F)]}_{j, k} & = & \frac{1}{n} \sum_{\begin{matrix} l, s = 1 \\ k - j + l - s \in {0, n} \end{matrix}}^{n} F_{l - s} = \frac{1}{n} \sum_{\begin{matrix} l, s = 1 \\ l - s = j - k \end{matrix}}^{n} F_{l - s} + \frac{1}{n} \sum_{\begin{matrix} l, s = 1 \\ l - s = j - k + n \end{matrix}}^{n} F_{l - s} \\ = & \frac{1}{n} (n - | j - k |) F_{j - k} + \frac{1}{n} (n - | j - k + n |) F_{j - k + n} = (1 - \frac{| j - k |}{n}) F_{j - k} + \frac{k - j}{n} F_{j - k + n} . \end{matrix}

(2) It is a direct consequence of Assertion (1). ☐

When

M = N = 1

and the matrices

T_{n} (F)

are real and symmetric, from Lemma 3, we obtain [2] (Equation (7)).

We now show that the block Toeplitz matrix

T_{n} (F)

can be approximated by the block circulant matrix

{\hat{C}}_{n} (F)

for large n.

Lemma 4.

If

F : R \to C^{M \times N}

is continuous and

2 π

-periodic, then

1.: $∥ {\hat{C}}_{n} {(F) ∥}_{2} \leq {∥ T_{n} (F) ∥}_{2} \leq σ_{1} (F)$ for all $n \in N$ , where $σ_{1} (F) : = {sup}_{ω \in [0, 2 π]} σ_{1} (F (ω)) < \infty$ .
2.: ${T_{n} (F)} \sim {{\hat{C}}_{n} (F)}$ .

Proof.

(1) From [3] (Theorem 4.3)

∥ T_{n} {(F) ∥}_{2} \leq σ_{1} (F)

for all

n \in N

. Consequently, to prove Assertion (1), we only need to show that

∥ {\hat{C}}_{n} {(F) ∥}_{2} \leq {∥ T_{n} (F) ∥}_{2}

for all

n \in N

. Fix

n \in N

. As the spectral norm is unitarily invariant (see, e.g., [3] (Section 2.1)) and

V_{n} \otimes I_{M}

and

{(V_{n} \otimes I_{N})}^{*}

are unitary matrices, we have

\begin{matrix} ∥ {\hat{C}}_{n} {(F) ∥}_{2} & = {∥{diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{M})}^{*} T_{n} (F) (V_{n} \otimes I_{N})]}_{k, k})∥}_{2} \\ = \max_{1 \leq k \leq n} {{∥{[{(V_{n} \otimes I_{M})}^{*} T_{n} (F) (V_{n} \otimes I_{N})]}_{k, k}∥}_{2}} . \end{matrix}

Consider

k_{0} \in {1, \dots, n}

and

x_{0} \in C^{N \times 1}

satisfying

∥ {\hat{C}}_{n} {(F) ∥}_{2} = {∥{[{(V_{n} \otimes I_{M})}^{*} T_{n} (F) (V_{n} \otimes I_{N})]}_{k_{0}, k_{0}}∥}_{2} = \frac{{∥{[{(V_{n} \otimes I_{M})}^{*} T_{n} (F) (V_{n} \otimes I_{N})]}_{k_{0}, k_{0}} x_{0}∥}_{2}}{∥ x_{0} ∥_{2}} .

Let

{\hat{x}}_{0} \in C^{n N \times 1}

with

{[{\hat{x}}_{0}]}_{j, 1} = δ_{j, k_{0}} x_{0}

for all

j \in {1, \dots, n}

, then

\begin{matrix} 0 \leq ∥ {\hat{C}}_{n} {(F) ∥}_{2} & \leq & \frac{{∥{(V_{n} \otimes I_{M})}^{*} T_{n} (F) (V_{n} \otimes I_{N}) {\hat{x}}_{0}∥}_{2}}{{∥x_{0}∥}_{2}} \\ = & \frac{{∥{(V_{n} \otimes I_{M})}^{*} T_{n} (F) (V_{n} \otimes I_{N}) {\hat{x}}_{0}∥}_{2}}{{∥{\hat{x}}_{0}∥}_{2}} \leq {∥{(V_{n} \otimes I_{M})}^{*} T_{n} (F) (V_{n} \otimes I_{N})∥}_{2} = {∥T_{n} (F)∥}_{2} . \end{matrix}

(2) From Lemma 1,

{(V_{n} \otimes I_{M})}^{*} C_{n} (F) (V_{n} \otimes I_{N})

is an

n \times n

block diagonal matrix with

M \times N

blocks:

{(V_{n} \otimes I_{M})}^{*} C_{n} (F) (V_{n} \otimes I_{N}) = {diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{M})}^{*} C_{n} (F) (V_{n} \otimes I_{N})]}_{k, k}) .

Therefore,

C_{n} (F) = (V_{n} \otimes I_{M}) {diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{M})}^{*} C_{n} (F) (V_{n} \otimes I_{N})]}_{k, k}) {(V_{n} \otimes I_{N})}^{*} .

Hence, since the Frobenius norm is unitarily invariant (see, e.g., [3] (Section 2.1)), one obtains

\begin{matrix} ∥ C_{n} (F) - {\hat{C}}_{n} {(F) ∥}_{F} \\ = ∥(V_{n} \otimes I_{M}) ({diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{M})}^{*} C_{n} (F) (V_{n} \otimes I_{N})]}_{k, k}) \\ {- {diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{M})}^{*} T_{n} (F) (V_{n} \otimes I_{N})]}_{k, k})) {(V_{n} \otimes I_{N})}^{*}∥}_{F} \\ = {∥({diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{M})}^{*} C_{n} (F) (V_{n} \otimes I_{N})]}_{k, k}) - {diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{M})}^{*} T_{n} (F) (V_{n} \otimes I_{N})]}_{k, k}))∥}_{F} \\ = {∥{diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{M})}^{*} (C_{n} (F) - T_{n} (F)) (V_{n} \otimes I_{N})]}_{k, k})∥}_{F} \\ \leq {∥{(V_{n} \otimes I_{M})}^{*} (C_{n} (F) - T_{n} (F)) (V_{n} \otimes I_{N})∥}_{F} = {∥C_{n} (F) - T_{n} (F)∥}_{F} = {∥T_{n} (F) - C_{n} (F)∥}_{F} . \end{matrix}

Thus, from Lemma 2, we conclude that

0 \leq \frac{∥ T_{n} (F) - {\hat{C}}_{n} {(F) ∥}_{F}}{\sqrt{n}} \leq \frac{{∥T_{n} (F) - C_{n} (F)∥}_{F} + {∥ C_{n} (F) - {\hat{C}}_{n} (F) ∥}_{F}}{\sqrt{n}} \leq 2 \frac{{∥T_{n} (F) - C_{n} (F)∥}_{F}}{\sqrt{n}} \to 0 .

☐

We now show that

{\hat{C}}_{n} (F)

keeps several properties of

T_{n} (F)

for the case in which

M = N

.

Lemma 5.

Consider

n \in N

and let

F : R \to C^{N \times N}

be continuous and

2 π

-periodic.

1.: If $T_{n} (F)$ is Hermitian, then ${\hat{C}}_{n} (F)$ is Hermitian.
2.: If $T_{n} (F)$ is real and symmetric, then ${\hat{C}}_{n} (F)$ is real and symmetric.
3.: If $T_{n} (F)$ is positive semidefinite, then ${\hat{C}}_{n} (F)$ is positive semidefinite.
4.: If $T_{n} (F)$ is positive definite, then ${\hat{C}}_{n} (F)$ is positive definite.

Proof.

(1) As

T_{n} (F)

is Hermitian,

{(V_{n} \otimes I_{N})}^{*} T_{n} (F) (V_{n} \otimes I_{N})

is Hermitian. Therefore,

{[{(V_{n} \otimes I_{N})}^{*} T_{n} (F) (V_{n} \otimes I_{N})]}_{k, k}

is Hermitian for all

k \in {1, \dots, n}

, and, consequently,

{diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{N})}^{*} T_{n} (F) (V_{n} \otimes I_{N})]}_{k, k})

is also Hermitian. Hence,

{\hat{C}}_{n} (F) = (V_{n} \otimes I_{N})

{diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{N})}^{*} T_{n} (F) (V_{n} \otimes I_{N})]}_{k, k}) {(V_{n} \otimes I_{N})}^{*}

is Hermitian.

(2) If

T_{n} (F)

is real and symmetric, then

T_{n} (F)

is Hermitian. Applying Lemma 3 and Assertion (1),

{\hat{C}}_{n} (F)

is real and Hermitian, and hence,

{\hat{C}}_{n} (F)

is real and symmetric.

(3) For every

k \in {1, \dots, n}

, let

u_{k} \in C^{n N \times N}

with

{[u_{k}]}_{j, 1} = δ_{j, k} I_{N}

for all

j \in {1, \dots, n}

. If

x \in C^{n N \times 1},

then

\begin{matrix} x^{*} {\hat{C}}_{n} (F) x & = & y^{*} {diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{N})}^{*} T_{n} (F) (V_{n} \otimes I_{N})]}_{k, k}) y \\ = & y^{*} {diag}_{1 \leq k \leq n} (u_{k}^{*} {(V_{n} \otimes I_{N})}^{*} T_{n} (F) (V_{n} \otimes I_{N}) u_{k}) y \\ = & \sum_{k = 1}^{n} {[y]}_{k, 1}^{*} u_{k}^{*} {(V_{n} \otimes I_{N})}^{*} T_{n} (F) (V_{n} \otimes I_{N}) u_{k} {[y]}_{k, 1} = \sum_{k = 1}^{n} z_{k}^{*} T_{n} (F) z_{k}, \end{matrix}

(2)

where

y = {(V_{n} \otimes I_{N})}^{*} x

and

z_{k} = (V_{n} \otimes I_{N}) u_{k} {[y]}_{k, 1}

for all

k \in {1, \dots, n}

. Since

T_{n} (F)

is positive semidefinite,

z_{k}^{*} T_{n} (F) z_{k} \geq 0

for all

k \in {1, \dots, n}

, and, thus,

x^{*} {\hat{C}}_{n} (F) x \geq 0

.

(4) Consider

x \in C^{n N \times 1}

. Since

T_{n} (F)

is positive definite, it is positive semidefinite, and from Assertion (3), we have

x^{*} {\hat{C}}_{n} (F) x \geq 0

. Suppose that

x^{*} {\hat{C}}_{n} (F) x = 0

. As

T_{n} (F)

is positive definite, applying Assertion (2) yields

z_{k} = 0_{n N \times 1}

for all

k \in {1, \dots, n}

. Consequently,

u_{k} {[y]}_{k, 1} = 0_{n N \times 1}

for all

k \in {1, \dots, n}

, and, therefore,

{[y]}_{k, 1} = 0_{N \times 1}

for all

k \in {1, \dots, n}

. Hence,

y = 0_{n N \times 1}

, and thus,

x = 0_{n N \times 1}

. ☐

Finally, we show that Assertion (1) is also true if

T_{n} (F)

is replaced by

{\hat{C}}_{n} (F)

.

Lemma 6.

Let

F : R \to C^{N \times N}

be continuous and

2 π

-periodic. If

F (ω)

is Hermitian for all

ω \in R,

then

inf (F) \leq λ_{n N} ({\hat{C}}_{n} (F)) \leq λ_{1} ({\hat{C}}_{n} (F)) \leq sup (F) \forall n \in N .

Proof.

Applying Lemma 5

{\hat{C}}_{n} (F)

is Hermitian for all

n \in N

. Fix

n \in N

and

x \in C^{n N \times 1}

. From Assertion (2), we have

x^{*} {\hat{C}}_{n} (F) x = \sum_{k = 1}^{n} z_{k}^{*} T_{n} (F) z_{k} .

Suppose that

x

is an eigenvector of

{\hat{C}}_{n} (F)

with

{∥ x ∥}_{2} = 1

. Consequently,

λ_{j} ({\hat{C}}_{n} (F)) = λ_{j} ({\hat{C}}_{n} (F)) x^{*} x = x^{*} λ_{j} ({\hat{C}}_{n} (F)) x = x^{*} {\hat{C}}_{n} (F) x = \sum_{k = 1}^{n} z_{k}^{*} T_{n} (F) z_{k}

for some

j \in {1, \dots, n N}

. Let

F (ω) = U (ω) diag (λ_{1} (F (ω)), \dots, λ_{N} (F (ω))) {(U (ω))}^{- 1}

be a unitary diagonalization (i.e., an eigenvalue decomposition where the eigenvector matrix

U (ω)

is unitary) of

F (ω)

for all

ω \in [0, 2 π]

. Then,

\begin{matrix} λ_{j} ({\hat{C}}_{n} (F)) \\ = \sum_{k = 1}^{n} {[z_{k}^{*} T_{n} (F) z_{k}]}_{1, 1} = \sum_{k = 1}^{n} \sum_{h = 1}^{n} {[z_{k}^{*}]}_{1, h} {[T_{n} (F) z_{k}]}_{h, 1} = \sum_{k = 1}^{n} \sum_{h = 1}^{n} {[z_{k}^{*}]}_{1, h} \sum_{l = 1}^{n} {[T_{n} (F)]}_{h, l} {[z_{k}]}_{l, 1} \\ = \sum_{k = 1}^{n} \sum_{h = 1}^{n} {[z_{k}]}_{h, 1}^{*} \sum_{l = 1}^{n} \frac{1}{2 π} \int_{0}^{2 π} e^{- (h - l) ω i} F (ω) d ω {[z_{k}]}_{l, 1} \\ = \sum_{k = 1}^{n} \frac{1}{2 π} \int_{0}^{2 π} \sum_{h = 1}^{n} {[z_{k}]}_{h, 1}^{*} \sum_{l = 1}^{n} e^{(- h + l) ω i} F (ω) {[z_{k}]}_{l, 1} d ω \\ = \sum_{k = 1}^{n} \frac{1}{2 π} \int_{0}^{2 π} {(\sum_{h = 1}^{n} e^{h ω i} {[z_{k}]}_{h, 1})}^{*} F (ω) (\sum_{l = 1}^{n} e^{l ω i} {[z_{k}]}_{l, 1}) d ω \\ = \sum_{k = 1}^{n} \frac{1}{2 π} \int_{0}^{2 π} {({(U (ω))}^{*} \sum_{h = 1}^{n} e^{h ω i} {[z_{k}]}_{h, 1})}^{*} diag (λ_{1} (F (ω)), \dots, λ_{N} (F (ω))) ({(U (ω))}^{*} \sum_{l = 1}^{n} e^{l ω i} {[z_{k}]}_{l, 1}) d ω \\ = \sum_{k = 1}^{n} \frac{1}{2 π} \int_{0}^{2 π} \sum_{s = 1}^{N} {[{({(U (ω))}^{*} \sum_{h = 1}^{n} e^{h ω i} {[z_{k}]}_{h, 1})}^{*}]}_{1, s} λ_{s} (F (ω)) {[{(U (ω))}^{*} \sum_{l = 1}^{n} e^{l ω i} {[z_{k}]}_{l, 1}]}_{s, 1} d ω \\ = \sum_{k = 1}^{n} \frac{1}{2 π} \int_{0}^{2 π} \sum_{s = 1}^{N} λ_{s} (F (ω)) {|{[{(U (ω))}^{*} \sum_{l = 1}^{n} e^{l ω i} {[z_{k}]}_{l, 1}]}_{s, 1}|}^{2} d ω . \end{matrix}

Therefore,

\begin{matrix} inf (F) \sum_{k = 1}^{n} \frac{1}{2 π} \int_{0}^{2 π} \sum_{s = 1}^{N} {|{[{(U (ω))}^{*} \sum_{l = 1}^{n} e^{l ω i} {[z_{k}]}_{l, 1}]}_{s, 1}|}^{2} d ω \leq λ_{j} ({\hat{C}}_{n} (F)) \\ \leq sup (F) \sum_{k = 1}^{n} \frac{1}{2 π} \int_{0}^{2 π} \sum_{s = 1}^{N} {|{[{(U (ω))}^{*} \sum_{l = 1}^{n} e^{l ω i} {[z_{k}]}_{l, 1}]}_{s, 1}|}^{2} d ω . \end{matrix}

Since

\frac{1}{2 π} \int_{0}^{2 π} e^{m ω i} d ω = \{\begin{matrix} 1 & if m = 0, \\ 0 & if m \in Z \ {0}, \end{matrix}

we obtain

\begin{matrix} \sum_{k = 1}^{n} \frac{1}{2 π} \int_{0}^{2 π} \sum_{s = 1}^{N} {|{[{(U (ω))}^{*} \sum_{l = 1}^{n} e^{l ω i} {[z_{k}]}_{l, 1}]}_{s, 1}|}^{2} d ω \\ = \sum_{k = 1}^{n} \frac{1}{2 π} \int_{0}^{2 π} {({(U (ω))}^{*} \sum_{h = 1}^{n} e^{h ω i} {[z_{k}]}_{h, 1})}^{*} {(U (ω))}^{*} \sum_{l = 1}^{n} e^{l ω i} {[z_{k}]}_{l, 1} d ω \\ = \sum_{k = 1}^{n} \frac{1}{2 π} \int_{0}^{2 π} \sum_{h = 1}^{n} e^{- h ω i} {[z_{k}]}_{h, 1}^{*} U (ω) {(U (ω))}^{*} \sum_{l = 1}^{n} e^{l ω i} {[z_{k}]}_{l, 1} d ω \\ = \sum_{k = 1}^{n} \sum_{h = 1}^{n} {[z_{k}^{*}]}_{1, h} \sum_{l = 1}^{n} \frac{1}{2 π} \int_{0}^{2 π} e^{(- h + l) ω i} d ω {[z_{k}]}_{l, 1} = \sum_{k = 1}^{n} \sum_{h = 1}^{n} {[z_{k}^{*}]}_{1, h} {[z_{k}]}_{h, 1} = \sum_{k = 1}^{n} z_{k}^{*} z_{k} \\ = \sum_{k = 1}^{n} {[y]}_{k, 1}^{*} u_{k}^{*} {(V_{n} \otimes I_{N})}^{*} (V_{n} \otimes I_{N}) u_{k} {[y]}_{k, 1} = \sum_{k = 1}^{n} {[y^{*}]}_{1, k} u_{k}^{*} u_{k} {[y]}_{k, 1} = \sum_{k = 1}^{n} {[y^{*}]}_{1, k} {[y]}_{k, 1} \\ = y^{*} y = x^{*} (V_{n} \otimes I_{N}) {(V_{n} \otimes I_{N})}^{*} x = x^{*} x = 1, \end{matrix}

which completes the proof. ☐

4. Coding WSS Vector Processes by Using the Sequence of Block Circulant Matrices Considered

Consider that

{x_{n} : n \in N}

is a real zero-mean Gaussian N-dimensional vector process. From [8], we know that the RDF of the real zero-mean Gaussian vector

x_{n : 1}

is given by

R_{n} (D) = \frac{1}{n N} \sum_{k = 1}^{n N} max \{0, \frac{1}{2} ln \frac{λ_{k} (E (x_{n : 1} x_{n : 1}^{⊤}))}{θ_{n}}\}, n \in N,

where

θ_{n}

is a real number satisfying

D = \frac{1}{n N} \sum_{k = 1}^{n N} min \{θ_{n}, λ_{k} (E (x_{n : 1} x_{n : 1}^{⊤}))\} .

We recall that

R_{n} (D)

represents the lowest possible required rate (measured in nats) for jointly encoding (compressing) n symbols from N sources with mean square error (MSE) distortion D.

We assume that

{x_{n} : n \in N}

is WSS with continuous PSD X and

inf (X) > 0

. Fix

D \in (0, inf (X)]

, and from Assertion (1), we obtain that

θ_{n} = D

, and, consequently,

\begin{matrix} R_{n} (D) & = \frac{1}{n N} \sum_{k = 1}^{n N} max \{0, \frac{1}{2} ln \frac{λ_{k} (E (x_{n : 1} x_{n : 1}^{⊤}))}{D}\} \\ = \frac{1}{n N} \sum_{k = 1}^{n N} \frac{1}{2} ln \frac{λ_{k} (E (x_{n : 1} x_{n : 1}^{⊤}))}{D} = \frac{1}{2 n N} \sum_{k = 1}^{n N} ln \frac{λ_{k} (T_{n} (X))}{D} \forall n \in N . \end{matrix}

For every

n \in N

, let us compute the discrete Fourier transform (DFT) of the N sources as

y_{n : 1} = (V_{n}^{*} \otimes I_{N}) x_{n : 1} = {(V_{n} \otimes I_{N})}^{*} x_{n : 1} .

(3)

The correlation matrix of

y_{n : 1}

is given by

\begin{matrix} E (y_{n : 1} y_{n : 1}^{*}) & = E ({(V_{n} \otimes I_{N})}^{*} x_{n : 1} x_{n : 1}^{*} (V_{n} \otimes I_{N})) = {(V_{n} \otimes I_{N})}^{*} E (x_{n : 1} x_{n : 1}^{*}) (V_{n} \otimes I_{N}) \\ = {(V_{n} \otimes I_{N})}^{*} E (x_{n : 1} x_{n : 1}^{⊤}) (V_{n} \otimes I_{N}) = {(V_{n} \otimes I_{N})}^{*} T_{n} (X) (V_{n} \otimes I_{N}) . \end{matrix}

It is clear from Assertion (3) that the lowest possible required rate for encoding

y_{n : 1}

with MSE distortion D is the same as for

x_{n : 1}

, i.e.,

R_{n} (D)

.

Let us compress the Gaussian vector

y_{n : 1}

as if the samples

y_{k_{1}}

and

y_{k_{2}}

were uncorrelated for all

k_{1}, k_{2} \in {1, \dots, n}

with

k_{1} \neq k_{2}

, or, equivalently, as if its correlation matrix were

{diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{N})}^{*} T_{n} (X) (V_{n} \otimes I_{N})]}_{k, k})

instead of

{(V_{n} \otimes I_{N})}^{*} T_{n} (X) (V_{n} \otimes I_{N})

. In this case, from Lemma 6, we obtain that the corresponding rate is given by

{\hat{R}}_{n} (D) = \frac{1}{2 n N} \sum_{k = 1}^{n N} ln \frac{λ_{k} ({\hat{C}}_{n} (X))}{D} \forall n \in N .

We now study the rate loss

{\hat{R}}_{n} (D) - R_{n} (D)

. We have

\begin{matrix} 0 & \leq {\hat{R}}_{n} (D) - R_{n} (D) = \frac{1}{2 n N} \sum_{k = 1}^{n N} ln \frac{λ_{k} ({\hat{C}}_{n} (X))}{λ_{k} (T_{n} (X))} \\ = \frac{1}{2 n N} ln \prod_{k = 1}^{n N} \frac{λ_{k} ({\hat{C}}_{n} (X))}{λ_{k} (T_{n} (X))} = \frac{1}{2 n N} ln \frac{det ({\hat{C}}_{n} (X))}{det (T_{n} (X))} = \frac{1}{2 n N} ln det ({\hat{C}}_{n} (X) {(T_{n} (X))}^{- 1}) \\ = \frac{1}{2 n N} ln \prod_{k = 1}^{n N} λ_{k} ({\hat{C}}_{n} (X) {(T_{n} (X))}^{- 1}) \leq \frac{1}{2 n N} ln ({(\frac{1}{n N} \sum_{k = 1}^{n N} λ_{k} ({\hat{C}}_{n} (X) {(T_{n} (X))}^{- 1}))}^{n N}) \\ = \frac{1}{2} ln (\frac{1}{n N} \sum_{k = 1}^{n N} λ_{k} ({\hat{C}}_{n} (X) {(T_{n} (X))}^{- 1})) = \frac{1}{2} ln (\frac{1}{n N} tr ({\hat{C}}_{n} (X) {(T_{n} (X))}^{- 1})) \\ \leq \frac{1}{2} ln (\frac{\sqrt{n N}}{n N} {∥ {\hat{C}}_{n} (X) {(T_{n} (X))}^{- 1} ∥}_{F}) = \frac{1}{2} ln (\frac{1}{\sqrt{n N}} {∥ ({\hat{C}}_{n} (X) - T_{n} (X)) {(T_{n} (X))}^{- 1} + I_{n N} ∥}_{F}) \\ \leq \frac{1}{2} ln (\frac{1}{\sqrt{n N}} (\sqrt{n N} + {∥ ({\hat{C}}_{n} (X) - T_{n} (X)) {(T_{n} (X))}^{- 1} ∥}_{F})) \\ \leq \frac{1}{2} ln (\frac{1}{\sqrt{n N}} (\sqrt{n N} + ∥ {\hat{C}}_{n} (X) - T_{n} {(X) ∥}_{F} {∥ {(T_{n} (X))}^{- 1} ∥}_{2})) \\ = \frac{1}{2} ln (1 + \frac{∥ {\hat{C}}_{n} (X) - T_{n} {(X) ∥}_{F}}{\sqrt{n N}} {∥ {(T_{n} (X))}^{- 1} ∥}_{2}) . \end{matrix}

Applying Assertion (1) yields

0 \leq {\hat{R}}_{n} (D) - R_{n} (D) \leq \frac{1}{2} ln (1 + \frac{1}{\sqrt{N} inf (X)} \frac{∥ {\hat{C}}_{n} (X) - T_{n} {(X) ∥}_{F}}{\sqrt{n}}),

and hence, from Lemma 4, we conclude that

lim_{n \to \infty} ({\hat{R}}_{n} (D) - R_{n} (D)) = 0 .

(4)

The obtained result (4) on the RDF of WSS vector processes shows that there is no rate loss for large enough n if we consider the correlation matrix of

y_{n : 1}

as if it were a block diagonal matrix. Obviously, encoding all the samples

y_{1}, \dots, y_{n}

separately involves a notably lower computational complexity than encoding them jointly. The complexity of coding a WSS vector process in this way is

O (n log n)

if the fast Fourier transform (FFT) algorithm is used.

5. Filtering WSS Vector Processes by Using the Sequence of Block Circulant Matrices Considered

Consider a zero-mean WSS M-dimensional vector process

{x_{n} : n \in N}

with continuous power spectral density (PSD) X. Let

{y_{n} : n \in N}

be a zero-mean WSS N-dimensional vector process with continuous PSD Y. Assume that those two processes are jointly WSS with continuous joint PSD Z.

For every

n \in N

, if

{\hat{x}}_{n : 1}

is an estimation of

x_{n : 1}

from

y_{n : 1}

of the form

{\hat{x}}_{n : 1} = W y_{n : 1}

(5)

with

W \in C^{n M \times n N}

, the MSE per sample is given by

\begin{matrix} \frac{1}{n} MSE (W) & : = & \frac{1}{n} E (∥ x_{n : 1} - {\hat{x}}_{n : 1} ∥_{2}^{2}) = \frac{1}{n} E ({(x_{n : 1} - {\hat{x}}_{n : 1})}^{*} (x_{n : 1} - {\hat{x}}_{n : 1})) \\ = & \frac{1}{n} tr (E ((x_{n : 1} - {\hat{x}}_{n : 1}) {(x_{n : 1} - {\hat{x}}_{n : 1})}^{*})) \\ = & \frac{1}{n} tr (E (x_{n : 1} x_{n : 1}^{*}) - E (x_{n : 1} y_{n : 1}^{*}) W^{*} - W E (y_{n : 1} x_{n : 1}^{*}) + W E (y_{n : 1} y_{n : 1}^{*}) W^{*}) \\ = & \frac{1}{n} tr (T_{n} (X) - T_{n} (Z) W^{*} - W {(T_{n} (Z))}^{*} + W T_{n} (Y) W^{*}) . \end{matrix}

The minimum MSE (MMSE) is given by

MMSE = MSE (W_{0})

, where

W_{0} = E (x_{n : 1} y_{n : 1}^{*}) {(E (y_{n : 1} y_{n : 1}^{*}))}^{- 1} = T_{n} (Z) {(T_{n} (Y))}^{- 1}

whenever

det (T_{n} (Y)) \neq 0

(or, equivalently, whenever

T_{n} (Y)

is positive definite). The filter

W_{0}

is known as the Wiener filter.

Consider the following filter:

\begin{matrix} W_{C} & : = {\hat{C}}_{n} (Z) {({\hat{C}}_{n} (Y))}^{- 1} \\ = (V_{n} \otimes I_{M}) {diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{M})}^{*} T_{n} (Z) (V_{n} \otimes I_{N})]}_{k, k} {[{(V_{n} \otimes I_{N})}^{*} T_{n} (Y) (V_{n} \otimes I_{N})]}_{k, k}^{- 1}) {(V_{n} \otimes I_{N})}^{*} . \end{matrix}

The filter

W_{C}

is well defined because, from Lemma 5,

{\hat{C}}_{n} (Y)

is positive definite, and, therefore, it is invertible.

We now study the effect on the MSE when the optimal filter

W_{0}

is substituted by

W_{C}

. We have

\begin{matrix} 0 & \leq \frac{MSE (W_{C})}{n} - \frac{MMSE}{n} = |\frac{MSE (W_{C})}{n} - \frac{MMSE}{n}| \\ = |\frac{1}{n} tr (T_{n} (X) - T_{n} (Z) W_{C}^{*} - W_{C} {(T_{n} (Z))}^{*} + W_{C} (T_{n} (Y)) W_{C}^{*}) - \frac{1}{n} tr (T_{n} (X) - T_{n} (Z) {(T_{n} (Y))}^{- 1} {(T_{n} (Z))}^{*})| \\ = |\frac{1}{n} tr (- T_{n} (Z) W_{C}^{*} - W_{C} {(T_{n} (Z))}^{*} + W_{C} (T_{n} (Y)) W_{C}^{*} + T_{n} (Z) {(T_{n} (Y))}^{- 1} {(T_{n} (Z))}^{*})| \\ = \frac{1}{n} | tr ((T_{n} (Z) {(T_{n} (Y))}^{- 1} - W_{C}) ({(T_{n} (Z))}^{*} - T_{n} (Y) W_{C}^{*})) | \\ \leq \frac{\sqrt{n M}}{n} {∥ (T_{n} (Z) {(T_{n} (Y))}^{- 1} - W_{C}) ({(T_{n} (Z))}^{*} - T_{n} (Y) W_{C}^{*}) ∥}_{F} \\ \leq \sqrt{\frac{M}{n}} ∥ T_{n} (Z) {(T_{n} (Y))}^{- 1} - W_{C} ∥_{F} {∥ {(T_{n} (Z))}^{*} - T_{n} (Y) W_{C}^{*} ∥}_{2} \\ \leq \sqrt{\frac{M}{n}} ∥ T_{n} (Z) {(T_{n} (Y))}^{- 1} - {\hat{C}}_{n} (Z) {({\hat{C}}_{n} (Y))}^{- 1} ∥_{F} {∥ {(T_{n} (Z))}^{*} - T_{n} (Y) {({\hat{C}}_{n} (Y))}^{- 1} {({\hat{C}}_{n} (Z))}^{*} ∥}_{2} . \end{matrix}

Since

\begin{matrix} ∥ T_{n} (Z) {(T_{n} (Y))}^{- 1} - {\hat{C}}_{n} (Z) {({\hat{C}}_{n} (Y))}^{- 1} ∥_{F} \\ = ∥ T_{n} (Z) {(T_{n} (Y))}^{- 1} - {\hat{C}}_{n} (Z) {(T_{n} (Y))}^{- 1} + {\hat{C}}_{n} (Z) {(T_{n} (Y))}^{- 1} - {\hat{C}}_{n} (Z) {({\hat{C}}_{n} (Y))}^{- 1} ∥_{F} \\ \leq ∥ (T_{n} (Z) - {\hat{C}}_{n} (Z)) {(T_{n} (Y))}^{- 1} ∥_{F} + {∥ {\hat{C}}_{n} (Z) ({(T_{n} (Y))}^{- 1} - {({\hat{C}}_{n} (Y))}^{- 1}) ∥}_{F} \\ \leq ∥ T_{n} (Z) - {\hat{C}}_{n} {(Z) ∥}_{F} ∥ {(T_{n} (Y))}^{- 1} ∥_{2} + ∥ {\hat{C}}_{n} {(Z) ∥}_{2} {∥ {(T_{n} (Y))}^{- 1} - {({\hat{C}}_{n} (Y))}^{- 1} ∥}_{F} \\ \leq ∥ T_{n} (Z) - {\hat{C}}_{n} {(Z) ∥}_{F} {∥ {(T_{n} (Y))}^{- 1} ∥}_{2} \\ + ∥ {\hat{C}}_{n} {(Z) ∥}_{2} {∥ {({\hat{C}}_{n} (Y))}^{- 1} {\hat{C}}_{n} (Y) {(T_{n} (Y))}^{- 1} - {({\hat{C}}_{n} (Y))}^{- 1} T_{n} (Y) {(T_{n} (Y))}^{- 1} ∥}_{F} \\ \leq ∥ T_{n} (Z) - {\hat{C}}_{n} {(Z) ∥}_{F} ∥ {(T_{n} (Y))}^{- 1} ∥_{2} + ∥ {\hat{C}}_{n} {(Z) ∥}_{2} {∥ {({\hat{C}}_{n} (Y))}^{- 1} ({\hat{C}}_{n} (Y) - T_{n} (Y)) {(T_{n} (Y))}^{- 1} ∥}_{F} \\ \leq ∥ T_{n} (Z) - {\hat{C}}_{n} {(Z) ∥}_{F} ∥ {(T_{n} (Y))}^{- 1} ∥_{2} + ∥ {\hat{C}}_{n} {(Z) ∥}_{2} ∥ {({\hat{C}}_{n} (Y))}^{- 1} ∥_{2} {∥ ({\hat{C}}_{n} (Y) - T_{n} (Y)) {(T_{n} (Y))}^{- 1} ∥}_{F} \\ \leq ∥ T_{n} (Z) - {\hat{C}}_{n} {(Z) ∥}_{F} ∥ {(T_{n} (Y))}^{- 1} ∥_{2} + ∥ {\hat{C}}_{n} {(Z) ∥}_{2} ∥ {({\hat{C}}_{n} (Y))}^{- 1} ∥_{2} ∥ {\hat{C}}_{n} (Y) - T_{n} {(Y) ∥}_{F} {∥ {(T_{n} (Y))}^{- 1} ∥}_{2} \end{matrix}

and

\begin{matrix} ∥ {(T_{n} (Z))}^{*} - T_{n} (Y) {({\hat{C}}_{n} (Y))}^{- 1} {({\hat{C}}_{n} (Z))}^{*} ∥_{2} & \leq & ∥ {(T_{n} (Z))}^{*} ∥_{2} + {∥ T_{n} (Y) {({\hat{C}}_{n} (Y))}^{- 1} {({\hat{C}}_{n} (Z))}^{*} ∥}_{2} \\ \leq & ∥ T_{n} {(Z) ∥}_{2} + ∥ T_{n} (Y) {({\hat{C}}_{n} (Y))}^{- 1} ∥_{2} {∥ {({\hat{C}}_{n} (Z))}^{*} ∥}_{2} \\ \leq & ∥ T_{n} {(Z) ∥}_{2} + ∥ T_{n} {(Y) ∥}_{2} ∥ {({\hat{C}}_{n} (Y))}^{- 1} ∥_{2} {∥ {\hat{C}}_{n} (Z) ∥}_{2}, \end{matrix}

we obtain

\begin{matrix} 0 & \leq & \frac{MSE (W_{C})}{n} - \frac{MMSE}{n} \\ \leq & \sqrt{M} ∥ {(T_{n} (Y))}^{- 1} ∥_{2} (∥ T_{n} {(Z) ∥}_{2} + ∥ T_{n} {(Y) ∥}_{2} ∥ {({\hat{C}}_{n} (Y))}^{- 1} ∥_{2} ∥ {\hat{C}}_{n} (Z) ∥_{2}) \\ (\frac{∥ {\hat{C}}_{n} (Z) - T_{n} {(Z) ∥}_{F}}{\sqrt{n}} + ∥ {\hat{C}}_{n} {(Z) ∥}_{2} {∥ {({\hat{C}}_{n} (Y))}^{- 1} ∥}_{2} \frac{∥ {\hat{C}}_{n} (Y) - T_{n} {(Y) ∥}_{F}}{\sqrt{n}}) . \end{matrix}

Consequently, applying Assertion (1) and Lemma 4 yields

\begin{matrix} 0 \leq \frac{MSE (W_{C})}{n} - \frac{MMSE}{n} & \leq & \sqrt{M} ∥ {(T_{n} (Y))}^{- 1} ∥_{2} σ_{1} (Z) (1 + sup (Y) ∥ {({\hat{C}}_{n} (Y))}^{- 1} ∥_{2}) \\ (\frac{∥ {\hat{C}}_{n} (Z) - T_{n} {(Z) ∥}_{F}}{\sqrt{n}} + σ_{1} (Z) {∥ {({\hat{C}}_{n} (Y))}^{- 1} ∥}_{2} \frac{∥ {\hat{C}}_{n} (Y) - T_{n} {(Y) ∥}_{F}}{\sqrt{n}}) . \end{matrix}

If we assume that

inf (Y) > 0

, from Assertion (1), we obtain that

T_{n} (Y)

is positive definite for all

n \in N

. Moreover, applying Assertion (1) and Lemma 6 yields

0 \leq \frac{MSE (W_{C})}{n} - \frac{MMSE}{n} \leq \frac{\sqrt{M} σ_{1} (Z)}{inf (Y)} (1 + \frac{sup (Y)}{inf (Y)}) (\frac{∥ {\hat{C}}_{n} (Z) - T_{n} {(Z) ∥}_{F}}{\sqrt{n}} + \frac{σ_{1} (Z)}{inf (Y)} \frac{∥ {\hat{C}}_{n} (Y) - T_{n} {(Y) ∥}_{F}}{\sqrt{n}}),

and, therefore, from Lemma 4, we conclude that

lim_{n \to \infty} (\frac{MSE (W_{C})}{n} - \frac{MMSE}{n}) = 0 .

(6)

The obtained result (6) shows that there is no difference in the MSE for large enough n if we substitute the optimal filter

W_{0}

by

W_{C}

. Obviously, the computational complexity of the operation (5) is notably reduced when applying this substitution and the FFT algorithm is used. Specifically, the complexity is reduced from

O (n^{2})

to

O (n log n)

.

6. Conclusions

In this paper, we present a sequence of block circulant matrices and we apply it to reduce the complexity of coding and filtering WSS vector processes. Specifically, in both applications, the complexity is reduced from

O (n^{2})

to

O (n log n)

, which is the complexity of performing an FFT.

Acknowledgments

This work was supported in part by the Spanish Ministry of Economy and Competitiveness through the projects RACHEL (TEC2013-47141-C4-2-R) and CARMEN (TEC2016-75067-C4-3-R).

Author Contributions

Authors are listed in order of their degree of involvement in the work, with the most active contributors listed first. All authors have read and approved the final manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Gray, R.M. Toeplitz and circulant matrices: A review. Found. Trends Commun. Inf. Theory 2006, 2, 155–239. [Google Scholar] [CrossRef]
Pearl, J. On coding and filtering stationary signals by discrete Fourier transforms. IEEE Trans. Inf. Theory 1973, 19, 229–232. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J.; Crespo, P.M. Block Toeplitz matrices: Asymptotic results and applications. Found. Trends Commun. Inf. Theory 2011, 8, 179–257. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J.; Crespo, P.M. Asymptotically equivalent sequences of matrices and Hermitian block Toeplitz matrices with continuous symbols: Applications to MIMO systems. IEEE Trans. Inf. Theory 2008, 54, 5671–5680. [Google Scholar] [CrossRef]
Bhatia, R. Matrix Analysis; Springer: Berlin/Heidelberg, Germany, 1997. [Google Scholar]
Gutiérrez-Gutiérrez, J.; Crespo, P.M. Asymptotically equivalent sequences of matrices and multivariate ARMA processes. IEEE Trans. Inf. Theory 2011, 57, 5444–5454. [Google Scholar] [CrossRef]
Gray, R.M. On the asymptotic eigenvalue distribution of Toeplitz matrices. IEEE Trans. Inf. Theory 1972, 18, 725–730. [Google Scholar] [CrossRef]
Kolmogorov, A.N. On the Shannon theory of information transmission in the case of continuous signals. IRE Trans. Inf. Theory 1956, 2, 102–108. [Google Scholar] [CrossRef]

© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).

On the Complexity Reduction of Coding WSS Vector Processes by Using a Sequence of Block Circulant Matrices

Abstract

1. Introduction

2. Preliminaries

2.1. Notation

2.2. Block Toeplitz Matrices

2.3. Block Circulant Matrices

2.4. Asymptotically Equivalent Sequences of Matrices

3. Sequence of Block Circulant Matrices Considered

4. Coding WSS Vector Processes by Using the Sequence of Block Circulant Matrices Considered

5. Filtering WSS Vector Processes by Using the Sequence of Block Circulant Matrices Considered

6. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics