A Low-Complexity and Asymptotically Optimal Coding Strategy for Gaussian Vector Sources

Marta Zárraga-Rodríguez; Jesús Gutiérrez-Gutiérrez; Xabier Insausti

doi:10.3390/e21100965

,

and

Tecnun, University of Navarra, Paseo de Manuel Lardizábal 13, 20018 San Sebastián, Spain

^*

Author to whom correspondence should be addressed.

Entropy2019, 21(10), 965;https://doi.org/10.3390/e21100965

This article belongs to the Section Information Theory, Probability and Statistics

Version Notes

Order Reprints

Abstract

In this paper, we present a low-complexity coding strategy to encode (compress) finite-length data blocks of Gaussian vector sources. We show that for large enough data blocks of a Gaussian asymptotically wide sense stationary (AWSS) vector source, the rate of the coding strategy tends to the lowest possible rate. Besides being a low-complexity strategy it does not require the knowledge of the correlation matrix of such data blocks. We also show that this coding strategy is appropriate to encode the most relevant Gaussian vector sources, namely, wide sense stationary (WSS), moving average (MA), autoregressive (AR), and ARMA vector sources.

Keywords:

source coding; rate distortion function (RDF); Gaussian vector; asymptotically wide sense stationary (AWSS) vector source; block discrete Fourier transform (DFT)

1. Introduction

The rate distortion function (RDF) of a source provides the minimum rate at which data can be encoded in order to be able to recover them with a mean squared error (MSE) per dimension not larger than a given distortion.

In this paper, we present a low-complexity coding strategy to encode (compress) finite-length data blocks of Gaussian N-dimensional vector sources. Moreover, we show that for large enough data blocks of a Gaussian asymptotically wide sense stationary (AWSS) vector source, the rate of our coding strategy tends to the RDF of the source. The definition of AWSS vector process can be found in ([1] (Definition 7.1)). This definition was first introduced for the scalar case N = 1 (see ([2] (Section 6)) or [3]), and it is based on the Gray concept of asymptotically equivalent sequences of matrices [4].

A low-complexity coding strategy can be found in [5] for finite-length data blocks of Gaussian wide sense stationary (WSS) sources and in [6] for finite-length data blocks of Gaussian AWSS autoregressive (AR) sources. Both precedents deal with scalar processes. The low-complexity coding strategy presented in this paper generalizes the aforementioned strategies to Gaussian AWSS vector sources.

Our coding strategy is based on the block discrete Fourier transform (DFT), and therefore, it turns out to be a low-complexity coding strategy when the fast Fourier transform (FFT) algorithm is used. Specifically, the computational complexity of our coding strategy is

O (n N log n)

, where n is the length of the data blocks. Besides being a low-complexity strategy, it does not require the knowledge of the correlation matrix of such data blocks.

We show that this coding strategy is appropriate to encode the most relevant Gaussian vector sources, namely, WSS, moving average (MA), autoregressive (AR), and ARMA vector sources. Observe that our coding strategy is then appropriate to encode Gaussian vector sources found in the literature, such as the corrupted WSS vector sources considered in [7,8] for the quadratic Gaussian CEO problem.

The paper is organized as follows. In Section 2, we obtain several new mathematical results on the block DFT, and we present an upper bound for the RDF of a complex Gaussian vector. In Section 3, using the results given in Section 2, we present a new coding strategy based on the block DFT to encode finite-length data blocks of Gaussian vector sources. In Section 4, we show that for large enough data blocks of a Gaussian AWSS vector source, the rate of our coding strategy tends to the RDF of the source. In Section 5, we show that our coding strategy is appropriate to encode WSS, MA, AR, and ARMA vector sources. In Section 6, conclusions and numerical examples are presented.

2. Preliminaries

2.1. Notation

In this paper

N

,

Z

,

R

, and

C

are the set of positive integers, the set of integers, the set of real numbers, and the set of complex numbers, respectively. The symbol ⊤ denotes transpose and the symbol ∗ denotes conjugate transpose.

{∥ \cdot ∥}_{2}

and

{∥ \cdot ∥}_{F}

are the spectral and the Frobenius norm, respectively.

⌈ x ⌉

denotes the smallest integer higher than or equal to x. E stands for expectation, ⊗ is the Kronecker product, and

λ_{j} (A)

,

j \in {1, \dots, n}

, denote the eigenvalues of an

n \times n

Hermitian matrix A arranged in decreasing order.

R^{n \times 1}

is the set of real n-dimensional (column) vectors,

C^{m \times n}

denotes the set of

m \times n

complex matrices,

0_{m \times n}

is the

m \times n

zero matrix,

I_{n}

denotes the

n \times n

identity matrix, and

V_{n}

is the

n \times n

Fourier unitary matrix, i.e.,

{[V_{n}]}_{j, k} = \frac{1}{\sqrt{n}} e^{- \frac{2 π (j - 1) (k - 1)}{n} i}, j, k \in {1, \dots, n},

where

i

is the imaginary unit.

If

A_{j} \in C^{N \times N}

for all

j \in {1, \dots, n}

, then

{diag}_{1 \leq j \leq n} (A_{j})

denotes the

n \times n

block diagonal matrix with

N \times N

blocks given by

{diag}_{1 \leq j \leq n} (A_{j}) = {(A_{j} δ_{j, k})}_{j, k = 1}^{n}

, where

δ

is the Kronecker delta.

Re

and

Im

denote the real part and the imaginary part of a complex number, respectively. If

A \in C^{m \times n}

, then

Re (A)

and

Im (A)

are the

m \times n

real matrices given by

{[Re (A)]}_{j, k} = Re ({[A]}_{j, k})

and

{[Im (A)]}_{j, k} = Im ({[A]}_{j, k})

with

j \in {1, \dots, m}

and

k \in {1, \dots, n}

, respectively.

If

z \in C^{N \times 1}

, then

\hat{z}

denotes the real

2 N

-dimensional vector given by

\hat{z} = (\begin{matrix} Re (z) \\ Im (z) \end{matrix}) .

If

z_{k} \in C^{N \times 1}

for all

k \in {1, \dots, n}

, then

z_{n : 1}

is the

n N

-dimensional vector given by

z_{n : 1} = (\begin{matrix} z_{n} \\ z_{n - 1} \\ ⋮ \\ z_{1} \end{matrix}) .

Finally, if

z_{k}

is a (complex) random N-dimensional vector for all

k \in N

,

{z_{k}}

denotes the corresponding (complex) random N-dimensional vector process.

2.2. New Mathematical Results on the Block DFT

We first give a simple result on the block DFT of real vectors.

Lemma 1.

Let

n, N \in N

. Consider

x_{k} \in C^{N \times 1}

for all

k \in {1, \dots, n}

. Suppose that

y_{n : 1}

is the block DFT of

x_{n : 1}

, i.e.,

y_{n : 1} = (V_{n}^{*} \otimes I_{N}) x_{n : 1} = {(V_{n} \otimes I_{N})}^{*} x_{n : 1} .

(1)

Then the two following assertions are equivalent:

1.: $x_{n : 1} \in R^{n N \times 1}$ .
2.: $y_{k} = \bar{y_{n - k}}$ for all $k \in {1, \dots, n - 1}$ and $y_{n} \in R^{N \times 1}$ .

Proof.

See Appendix A. □

We now give three new mathematical results on the block DFT of random vectors that are used in Section 3.

Theorem 1.

Consider

n, N \in N

. Let

x_{k}

be a random N-dimensional vector for all

k \in {1, \dots, n}

. Suppose that

y_{n : 1}

is given by Equation (1). If

k \in {1, \dots, n}

, then

λ_{n N} (E (x_{n : 1} x_{n : 1}^{*})) \leq λ_{N} (E (x_{k} x_{k}^{*})) \leq λ_{1} (E (x_{k} x_{k}^{*})) \leq λ_{1} (E (x_{n : 1} x_{n : 1}^{*}))

(2)

and

λ_{n N} (E (x_{n : 1} x_{n : 1}^{*})) \leq λ_{N} (E (y_{k} y_{k}^{*})) \leq λ_{1} (E (y_{k} y_{k}^{*})) \leq λ_{1} (E (x_{n : 1} x_{n : 1}^{*})) .

(3)

Proof.

See Appendix B. □

Theorem 2.

Let

x_{n : 1}

and

y_{n : 1}

be as in Theorem 1. Suppose that

x_{n : 1}

is real. If

k \in {1, \dots, n - 1} ∖ {\frac{n}{2}}

, then

\frac{λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤}))}{2} \leq λ_{2 N} (E (\hat{y_{k}} {\hat{y_{k}}}^{⊤})) \leq λ_{1} (E (\hat{y_{k}} {\hat{y_{k}}}^{⊤})) \leq \frac{λ_{1} (E (x_{n : 1} x_{n : 1}^{⊤}))}{2} .

Proof.

See Appendix C. □

Lemma 2.

Let

x_{n : 1}

and

y_{n : 1}

be as in Theorem 1. If

k \in {1, \dots, n}

, then

1.: $E (y_{k} y_{k}^{*}) = {[{(V_{n} \otimes I_{N})}^{*} E (x_{n : 1} x_{n : 1}^{*}) (V_{n} \otimes I_{N})]}_{n - k + 1, n - k + 1}$ .
2.: $E (y_{k} y_{k}^{⊤}) = {[{(V_{n} \otimes I_{N})}^{*} E (x_{n : 1} x_{n : 1}^{⊤}) (\bar{V_{n} \otimes I_{N}})]}_{n - k + 1, n - k + 1}$ .
3.: $E (\hat{y_{k}} {\hat{y_{k}}}^{⊤}) = \frac{1}{2} (\begin{array}{c} Re (E (y_{k} y_{k}^{*})) + Re (E (y_{k} y_{k}^{⊤})) & Im (E (y_{k} y_{k}^{⊤})) - Im (E (y_{k} y_{k}^{*})) \\ Im (E (y_{k} y_{k}^{*})) + Im (E (y_{k} y_{k}^{⊤})) & Re (E (y_{k} y_{k}^{*})) - Re (E (y_{k} y_{k}^{⊤})) \end{array})$ .

Proof.

See Appendix D. □

2.3. Upper Bound for the RDF of a Complex Gaussian Vector

In [9], Kolmogorov gave a formula for the RDF of a real zero-mean Gaussian N-dimensional vector

x

with positive definite correlation matrix

E (x x^{⊤})

, namely,

R_{x} (D) = \frac{1}{N} \sum_{k = 1}^{N} max \{0, \frac{1}{2} ln \frac{λ_{k} (E (x x^{⊤}))}{θ}\} \forall D \in (0, \frac{tr (E (x x^{⊤}))}{N}],

(4)

where

tr

denotes trace and

θ

is a real number satisfying

D = \frac{1}{N} \sum_{k = 1}^{N} min \{θ, λ_{k} (E (x x^{⊤}))\} .

If

D \in (0, λ_{N} (E (x x^{⊤}))]

, an optimal coding strategy to achieve

R_{x} (D)

is to encode

{[z]}_{1, 1}, \dots, {[z]}_{N, 1}

separately, where

z = U^{⊤} x

with U being a real orthogonal eigenvector matrix of

E (x x^{⊤})

(see ([6] (Corollary 1))). Observe that in order to obtain U, we need to know the correlation matrix

E (x x^{⊤})

. This coding strategy also requires an optimal coding method for real Gaussian random variables. Moreover, as

0 < D \leq λ_{N} (E (x x^{⊤})) \leq \frac{1}{N} \sum_{k = 1}^{N} λ_{k} (E (x x^{⊤})) = \frac{tr (E (x x^{⊤}))}{N}

, if

D \in (0, λ_{N} (E (x x^{⊤}))]

, then from Equation (4) we obtain

R_{x} (D) = \frac{1}{N} \sum_{k = 1}^{N} \frac{1}{2} ln \frac{λ_{k} (E (x x^{⊤}))}{D} = \frac{1}{2 N} ln \frac{\prod_{k = 1}^{N} λ_{k} (E (x x^{⊤}))}{D^{N}} = \frac{1}{2 N} ln \frac{det (E (x x^{⊤}))}{D^{N}} .

(5)

We recall that

R_{x} (D)

can be thought of as the minimum rate (measured in nats) at which

x

can be encoded (compressed) in order to be able to recover it with an MSE per dimension not larger than D, that is:

\frac{E ({∥x - \tilde{x}∥}_{2}^{2})}{N} \leq D,

where

\tilde{x}

denotes the estimation of

x

.

The following result gives an upper bound for the RDF of a complex zero-mean Gaussian N-dimensional vector (i.e., a real zero-mean Gaussian

2 N

-dimensional vector).

Lemma 3.

Consider

N \in N

. Let z be a complex zero-mean Gaussian N-dimensional vector. If

E (\hat{z} \hat{z}^{⊤})

is a positive definite matrix, then

R_{\hat{z}} (D) \leq \frac{1}{2 N} ln \frac{det (E (z z^{*}))}{{(2 D)}^{N}} \forall D \in (0, λ_{2 N} (E (\hat{z} \hat{z}^{⊤}))] .

(6)

Proof.

We divide the proof into three steps:

Step 1: We prove that

E (z z^{*})

is a positive definite matrix. We have

E (\hat{z} \hat{z}^{⊤}) = (\begin{matrix} E (Re (z) {(Re (z))}^{⊤}) & E (Re (z) {(Im (z))}^{⊤}) \\ E (Im (z) {(Re (z))}^{⊤}) & E (Im (z) {(Im (z))}^{⊤}) \end{matrix})

and

\begin{matrix} E (z z^{*}) & = & E ((Re (z) + i Im (z)) ({(Re (z))}^{⊤} - i {(Im (z))}^{⊤})) \\ = & E (Re (z) {(Re (z))}^{⊤}) + E (Im (z) {(Im (z))}^{⊤}) + i E (Im (z) {(Re (z))}^{⊤}) - i E (Re (z) {(Im (z))}^{⊤}) . \end{matrix}

Consider

u \in C^{N \times 1}

, and suppose that

u^{*} E (z z^{*}) u = 0

. We only need to show that

u = 0_{N \times 1}

. As

E (\hat{z} \hat{z}^{⊤})

is a positive definite matrix and

\begin{matrix} {(\begin{matrix} u \\ - i u \end{matrix})}^{*} E (\hat{z} \hat{z}^{⊤}) (\begin{matrix} u \\ - i u \end{matrix}) & = & {(\begin{matrix} u \\ - i u \end{matrix})}^{*} (\begin{matrix} E (Re (z) {(Re (z))}^{⊤}) u - i E (Re (z) {(Im (z))}^{⊤}) u \\ E (Im (z) {(Re (z))}^{⊤}) u - i E (Im (z) {(Im (z))}^{⊤}) u \end{matrix}) \\ = & u^{*} E (Re (z) {(Re (z))}^{⊤}) u - i u^{*} E (Re (z) {(Im (z))}^{⊤}) u \\ + i u^{*} E (Im (z) {(Re (z))}^{⊤}) u + u^{*} E (Im (z) {(Im (z))}^{⊤}) u \\ = & u^{*} E (z z^{*}) u = 0, \end{matrix}

we obtain

(\begin{matrix} u \\ - i u \end{matrix}) = 0_{2 N \times 1}

, or equivalently

u = 0_{N \times 1}

.

Step 2: We show that

det (E (\hat{z} \hat{z}^{⊤})) \leq \frac{{(det (E (z z^{*})))}^{2}}{2^{2 N}}

. We have

E (z z^{*}) = Λ_{c} + i Λ_{s}

, where

Λ_{c} = E (Re (z) (Re (z))^{⊤}) + E (Im (z) (Im (z))^{⊤})

and

Λ_{s} = E (Im (z) (Re (z))^{⊤}) - (E (Im (z) (Re (z))^{⊤}))^{⊤}

. Applying ([10] (Corollary 1)), we obtain

\begin{matrix} det (E (\hat{z} \hat{z}^{⊤})) & \leq \frac{det (Λ_{c} + Λ_{s} Λ_{c}^{- 1} Λ_{s}) det (Λ_{c})}{2^{2 N}} = \frac{det (I_{N} + Λ_{s} Λ_{c}^{- 1} Λ_{s} Λ_{c}^{- 1}) {(det (Λ_{c}))}^{2}}{2^{2 N}} \\ = \frac{det ((I_{N} + i Λ_{s} Λ_{c}^{- 1}) (I_{N} - i Λ_{s} Λ_{c}^{- 1})) {(det (Λ_{c}))}^{2}}{2^{2 N}} = \frac{det (Λ_{c} + i Λ_{s}) det (Λ_{c} - i Λ_{s})}{2^{2 N}} \\ = \frac{det (E (z z^{*})) det (\bar{E (z z^{*})})}{2^{2 N}} = \frac{det (E (z z^{*})) \bar{det (E (z z^{*}))}}{2^{2 N}} = \frac{{(det (E (z z^{*})))}^{2}}{2^{2 N}} . \end{matrix}

Step 3: We now prove Equation (6). From Equation (5), we conclude that

\begin{matrix} R_{\hat{z}} (D) = \frac{1}{4 N} ln \frac{det (E (\hat{z} \hat{z}^{⊤}))}{D^{2 N}} \leq \frac{1}{4 N} ln \frac{{(det (E (z z^{*})))}^{2}}{{(2 D)}^{2 N}} = \frac{1}{2 N} ln \frac{det (E (z z^{*}))}{{(2 D)}^{N}} . \end{matrix}

□

3. Low-Complexity Coding Strategy for Gaussian Vector Sources

In this section (see Theorem 3), we present our coding strategy for Gaussian vector sources. To encode a finite-length data block

x_{n : 1}

of a Gaussian N-dimensional vector source

{x_{k}}

, we compute the block DFT of

x_{n : 1}

(

y_{n : 1}

) and we encode

y_{⌈ \frac{n}{2} ⌉}, \dots, y_{n}

separately with

\frac{E ({∥y_{k} - \tilde{y_{k}}∥}_{2}^{2})}{N} \leq D

for all

k \in \{⌈\frac{n}{2}⌉, \dots, n\}

(see Figure 1).

Figure 1. Proposed coding strategy for Gaussian vector sources. In this figure,

E n c o d e r_{k}

(

D e c o d e r_{k}

) denotes the optimal encoder (decoder) for the Gaussian N-dimensional vector

y_{k}

with

k \in \{⌈\frac{n}{2}⌉, \dots, n\}

.

We denote by

{\tilde{R}}_{x_{n : 1}} (D)

the rate of our strategy. Theorem 3 also provides an upper bound of

{\tilde{R}}_{x_{n : 1}} (D)

. This upper bound is used in Section 4 to prove that our coding strategy is asymptotically optimal whenever the Gaussian vector source is AWSS.

In Theorem 3

C_{A_{n}}

denotes the matrix

(V_{n} \otimes I_{N}) {diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{N})}^{*} A_{n} (V_{n} \otimes I_{N})]}_{k, k}) {(V_{n} \otimes I_{N})}^{*}

, where

A_{n} \in C^{n N \times n N}

.

Theorem 3.

Consider

n, N \in N

. Let

x_{k}

be a random N-dimensional vector for all

k \in {1, \dots, n}

. Suppose that

x_{n : 1}

is a real zero-mean Gaussian vector with a positive definite correlation matrix (or equivalently,

λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤})) > 0

). Let

y_{n : 1}

be the random vector given by Equation (1). If

D \in (0, λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤}))]

, then

\begin{matrix} R_{x_{n : 1}} & (D) \leq {\tilde{R}}_{x_{n : 1}} (D) \leq \frac{1}{2 n N} ln \frac{\det (C_{E (x_{n : 1} x_{n : 1}^{⊤})})}{D^{n N}}, \end{matrix}

(7)

where

{\tilde{R}}_{x_{n : 1}} (D) = \{\begin{matrix} \frac{R_{y_{\frac{n}{2}}} (D) + 2 \sum_{k = \frac{n}{2} + 1}^{n - 1} R_{\hat{y_{k}}} (\frac{D}{2}) + R_{y_{n}} (D)}{n} & i f n i s e v e n, \\ \frac{2 \sum_{k = \frac{n + 1}{2}}^{n - 1} R_{\hat{y_{k}}} (\frac{D}{2}) + R_{y_{n}} (D)}{n} & i f n i s o d d . \end{matrix}

Moreover,

\begin{matrix} 0 \leq \frac{1}{2 n N} ln \frac{\det (C_{E (x_{n : 1} x_{n : 1}^{⊤})})}{D^{n N}} - R_{x_{n : 1}} (D) \leq \frac{1}{2} ln (1 + \frac{{∥E (x_{n : 1} x_{n : 1}^{⊤}) - C_{E (x_{n : 1} x_{n : 1}^{⊤})}∥}_{F}}{\sqrt{n N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤}))}) . \end{matrix}

(8)

Proof.

We divide the proof into three steps:

Step 1: We show that

R_{x_{n : 1}} (D) \leq {\tilde{R}}_{x_{n : 1}} (D)

. From Lemma 1,

y_{k} = \bar{y_{n - k}}

for all

k \in {1, \dots, ⌈ \frac{n}{2} ⌉ - 1}

, and

y_{k} \in R^{N \times 1}

with

k \in {\frac{n}{2}, n} \cap N

. We encode

y_{⌈ \frac{n}{2} ⌉}, \dots, y_{n}

separately (i.e., if n is even, we encode

y_{\frac{n}{2}}, \hat{y_{\frac{n}{2} + 1}}, \dots, \hat{y_{n - 1}}, y_{n}

separately, and if n is odd, we encode

\hat{y_{\frac{n + 1}{2}}}, \dots, \hat{y_{n - 1}}, y_{n}

separately) with

\frac{E ({∥\hat{y_{k}} - \tilde{\hat{y_{k}}}∥}_{2}^{2})}{2 N} \leq \frac{D}{2}, k \in \{⌈\frac{n}{2}⌉ \dots, n - 1\} ∖ \{\frac{n}{2}\}

and

\frac{E ({∥y_{k} - \tilde{y_{k}}∥}_{2}^{2})}{N} \leq D, k \in \{\frac{n}{2}, n\} \cap N .

Let

\tilde{x_{n : 1}} = (V_{n} \otimes I_{N}) \tilde{y_{n : 1}}

with

\tilde{y_{n : 1}} = (\begin{matrix} \tilde{y_{n}} \\ ⋮ \\ \tilde{y_{1}} \end{matrix}),

where

\hat{\tilde{y_{k}}} = \tilde{\hat{y_{k}}}

for all

k \in {⌈ \frac{n}{2} ⌉ \dots, n - 1} ∖ {\frac{n}{2}}

, and

\tilde{y_{k}} = \bar{\tilde{y_{n - k}}}

for all

k \in {1, \dots, ⌈ \frac{n}{2} ⌉ - 1}

. Applying Lemma 1 yields

\tilde{x_{n : 1}} \in R^{n N \times 1}

. As

{(V_{n} \otimes I_{N})}^{*}

is unitary and

{∥ \cdot ∥}_{2}

is unitarily invariant, we have

\begin{matrix} \frac{E ({∥x_{n : 1} - \tilde{x_{n : 1}}∥}_{2}^{2})}{n N} & = \frac{E ({∥{(V_{n} \otimes I_{N})}^{*} x_{n : 1} - {(V_{n} \otimes I_{N})}^{*} \tilde{x_{n : 1}}∥}_{2}^{2})}{n N} \\ = \frac{E ({∥y_{n : 1} - \tilde{y_{n : 1}}∥}_{2}^{2})}{n N} = \frac{1}{n N} \sum_{k = 1}^{n} E ({∥y_{k} - \tilde{y_{k}}∥}_{2}^{2}) \\ = \frac{1}{n N} (2 \sum_{k_{1} \in {⌈ \frac{n}{2} ⌉ \dots, n - 1} ∖ {\frac{n}{2}}} E ({∥y_{k_{1}} - \tilde{y_{k_{1}}}∥}_{2}^{2}) + \sum_{k_{2} \in {\frac{n}{2}, n} \cap N} E ({∥y_{k_{2}} - \tilde{y_{k_{2}}}∥}_{2}^{2})) \\ = \frac{1}{n N} (2 \sum_{k_{1} \in {⌈ \frac{n}{2} ⌉ \dots, n - 1} ∖ {\frac{n}{2}}} E ({∥\hat{y_{k_{1}}} - \tilde{\hat{y_{k_{1}}}}∥}_{2}^{2}) + \sum_{k_{2} \in {\frac{n}{2}, n} \cap N} E ({∥y_{k_{2}} - \tilde{y_{k_{2}}}∥}_{2}^{2})) \\ \leq \{\begin{matrix} \frac{1}{n N} (2 (\frac{n}{2} - 1) N D + 2 N D) & if n is even, \\ \frac{1}{n N} (2 (n - \frac{n + 1}{2}) N D + N D) & if n is odd, \end{matrix}\} = D . \end{matrix}

Consequently,

R_{x_{n : 1}} (D) \leq \{\begin{matrix} \frac{N R_{y_{\frac{n}{2}}} (D) + 2 N \sum_{k = \frac{n}{2} + 1}^{n - 1} R_{\hat{y_{k}}} (\frac{D}{2}) + N R_{y_{n}} (D)}{n N} & if n is even, \\ \frac{2 N \sum_{k = \frac{n + 1}{2}}^{n - 1} R_{\hat{y_{k}}} (\frac{D}{2}) + N R_{y_{n}} (D)}{n N} & if n is odd, \end{matrix}\} = {\tilde{R}}_{x_{n : 1}} (D) .

Step 2: We prove that

{\tilde{R}}_{x_{n : 1}} (D) \leq \frac{1}{2 n N} ln \frac{\det (C_{E (x_{n : 1} x_{n : 1}^{⊤})})}{D^{n N}}

. From Equations (3) and (5), we obtain

R_{y_{k}} (D) = \frac{1}{2 N} ln \frac{det (E (y_{k} y_{k}^{⊤}))}{D^{N}}, k \in \{\frac{n}{2}, n\} \cap N,

(9)

and applying Theorem 2 and Equation (5) yields

R_{\hat{y_{k}}} (\frac{D}{2}) = \frac{1}{4 N} ln \frac{det (E (\hat{y_{k}} {\hat{y_{k}}}^{⊤}))}{{(\frac{D}{2})}^{2 N}}, k \in \{1, \dots, n - 1\} ∖ \{\frac{n}{2}\} .

(10)

From Lemma 3, we have

\begin{matrix} {\tilde{R}}_{x_{n : 1}} (D) \\ \leq \frac{1}{n} [2 \sum_{k_{1} \in {⌈ \frac{n}{2} ⌉, \dots, n - 1} ∖ {\frac{n}{2}}} \frac{1}{2 N} ln \frac{det (E (y_{k_{1}} y_{k_{1}}^{*}))}{D^{N}} + \sum_{k_{2} \in {\frac{n}{2}, n} \cap N} \frac{1}{2 N} ln \frac{det (E (y_{k_{2}} y_{k_{2}}^{*}))}{D^{N}}] \\ = \frac{1}{2 n N} [\sum_{k_{1} \in {⌈ \frac{n}{2} ⌉, \dots, n - 1} ∖ {\frac{n}{2}}} (ln \frac{det (E (y_{k_{1}} y_{k_{1}}^{*}))}{D^{N}} + ln \frac{\bar{det (E (y_{k_{1}} y_{k_{1}}^{*}))}}{D^{N}}) + \sum_{k_{2} \in {\frac{n}{2}, n} \cap N} ln \frac{det (E (y_{k_{2}} y_{k_{2}}^{*}))}{D^{N}}] \\ = \frac{1}{2 n N} [\sum_{k_{1} \in {⌈ \frac{n}{2} ⌉, \dots, n - 1} ∖ {\frac{n}{2}}} (ln \frac{det (E (y_{k_{1}} y_{k_{1}}^{*}))}{D^{N}} + ln \frac{det (E (y_{n - k_{1}} y_{n - k_{1}}^{*}))}{D^{N}}) \\ + \sum_{k_{2} \in {\frac{n}{2}, n} \cap N} ln \frac{det (E (y_{k_{2}} y_{k_{2}}^{*}))}{D^{N}}] \\ = \frac{1}{2 n N} \sum_{k = 1}^{n} ln \frac{det (E (y_{k} y_{k}^{*}))}{D^{N}} = \frac{1}{2 n N} ln \frac{\prod_{k = 1}^{n} det (E (y_{k} y_{k}^{*}))}{D^{n N}} . \end{matrix}

As

\begin{matrix} \{λ_{j} (E (y_{k} y_{k}^{*})) : j \in {1, \dots, N}, k \in {1, \dots, n}\} = \{λ_{j} ({[E (y_{n : 1} y_{n : 1}^{*})]}_{k, k}) : j \in {1, \dots, N}, k \in {1, \dots, n}\} \\ = \{λ_{j} ({[(V_{n} \otimes I_{N})^{*} E (x_{n : 1} x_{n : 1}^{⊤}) (V_{n} \otimes I_{N})]}_{k, k}) : j \in {1, \dots, N}, k \in {1, \dots, n}\} \\ = \{λ_{j} ({diag}_{1 \leq k \leq n} ({[(V_{n} \otimes I_{N})^{*} E (x_{n : 1} x_{n : 1}^{⊤}) (V_{n} \otimes I_{N})]}_{k, k})) : j \in {1, \dots, n N}\} \\ = \{λ_{j} ((V_{n} \otimes I_{N}) {diag}_{1 \leq k \leq n} ({[(V_{n} \otimes I_{N})^{*} E (x_{n : 1} x_{n : 1}^{⊤}) (V_{n} \otimes I_{N})]}_{k, k}) {(V_{n} \otimes I_{N})}^{- 1}) : j \in {1, \dots, n N}\} \\ = \{λ_{j} (C_{E (x_{n : 1} x_{n : 1}^{⊤})}) : j \in {1, \dots, n N}\}, \end{matrix}

(11)

we obtain

\begin{matrix} \prod_{k = 1}^{n} det (E (y_{k} y_{k}^{*})) = \prod_{k = 1}^{n} \prod_{j = 1}^{N} λ_{j} (E (y_{k} y_{k}^{*})) = \prod_{j = 1}^{n N} λ_{j} (C_{E (x_{n : 1} x_{n : 1}^{⊤})}) = det (C_{E (x_{n : 1} x_{n : 1}^{⊤})}) . \end{matrix}

Step 3: We show Equation (8).

As

E (x_{n : 1} x_{n : 1}^{⊤})

is a positive definite matrix (or equivalently,

E (x_{n : 1} x_{n : 1}^{⊤})

is Hermitian and

λ_{j} (E (x_{n : 1} x_{n : 1}^{⊤})) > 0

for all

j \in {1, \dots, n N}

),

{(V_{n} \otimes I_{N})}^{*} E (x_{n : 1} x_{n : 1}^{⊤}) (V_{n} \otimes I_{N})

is Hermitian. Hence,

{[{(V_{n} \otimes I_{N})}^{*} E (x_{n : 1} x_{n : 1}^{⊤}) (V_{n} \otimes I_{N})]}_{k, k}

is Hermitian for all

k \in {1, \dots, n}

, and therefore,

{diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{N})}^{*} E (x_{n : 1} x_{n : 1}^{⊤}) (V_{n} \otimes I_{N})]}_{k, k})

is also Hermitian. Consequently,

(V_{n} \otimes I_{N}) {diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{N})}^{*} E (x_{n : 1} x_{n : 1}^{⊤}) (V_{n} \otimes I_{N})]}_{k, k}) {(V_{n} \otimes I_{N})}^{*}

is Hermitian, and applying Equations (3) and (11), we have that

C_{E (x_{n : 1} x_{n : 1}^{⊤})}

is a positive definite matrix.

Let

E (x_{n : 1} x_{n : 1}^{⊤}) = U {diag}_{1 \leq j \leq n N} (λ_{j} (E (x_{n : 1} x_{n : 1}^{⊤}))) U^{- 1}

be an eigenvalue decomposition (EVD) of

E (x_{n : 1} x_{n : 1}^{⊤})

, where U is unitary. Thus,

\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})} = U {diag}_{1 \leq j \leq n N} (\sqrt{λ_{j} (E (x_{n : 1} x_{n : 1}^{⊤}))}) U^{*}

and

{(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1} = U {diag}_{1 \leq j \leq n N} (\frac{1}{\sqrt{λ_{j} (E (x_{n : 1} x_{n : 1}^{⊤}))}}) U^{*}

.

Since

{(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1}

is Hermitian and

C_{E (x_{n : 1} x_{n : 1}^{⊤})}

is a positive definite matrix,

{(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1} C_{E (x_{n : 1} x_{n : 1}^{⊤})} {(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1}

is also a positive definite matrix.

From Equation (5), we have

R_{x_{n : 1}} (D) = \frac{1}{2 n N} ln \frac{det (E (x_{n : 1} x_{n : 1}^{⊤}))}{D^{n N}},

(12)

and applying the arithmetic mean-geometric mean inequality yields

\begin{matrix} 0 & \leq \frac{1}{2 n N} ln \frac{det (C_{E (x_{n : 1} x_{n : 1}^{⊤})})}{D^{n N}} - R_{x_{n : 1}} (D) \\ = \frac{1}{2 n N} ln \frac{det (C_{E (x_{n : 1} x_{n : 1}^{⊤})})}{det (E (x_{n : 1} x_{n : 1}^{⊤}))} = \frac{1}{2 n N} ln \frac{det (C_{E (x_{n : 1} x_{n : 1}^{⊤})})}{det (\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})}) det (\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})} \\ = \frac{1}{2 n N} ln (det ({(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1}) det (C_{E (x_{n : 1} x_{n : 1}^{⊤})}) det ({(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1})) \\ = \frac{1}{2 n N} ln det ({(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1} C_{E (x_{n : 1} x_{n : 1}^{⊤})} {(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1}) \\ = \frac{1}{2 n N} ln \prod_{j = 1}^{n N} λ_{j} ({(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1} C_{E (x_{n : 1} x_{n : 1}^{⊤})} {(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1}) \\ \leq \frac{1}{2 n N} ln ({(\frac{1}{n N} \sum_{j = 1}^{n N} λ_{j} ({(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1} C_{E (x_{n : 1} x_{n : 1}^{⊤})} {(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1}))}^{n N}) \\ = \frac{1}{2} ln (\frac{1}{n N} tr ({(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1} C_{E (x_{n : 1} x_{n : 1}^{⊤})} {(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1})) \\ = \frac{1}{2} ln (\frac{1}{n N} tr (C_{E (x_{n : 1} x_{n : 1}^{⊤})} {(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1} {(\sqrt{E (x_{n : 1} x_{n : 1}^{⊤})})}^{- 1})) \\ = \frac{1}{2} ln (\frac{1}{n N} tr (C_{E (x_{n : 1} x_{n : 1}^{⊤})} {(E (x_{n : 1} x_{n : 1}^{⊤}))}^{- 1})) \\ \leq \frac{1}{2} ln (\frac{\sqrt{n N}}{n N} {∥C_{E (x_{n : 1} x_{n : 1}^{⊤})} {(E (x_{n : 1} x_{n : 1}^{⊤}))}^{- 1}∥}_{F}) \\ = \frac{1}{2} ln (\frac{1}{\sqrt{n N}} {∥(C_{E (x_{n : 1} x_{n : 1}^{⊤})} - E (x_{n : 1} x_{n : 1}^{⊤})) {(E (x_{n : 1} x_{n : 1}^{⊤}))}^{- 1} + I_{n N}∥}_{F}) \\ \leq \frac{1}{2} ln (\frac{1}{\sqrt{n N}} ({∥(C_{E (x_{n : 1} x_{n : 1}^{⊤})} - E (x_{n : 1} x_{n : 1}^{⊤})) {(E (x_{n : 1} x_{n : 1}^{⊤}))}^{- 1}∥}_{F} + \sqrt{n N})) \\ \leq \frac{1}{2} ln (\frac{1}{\sqrt{n N}} ({∥C_{E (x_{n : 1} x_{n : 1}^{⊤})} - E (x_{n : 1} x_{n : 1}^{⊤})∥}_{F} {∥{(E (x_{n : 1} x_{n : 1}^{⊤}))}^{- 1}∥}_{2} + \sqrt{n N})) \\ = \frac{1}{2} ln (1 + \frac{{∥E (x_{n : 1} x_{n : 1}^{⊤}) - C_{E (x_{n : 1} x_{n : 1}^{⊤})}∥}_{F}}{\sqrt{n N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤}))}) . \end{matrix}

□

In Equation (12),

R_{x_{n : 1}} (D)

is written in terms of

E (x_{n : 1} x_{n : 1}^{⊤})

.

{\tilde{R}}_{x_{n : 1}} (D)

can be written in terms of

E (x_{n : 1} x_{n : 1}^{⊤})

and

V_{n}

by using Lemma 2 and Equations (9) and (10).

As our coding strategy requires the computation of the block DFT, its computational complexity is

O (n N log n)

whenever the FFT algorithm is used. We recall that the computational complexity of the optimal coding strategy for

x_{n : 1}

is

O (n^{2} N^{2})

since it requires the computation of

U_{n}^{⊤} x_{n : 1}

, where

U_{n}

is a real orthogonal eigenvector matrix of

E (x_{n : 1} x_{n : 1}^{⊤})

. Observe that such eigenvector matrix

U_{n}

also needs to be computed, which further increases the complexity. Hence, the main advantage of our coding strategy is that it notably reduces the computational complexity of coding

x_{n : 1}

. Moreover, our coding strategy does not require the knowledge of

E (x_{n : 1} x_{n : 1}^{⊤})

. It only requires the knowledge of

E (\hat{y_{k}} {\hat{y_{k}}}^{⊤})

, with

k \in {⌈ \frac{n}{2} ⌉ \dots, n}

.

It should be mentioned that Equation (7) provides two upper bounds for the RDF of finite-length data blocks of a real zero-mean Gaussian N-dimensional vector source

{x_{k}}

. The greatest upper bound in Equation (7) was given in [11] for the case in which the random vector source

{x_{k}}

is WSS, and therefore, the correlation matrix of the Gaussian vector,

E (x_{n : 1} x_{n : 1}^{⊤})

, is a block Toeplitz matrix. Such upper bound was first presented by Pearl in [12] for the case in which the source is WSS and

N = 1

. However, neither [11] nor [12] propose a coding strategy for

{x_{k}}

.

4. Optimality of the Proposed Coding Strategy for Gaussian AWSS Vector Sources

In this section (see Theorem 4), we show that our coding strategy is asymptotically optimal, i.e., we show that for large enough data blocks of a Gaussian AWSS vector source

{x_{k}}

, the rate of our coding strategy, presented in Section 3, tends to the RDF of the source.

We begin by introducing some notation. If

X : R \to C^{N \times N}

is a continuous and

2 π

-periodic matrix-valued function of a real variable, we denote by

T_{n} (X)

the

n \times n

block Toeplitz matrix with

N \times N

blocks given by

T_{n} (X) = {(X_{j - k})}_{j, k = 1}^{n},

where

{X_{k}}_{k \in Z}

is the sequence of Fourier coefficients of X:

X_{k} = \frac{1}{2 π} \int_{0}^{2 π} e^{- k ω i} X (ω) d ω \forall k \in Z .

If

A_{n}

and

B_{n}

are

n N \times n N

matrices for all

n \in N

, we write

{A_{n}} \sim {B_{n}}

when the sequences

{A_{n}}

and

{B_{n}}

are asymptotically equivalent (see ([13] (p. 5673))), that is,

{∥ A_{n} ∥_{2}}

and

{∥ B_{n} ∥_{2}}

are bounded and

lim_{n \to \infty} \frac{∥ A_{n} - B_{n} ∥_{F}}{\sqrt{n}} = 0 .

The original definition of asymptotically equivalent sequences of matrices was given by Gray (see ([2] (Section 2.3)) or [4]) for

N = 1

.

We now review the definition of the AWSS vector process given in ([1] (Definition 7.1)). This definition was first introduced for the scalar case N = 1 (see ([2] (Section 6)) or [3]).

Definition 1.

Let

X : R \to C^{N \times N}

, and suppose that it is continuous and

2 π

-periodic. A random N-dimensional vector process

{x_{k}}

is said to be AWSS with asymptotic power spectral density (APSD) X if it has constant mean (i.e.,

E (x_{k_{1}}) = E (x_{k_{2}})

for all

k_{1}, k_{2} \in N)

and

{E (x_{n : 1} x_{n : 1}^{*})} \sim {T_{n} (X)}

.

We recall that the RDF of

{x_{k}}

is defined as

{lim}_{n \to \infty} R_{x_{n : 1}} (D)

.

Theorem 4.

Let

{x_{k}}

be a real zero-mean Gaussian AWSS N-dimensional vector process with APSD X. Suppose that

{inf}_{n \in N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤})) > 0

. If

D \in (0, {inf}_{n \in N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤}))]

, then

lim_{n \to \infty} R_{x_{n : 1}} (D) = lim_{n \to \infty} {\tilde{R}}_{x_{n : 1}} (D) = \frac{1}{4 π N} \int_{0}^{2 π} ln \frac{det (X (ω))}{D^{N}} d ω .

(13)

Proof.

We divide the proof into two steps:

Step 1: We show that

{lim}_{n \to \infty} R_{x_{n : 1}} (D) = \frac{1}{4 π N} \int_{0}^{2 π} ln \frac{det (X (ω))}{D^{N}} d ω

. From Equation (12), ([1] (Theorem 6.6)), and ([14] (Proposition 2)) yields

\begin{matrix} lim_{n \to \infty} R_{x_{n : 1}} (D) & = lim_{n \to \infty} \frac{1}{2 n N} ln \frac{\prod_{k = 1}^{n N} λ_{k} (E (x_{n : 1} x_{n : 1}^{⊤}))}{D^{n N}} = lim_{n \to \infty} \frac{1}{2 n N} \sum_{k = 1}^{n N} ln \frac{λ_{k} (E (x_{n : 1} x_{n : 1}^{⊤}))}{D} \\ = \frac{1}{4 π} \int_{0}^{2 π} \frac{1}{N} \sum_{k = 1}^{N} ln \frac{λ_{k} (X (ω))}{D} d ω = \frac{1}{4 π N} \int_{0}^{2 π} ln \frac{det (X (ω))}{D^{N}} d ω . \end{matrix}

Step 2: We prove that

{lim}_{n \to \infty} R_{x_{n : 1}} (D) = {lim}_{n \to \infty} {\tilde{R}}_{x_{n : 1}} (D)

. Applying Equations (7) and (8), we obtain

\begin{matrix} 0 & \leq {\tilde{R}}_{x_{n : 1}} (D) - R_{x_{n : 1}} (D) \leq \frac{1}{2 n N} ln \frac{\det (C_{E (x_{n : 1} x_{n : 1}^{⊤})})}{D^{n N}} - R_{x_{n : 1}} (D) \\ \leq \frac{1}{2} ln (1 + \frac{{∥E (x_{n : 1} x_{n : 1}^{⊤}) - C_{E (x_{n : 1} x_{n : 1}^{⊤})}∥}_{F}}{\sqrt{n N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤}))}) \\ \leq \frac{1}{2} ln (1 + \frac{{∥E (x_{n : 1} x_{n : 1}^{⊤}) - C_{E (x_{n : 1} x_{n : 1}^{⊤})}∥}_{F}}{\sqrt{n N} {inf}_{m \in N} λ_{m N} (E (x_{m : 1} x_{m : 1}^{⊤}))}) \forall n \in N . \end{matrix}

(14)

To finish the proof, we only need to show that

lim_{n \to \infty} \frac{{∥E (x_{n : 1} x_{n : 1}^{⊤}) - C_{E (x_{n : 1} x_{n : 1}^{⊤})}∥}_{F}}{\sqrt{n}} = 0 .

(15)

Let

C_{n} (X)

be the

n \times n

block circulant matrix with

N \times N

blocks defined in ([13] (p. 5674)), i.e.,

C_{n} (X) = (V_{n} \otimes I_{N}) {diag}_{1 \leq k \leq n} (X (\frac{2 π (k - 1)}{n})) {(V_{n} \otimes I_{N})}^{*} \forall n \in N .

Observe that

\begin{matrix} C_{C_{n} (X)} = & (V_{n} \otimes I_{N}) {diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{N})}^{*} C_{n} (X) (V_{n} \otimes I_{N})]}_{k, k}) {(V_{n} \otimes I_{N})}^{*} \\ = & (V_{n} \otimes I_{N}) {diag}_{1 \leq k \leq n} ({[{diag}_{1 \leq j \leq n} (X (\frac{2 π (j - 1)}{n}))]}_{k, k}) {(V_{n} \otimes I_{N})}^{*} \\ = & (V_{n} \otimes I_{N}) {diag}_{1 \leq k \leq n} (X (\frac{2 π (k - 1)}{n})) {(V_{n} \otimes I_{N})}^{*} = C_{n} (X) \forall n \in N . \end{matrix}

Consequently, as the Frobenius norm is unitarily invariant, we have

\begin{matrix} {∥C_{n} (X) - C_{E (x_{n : 1} x_{n : 1}^{⊤})}∥}_{F} = {∥C_{C_{n} (X)} - C_{E (x_{n : 1} x_{n : 1}^{⊤})}∥}_{F} \\ = {∥(V_{n} \otimes I_{N}) {diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{N})}^{*} (C_{n} (X) - E (x_{n : 1} x_{n : 1}^{⊤})) (V_{n} \otimes I_{N})]}_{k, k}) {(V_{n} \otimes I_{N})}^{*}∥}_{F} \\ = {∥{diag}_{1 \leq k \leq n} ({[{(V_{n} \otimes I_{N})}^{*} (C_{n} (X) - E (x_{n : 1} x_{n : 1}^{⊤})) (V_{n} \otimes I_{N})]}_{k, k})∥}_{F} \\ \leq {∥{(V_{n} \otimes I_{N})}^{*} (C_{n} (X) - E (x_{n : 1} x_{n : 1}^{⊤})) (V_{n} \otimes I_{N})∥}_{F} = {∥C_{n} (X) - E (x_{n : 1} x_{n : 1}^{⊤})∥}_{F} \forall n \in N . \end{matrix}

Therefore,

\begin{matrix} 0 \leq & \frac{{∥E (x_{n : 1} x_{n : 1}^{⊤}) - C_{E (x_{n : 1} x_{n : 1}^{⊤})}∥}_{F}}{\sqrt{n}} \leq \frac{{∥E (x_{n : 1} x_{n : 1}^{⊤}) - C_{n} (X)∥}_{F}}{\sqrt{n}} + \frac{{∥C_{n} (X) - C_{E (x_{n : 1} x_{n : 1}^{⊤})}∥}_{F}}{\sqrt{n}} \\ \leq & 2 \frac{{∥E (x_{n : 1} x_{n : 1}^{⊤}) - C_{n} (X)∥}_{F}}{\sqrt{n}} \leq 2 (\frac{{∥E (x_{n : 1} x_{n : 1}^{⊤}) - T_{n} (X)∥}_{F}}{\sqrt{n}} + \frac{{∥T_{n} (X) - C_{n} (X)∥}_{F}}{\sqrt{n}}) \forall n \in N . \end{matrix}

(16)

Since

{E (x_{n : 1} x_{n : 1}^{⊤})} \sim {T_{n} (X)}

, Equation (16) and ([1] (Lemma 6.1)) yields Equation (15). □

Observe that the integral formula in Equation (13) provides the value of the RDF of the Gaussian AWSS vector source whenever

D \in (0, {inf}_{n \in N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤}))]

. An integral formula of such an RDF for any

D > 0

can be found in ([15] (Theorem 1)). It should be mentioned that ([15] (Theorem 1)) generalized the integral formulas previously given in the literature for the RDF of certain Gaussian AWSS sources, namely, WSS scalar sources [9], AR AWSS scalar sources [16], and AR AWSS vector sources of finite order [17].

5. Relevant AWSS Vector Sources

WSS, MA, AR, and ARMA vector processes are frequently used to model multivariate time series (see, e.g., [18]) that arise in any domain that involves temporal measurements. In this section, we show that our coding strategy is appropriate to encode the aforementioned vector sources whenever they are Gaussian and AWSS.

It should be mentioned that Gaussian AWSS MA vector (VMA) processes, Gaussian AWSS AR vector (VAR) processes, and Gaussian AWSS ARMA vector (VARMA) processes are frequently called Gaussian stationary VMA processes, Gaussian stationary VAR processes, and Gaussian stationary VARMA processes, respectively (see, e.g., [18]). However, they are asymptotically stationary but not stationary, because their corresponding correlation matrices are not block Toeplitz.

5.1. WSS Vector Sources

In this subsection (see Theorem 5), we give conditions under which our coding strategy is asymptotically optimal for WSS vector sources.

We first recall the well-known concept of WSS vector process.

Definition 2.

Let

X : R \to C^{N \times N}

, and suppose that it is continuous and

2 π

-periodic. A random N-dimensional vector process

{x_{k}}

is said to be WSS (or weakly stationary) with PSD X if it has constant mean and

{E (x_{n : 1} x_{n : 1}^{*})} = {T_{n} (X)}

.

Theorem 5.

Let

{x_{k}}

be a real zero-mean Gaussian WSS N-dimensional vector process with PSD X. Suppose that

{min}_{ω \in [0, 2 π]} λ_{N} (X (ω)) > 0

(or equivalently,

det (X (ω)) \neq 0

for all

ω \in R

). If

D \in (0, {min}_{ω \in [0, 2 π]} λ_{N} (X (ω))]

, then

lim_{n \to \infty} R_{x_{n : 1}} (D) = lim_{n \to \infty} {\tilde{R}}_{x_{n : 1}} (D) = \frac{1}{4 π N} \int_{0}^{2 π} ln \frac{det (X (ω))}{D^{N}} d ω .

Proof.

Applying ([1] (Lemma 3.3)) and ([1] (Theorem 4.3)) yields

{E (x_{n : 1} x_{n : 1}^{⊤})} = {T_{n} (X)} \sim {T_{n} (X)}

. Theorem 5 now follows from ([14] (Proposition 3)) and Theorem 4. □

Theorem 5 was presented in [5] for the case

N = 1

(i.e., just for WSS sources but not for vector WSS sources).

5.2. VMA Sources

In this subsection (see Theorem 6), we give conditions under which our coding strategy is asymptotically optimal for VMA sources.

We start by reviewing the concept of VMA process.

Definition 3.

A real zero-mean random N-dimensional vector process

{x_{k}}

is said to be MA if

x_{k} = w_{k} + \sum_{j = 1}^{k - 1} G_{- j} w_{k - j} \forall k \in N,

where

G_{- j}

,

j \in N

, are real

N \times N

matrices,

{w_{k}}

is a real zero-mean random N-dimensional vector process, and

E (w_{k_{1}} w_{k_{2}}^{⊤}) = δ_{k_{1}, k_{2}} Λ

for all

k_{1}, k_{2} \in N

with Λ being a real

N \times N

positive definite matrix. If there exists

q \in N

such that

G_{- j} = 0_{N \times N}

for all

j > q

, then

{x_{k}}

is called a VMA(q) process.

Theorem 6.

Let

{x_{k}}

be as in Definition 3. Assume that

{G_{k}}_{k = - \infty}^{\infty}

, with

G_{0} = I_{N}

and

G_{k} = 0_{N \times N}

for all

k \in N

, is the sequence of Fourier coefficients of a function

G : R \to C^{N \times N}

, which is continuous and

2 π

-periodic. Suppose that

{T_{n} (G)}

is stable (that is,

{∥ {(T_{n} (G))}^{- 1} ∥_{2}}

is bounded). If

{x_{k}}

is Gaussian and

D \in (0, {inf}_{n \in N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤}))]

, then

lim_{n \to \infty} R_{x_{n : 1}} (D) = lim_{n \to \infty} {\tilde{R}}_{x_{n : 1}} (D) = \frac{1}{2 N} ln \frac{det (Λ)}{D^{N}} .

(17)

Moreover,

R_{x_{n : 1}} (D) = \frac{1}{2 N} ln \frac{det (Λ)}{D^{N}}

for all

n \in N

.

Proof.

We divide the proof into three steps:

Step 1: We show that

det (E (x_{n : 1} x_{n : 1}^{⊤})) = {(det (Λ))}^{n}

for all

n \in N

. From ([15] (Equation (A3))) we have that

\{E (x_{n : 1} x_{n : 1}^{⊤})\} = \{T_{n} (G) T_{n} (Λ) {(T_{n} (G))}^{*}\}

. Consequently,

\begin{matrix} det (E (x_{n : 1} x_{n : 1}^{⊤})) & = det (T_{n} (G)) det (T_{n} (Λ)) \bar{det (T_{n} (G))} = {| det (T_{n} (G)) |}^{2} {(det (Λ))}^{n} = {(det (Λ))}^{n} \forall n \in N . \end{matrix}

Step 2: We prove the first equality in Equation (17). Applying ([15] (Theorem 2)), we obtain that

{x_{k}}

is AWSS. From Theorem 4, we only need to show that

{inf}_{n \in N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤})) > 0

. We have

\begin{matrix} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤})) = \frac{1}{λ_{1} ({(E (x_{n : 1} x_{n : 1}^{⊤}))}^{- 1})} = \frac{1}{{∥{(E (x_{n : 1} x_{n : 1}^{⊤}))}^{- 1}∥}_{2}} = \frac{1}{{∥{(T_{n} (G) T_{n} (Λ) {(T_{n} (G))}^{*})}^{- 1}∥}_{2}} \\ = \frac{1}{{∥{({(T_{n} (G))}^{- 1})}^{*} T_{n} (Λ^{- 1}) {(T_{n} (G))}^{- 1}∥}_{2}} \geq \frac{1}{{∥{({(T_{n} (G))}^{- 1})}^{*}∥}_{2} {∥T_{n} (Λ^{- 1})∥}_{2} {∥{(T_{n} (G))}^{- 1}∥}_{2}} \\ = \frac{1}{{∥{(T_{n} (G))}^{- 1}∥}_{2}^{2} λ_{1} (Λ^{- 1})} = \frac{λ_{N} (Λ)}{{∥{(T_{n} (G))}^{- 1}∥}_{2}^{2}} \geq \frac{λ_{N} (Λ)}{{({sup}_{m \in N} {∥{(T_{m} (G))}^{- 1}∥}_{2})}^{2}} > 0 \forall n \in N . \end{matrix}

Step 3: We show that

R_{x_{n : 1}} (D) = \frac{1}{2 N} ln \frac{det (Λ)}{D^{N}}

for all

n \in N

. Applying Equation (12) yields

\begin{matrix} R_{x_{n : 1}} (D) = \frac{1}{2 n N} ln \frac{{(det (Λ))}^{n}}{D^{n N}} = \frac{1}{2 N} ln \frac{det (Λ)}{D^{N}} \forall n \in N . \end{matrix}

□

5.3. VAR AWSS Sources

In this subsection (see Theorem 7), we give conditions under which our coding strategy is asymptotically optimal for VAR sources.

We first recall the concept of VAR process.

Definition 4.

A real zero-mean random N-dimensional vector process

{x_{k}}

is said to be AR if

x_{k} = w_{k} - \sum_{j = 1}^{k - 1} F_{- j} x_{k - j} \forall k \in N,

where

F_{- j}

,

j \in N

, are real

N \times N

matrices,

{w_{k}}

is a real zero-mean random N-dimensional vector process, and

E (w_{k_{1}} w_{k_{2}}^{⊤}) = δ_{k_{1}, k_{2}} Λ

for all

k_{1}, k_{2} \in N

with Λ being a real

N \times N

positive definite matrix. If there exists

p \in N

such that

F_{- j} = 0_{N \times N}

for all

j > p

, then

{x_{k}}

is called a VAR(p) process.

Theorem 7.

Let

{x_{k}}

be as in Definition 4. Assume that

{F_{k}}_{k = - \infty}^{\infty}

, with

F_{0} = I_{N}

and

F_{k} = 0_{N \times N}

for all

k \in N

, is the sequence of Fourier coefficients of a function

F : R \to C^{N \times N}

, which is continuous and

2 π

-periodic. Suppose that

{T_{n} (F)}

is stable and

det (F (ω)) \neq 0

for all

ω \in R

. If

{x_{k}}

is Gaussian and

D \in (0, {inf}_{n \in N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤}))]

, then

lim_{n \to \infty} R_{x_{n : 1}} (D) = lim_{n \to \infty} {\tilde{R}}_{x_{n : 1}} (D) = \frac{1}{2 N} ln \frac{det (Λ)}{D^{N}} .

(18)

Moreover,

R_{x_{n : 1}} (D) = \frac{1}{2 N} ln \frac{det (Λ)}{D^{N}}

for all

n \in N

.

Proof.

We divide the proof into three steps:

Step 1: We show that

det (E (x_{n : 1} x_{n : 1}^{⊤})) = {(det (Λ))}^{n}

for all

n \in N

. From ([19] (Equation (19))), we have that

\{E (x_{n : 1} x_{n : 1}^{⊤})\} = \{{(T_{n} (F))}^{- 1} T_{n} (Λ) {({(T_{n} (F))}^{*})}^{- 1}\}

. Consequently,

\begin{matrix} det (E (x_{n : 1} x_{n : 1}^{⊤})) & = \frac{det (T_{n} (Λ))}{det (T_{n} (F)) det ({(T_{n} (F))}^{*})} = \frac{{(det (Λ))}^{n}}{| det (T_{n} (F)) |^{2}} = {(det (Λ))}^{n} \forall n \in N . \end{matrix}

Step 2: We prove the first equality in Equation (18). Applying ([15] (Theorem 3)), we obtain that

{x_{k}}

is AWSS. From Theorem 4, we only need to show that

{inf}_{n \in N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤})) > 0

. Applying ([1] (Theorem 4.3)) yields

\begin{matrix} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤})) & = \frac{1}{{∥{(E (x_{n : 1} x_{n : 1}^{⊤}))}^{- 1}∥}_{2}} = \frac{1}{{∥{({(T_{n} (F))}^{- 1} T_{n} (Λ) {({(T_{n} (F))}^{*})}^{- 1})}^{- 1}∥}_{2}} \\ \geq \frac{λ_{N} (Λ)}{{∥T_{n} (F)∥}_{2}^{2}} \geq \frac{λ_{N} (Λ)}{{({sup}_{m \in N} {∥T_{m} (F)∥}_{2})}^{2}} > 0 \forall n \in N . \end{matrix}

Step 3: We show that

R_{x_{n : 1}} (D) = \frac{1}{2 N} ln \frac{det (Λ)}{D^{N}}

for all

n \in N

. This can be directly obtained from Equation (12). □

Theorem 7 was presented in [6] for the case of

N = 1

(i.e., just for AR sources but not for VAR sources).

5.4. VARMA AWSS Sources

In this subsection (see Theorem 8), we give conditions under which our coding strategy is asymptotically optimal for VARMA sources.

We start by reviewing the concept of VARMA process.

Definition 5.

A real zero-mean random N-dimensional vector process

{x_{k}}

is said to be ARMA if

x_{k} = w_{k} + \sum_{j = 1}^{k - 1} G_{- j} w_{k - j} - \sum_{j = 1}^{k - 1} F_{- j} x_{k - j} \forall k \in N,

where

G_{- j}

and

F_{- j}

,

j \in N

, are real

N \times N

matrices,

{w_{k}}

is a real zero-mean random N-dimensional vector process, and

E (w_{k_{1}} w_{k_{2}}^{⊤}) = δ_{k_{1}, k_{2}} Λ

for all

k_{1}, k_{2} \in N

with Λ being a real

N \times N

positive definite matrix. If there exists

p, q \in N

such that

F_{- j} = 0_{N \times N}

for all

j > p

and

G_{- j} = 0_{N \times N}

for all

j > q

, then

{x_{k}}

is called a VARMA(p,q) process (or a VARMA process of (finite) order (p,q)).

Theorem 8.

Let

{x_{k}}

be as in Definition 5. Assume that

{G_{k}}_{k = - \infty}^{\infty}

, with

G_{0} = I_{N}

and

G_{k} = 0_{N \times N}

for all

k \in N

, is the sequence of Fourier coefficients of a function

G : R \to C^{N \times N}

which is continuous and

2 π

-periodic. Suppose that

{F_{k}}_{k = - \infty}^{\infty}

, with

F_{0} = I_{N}

and

F_{k} = 0_{N \times N}

for all

k \in N

, is the sequence of Fourier coefficients of a function

F : R \to C^{N \times N}

which is continuous and

2 π

-periodic. Assume that

{T_{n} (G)}

and

{T_{n} (F)}

are stable, and

det (F (ω)) \neq 0

for all

ω \in R

. If

{x_{k}}

is Gaussian and

D \in (0, {inf}_{n \in N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤}))]

, then

lim_{n \to \infty} R_{x_{n : 1}} (D) = lim_{n \to \infty} {\tilde{R}}_{x_{n : 1}} (D) = \frac{1}{2 N} ln \frac{det (Λ)}{D^{N}} .

(19)

Moreover,

R_{x_{n : 1}} (D) = \frac{1}{2 N} ln \frac{det (Λ)}{D^{N}}

for all

n \in N

.

Proof.

We divide the proof into three steps:

Step 1: We show that

det (E (x_{n : 1} x_{n : 1}^{⊤})) = {(det (Λ))}^{n}

for all

n \in N

. From ([15] (Appendix D)) and ([1] (Lemma 4.2)), we have that

\{E (x_{n : 1} x_{n : 1}^{⊤})\} = \{{(T_{n} (F))}^{- 1} T_{n} (G) T_{n} (Λ) {(T_{n} (G))}^{*} {({(T_{n} (F))}^{*})}^{- 1}\}

. Consequently,

\begin{matrix} det (E (x_{n : 1} x_{n : 1}^{⊤})) & = \frac{| det (T_{n} (G)) |^{2} {(det (Λ))}^{n}}{| det (T_{n} (F)) |^{2}} = {(det (Λ))}^{n} \forall n \in N . \end{matrix}

Step 2: We prove the first equality in Equation (19). Applying ([15] (Theorem 3)), we obtain that

{x_{k}}

is AWSS. From Theorem 4, we only need to show that

{inf}_{n \in N} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤})) > 0

. Applying ([1] (Theorem 4.3)) yields

\begin{matrix} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤})) = \frac{1}{{∥{(E (x_{n : 1} x_{n : 1}^{⊤}))}^{- 1}∥}_{2}} = \frac{1}{{∥{({(T_{n} (F))}^{- 1} T_{n} (G) T_{n} (Λ) {(T_{n} (G))}^{*} {({(T_{n} (F))}^{*})}^{- 1})}^{- 1}∥}_{2}} \\ \geq \frac{λ_{N} (Λ)}{{∥T_{n} (F)∥}_{2}^{2} {∥{(T_{n} (G))}^{- 1}∥}_{2}^{2}} \geq \frac{λ_{N} (Λ)}{{({sup}_{m \in N} {∥T_{m} (F)∥}_{2})}^{2} {({sup}_{m \in N} {∥{(T_{m} (G))}^{- 1}∥}_{2})}^{2}} > 0 \forall n \in N . \end{matrix}

Step 3: We show that

R_{x_{n : 1}} (D) = \frac{1}{2 N} ln \frac{det (Λ)}{D^{N}}

for all

n \in N

. This can be directly obtained from Equation (12). □

6. Numerical Examples

We first consider four AWSS vector processes, namely, we consider the zero-mean WSS vector process in ([20] (Section 4)), the VMA(1) process in ([18] (Example 2.1)), the VAR(1) process in ([18] (Example 2.3)), and the VARMA(1,1) process in ([18] (Example 3.2)). In ([20] (Section 4)),

N = 2

and the Fourier coefficients of its PSD X are

X_{0} = (\begin{matrix} 2.0002 & 0.7058 \\ 0.7058 & 2.0000 \end{matrix}), X_{- 1} = X_{1}^{*} = (\begin{matrix} - 0.3542 & 0.1016 \\ 0.1839 & - 0.2524 \end{matrix}), X_{- 2} = X_{2}^{*} = (\begin{matrix} - 0.0923 & 0.0153 \\ 0.1490 & 0.0696 \end{matrix}),

X_{- 3} = X_{3}^{*} = (\begin{matrix} - 0.1443 & - 0.0904 \\ 0.0602 & 0.0704 \end{matrix}), X_{- 4} = X_{4}^{*} = (\begin{matrix} - 0.0516 & - 0.0603 \\ 0 & 0 \end{matrix}),

and

X_{j} = 0_{2 \times 2}

with

| j | > 4

. In ([18] (Example 2.1)),

N = 2

,

G_{- 1}

is given by

(\begin{matrix} - 0.8 & - 0.7 \\ 0.4 & - 0.6 \end{matrix}),

(20)

G_{- j} = 0_{2 \times 2}

for all

j \in N

, and

Λ = (\begin{matrix} 4 & 1 \\ 1 & 2 \end{matrix}) .

(21)

In ([18] (Example 2.3)),

N = 2

,

F_{- j} = 0_{2 \times 2}

for all

j \in N

, and

F_{- 1}

and

Λ

are given by Equations (20) and (21), respectively. In ([18] (Example 3.2)),

N = 2

,

G_{- 1} = (\begin{matrix} 0.6 & - 0.3 \\ - 0.3 & - 0.6 \end{matrix}), F_{- 1} = (\begin{matrix} - 1.2 & 0.5 \\ - 0.6 & - 0.3 \end{matrix}), Λ = (\begin{matrix} 1 & 0.5 \\ 0.5 & 1.25 \end{matrix}),

G_{- j} = 0_{2 \times 2}

for all

j \in N

, and

F_{- j} = 0_{2 \times 2}

for all

j \in N

.

Figure 2, Figure 3, Figure 4 and Figure 5 show

R_{x_{n : 1}} (D)

and

{\tilde{R}}_{x_{n : 1}} (D)

with

n \leq 100

and

D = 0.001

for the four vector processes considered, by assuming that they are Gaussian. The figures bear evidence of the fact that the rate of our coding strategy tends to the RDF of the source.

Figure 2. Considered rates for the wide sense stationary (WSS) vector process in ([20] (Section 4)).

Figure 3. Considered rates for the VMA(1) process in ([18] (Example 2.1)).

Figure 4. Considered rates for the VAR(1) process in ([18] (Example 2.3)).

Figure 5. Considered rates for the VARMA(1,1) process in ([18] (Example 3.2)).

We finish with a numerical example to explore how our method performs in the presence of a perturbation. Specifically, we consider a perturbed version of the WSS vector process in ([20] (Section 4)) (Figure 6). The correlation matrices of the perturbed process are

T_{n} (X) + (\begin{matrix} 0_{2 n - 2 \times 2 n - 2} & 0_{2 n - 2 \times 2} \\ 0_{2 \times 2 n - 2} & I_{2} \end{matrix}), n \in N .

Figure 6. Considered rates for the perturbed WSS vector process with D = 0.001.

7. Conclusions

The computational complexity of coding finite-length data blocks of Gaussian N-dimensional vector sources can be reduced by using the low-complexity coding strategy presented here instead of the optimal coding strategy. Specifically, the computational complexity is reduced from

O (n^{2} N^{2})

to

O (n N log n)

, where n is the length of the data blocks. Moreover, our coding strategy is asymptotically optimal (i.e., the rate of our coding strategy tends to the RDF of the source) whenever the Gaussian vector source is AWSS and the considered data blocks are large enough. Besides being a low-complexity strategy, it does not require the knowledge of the correlation matrix of such data blocks. Furthermore, our coding strategy is appropriate to encode the most relevant Gaussian vector sources, namely, WSS, MA, AR, and ARMA vector sources.

Author Contributions

Authors are listed in order of their degree of involvement in the work, with the most active contributors listed first. J.G.-G. conceived the research question. All authors proved the main results and wrote the paper. All authors have read and approved the final manuscript.

Funding

This work was supported in part by the Spanish Ministry of Economy and Competitiveness through the CARMEN project (TEC2016-75067-C4-3-R).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Lemma 1

Proof.

(1) ⇒(2) We have

\begin{matrix} y_{k} & = & {[y_{n : 1}]}_{n - k + 1, 1} = \sum_{j = 1}^{n} {[V_{n}^{*} \otimes I_{N}]}_{n - k + 1, j} {[x_{n : 1}]}_{j, 1} = \sum_{j = 1}^{n} {[V_{n}^{*}]}_{n - k + 1, j} I_{N} {[x_{n : 1}]}_{j, 1} = \sum_{j = 1}^{n} \bar{{[V_{n}]}_{j, n - k + 1}} {[x_{n : 1}]}_{j, 1} \\ = & \frac{1}{\sqrt{n}} \sum_{j = 1}^{n} e^{\frac{2 π (j - 1) (n - k)}{n} i} {[x_{n : 1}]}_{j, 1} = \frac{1}{\sqrt{n}} \sum_{j = 1}^{n} e^{2 π (j - 1) i} e^{- \frac{2 π (j - 1) k}{n} i} {[x_{n : 1}]}_{j, 1} = \frac{1}{\sqrt{n}} \sum_{j = 1}^{n} e^{- \frac{2 π (j - 1) k}{n} i} {[x_{n : 1}]}_{j, 1} \\ = & \bar{\frac{1}{\sqrt{n}} \sum_{j = 1}^{n} e^{\frac{2 π (j - 1) k}{n} i} {[x_{n : 1}]}_{j, 1}} = \bar{y_{n - k}} \end{matrix}

for all

k \in {1, \dots, n - 1}

and

y_{n} = \frac{1}{\sqrt{n}} \sum_{j = 1}^{n} {[x_{n : 1}]}_{j, 1} \in R^{N \times 1}

.

(2)⇒(1) Since

V_{n} \otimes I_{N}

is a unitary matrix and

\begin{matrix} {[V_{n}]}_{k, n - j + 1} = \frac{1}{\sqrt{n}} e^{- \frac{2 π (k - 1) (n - j)}{n} i} = \frac{1}{\sqrt{n}} e^{- 2 π (k - 1) i} e^{\frac{2 π (k - 1) j}{n} i} = \bar{{[V_{n}]}_{k, j + 1}} \end{matrix}

for all

k \in {1, \dots, n}

and

j \in {1, \dots, n - 1}

, we conclude that

\begin{matrix} x_{k} & = & {[x_{n : 1}]}_{n - k + 1, 1} = {[(V_{n} \otimes I_{N}) y_{n : 1}]}_{n - k + 1, 1} = \sum_{j = 1}^{n} {[V_{n}]}_{n - k + 1, j} {[y_{n : 1}]}_{j, 1} = \sum_{j = 1}^{n} {[V_{n}]}_{n - k + 1, j} y_{n - j + 1} \\ = & {[V_{n}]}_{n - k + 1, 1} y_{n} + \sum_{h = 1}^{n - 1} {[V_{n}]}_{n - k + 1, n - h + 1} y_{h} \\ = & \frac{1}{\sqrt{n}} y_{n} + \sum_{h = 1}^{⌈ \frac{n}{2} ⌉ - 1} ({[V_{n}]}_{n - k + 1, n - h + 1} y_{h} + \bar{{[V_{n}]}_{n - k + 1, n - h + 1}} \bar{y_{h}}) + \frac{1 + {(- 1)}^{n}}{2} {[V_{n}]}_{n - k + 1, ⌈ \frac{n}{2} ⌉ + 1} y_{⌈ \frac{n}{2} ⌉} \\ = & \frac{1}{\sqrt{n}} y_{n} + \sum_{h = 1}^{⌈ \frac{n}{2} ⌉ - 1} ({[V_{n}]}_{n - k + 1, n - h + 1} y_{h} + \bar{{[V_{n}]}_{n - k + 1, n - h + 1} y_{h}}) + \frac{1 + {(- 1)}^{n}}{2} \frac{1}{\sqrt{n}} e^{- π (n - k) i} y_{⌈ \frac{n}{2} ⌉} \\ = & \frac{1}{\sqrt{n}} y_{n} + 2 \sum_{h = 1}^{⌈ \frac{n}{2} ⌉ - 1} Re ({[V_{n}]}_{n - k + 1, n - h + 1} y_{h}) + \frac{1 + {(- 1)}^{n}}{2} \frac{{(- 1)}^{n - k}}{\sqrt{n}} y_{⌈ \frac{n}{2} ⌉} \in R^{N \times 1} \end{matrix}

for all

k \in {1, \dots, n}

. □

Appendix B. Proof of Theorem 1

Proof.

Fix

k \in {1, \dots, n}

. Let

E (x_{n : 1} x_{n : 1}^{*}) = U {diag}_{1 \leq j \leq n N} (λ_{j} (E (x_{n : 1} x_{n : 1}^{*}))) U^{- 1}

and

E (x_{k} x_{k}^{*}) = W {diag}_{1 \leq j \leq N} (λ_{j} (E (x_{k} x_{k}^{*})) W^{- 1}

be an eigenvalue decomposition (EVD) of

E (x_{n : 1} x_{n : 1}^{*})

and

E (x_{k} x_{k}^{*})

, respectively. We can assume that the eigenvector matrices U and W are unitary. We have

\begin{matrix} λ_{j} (E (x_{k} x_{k}^{*})) & = & {[W^{*} E (x_{k} x_{k}^{*}) W]}_{j, j} = \sum_{h = 1}^{N} {[W^{*}]}_{j, h} \sum_{l = 1}^{N} {[E (x_{k} x_{k}^{*})]}_{h, l} {[W]}_{l, j} \\ = & \sum_{h = 1}^{N} {[W^{*}]}_{j, h} \sum_{l = 1}^{N} {[E (x_{n : 1} x_{n : 1}^{*})]}_{(n - k) N + h, (n - k) N + l} {[W]}_{l, j} \\ = & \sum_{h = 1}^{N} {[W^{*}]}_{j, h} \sum_{l = 1}^{N} {[U {diag}_{1 \leq p \leq n N} (λ_{p} (E (x_{n : 1} x_{n : 1}^{*}))) U^{*}]}_{(n - k) N + h, (n - k) N + l} {[W]}_{l, j} \\ = & \sum_{h = 1}^{N} {[W^{*}]}_{j, h} \sum_{l = 1}^{N} (\sum_{p = 1}^{n N} {[U]}_{(n - k) N + h, p} λ_{p} (E (x_{n : 1} x_{n : 1}^{*})) {[U^{*}]}_{p, (n - k) N + l}) {[W]}_{l, j} \\ = & \sum_{p = 1}^{n N} λ_{p} (E (x_{n : 1} x_{n : 1}^{*})) \sum_{h = 1}^{N} \bar{{[W]}_{h, j}} {[U]}_{(n - k) N + h, p} \sum_{l = 1}^{N} \bar{{[U]}_{(n - k) N + l, p}} {[W]}_{l, j} \\ = & \sum_{p = 1}^{n N} λ_{p} (E (x_{n : 1} x_{n : 1}^{*})) (\sum_{h = 1}^{N} \bar{{[W]}_{h, j}} {[U]}_{(n - k) N + h, p}) \bar{(\sum_{l = 1}^{N} \bar{{[W]}_{l, j}} {[U]}_{(n - k) N + l, p})} \\ = & \sum_{p = 1}^{n N} λ_{p} (E (x_{n : 1} x_{n : 1}^{*})) {|\sum_{h = 1}^{N} \bar{{[W]}_{h, j}} {[U]}_{(n - k) N + h, p}|}^{2}, \end{matrix}

and consequently,

\begin{matrix} λ_{n N} (E (x_{n : 1} x_{n : 1}^{*})) \sum_{p = 1}^{n N} {|\sum_{h = 1}^{N} \bar{{[W]}_{h, j}} {[U]}_{(n - k) N + h, p}|}^{2} & \leq λ_{j} (E (x_{k} x_{k}^{*})) \leq \\ λ_{1} (E (x_{n : 1} x_{n : 1}^{*})) \sum_{p = 1}^{n N} {|\sum_{h = 1}^{N} \bar{{[W]}_{h, j}} {[U]}_{(n - k) N + h, p}|}^{2} \end{matrix}

for all

j \in {1, \dots, N}

. Therefore, since

\begin{matrix} \sum_{p = 1}^{n N} & {|\sum_{h = 1}^{N} \bar{{[W]}_{h, j}} {[U]}_{(n - k) N + h, p}|}^{2} = \sum_{p = 1}^{n N} \sum_{h = 1}^{N} \bar{{[W]}_{h, j}} {[U]}_{(n - k) N + h, p} \sum_{l = 1}^{N} \bar{{[U]}_{(n - k) N + l, p}} {[W]}_{l, j} \\ = \sum_{h = 1}^{N} {[W^{*}]}_{j, h} \sum_{l = 1}^{N} \sum_{p = 1}^{n N} {[U]}_{(n - k) N + h, p} {[U^{*}]}_{p, (n - k) N + l} {[W]}_{l, j} = \sum_{h = 1}^{N} {[W^{*}]}_{j, h} \sum_{l = 1}^{N} {[U U^{*}]}_{(n - k) N + h, (n - k) N + l} {[W]}_{l, j} \\ = \sum_{h = 1}^{N} {[W^{*}]}_{j, h} \sum_{l = 1}^{N} {[I_{n N}]}_{(n - k) N + h, (n - k) N + l} {[W]}_{l, j} = \sum_{h = 1}^{N} {[W^{*}]}_{j, h} {[W]}_{h, j} = {[W^{*} W]}_{j, j} = {[I_{N}]}_{j, j} = 1, \end{matrix}

Equation (2) holds. We now prove Equation (3). Let

E (y_{k} y_{k}^{*}) = M {diag}_{1 \leq j \leq N} (λ_{j} (E (y_{k} y_{k}^{*})) M^{- 1}

be an EVD of

E (y_{k} y_{k}^{*})

, where M is unitary. We have

\begin{matrix} λ_{j} (E (y_{k} y_{k}^{*})) & = & \sum_{h = 1}^{N} {[M^{*}]}_{j, h} \sum_{l = 1}^{N} {[E (y_{n : 1} y_{n : 1}^{*})]}_{(n - k) N + h, (n - k) N + l} {[M]}_{l, j} \\ = & \sum_{h = 1}^{N} {[M^{*}]}_{j, h} \sum_{l = 1}^{N} {[E ({(V_{n} \otimes I_{N})}^{*} x_{n : 1} x_{n : 1}^{*} (V_{n} \otimes I_{N}))]}_{(n - k) N + h, (n - k) N + l} {[M]}_{l, j} \\ = & \sum_{h = 1}^{N} {[M^{*}]}_{j, h} \sum_{l = 1}^{N} {[{(V_{n} \otimes I_{N})}^{*} E (x_{n : 1} x_{n : 1}^{*}) (V_{n} \otimes I_{N})]}_{(n - k) N + h, (n - k) N + l} {[M]}_{l, j} \\ = & \sum_{h = 1}^{N} {[M^{*}]}_{j, h} \sum_{l = 1}^{N} {[{(V_{n} \otimes I_{N})}^{*} U {diag}_{1 \leq p \leq n N} (λ_{p} (E (x_{n : 1} x_{n : 1}^{*}))) ({(V_{n} \otimes I_{N})}^{*} U)^{*}]}_{(n - k) N + h, (n - k) N + l} {[M]}_{l, j} \\ = & \sum_{p = 1}^{n N} λ_{p} (E (x_{n : 1} x_{n : 1}^{*})) {|\sum_{h = 1}^{N} \bar{{[M]}_{h, j}} {[{(V_{n} \otimes I_{N})}^{*} U]}_{(n - k) N + h, p}|}^{2}, \end{matrix}

and thus,

\begin{matrix} λ_{n N} (E (x_{n : 1} x_{n : 1}^{*})) & \sum_{p = 1}^{n N} {|\sum_{h = 1}^{N} \bar{{[M]}_{h, j}} {[{(V_{n} \otimes I_{N})}^{*} U]}_{(n - k) N + h, p}|}^{2} \\ \leq λ_{j} (E (y_{k} y_{k}^{*})) \leq λ_{1} (E (x_{n : 1} x_{n : 1}^{*})) \sum_{p = 1}^{n N} {|\sum_{h = 1}^{N} \bar{{[M]}_{h, j}} {[{(V_{n} \otimes I_{N})}^{*} U]}_{(n - k) N + h, p}|}^{2} \end{matrix}

for all

j \in {1, \dots, N}

. Hence, as

\begin{matrix} \sum_{p = 1}^{n N} & {|\sum_{h = 1}^{N} \bar{{[M]}_{h, j}} {[{(V_{n} \otimes I_{N})}^{*} U]}_{(n - k) N + h, p}|}^{2} = \sum_{h = 1}^{N} {[M^{*}]}_{j, h} \sum_{l = 1}^{N} {[{(V_{n} \otimes I_{N})}^{*} U {({(V_{n} \otimes I_{N})}^{*} U)}^{*}]}_{(n - k) N + h, (n - k) N + l} {[M]}_{l, j} \\ = \sum_{h = 1}^{N} {[M^{*}]}_{j, h} \sum_{l = 1}^{N} {[{(V_{n} \otimes I_{N})}^{*} I_{n N} (V_{n} \otimes I_{N})]}_{(n - k) N + h, (n - k) N + l} {[M]}_{l, j} = \sum_{h = 1}^{N} {[M^{*}]}_{j, h} \sum_{l = 1}^{N} {[I_{n N}]}_{(n - k) N + h, (n - k) N + l} {[M]}_{l, j} = 1, \end{matrix}

Equation (3) holds. □

Appendix C. Proof of Theorem 2

Proof.

Fix

k \in {1, \dots, n - 1} ∖ {\frac{n}{2}}

. Since

\begin{matrix} y_{k} & = \frac{1}{\sqrt{n}} \sum_{j = 1}^{n} e^{- \frac{2 π (j - 1) k}{n} i} {[x_{n : 1}]}_{j, 1} = \frac{1}{\sqrt{n}} \sum_{j = 1}^{n} (cos \frac{2 π (1 - j) k}{n} + i sin \frac{2 π (1 - j) k}{n}) x_{n - j + 1}, \end{matrix}

we obtain

\begin{matrix} E (\hat{y_{k}} {\hat{y_{k}}}^{⊤}) = E ((\begin{matrix} Re (y_{k}) \\ Im (y_{k}) \end{matrix}) (\begin{matrix} {(Re (y_{k}))}^{⊤} & | {(Im (y_{k}))}^{⊤} \end{matrix})) = (\begin{matrix} E (Re (y_{k}) {(Re (y_{k}))}^{⊤}) & E (Re (y_{k}) {(Im (y_{k}))}^{⊤}) \\ E (Im (y_{k}) {(Re (y_{k}))}^{⊤}) & E (Im (y_{k}) {(Im (y_{k}))}^{⊤}) \end{matrix}) \\ = \frac{1}{n} \sum_{j_{1}, j_{2} = 1}^{n} (\begin{matrix} cos \frac{2 π (1 - j_{1}) k}{n} cos \frac{2 π (1 - j_{2}) k}{n} E (x_{n - j_{1} + 1} x_{n - j_{2} + 1}^{⊤}) & cos \frac{2 π (1 - j_{1}) k}{n} sin \frac{2 π (1 - j_{2}) k}{n} E (x_{n - j_{1} + 1} x_{n - j_{2} + 1}^{⊤}) \\ sin \frac{2 π (1 - j_{1}) k}{n} cos \frac{2 π (1 - j_{2}) k}{n} E (x_{n - j_{1} + 1} x_{n - j_{2} + 1}^{⊤}) & sin \frac{2 π (1 - j_{1}) k}{n} sin \frac{2 π (1 - j_{2}) k}{n} E (x_{n - j_{1} + 1} x_{n - j_{2} + 1}^{⊤}) \end{matrix}) \\ = \frac{1}{n} \sum_{j_{1}, j_{2} = 1}^{n} A_{j_{1}}^{⊤} E (x_{n - j_{1} + 1} x_{n - j_{2} + 1}^{⊤}) A_{j_{2}}, \end{matrix}

where

A_{j} = (\begin{matrix} cos \frac{2 π (1 - j) k}{n} I_{N} & | & sin \frac{2 π (1 - j) k}{n} I_{N} \end{matrix})

with

j \in {1, \dots, n}

. Fix

r \in {1, \dots, 2 N}

, and consider a real eigenvector

v

corresponding to

λ_{r} (E (\hat{y_{k}} {\hat{y_{k}}}^{⊤}))

with

v^{⊤} v = 1

. Let

E (x_{n : 1} x_{n : 1}^{⊤}) = U {diag}_{1 \leq j \leq n N} (λ_{j} (E (x_{n : 1} x_{n : 1}^{⊤}))) U^{- 1}

be an EVD of

E (x_{n : 1} x_{n : 1}^{⊤})

, where U is real and orthogonal. Then

\begin{matrix} λ_{r} (E (\hat{y_{k}} {\hat{y_{k}}}^{⊤})) & = λ_{r} (E (\hat{y_{k}} {\hat{y_{k}}}^{⊤})) v^{⊤} v = v^{⊤} λ_{r} (E (\hat{y_{k}} {\hat{y_{k}}}^{⊤})) v = v^{⊤} E (\hat{y_{k}} {\hat{y_{k}}}^{⊤}) v \\ = \frac{1}{n} \sum_{j_{1}, j_{2} = 1}^{n} v^{⊤} A_{j_{1}}^{⊤} E (x_{n - j_{1} + 1} x_{n - j_{2} + 1}^{⊤}) A_{j_{2}} v = \frac{1}{n} \sum_{j_{1}, j_{2} = 1}^{n} v^{⊤} A_{j_{1}}^{⊤} {[E (x_{n : 1} x_{n : 1}^{⊤})]}_{j_{1}, j_{2}} A_{j_{2}} v \\ = \frac{1}{n} \sum_{j_{1}, j_{2} = 1}^{n} v^{⊤} A_{j_{1}}^{⊤} e_{j_{1}}^{⊤} E (x_{n : 1} x_{n : 1}^{⊤}) e_{j_{2}} A_{j_{2}} v \\ = \frac{1}{n} \sum_{j_{1}, j_{2} = 1}^{n} v^{⊤} A_{j_{1}}^{⊤} e_{j_{1}}^{⊤} U {diag}_{1 \leq p \leq n N} (λ_{p} (E (x_{n : 1} x_{n : 1}^{⊤}))) U^{⊤} e_{j_{2}} A_{j_{2}} v \\ = \frac{1}{n} \sum_{j_{1} = 1}^{n} v^{⊤} A_{j_{1}}^{⊤} e_{j_{1}}^{⊤} U {diag}_{1 \leq p \leq n N} (λ_{p} (E (x_{n : 1} x_{n : 1}^{⊤}))) \sum_{j_{2} = 1}^{n} U^{⊤} e_{j_{2}} A_{j_{2}} v \\ = \frac{1}{n} {[B^{⊤} {diag}_{1 \leq p \leq n N} (λ_{p} (E (x_{n : 1} x_{n : 1}^{⊤}))) B]}_{1, 1} = \frac{1}{n} \sum_{p = 1}^{n N} {[B^{⊤}]}_{1, p} λ_{p} (E (x_{n : 1} x_{n : 1}^{⊤})) {[B]}_{p, 1} \\ = \frac{1}{n} \sum_{p = 1}^{n N} λ_{p} (E (x_{n : 1} x_{n : 1}^{⊤})) {[B]}_{p, 1}^{2}, \end{matrix}

where

e_{l} \in C^{n N \times N}

with

{[e_{l}]}_{j, 1} = δ_{j, l} I_{N}

for all

j, l \in {1, \dots, n}

and

B = \sum_{j = 1}^{n} U^{⊤} e_{j} A_{j} v

. Consequently,

\begin{matrix} λ_{n N} (E (x_{n : 1} x_{n : 1}^{⊤})) \frac{1}{n} \sum_{p = 1}^{n N} {[B]}_{p, 1}^{2} \leq λ_{r} (E (\hat{y_{k}} {\hat{y_{k}}}^{⊤})) \leq λ_{1} (E (x_{n : 1} x_{n : 1}^{⊤})) \frac{1}{n} \sum_{p = 1}^{n N} {[B]}_{p, 1}^{2} . \end{matrix}

Therefore, to finish the proof we only need to show that

\frac{1}{n} \sum_{p = 1}^{n N} {[B]}_{p, 1}^{2} = \frac{1}{2}

. Applying ([5] (Equations (14) and (15))) yields

\begin{matrix} \frac{1}{n} \sum_{p = 1}^{n N} {[B]}_{p, 1}^{2} & = & \frac{1}{n} \sum_{p = 1}^{n N} {[B^{⊤}]}_{1, p} {[B]}_{p, 1} = \frac{1}{n} B^{⊤} B = \frac{1}{n} {(\sum_{j_{1} = 1}^{n} U^{⊤} e_{j_{1}} A_{j_{1}} v)}^{⊤} (\sum_{j_{2} = 1}^{n} U^{⊤} e_{j_{2}} A_{j_{2}} v) \\ = & \frac{1}{n} \sum_{j_{1}, j_{2} = 1}^{n} v^{⊤} A_{j_{1}}^{⊤} e_{j_{1}}^{⊤} e_{j_{2}} A_{j_{2}} v = \frac{1}{n} \sum_{j = 1}^{n} v^{⊤} A_{j}^{⊤} A_{j} v = \frac{1}{n} \sum_{j = 1}^{n} {(A_{j} v)}^{⊤} (A_{j} v) = \frac{1}{n} \sum_{j = 1}^{n} \sum_{s = 1}^{N} {[A_{j} v]}_{s, 1}^{2} \\ = & \frac{1}{n} \sum_{j = 1}^{n} \sum_{s = 1}^{N} {(cos \frac{2 π (1 - j) k}{n} {[v]}_{s, 1} + sin \frac{2 π (1 - j) k}{n} {[v]}_{N + s, 1})}^{2} \\ = & \frac{1}{n} \sum_{s = 1}^{N} \sum_{j = 1}^{n} ({(cos \frac{2 π (1 - j) k}{n})}^{2} {[v]}_{s, 1}^{2} + {(sin \frac{2 π (1 - j) k}{n})}^{2} {[v]}_{N + s, 1}^{2} \\ + 2 cos \frac{2 π (1 - j) k}{n} sin \frac{2 π (1 - j) k}{n} {[v]}_{s, 1} {[v]}_{N + s, 1}) \\ = & \sum_{s = 1}^{N} ({[v]}_{s, 1}^{2} \frac{1}{n} \sum_{j = 1}^{n} {(cos \frac{2 π (1 - j) k}{n})}^{2} + {[v]}_{N + s, 1}^{2} \frac{1}{n} \sum_{j = 1}^{n} {(sin \frac{2 π (1 - j) k}{n})}^{2} \\ + {[v]}_{s, 1} {[v]}_{N + s, 1} \frac{1}{n} \sum_{j = 1}^{n} 2 sin \frac{2 π (1 - j) k}{n} cos \frac{2 π (1 - j) k}{n}) \\ = & \sum_{s = 1}^{N} ({[v]}_{s, 1}^{2} \frac{1}{n} \sum_{j = 1}^{n} (1 - {(sin \frac{2 π (1 - j) k}{n})}^{2}) + \frac{{[v]}_{N + s, 1}^{2}}{2} + {[v]}_{s, 1} {[v]}_{N + s, 1} \frac{1}{n} \sum_{j = 1}^{n} sin \frac{4 π (1 - j) k}{n}) \\ = & \sum_{s = 1}^{N} ({[v]}_{s, 1}^{2} (1 - \frac{1}{n} \sum_{j = 1}^{n} {(sin \frac{2 π (1 - j) k}{n})}^{2}) + \frac{{[v]}_{N + s, 1}^{2}}{2} - {[v]}_{s, 1} {[v]}_{N + s, 1} \frac{1}{n} \sum_{j = 1}^{n} sin \frac{4 π (j - 1) k}{n}) \\ = & \sum_{s = 1}^{N} (\frac{{[v]}_{s, 1}^{2}}{2} + \frac{{[v]}_{N + s, 1}^{2}}{2} - {[v]}_{s, 1} {[v]}_{N + s, 1} \frac{1}{n} \sum_{j = 1}^{n} Im (e^{\frac{4 π (j - 1) k}{n} i})) \\ = & \sum_{s = 1}^{N} (\frac{{[v]}_{s, 1}^{2}}{2} + \frac{{[v]}_{N + s, 1}^{2}}{2} - {[v]}_{s, 1} {[v]}_{N + s, 1} \frac{1}{n} Im (\sum_{j = 1}^{n} e^{\frac{4 π (j - 1) k}{n} i})) \\ = & \sum_{s = 1}^{N} (\frac{{[v]}_{s, 1}^{2}}{2} + \frac{{[v]}_{N + s, 1}^{2}}{2}) = \frac{1}{2} \sum_{h = 1}^{2 N} {[v]}_{h, 1}^{2} = \frac{1}{2} v^{⊤} v = \frac{1}{2} . \end{matrix}

□

Appendix D. Proof of Lemma 2

Proof.

(1)

E (y_{k} y_{k}^{*}) = {[E (y_{n : 1} y_{n : 1}^{*})]}_{n - k + 1, n - k + 1} = {[{(V_{n} \otimes I_{N})}^{*} E (x_{n : 1} x_{n : 1}^{*}) (V_{n} \otimes I_{N})]}_{n - k + 1, n - k + 1}

.

(2)

E (y_{k} y_{k}^{⊤}) = {[E (y_{n : 1} y_{n : 1}^{⊤})]}_{n - k + 1, n - k + 1} = {[{(V_{n} \otimes I_{N})}^{*} E (x_{n : 1} x_{n : 1}^{⊤}) {({(V_{n} \otimes I_{N})}^{*})}^{⊤}]}_{n - k + 1, n - k + 1}

.

(3) We have

\begin{matrix} E (y_{k} y_{k}^{*}) = E ((Re (y_{k}) + i Im (y_{k})) ({(Re (y_{k}))}^{⊤} - i {(Im (y_{k}))}^{⊤})) \\ = E (Re (y_{k}) {(Re (y_{k}))}^{⊤}) + E (Im (y_{k}) {(Im (y_{k}))}^{⊤}) + i (E (Im (y_{k}) {(Re (y_{k}))}^{⊤}) - E (Re (y_{k}) {(Im (y_{k}))}^{⊤})), \end{matrix}

(A1)

and

\begin{matrix} E (y_{k} y_{k}^{⊤}) = E ((Re (y_{k}) + i Im (y_{k})) ({(Re (y_{k}))}^{⊤} + i {(Im (y_{k}))}^{⊤})) \\ = E (Re (y_{k}) {(Re (y_{k}))}^{⊤}) - E (Im (y_{k}) {(Im (y_{k}))}^{⊤}) + i (E (Im (y_{k}) {(Re (y_{k}))}^{⊤}) + E (Re (y_{k}) {(Im (y_{k}))}^{⊤})) . \end{matrix}

(A2)

As

\begin{matrix} E (\hat{y_{k}} {\hat{y_{k}}}^{⊤}) = (\begin{matrix} E (Re (y_{k}) {(Re (y_{k}))}^{⊤}) & E (Re (y_{k}) {(Im (y_{k}))}^{⊤}) \\ E (Im (y_{k}) {(Re (y_{k}))}^{⊤}) & E (Im (y_{k}) {(Im (y_{k}))}^{⊤}) \end{matrix}), \end{matrix}

assertion (3) follows directly from Equations (A1) and (A2). □

References

Gutiérrez-Gutiérrez, J.; Crespo, P.M. Block Toeplitz matrices: Asymptotic results and applications. Found. Trends Commun. Inf. Theory 2011, 8, 179–257. [Google Scholar] [CrossRef]
Gray, R.M. Toeplitz and circulant matrices: A review. Found. Trends Commun. Inf. Theory 2006, 2, 155–239. [Google Scholar] [CrossRef]
Ephraim, Y.; Lev-Ari, H.; Gray, R.M. Asymptotic minimum discrimination information measure for asymptotically weakly stationary processes. IEEE Trans. Inf. Theory 1988, 34, 1033–1040. [Google Scholar] [CrossRef]
Gray, R.M. On the asymptotic eigenvalue distribution of Toeplitz matrices. IEEE Trans. Inf. Theory 1972, IT-18, 725–730. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J.; Zárraga-Rodríguez, M.; Insausti, X. Upper bounds for the rate distortion function of finite-length data blocks of Gaussian WSS sources. Entropy 2017, 19, 554. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J.; Zárraga-Rodríguez, M.; Villar-Rosety, F.M.; Insausti, X. Rate-Distortion function upper bounds for Gaussian vectors and their applications in coding AR sources. Entropy 2018, 20, 399. [Google Scholar] [CrossRef]
Viswanathan, H.; Berger, T. The quadratic Gaussian CEO problem. IEEE Trans. Inf. Theory 1997, 43, 1549–1559. [Google Scholar] [CrossRef]
Torezzan, C.; Panek, L.; Firer, M. A low complexity coding and decoding strategy for the quadratic Gaussian CEO problem. J. Frankl. Inst. 2016, 353, 643–656. [Google Scholar] [CrossRef]
Kolmogorov, A.N. On the Shannon theory of information transmission in the case of continuous signals. IRE Trans. Inf. Theory 1956, IT-2, 102–108. [Google Scholar] [CrossRef]
Neeser, F.D.; Massey, J.L. Proper complex random processes with applications to information theory. IEEE Trans. Inf. Theory 1993, 39, 1293–1302. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J.; Zárraga-Rodríguez, M.; Insausti, X.; Hogstad, B.O. On the complexity reduction of coding WSS vector processes by using a sequence of block circulant matrices. Entropy 2017, 19, 95. [Google Scholar] [CrossRef]
Pearl, J. On coding and filtering stationary signals by discrete Fourier transforms. IEEE Trans. Inf. Theory 1973, 19, 229–232. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J.; Crespo, P.M. Asymptotically equivalent sequences of matrices and Hermitian block Toeplitz matrices with continuous symbols: Applications to MIMO systems. IEEE Trans. Inf. Theory 2008, 54, 5671–5680. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J. A modified version of the Pisarenko method to estimate the power spectral density of any asymptotically wide sense stationary vector process. Appl. Math. Comput. 2019, 362, 124526. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J.; Zárraga-Rodríguez, M.; Crespo, P.M.; Insausti, X. Rate distortion function of Gaussian asymptotically WSS vector processes. Entropy 2018, 20, 719. [Google Scholar] [CrossRef]
Gray, R.M. Information rates of autoregressive processes. IEEE Trans. Inf. Theory 1970, IT-16, 412–421. [Google Scholar] [CrossRef]
Toms, W.; Berger, T. Information rates of stochastically driven dynamic systems. IEEE Trans. Inf. Theory 1971, 17, 113–114. [Google Scholar] [CrossRef]
Reinsel, G.C. Elements of Multivariate Time Series Analysis; Springer: Berlin/Heidelberg, Germany, 1993. [Google Scholar]
Gutiérrez-Gutiérrez, J.; Crespo, P.M. Asymptotically equivalent sequences of matrices and multivariate ARMA processes. IEEE Trans. Inf. Theory 2011, 57, 5444–5454. [Google Scholar] [CrossRef]
Gutiérrez-Gutiérrez, J.; Iglesias, I.; Podhorski, A. Geometric MMSE for one-sided and two-sided vector linear predictors: From the finite-length case to the infinite-length case. Signal Process. 2011, 91, 2237–2245. [Google Scholar] [CrossRef]

Figure 1. Proposed coding strategy for Gaussian vector sources. In this figure,

E n c o d e r_{k}

(

D e c o d e r_{k}

) denotes the optimal encoder (decoder) for the Gaussian N-dimensional vector

y_{k}

with

k \in \{⌈\frac{n}{2}⌉, \dots, n\}

.

Figure 2. Considered rates for the wide sense stationary (WSS) vector process in ([20] (Section 4)).

Figure 3. Considered rates for the VMA(1) process in ([18] (Example 2.1)).

Figure 4. Considered rates for the VAR(1) process in ([18] (Example 2.3)).

Figure 5. Considered rates for the VARMA(1,1) process in ([18] (Example 3.2)).

Figure 6. Considered rates for the perturbed WSS vector process with D = 0.001.

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

A Low-Complexity and Asymptotically Optimal Coding Strategy for Gaussian Vector Sources

Abstract

1. Introduction

2. Preliminaries

2.1. Notation

2.2. New Mathematical Results on the Block DFT

2.3. Upper Bound for the RDF of a Complex Gaussian Vector

3. Low-Complexity Coding Strategy for Gaussian Vector Sources

4. Optimality of the Proposed Coding Strategy for Gaussian AWSS Vector Sources

5. Relevant AWSS Vector Sources

5.1. WSS Vector Sources

5.2. VMA Sources

5.3. VAR AWSS Sources

5.4. VARMA AWSS Sources

6. Numerical Examples

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Proof of Lemma 1

Appendix B. Proof of Theorem 1

Appendix C. Proof of Theorem 2

Appendix D. Proof of Lemma 2

References

Article Metrics

Citations

Article Access Statistics