Cascaded RLS Adaptive Filters Based on a Kronecker Product Decomposition

Rusu, Alexandru-George; Ciochină, Silviu; Paleologu, Constantin; Benesty, Jacob

doi:10.3390/electronics11030409

Open AccessArticle

Cascaded RLS Adaptive Filters Based on a Kronecker Product Decomposition

¹

Department of Telecommunications, University Politehnica of Bucharest, 061071 Bucharest, Romania

²

Department of Research and Development, Rohde & Schwarz Topex, 020335 Bucharest, Romania

³

INRS-EMT, University of Quebec, Montreal, QC H5A 1K6, Canada

^*

Author to whom correspondence should be addressed.

Electronics 2022, 11(3), 409; https://doi.org/10.3390/electronics11030409

Submission received: 10 December 2021 / Revised: 13 January 2022 / Accepted: 27 January 2022 / Published: 29 January 2022

(This article belongs to the Special Issue Efficient Algorithms and Architectures for DSP Applications)

Download

Browse Figures

Versions Notes

Abstract

:

The multilinear system framework allows for the exploitation of the system identification problem from different perspectives in the context of various applications, such as nonlinear acoustic echo cancellation, multi-party audio conferencing, and video conferencing, in which the system could be modeled through parallel or cascaded filters. In this paper, we introduce different memoryless and memory structures that are described from a bilinear perspective. Following the memory structures, we develop the multilinear recursive least-squares algorithm by considering the Kronecker product decomposition concept. We have performed a set of simulations in the context of echo cancellation, aiming both long length impulse responses and the reverberation effect.

Keywords:

recursive least-squares (RLS) algorithm; adaptive filters; Kronecker product decomposition; system identification; echo cancellation

1. Introduction

In the field of system identification, many applications involve adaptive filtering algorithms [1,2]. One of them is the echo cancellation problem, which has raised many challenges over the years [3,4]. Based on the input-output relation, a dynamic system should be determined (i.e., the echo path), considering various parameters and external factors that must be estimated. These dynamic systems are modeled linearly through an adaptive filter with a finite-impulse-response (FIR) structure [5,6]. The main performance bottlenecks, in terms of computational complexity, tracking, and convergence rate, arise when the length of the impulse response reaches hundreds/thousands of coefficients. The literature presents many approaches to improve the overall performance, also taking into account the fact that the echo paths are sparse in nature [7,8,9,10,11,12,13]. Recently, in our previous work [14], we introduced a new approach of splitting a long length impulse response into several impulse responses of shorter lengths, aiming to reduce the computational complexity by maintaining the overall performance. Another challenge arises when the echo path produces multiple reflections, and this effect is called reverberation. From a mathematical point of view, this effect could be described (to some extent) by using the Kronecker product decomposition of the impulse response [15,16].

In this paper, we extend our study on cascaded adaptive filters, aiming to reduce the computational complexity considering both long length impulse responses and the reverberation effect. Our approach is based on multilinear structures and the Kronecker product decomposition. The main goal is to outline the features of this development and its potential.

The rest of the paper is organized as follows. Section 2 presents the background for different bilinear structures without memory, while Section 3 introduces bilinear structures with memory. In Section 4, the new development is combined with the recursive least-squares (RLS) algorithm, thus resulting in a practical solution based on adaptive filtering. We perform an experimental study in Section 5 and conclude the paper in Section 6.

2. Bilinear Structures without Memory

In order to introduce the bilinear structures with memory and the development based on the Kronecker product decomposition, let us start by presenting the bilinear structure without memory [14,17,18], defined as

\begin{matrix} y (n) & = h_{1}^{T} X (n) h_{2} = \sum_{l_{1} = 1}^{L_{1}} \sum_{l_{2} = 1}^{L_{2}} x_{l_{1} l_{2}} (n) h_{1, l_{1}} h_{2, l_{2}}, \end{matrix}

(1)

where

X (n)

is the multiple input data matrix of size

L_{1} \times L_{2}

, with

\begin{matrix} X (n) = [\begin{matrix} x_{1} (n) & \dots & x_{l_{2}} (n) & \dots & x_{L_{2}} (n) \end{matrix}], \end{matrix}

(2)

and

\begin{matrix} x_{l_{2}} (n) = {[\begin{matrix} x_{l_{2}, 1} (n) & \dots & x_{l_{2}, l_{1}} (n) & \dots & x_{l_{2}, L_{1}} (n) \end{matrix}]}^{T}, l_{2} = 1, 2, \dots, L_{2} \end{matrix}

(3)

is an input signal vector containing the

L_{1}

most recent data at the discrete-time index n, while the superscript ^T is the transpose operator. The two impulse responses

h_{1}

and

h_{2}

have

L_{1}

and

L_{2}

coefficients, respectively. In other words, the input-output equation in (1) describes a system with

L_{1} L_{2}

inputs and a single output. In order to facilitate the graphical representation, let us rewrite (1) as

\begin{matrix} y (n) & = \sum_{l_{1} = 1}^{L_{1}} h_{1, l_{1}} \sum_{l_{2} = 1}^{L_{2}} h_{2, l_{2}} x_{l_{1} l_{2}} (n) = \sum_{l_{2} = 1}^{L_{2}} h_{2, l_{2}} \sum_{l_{1} = 1}^{L_{1}} h_{1, l_{1}} x_{l_{1} l_{2}} (n) \\ = \sum_{l_{2} = 1}^{L_{2}} h_{2, l_{2}} s_{1, l_{2}} (n), \end{matrix}

(4)

where

\begin{matrix} s_{1, l_{2}} (n) = \sum_{l_{1} = 1}^{L_{1}} h_{1, l_{1}} x_{l_{1} l_{2}} (n) \end{matrix}

(5)

is the output of a memoryless weighted adder with

L_{1}

inputs at the discrete-time index n. We can transpose (5) in a graphical representation as shown in Figure 1.

Based on (4) and Figure 1, we can introduce the graphical respresentation of the multiple-input single-output (MISO) system as shown in Figure 2. Overall, this structure consists of two levels of combiners.

3. Bilinear Structures with Memory

By introducing a delay line, the

s_{1, l_{2}} (n)

structure described by

L_{1}

inputs and a single output can be transformed in a single-input single-output (SISO) structure. Therefore, the following input signal vector results

\begin{matrix} x_{l_{2}} (n) = {[\begin{matrix} x_{l_{2}} (n) & \dots & x_{l_{2}} (n - l_{1} + 1) & \dots & x_{l_{2}} (n - L_{1} + 1) \end{matrix}]}^{T}, l_{2} = 1, 2, \dots, L_{2} . \end{matrix}

(6)

Thus, the input data matrix has the following structure:

X (n) = [\begin{matrix} x_{1} (n) & \dots & x_{l_{2}} (n) & \dots & x_{L_{2}} (n) \\ x_{1} (n - 1) & \dots & x_{l_{2}} (n - 1) & \dots & x_{L_{2}} (n - 1) \\ ⋮ & ⋱ & ⋮ & ⋱ & ⋮ \\ x_{1} (n - L_{1} + 1) & \dots & x_{l_{2}} (n - L_{1} + 1) & \dots & x_{L_{2}} (n - L_{1} + 1) \end{matrix}] .

Hence,

\begin{matrix} s_{1, l_{2}} (n) = \sum_{l_{1} = 1}^{L_{1}} h_{1, l_{1}} x_{l_{2}} (n - l_{1} + 1) \end{matrix}

(7)

is a structure associated to a transversal filter, with the weighted function

h_{1}

, having as input the vector

x_{l_{2}} (n)

. The graph representation of the new

s_{1, l_{2}} (n)

structure is shown in Figure 3. Also, Figure 4 outlines the two combiner level structures based on the transversal filters.

In terms of the z-transform, (7) is defined as

\begin{matrix} S_{1, l_{2}} (z) & = \sum_{n = - \infty}^{\infty} s_{1, l_{2}} (n) z^{- n} = \sum_{l_{1} = 1}^{L_{1}} h_{1, l_{1}} \sum_{n = - \infty}^{\infty} x_{l_{2}} (n - l_{1} + 1) z^{- n} \\ = X_{l_{2}} (z) \sum_{l_{1} = 1}^{L_{1}} h_{1, l_{1}} z^{- l_{1} + 1} \\ = X_{l_{2}} (z) H_{1} (z), \end{matrix}

(8)

where

\begin{matrix} H_{1} (z) = \sum_{l_{1} = 1}^{L_{1}} h_{1, l_{1}} z^{- l_{1} + 1} . \end{matrix}

(9)

A more efficient form in terms of correlation between the columns of the input matrix

X (n)

can be obtained if we consider successive data related to the columns, and is defined as

X (n) = [\begin{matrix} x_{1} (n) & \dots & x_{1} (n - (l_{2} - 1) L_{1}) & \dots & x_{1} (n - (L_{2} - 1) L_{1}) \\ x_{1} (n - 1) & \dots & x_{1} (n - (l_{2} - 1) L_{1} - 1) & \dots & x_{1} (n - (L_{2} - 1) L_{1} - 1) \\ ⋮ & ⋱ & ⋮ & ⋱ & ⋮ \\ x_{1} (n - L_{1} + 1) & \dots & x_{1} (n - (l_{2} - 1) L_{1} - L_{1} + 1) & \dots & x_{1} (n - (L_{2} - 1) L_{1} - L_{1} + 1) \end{matrix}] .

The input signal vector becomes a sequence of

L_{1} L_{2}

successive data applied to a FIR filter of length

L_{1} L_{2}

, so that

\begin{matrix} x (n) = vec [X (n)], \end{matrix}

(10)

where

vec (\cdot)

denotes the vectorization operation and

\begin{matrix} s_{1, l_{2}} (n) = \sum_{l_{1} = 1}^{L_{1}} h_{1, l_{1}} x_{1} (n - l_{1} L_{2}), \end{matrix}

(11)

with

x_{1} (n) = x_{1} (n - L_{1}), \dots, x_{l_{2}} (n) = x_{1} (n - l_{1} L_{2}), \dots, x_{L_{2}} (n) = x_{1} (n - L_{1} L_{2})

. In terms of the z-transform, we can write (11) as

\begin{matrix} S_{1, l_{2}} (z) & = \sum_{l_{1} = 1}^{L_{1}} X_{1} (z) z^{- l_{1} L_{2}} h_{1, l_{1}} \\ = X_{1} (z) \sum_{l_{1} = 1}^{L_{1}} z^{- l_{1} L_{2}} h_{1, l_{1}} = X_{1} (z) H_{1} (z^{L_{2}}) . \end{matrix}

(12)

Forwards, the output of the global system in the z-transform domain is

\begin{matrix} Y (z) = S_{1, l_{2}} (z) H_{2} (z) = X_{1} (z) H_{1} (z^{L_{2}}) H_{2} (z) = X_{1} (z) H (z), \end{matrix}

(13)

where

\begin{matrix} H (z) = H_{1} (z^{L_{2}}) H_{2} (z) . \end{matrix}

(14)

Equation (12) describes a SISO structure of two cascaded filters as shown in Figure 5. The first filter

H_{1} (z^{L_{2}})

is obtained through interpolation with zeroes by

L_{2}

factor of the

H_{1} (z)

function and its length is

L_{2} (L_{1} - 1) + 1

. Indeed, it has only

L_{1}

non-zero coefficients, from the total of

L_{2} (L_{1} - 1) + 1

, (hence a certain degree of sparsity), according to its impulse response

\begin{matrix} h_{1}^{^{'}} (n) = \{\begin{matrix} h_{1} (\frac{n}{L_{1}}), & for n \mod L_{1} = 0, n = 1, 2, \dots, L_{2} (L_{1} - 1) + 1 \\ 0, & otherwise \end{matrix}, \end{matrix}

(15)

where

h_{1}^{^{'}} (n)

is the impulse response of

H_{1} (z^{L_{2}})

and

\mod

denotes the modulo operation. The second filter,

H_{2} (z)

, is of length

L_{2}

. Afterwards, the total length of the

H (z)

filter is

L_{2} (L_{1} - 1) + 1 + L_{2} - 1 = L_{1} L_{2}

.

Based on this configuration, let us consider the two vectors:

h_{1} = [\begin{matrix} h_{1, 1} \\ h_{1, 2} \\ ⋮ \\ h_{1, l_{1}} \\ ⋮ \\ h_{1, L_{1}} \end{matrix}], h_{2} = [\begin{matrix} h_{2, 1} \\ h_{2, 2} \\ ⋮ \\ h_{2, l_{2}} \\ ⋮ \\ h_{2, L_{2}} \end{matrix}],

and the Kronecker product:

h = h_{1} \otimes h_{2} = [\begin{matrix} h_{1, 1} h_{2} \\ h_{1, 2} h_{2} \\ ⋮ \\ h_{1, l_{1}} h_{2} \\ ⋮ \\ h_{1, L_{1}} h_{2} \end{matrix}] .

Having as coefficients the elements of this vector, the polynomial form is developed as

\begin{matrix} H (z) = \sum_{r = 0}^{L_{1} L_{2} - 1} h_{r} z^{- r} \end{matrix}

(16)

and can be factored in the form described in (14). While the position of an element for the

l_{1}, l_{2}

indexes is

r = (l_{1} - 1) L_{2} + l_{2} - 1

, we have

\begin{matrix} H_{1} (z^{L_{2}}) H_{2} (z) & = \sum_{l_{1} = 1}^{L_{1}} h_{1, l_{1}} z^{- (l_{1} - 1) L_{2}} \sum_{l_{2} = 1}^{L_{2}} h_{2, l_{2}} z^{- (l_{2} - 1)} \\ = \sum_{l_{1} = 1}^{L_{1}} \sum_{l_{2} = 1}^{L_{2}} h_{1, l_{1}} h_{2, l_{2}} z^{- (l_{1} - 1) L_{2} - (l_{2} - 1)} \\ = \sum_{r = 0}^{L_{1} L_{2} - 1} h_{r} z^{- r} . \end{matrix}

(17)

4. Cascaded Multilinear RLS Algorithm Using Kronecker Product Decomposition

Based on the development from Section 3, we introduce the set of equations for the RLS algorithm in a multilinear manner, following the Kronecker product decomposition [14,19]. Our approach is determined considering the system identification framework. In this context, the output of the MISO system is

\begin{matrix} y (n) = X (n) \times_{1} h_{1}^{T} \times_{2} h_{2}^{T} \times_{i} \dots \times_{N} h_{N}^{T}, \end{matrix}

(18)

where N denotes the multilinear degree and

\times_{i}

represents the multiplication operation by the dimension

i = 1, 2, \dots, N

. The input data are described in a N degree tensorial form as

{[X (n)]}_{l_{1} l_{2} \dots l_{N}}

with the real-values

x_{l_{1} l_{2} \dots l_{N}}, l_{i} = 1, 2, \dots, L_{i}, i = 1, 2, \dots, N

. The vector

h_{i}

of length

L_{i}

, stores the impulse response for the i cascaded filter,

i = 1, 2, \dots, N

. Based on the

h_{i}

(

i = 1, 2, \dots, N

) impulse responses of the MISO system, the rank-1 tensor of dimension

L_{1} \times L_{2} \times \dots \times L_{N}

is

\begin{matrix} H = h_{1} \circ h_{2} \circ \dots \circ h_{N}, \end{matrix}

(19)

where ∘ denotes the outer product. Usually, in the context of system identification, the desired signal results from the output signal corrupted by an additive noise,

w (n)

, which in our development is a zero-mean Gaussian signal, so that

\begin{matrix} d (n) = y (n) + w (n) . \end{matrix}

(20)

Consequently, the output signal described by (18) results in

\begin{matrix} y (n) = {vec}^{T} [X (n)] vec (H), \end{matrix}

(21)

where

vec [X (n)] = [\begin{matrix} vec [X_{: : \dots : L_{1}} (n)] \\ vec [X_{: : \dots : L_{2}} (n)] \\ ⋮ \\ vec [X_{: : \dots : L_{N}} (n)] \end{matrix}] ≜ \tilde{x} (n)

and

vec (H) = [\begin{matrix} vec (H_{: : \dots : L_{1}}) \\ vec (H_{: : \dots : L_{2}}) \\ ⋮ \\ vec (H_{: : \dots : L_{N}}) \end{matrix}] = h_{N} \otimes \dots \otimes h_{2} \otimes h_{1} ≜ h,

with

X_{: : \dots : l_{i}} (n) \in R^{L_{1} \times L_{2} \times \dots \times L_{N - 1}}

and

H_{: : \dots : l_{i}} (n) \in R^{L_{1} \times L_{2} \times \dots \times L_{N - 1}}

representing the frontal slices of

X (n)

and

H (n)

, respectively. The two new vectors

\tilde{x} (n)

and

h

consist of

L_{1} L_{2} \dots L_{N}

elements. Also, the output of the system can be rewritten as

\begin{matrix} y (n) = {\tilde{x}}^{T} (n) h . \end{matrix}

(22)

Then, the a priori error signal is computed as

\begin{matrix} e (n) = d (n) - \hat{y} (n), \end{matrix}

(23)

where

\hat{y} (n)

represents an estimate of the output signal. Following the least-squares (LS) error criterion, we can introduce the cost functions:

\begin{matrix} J [{\hat{h}}_{1} (n)] & = \sum_{k = 1}^{n} λ_{1}^{n - k} {[d (n) - {\hat{h}}_{1}^{T} (n) {\tilde{x}}_{{\hat{h}}_{2} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n)]}^{2}, \\ J [{\hat{h}}_{2} (n)] & = \sum_{k = 1}^{n} λ_{2}^{n - k} {[d (n) - {\hat{h}}_{2}^{T} (n) {\tilde{x}}_{{\hat{h}}_{1} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n)]}^{2}, \\ ⋮ \\ J [{\hat{h}}_{N} (n)] & = \sum_{k = 1}^{n} λ_{N}^{n - k} {[d (n) - {\hat{h}}_{N}^{T} (n) {\tilde{x}}_{{\hat{h}}_{1} {\hat{h}}_{2} \dots {\hat{h}}_{N - 1}} (n)]}^{2}, \end{matrix}

(24)

where

0 < λ_{i} \leq 1

,

i = 1, 2, \dots, N

, represent the forgetting factors and

\begin{matrix} {\tilde{x}}_{{\hat{h}}_{2} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n) & = {[{\hat{h}}_{N} (n - 1) \otimes \dots \otimes {\hat{h}}_{2} (n - 1) \otimes I_{L_{1}}]}^{T} \tilde{x} (n), \\ {\tilde{x}}_{{\hat{h}}_{1} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n) & = {[{\hat{h}}_{N} (n - 1) \otimes \dots \otimes I_{L_{2}} \otimes {\hat{h}}_{1} (n - 1)]}^{T} \tilde{x} (n), \\ ⋮ \\ {\tilde{x}}_{{\hat{h}}_{1} {\hat{h}}_{2} \dots {\hat{h}}_{N - 1}} (n) & = {[I_{L_{N}} \otimes \dots \otimes {\hat{h}}_{2} (n - 1) \otimes {\hat{h}}_{1} (n - 1)]}^{T} \tilde{x} (n), \end{matrix}

(25)

with

I_{L_{i}}

denoting the identity matrix of size

L_{i} \times L_{i}

,

i = 1, 2, \dots, N

. Following the minimization of the cost functions

J [{\hat{h}}_{1} (n)]

,

J [{\hat{h}}_{2} (n)]

,…,

J [{\hat{h}}_{N} (n)]

, the update equations of the RLS algorithm in the multilinear approach result:

\begin{matrix} {\hat{h}}_{1} (n) & = {\hat{h}}_{1} (n - 1) + s_{{\hat{h}}_{2} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n) e_{{\hat{h}}_{2} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n), \\ {\hat{h}}_{2} (n) & = {\hat{h}}_{2} (n - 1) + s_{{\hat{h}}_{1} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n) e_{{\hat{h}}_{1} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n), \\ ⋮ \\ {\hat{h}}_{N} (n) & = {\hat{h}}_{N} (n - 1) + s_{{\hat{h}}_{1} {\hat{h}}_{2} \dots {\hat{h}}_{N - 1}} (n) e_{{\hat{h}}_{1} {\hat{h}}_{2} \dots {\hat{h}}_{N - 1}} (n), \end{matrix}

(26)

where the a priori errors are defined as

\begin{matrix} e_{{\hat{h}}_{2} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n) & = d (n) - {\hat{h}}_{1}^{T} (n - 1) {\tilde{x}}_{{\hat{h}}_{2} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n), \\ e_{{\hat{h}}_{1} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n) & = d (n) - {\hat{h}}_{2}^{T} (n - 1) {\tilde{x}}_{{\hat{h}}_{1} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n), \\ ⋮ \\ e_{{\hat{h}}_{1} {\hat{h}}_{2} \dots {\hat{h}}_{N - 1}} (n) & = d (n) - {\hat{h}}_{N}^{T} (n - 1) {\tilde{x}}_{{\hat{h}}_{1} {\hat{h}}_{2} \dots {\hat{h}}_{N - 1}} (n), \end{matrix}

(27)

with

\begin{matrix} s_{{\hat{h}}_{2} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n) & = P_{{\hat{h}}_{1}} (n - 1) {\tilde{x}}_{{\hat{h}}_{2} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n) {[λ_{1} + {\tilde{x}}_{{\hat{h}}_{2} {\hat{h}}_{3} \dots {\hat{h}}_{N}}^{T} (n) P_{{\hat{h}}_{1}} (n - 1) {\tilde{x}}_{{\hat{h}}_{2} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n)]}^{- 1}, \\ s_{{\hat{h}}_{1} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n) & = P_{{\hat{h}}_{2}} (n - 1) {\tilde{x}}_{{\hat{h}}_{1} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n) {[λ_{2} + {\tilde{x}}_{{\hat{h}}_{1} {\hat{h}}_{3} \dots {\hat{h}}_{N}}^{T} (n) P_{{\hat{h}}_{2}} (n - 1) {\tilde{x}}_{{\hat{h}}_{1} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n)]}^{- 1}, \\ ⋮ \\ s_{{\hat{h}}_{1} {\hat{h}}_{2} \dots {\hat{h}}_{N - 1}} (n) & = P_{{\hat{h}}_{N}} (n - 1) {\tilde{x}}_{{\hat{h}}_{1} {\hat{h}}_{2} \dots {\hat{h}}_{N - 1}} (n) \\ \times {[λ_{N} + {\tilde{x}}_{{\hat{h}}_{1} {\hat{h}}_{2} \dots {\hat{h}}_{N - 1}}^{T} (n) P_{{\hat{h}}_{N}} (n - 1) {\tilde{x}}_{{\hat{h}}_{1} {\hat{h}}_{2} \dots {\hat{h}}_{N - 1}} (n)]}^{- 1} \end{matrix}

(28)

and

\begin{matrix} P_{{\hat{h}}_{1}} (n) & = λ_{1}^{- 1} P_{{\hat{h}}_{1}} (n - 1) - λ_{1}^{- 1} s_{{\hat{h}}_{2} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n) {\tilde{x}}_{{\hat{h}}_{2} {\hat{h}}_{3} \dots {\hat{h}}_{N}}^{T} (n) P_{{\hat{h}}_{1}} (n - 1), \\ P_{{\hat{h}}_{2}} (n) & = λ_{2}^{- 1} P_{{\hat{h}}_{2}} (n - 1) - λ_{2}^{- 1} s_{{\hat{h}}_{1} {\hat{h}}_{3} \dots {\hat{h}}_{N}} (n) {\tilde{x}}_{{\hat{h}}_{1} {\hat{h}}_{3} \dots {\hat{h}}_{N}}^{T} (n) P_{{\hat{h}}_{2}} (n - 1), \\ ⋮ \\ P_{{\hat{h}}_{N}} (n) & = λ_{N}^{- 1} P_{{\hat{h}}_{N}} (n - 1) - λ_{N}^{- 1} s_{{\hat{h}}_{1} {\hat{h}}_{2} \dots {\hat{h}}_{N - 1}} (n) {\tilde{x}}_{{\hat{h}}_{1} {\hat{h}}_{2} \dots {\hat{h}}_{N - 1}}^{T} (n) P_{{\hat{h}}_{N}} (n - 1) . \end{matrix}

(29)

In fact, the equations from (26) represent a multilinear optimization strategy, where

N - 1

impulse responses are considered fixed during the optimization of the remaining one [20]. In other words, in each of the cost functions from Equation (24), for the optimization of

{\hat{h}}_{i} (n)

, we consider that the other

{\hat{h}}_{j} (n)

, with

i \neq j

, are fixed. The initialization of the RLS-based algorithms is influenced by the initialization of the matrix

P (n)

, which represents a recursive estimate of the inverse of the covariance matrix of the input signal [21]. In fact, this is the initialization factor that controls the initial convergence of the algorithm. Usually, this initialization is

P (0) = δ^{- 1} I_{L}

, where

δ

is the so-called regularization parameter and

I_{L}

is the identity matrix of size

L \times L

. This regularization parameter depends on the length of the filter and the power of the input signal. In the case of the RLS-CKD algorithm, the matrices from (29) should be initialized in a similar manner. However, even if the initialization of the conventional RLS and RLS-CKD algorithms could be different from this point of view, the regularization parameters do not bias the overall performance, since their influence (for n large enough) is negligible due to the forgetting factors (i.e.,

λ

for the conventional RLS and

λ_{i}

for the RLS-CKD algorithm), which are positive constants smaller than 1. Finally, the cascaded multilinear RLS algorithm based on the Kronecker product decomposition (RLS-CKD) is defined by Equations (26)–(29). While the classical RLS algorithm involves matrices of size

L \times L

, the RLS-CKD algorithm solves the system identification problem by splitting the long length impulse response in shorter length impulse responses, so that it implies matrices of sizes

L_{i} \times L_{i}

,

i = 1, 2, \dots, N

, where

L = L_{1} L_{2} \dots L_{N}

. The classical RLS algorithm involves a computational complexity of

O (L^{2})

. In the case of the RLS-CKD algorithm, the computation complexity results as a sum of

O (L_{i}^{2})

. Following the presented approach, the computational complexity of the RLS-CKD is reduced to

O (L_{1}^{2}) + O (L_{2}^{2}) + \dots + O (L_{N}^{2}) + O (N L)

, with

N ≪ L

. The extra

O (N L)

computational amount is due to the Kronecker product operations. At this point, we can observe a drastic reduction in computational complexity for the RLS-CKD algorithm as compared to that of the classical RLS algorithm, especially for impulse responses of long length (as in echo cancellation).

5. Simulation Results

In order to simulate the RLS-CKD algorithm, we have chosen two different multilinear degrees,

N = 2

(bilinear) and

N = 3

(trilinear), considering the echo cancellation framework. As input signals, we have used white Gaussian noise (i.e., a random process with standard normal distribution, zero mean, and unit variance), an AR(1) process produced by filtering a white Gaussian noise through a first-order system

1 / (1 - 0.9 z^{- 1})

, and a speech sequence, at a sample rate of 8 kHz. For the purpose of these simulations, we have considered that the output of the target system (i.e., the echo signal) is corrupted by white Gaussian noise [i.e.,

w (n)

], considering an echo-to-noise ratio (ENR) of 20 dB when the input signal is a white Gaussian noise or an AR(1) process, and 30 dB when the input signal is a speech sequence. In order to measure the performance, we have used the normalized misalignment in dB, defined as

\begin{matrix} NM [h, \hat{h} (n)] = 20 \log_{10} [\frac{| | h - \hat{h} (n) {| |}_{2}}{{| | h | |}_{2}}], \end{matrix}

(30)

where

| | \cdot {| |}_{2}

denotes the Euclidean norm and

\begin{matrix} \hat{h} (n) = {\hat{h}}_{N} (n) \otimes \dots \otimes {\hat{h}}_{2} (n) \otimes {\hat{h}}_{1} (n) . \end{matrix}

(31)

As initialization we have used

{\hat{h}}_{1} (0) = {[1 0_{L_{1} - 1}^{T}]}^{T}

(i.e., the first coefficient is equal to one, which is followed by

L_{1} - 1

zeros), while the other impulse responses

{\hat{h}}_{j} (0)

, with

j = 2, 3, \dots, N

are initialized as

\begin{matrix} {\hat{h}}_{j} (0) = \frac{1}{L_{j}} 1_{L_{j}}, \end{matrix}

(32)

where

1_{L_{j}}

denotes a column vector with all its

L_{j}

elements equal to one. The conventional all-zeros initialization specific to most of the adaptive filtering algorithms cannot be used in the case of tensor-based algorithms, due to connection between the individual filters, as shown in Equation (25). In this case, the initialization

{\hat{h}}_{i} (0) = 0_{L_{i}}

(

i = 1, 2, \dots, N

) would stall the algorithm.

For the first set of simulations that implies the bilinear approach, we have considered the impulse responses depicted in Figure 6. In the first plot, Figure 6a, the first impulse response

h_{1}

from the G168 Recommendation [22] is represented (i.e., a 64 coefficients cluster). Next, Figure 6b depicts the second impulse response

h_{2}

, evaluated as

h_{2 l_{2}} = {0.5}^{l 2 - 1}

, with

l_{2} = 1, 2, \dots, L_{2}

, where

L_{2} = 8

. The third impulse response is the target that must be determined and is obtained as the Kronecker product between the first two impulse responses, i.e.,

h = h_{2} \otimes h_{1}

. This impulse response is similar to the echo produced by an acoustic environment characterized by a reverberation effect and its length is

L = L_{1} L_{2} = 512

coefficients. Here, we consider the case of a linearly separable system, which is the benchmark of our approach, and show how it can be efficiently exploited in the framework of system identification problems. The impulse response from Figure 6c could correspond to a channel with echoes. This repetitive (but not periodic) structure could also result if a certain impulse response is followed by its reflections, e.g., as in wireless transmissions. The method allows temporal localization and magnitude estimation of the reflections, considering a temporal grid, without any restrictions of periodicity. However, the tensor-based adaptive algorithms can efficiently model the separable part of the system. The forgetting factor used for the RLS algorithm is computed as

λ = 1 - 1 / (K L)

, with

K = 10

in the bilinear context and

K = 1

in the trilinear context, while for the RLS-CKD algorithm is computed as

λ_{i} = 1 - 1 / (M K L_{i})

,

i = 1, 2

, with

K = 10

and

M = 1, 3, 5

.

In the first simulation represented in Figure 7, we analyze the performance of the RLS-CKD algorithm with that of the classical RLS algorithm. The echo path changes after 4 s of simulation by changing the impulse response

h_{2}

with a random impulse response of the same length, with samples between 0 and 0.5. In the first part of the plot, we can remark that the RLS-CKD algorithm achieves a convergence rate similar to that of the classical RLS algorithm. Regarding the tracking capability, when the echo path changes, the RLS-CKD algorithm outperforms the RLS algorithm. The RLS-CKD achieves a normalized misalignment of −30 dB in less than 200 ms. We can remark that the constant value M only affects the normalized misalignment level when the echo path changes.

Next, in Figure 8, we analyze the behavior of the RLS-CKD algorithm in a scenario where the input signal is an AR(1) process. The echo path changes in the same manner as in the previous scenario. In this case, the RLS-CKD algorithm achieves an even lower normalized misalignment of almost 10 dB (e.g., when

M = 5

) compared to the RLS algorithm. When the echo path changes, the values of M do not impact the RLC-CKD algorithm too much. This time, the RLS-CKD achieves a normalized misalignment of −40 dB in less than 500 ms, while the RLS algorithm requires at least 3 s to achieve a comparable level.

We conclude the first set of simulations with the scenario depicted in Figure 9, where the input signal is a speech sequence and the echo path changes in the middle of the simulation in the same manner.

As we can notice in Figure 9, the steady-state misalignment of the conventional RLS algorithm (the blue curve) is similar to the misalignment of the RLS-CKD algorithm using

M = 1

, while the initial convergence rate and tracking of the proposed algorithm are much better. A larger value of M influences only the initial convergence rate of the RLS-CKD algorithm, but keeps the same fast tracking reaction. On the other hand, the steady-state misalignment of the RLS-CKD is improving for a larger value of M (i.e., for larger values of the forgetting factors

λ_{i}

, closer to 1).

Furthermore, we continue the simulations with the trilinear approach, based on the impulse responses from Figure 10. In this case, we have considered an even longer echo path of thousands of coefficients. The echo path of the system that must be identified is obtained as

h = h_{3} \otimes h_{2} \otimes h_{1}

, of size

L = L_{1} L_{2} L_{3} = 2048

, with

h_{1}

(

L_{1} = 64

) from Figure 6a,

h_{2}

(

L_{2} = 8

) from Figure 10a, and

h_{3}

(

L_{3} = 4

) from Figure 10b. The second impulse response (i.e.,

h_{2}

) is randomly generated, with samples between 0 and 0.5, while the third impulse response (i.e.,

h_{3}

) is obtained as

h_{3 l_{3}} = {0.5}^{l 3 - 1}

, with

l_{3} = 1, 2, \dots, L_{3}

, where

L_{3} = 4

.

In Figure 11, the first simulation in the trilinear scenario is represented. The input signal is a white Gaussian noise and the echo path changes by generating

h_{3}

as a random impulse response after 4 s, so this impacts the whole system. It is worth noting that the RLS-CKD algorithm presents a slightly faster converge rate compared to that of the RLS algorithm and a lower normalized misalignment for

M = 5

of at least 10 dB. In terms of tracking, the RLS-CKD algorithm succeeds in re-estimating the new echo path and we can see that the smaller the forgetting factor is (i.e.,

M = 1

), the faster the tracking.

In the scenario represented in Figure 12, the input signal is an AR(1) process and the echo path changes by regenerating

h_{3}

after 4 s. The RLS-CKD algorithm outperforms the classical RLS algorithm in terms of convergence rate, normalized misalignment, and tracking capability, with a much lower computational complexity. Finally, in Figure 13, we conclude the set of simulations with a scenario where the input signal is a speech sequence. Again, the echo path changes by regenerating

h_{3}

after 6 s of simulation. While the classical RLS algorithm requires more than 3 s to achieve a reasonable normalized misalignment level, the RLS-CKD algorithm succeeds at estimating the target system, presenting a good tracking capability even when the echo path changes. However, for a faster convergence rate, the RLS-CKD algorithm requires a much lower forgetting factor (e.g.,

M = 1

). Also, the steady-state misalignment of the conventional RLS algorithm (after the change of the system) is similar to the misalignment of the RLS-CKD algorithm using

M = 1

.

6. Conclusions

In this paper, we have introduced various memoryless and memory structures described by a bilinear input-output relation. Based on this approach, we have obtained a SISO system from a MISO system, which is a cascade of shorter length filters. We then developed the multilinear RLS algorithm considering the Kronecker product decomposition and outlining the reduction in terms of computational complexity. Finally, we have presented a set of simulations as a comparison between the newly developed RLS-CKD algorithm and the classical RLS algorithm. Simulations proved that the RLS-CKD algorithm outperforms the classical RLS algorithm in terms of convergence rate, normalized misalignment, and tracking capability. We can conclude that the RLS-CKD algorithm is a good candidate for real-time applications, which implies long length impulse responses and systems characterized by reverberation.

Author Contributions

Conceptualization, A.-G.R.; Formal analysis, S.C.; Software, C.P.; Methodology, J.B. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by a grant of the Romanian Ministry of Education and Research, CNCS-UEFISCDI, project number: PN-III-P1-1.1-TE-2019-0420, within PNCDI III.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare that they have no conflict of interest.

References

Haykin, S. Adaptive Filter Theory, 4th ed.; Prentice-Hall: Upper Saddle River, NJ, USA, 2002. [Google Scholar]
Benesty, J.; Huang, Y. (Eds.) Adaptive Signal Processing–Applications to Real-World Problems; Springer: Berlin/Heidelberg, Germany, 2003. [Google Scholar]
Gay, S.L.; Benesty, J. (Eds.) Acoustic Signal Processing for Telecommunication; Kluwer Academic Publisher: Boston, MA, USA, 2000. [Google Scholar]
Benesty, J.; Gaensler, T.; Morgan, D.R.; Sondhi, M.M.; Gay, S.L. Advances in Network and Acoustic Echo Cancellation; Springer: Berlin/Heidelberg, Germany, 2001. [Google Scholar]
Tsoulos, I.G.; Stavrou, V.; Mastorakis, N.E.; Tsalikakis, D. GenConstraint: A programming tool for constraint optimization problems. SoftwareX 2019, 10. [Google Scholar] [CrossRef]
Stavrou, V.N.; Tsoulos, I.G.; Mastorakis, N.E. Transformations for FIR and IIR Filters’ Design. Symmetry 2021, 13, 533. [Google Scholar] [CrossRef]
Duttweiler, D.L. Proportionate normalized least-mean-squares adaptation in echo cancelers. IEEE Trans. Speech Audio Process. 2000, 8, 508–518. [Google Scholar] [CrossRef] [Green Version]
Benesty, J.; Gay, S.L. An improved PNLMS algorithm. In Proceedings of the 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, Orlando, FL, USA, 13–17 May 2002; pp. II-1881–II-1884. [Google Scholar]
Deng, H.; Doroslovački, M. Proportionate adaptive algorithms for network echo cancellation. IEEE Trans. Signal Process. 2006, 54, 1794–1803. [Google Scholar] [CrossRef]
Loganathan, P.; Khong, A.W.; Naylor, P. A class of sparseness-controlled algorithms for echo cancellation. IEEE Trans. Audio Speech Lang. Process. 2009, 17, 1591–1601. [Google Scholar] [CrossRef]
Paleologu, C.; Benesty, J.; Ciochină, S. Sparse Adaptive Filters for Echo Cancellation; Morgan & Claypool Publishers: San Rafael, CA, USA, 2010. [Google Scholar]
Yang, Z.; Zheng, Y.R.; Grant, S.L. Proportionate affine projection sign algorithms for network echo cancellation. IEEE Trans. Audio Speech Lang. Process. 2011, 19, 2273–2284. [Google Scholar] [CrossRef]
Liu, J.; Grant, S.L. Proportionate adaptive filtering for block-sparse system identification. IEEE/ACM Trans. Audio Speech Lang. Process. 2016, 24, 623–630. [Google Scholar] [CrossRef] [Green Version]
Rusu, A.-G.; Ciochină, S. Cascaded adaptive filters in a bilinear approach for system identification. In Proceedings of the 2020 International Symposium on Electronics and Telecommunications (ISETC), Timisoara, Romania, 5–6 November 2020; pp. 1–4. [Google Scholar]
Loan, C.F.V. The ubiquitous Kronecker product. J. Comput. Appl. Math. 2000, 123, 85–100. [Google Scholar] [CrossRef] [Green Version]
Benesty, J.; Cohen, I.; Chen, J. Array Processing–Kronecker Product Beamforming; Springer: Cham, Switzerland, 2019. [Google Scholar]
Benesty, J.; Paleologu, C.; Ciochină, S. On the identification of bilinear forms with the Wiener filter. IEEE Signal Process. Lett. 2017, 24, 653–657. [Google Scholar] [CrossRef]
Paleologu, C.; Benesty, J.; Ciochină, S. Adaptive filtering for the identification of bilinear forms. Digital Signal Process. 2018, 75, 153–167. [Google Scholar] [CrossRef]
Dogariu, L.-M.; Stanciu, C.L.; Elisei-Iliescu, C.; Paleologu, C.; Benesty, J.; Ciochină, S. Tensor-based adaptive filtering algorithms. Symmetry 2021, 13, 481. [Google Scholar] [CrossRef]
Bertsekas, D.P. Nonlinear Programming, 2nd ed.; Athena Scientific: Belmont, MA, USA, 1999. [Google Scholar]
Benesty, J.; Paleologu, C.; Ciochină, S. Regularization of the RLS algorithm. IEICE Trans. Fundam. 2011, E94-A, 1628–1629. [Google Scholar] [CrossRef]
Digital Network Echo Cancellers; ITU-T Recommendation G.168; ITU: Geneva, Switzerland, 2002.

Figure 1. (a) The structure of

s_{1, l_{2}} (n)

[see (5)] and (b) the symbolic representation of the

s_{1, l_{2}} (n)

memoryless weighted adder.

Figure 1. (a) The structure of

s_{1, l_{2}} (n)

[see (5)] and (b) the symbolic representation of the

s_{1, l_{2}} (n)

memoryless weighted adder.

Figure 2. The two combiner levels structure (i.e., a MISO system).

Figure 3. (a) The structure of

s_{1, l_{2}} (n)

with delay line and (b) its symbolic representation.

Figure 3. (a) The structure of

s_{1, l_{2}} (n)

with delay line and (b) its symbolic representation.

Figure 4. The two combiner levels structure based on transversal filters.

Figure 5. SISO system in cascaded configuration.

Figure 6. Impulse responses for the bilinear setup: (a)

h_{1}

, first impulse from the G168 Recommendation [22]; (b)

h_{2}

, exponential generated impulse response; and (c) impulse response of the target system,

h = h_{2} \otimes h_{1}

.

Figure 6. Impulse responses for the bilinear setup: (a)

h_{1}

, first impulse from the G168 Recommendation [22]; (b)

h_{2}

, exponential generated impulse response; and (c) impulse response of the target system,

h = h_{2} \otimes h_{1}

.

Figure 7. Normalized misalignment of the classical RLS (

L = 512

) and RLS-CKD (

L_{1} = 64

,

L_{2} = 8

) algorithms. The input signal is white Gaussian noise and the impulse response changes after 4 s of simulation.

Figure 7. Normalized misalignment of the classical RLS (

L = 512

) and RLS-CKD (

L_{1} = 64

,

L_{2} = 8

) algorithms. The input signal is white Gaussian noise and the impulse response changes after 4 s of simulation.

Figure 8. Normalized misalignment of the classical RLS (

L = 512

) and RLS-CKD (

L_{1} = 64

,

L_{2} = 8

) algorithms. The input signal is an AR(1) process and the impulse response changes after 4 s of simulation.

Figure 8. Normalized misalignment of the classical RLS (

L = 512

) and RLS-CKD (

L_{1} = 64

,

L_{2} = 8

) algorithms. The input signal is an AR(1) process and the impulse response changes after 4 s of simulation.

Figure 9. Normalized misalignment of the classical RLS (

L = 512

) and RLS-CKD (

L_{1} = 64

,

L_{2} = 8

) algorithms. The input signal is a speech sequence and the impulse response changes in the middle of the simulation.

Figure 9. Normalized misalignment of the classical RLS (

L = 512

) and RLS-CKD (

L_{1} = 64

,

L_{2} = 8

) algorithms. The input signal is a speech sequence and the impulse response changes in the middle of the simulation.

Figure 10. Impulse responses for the trilinear setup: (a)

h_{2}

, random generated impulse response; (b)

h_{3}

, exponential generated impulse response; and (c) Impulse response of the target system,

h = h_{3} \otimes h_{2} \otimes h_{1}

.

Figure 10. Impulse responses for the trilinear setup: (a)

h_{2}

, random generated impulse response; (b)

h_{3}

, exponential generated impulse response; and (c) Impulse response of the target system,

h = h_{3} \otimes h_{2} \otimes h_{1}

.

Figure 11. Normalized misalignment of the classical RLS (

L = 2048

) and RLS-CKD (

L_{1} = 64

,

L_{2} = 8

,

L_{3} = 4

) algorithms. The input signal is white Gaussian noise and the impulse response changes after 4 s.

Figure 11. Normalized misalignment of the classical RLS (

L = 2048

) and RLS-CKD (

L_{1} = 64

,

L_{2} = 8

,

L_{3} = 4

) algorithms. The input signal is white Gaussian noise and the impulse response changes after 4 s.

Figure 12. Normalized misalignment of the classical RLS (

L = 2048

) and RLS-CKD (

L_{1} = 64

,

L_{2} = 8

,

L_{3} = 4

) algorithms. The input signal is an AR(1) process and the impulse response changes after 4 s.

Figure 12. Normalized misalignment of the classical RLS (

L = 2048

) and RLS-CKD (

L_{1} = 64

,

L_{2} = 8

,

L_{3} = 4

) algorithms. The input signal is an AR(1) process and the impulse response changes after 4 s.

Figure 13. Normalized misalignment of the classical RLS (

L = 2048

) and RLS-CKD (

L_{1} = 64

,

L_{2} = 8

,

L_{3} = 4

) algorithms. The input signal is a speech sequence and the impulse response changes after 6 s.

Figure 13. Normalized misalignment of the classical RLS (

L = 2048

) and RLS-CKD (

L_{1} = 64

,

L_{2} = 8

,

L_{3} = 4

) algorithms. The input signal is a speech sequence and the impulse response changes after 6 s.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rusu, A.-G.; Ciochină, S.; Paleologu, C.; Benesty, J. Cascaded RLS Adaptive Filters Based on a Kronecker Product Decomposition. Electronics 2022, 11, 409. https://doi.org/10.3390/electronics11030409

AMA Style

Rusu A-G, Ciochină S, Paleologu C, Benesty J. Cascaded RLS Adaptive Filters Based on a Kronecker Product Decomposition. Electronics. 2022; 11(3):409. https://doi.org/10.3390/electronics11030409

Chicago/Turabian Style

Rusu, Alexandru-George, Silviu Ciochină, Constantin Paleologu, and Jacob Benesty. 2022. "Cascaded RLS Adaptive Filters Based on a Kronecker Product Decomposition" Electronics 11, no. 3: 409. https://doi.org/10.3390/electronics11030409

APA Style

Rusu, A.-G., Ciochină, S., Paleologu, C., & Benesty, J. (2022). Cascaded RLS Adaptive Filters Based on a Kronecker Product Decomposition. Electronics, 11(3), 409. https://doi.org/10.3390/electronics11030409

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Cascaded RLS Adaptive Filters Based on a Kronecker Product Decomposition

Abstract

1. Introduction

2. Bilinear Structures without Memory

3. Bilinear Structures with Memory

4. Cascaded Multilinear RLS Algorithm Using Kronecker Product Decomposition

5. Simulation Results

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI