Identification of Linear and Bilinear Systems: A Unified Study

Benesty, Jacob; Paleologu, Constantin; Dogariu, Laura-Maria; Ciochină, Silviu

doi:10.3390/electronics10151790

Open AccessReview

Identification of Linear and Bilinear Systems: A Unified Study

¹

INRS-EMT, University of Quebec, Montreal, QC H5A 1K6, Canada

²

Department of Telecommunications, University Politehnica of Bucharest, 1-3, Iuliu Maniu Blvd., 061071 Bucharest, Romania

^*

Author to whom correspondence should be addressed.

Electronics 2021, 10(15), 1790; https://doi.org/10.3390/electronics10151790

Submission received: 4 June 2021 / Revised: 13 July 2021 / Accepted: 21 July 2021 / Published: 26 July 2021

(This article belongs to the Special Issue Efficient Algorithms and Architectures for DSP Applications)

Download

Browse Figures

Versions Notes

Abstract

:

System identification problems are always challenging to address in applications that involve long impulse responses, especially in the framework of multichannel systems. In this context, the main goal of this review paper is to promote some recent developments that exploit decomposition-based approaches to multiple-input/single-output (MISO) system identification problems, which can be efficiently solved as combinations of low-dimension solutions. The basic idea is to reformulate such a high-dimension problem in the framework of bilinear forms, and to then take advantage of the Kronecker product decomposition and low-rank approximation of the spatiotemporal impulse response of the system. The validity of this approach is addressed in terms of the celebrated Wiener filter, by developing an iterative version with improved performance features (related to the accuracy and robustness of the solution). Simulation results support the main theoretical findings and indicate the appealing performance of these developments.

Keywords:

system identification; linear system; bilinear system; best approximation; singular value decomposition; optimal filtering; Wiener filter; multichannel acoustic echo cancellation

1. Introduction

Solving a system identification problem represents a key step in many important real-world applications [1,2]. In general, such a problem can be formulated in terms of estimating or modeling the parameters of an unknown system when a set of data is available, which is usually related to the input and output of the system. Depending on the specific particularities of the problem or application, we can deal with different types of systems, according to their numbers of inputs and outputs. The simplest formulation is the well-known single-input/single-output (SISO) system. Furthermore, in some applications we can deal with more elaborated structures, such as multiple-input/single-output (MISO) and multiple-input/multiple-output (MIMO) systems.

The linearity is an important feature of a system, which can significantly simplify the overall identification problem. Even if many real-world systems face nonlinear behaviors, it is always desirable to address or reformulate the framework such that it has a linear approach to some extent. In this context, a useful topic is related to bilinear forms, which have been addressed in the literature in different ways and contexts [3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22]; most often they are related to approximating nonlinear systems.

In general, a bilinear model can approximate a large class of nonlinear systems via a finite sum of the Volterra series expansion between the inputs and outputs of the system. Therefore, in this context, bilinear systems behave similarly (to some extent) to linear models. This could further simplify the analysis, as outlined before. Due to this simplicity, bilinear systems have been involved in a wide range of applications, such as digital filter synthesis [7], prediction problems [8], channel equalization [9], echo cancellation [10], chaotic communications [16], neural networks [20], and active noise control [21]. Nevertheless, in all these frameworks, the bilinear term is defined with respect to the data, i.e., in terms of an input-output relation.

In this study, we focus on a different approach by defining the bilinear term with respect to the impulse responses of a spatiotemporal model, in the context of MISO systems. Several similar frameworks can be found in the literature, in the context of particular applications, such as channel equalization [13], nonlinear acoustic echo cancellation [15], and target detection [18]. However, most of these works were not associated with or analyzed in conjunction with bilinear forms. Usually, they were referred to as joint adaptation processes or cascaded systems, which are similar to the Hammerstein model [23].

More recently, an iterative Wiener filter for such bilinear forms was developed in the framework of a MISO system identification problem [24]. As compared to the conventional Wiener filter, the iterative version can obtain good accuracy even when a only small amount of information is available for the estimation of the statistics. Following the Wiener benchmark, another category of solutions relies on adaptive filtering [25,26]. Several adaptive filters tailored for the identification of bilinear forms have also been developed, following the main categories of algorithms. For example, the least-mean-square (LMS) and normalized LMS (NLMS) versions can be found in [27,28]. In addition, several recursive least-squares (RLS) algorithms for bilinear forms were developed in [29]. Moreover, a Kalman filter tailored for the identification of bilinear forms was proposed in [30].

In the previously mentioned approaches, the spatiotemporal impulse response of the MISO system is considered perfectly separable, and its components are combined using the Kronecker product. The identification of such linearly separable systems can be efficiently exploited in the frameworks of different applications, such as source separation [31,32], array beamforming [33,34], and object recognition [35,36]. In these contexts, the basic solution relies on the decomposition and modeling techniques of rank-1 tensors [37,38,39,40,41,42]. Nevertheless, it is highly useful to exploit the decomposition-based approach for the identification of more general forms of impulse responses.

Several recent works have followed this idea by exploiting the nearest Kronecker product decomposition and low-rank approximations [43,44,45,46,47]. In this context, the basic concept is to reformulate a high-dimension system identification problem as a combination of low-dimension solutions, thereby gaining in terms of both performance and complexity. Due to its features, this approach can be used in different practical applications—e.g., [48,49,50,51,52,53,54,55], among which we can mention acoustic feedback cancellation, adaptive beamforming, speech dereverberation, multichannel linear prediction, and nonlinear system identification.

A unified study on the efficient identification of linear and bilinear systems exploiting the decomposition-based approach is provided in this review paper. First, in Section 2, we present different system models, in the context of linear and bilinear forms. Then, in Section 3, we show how these models are related, thereby outlining the equivalence among the systems. The ideas behind the decomposition-based approach together with the optimal low-rank approximation are presented in Section 4. Since the Wiener filter represents a benchmark tool for the system identification problems, we illustrate its behavior in Section 5, wherein we also introduce an iterative version with improved performance features. Simulation results are provided in Section 6, in order to support the main theoretical findings. Finally, several conclusions and perspectives of this study are outlined in Section 7.

2. Different Input Output Linear/Bilinear System Models

In this study, we assume that the input and output, and the noise signals, take real values and have zero means. The most popular input output system is the so-called SISO system given by

\begin{matrix} d (k) & = h_{t}^{T} x (k) + w (k) \\ = y (k) + w (k), \end{matrix}

(1)

where

d (k)

denotes the desired (or reference) signal at discrete-time index k,

h_{t}

is the system’s temporal impulse response of length L, and the superscript

^{T}

denotes the transpose operator. The vector

\begin{matrix} x (k) & = {[\begin{matrix} x (k) & x (k - 1) & \dots & x (k - L + 1) \end{matrix}]}^{T} \end{matrix}

(2)

contains the most recent L samples of the input signal,

x (k)

;

w (k)

is the additive noise; and

y (k) = h_{t}^{T} x (k)

is the linear form in

h_{t}

. A typical assumption that can be made is that

x (k)

and

w (k)

are uncorrelated (or even independent, which is not really required if we only handle second-order statistics). We refer to (1) as the linear SISO (LSISO) system. Its general block diagram is provided in Figure 1.

Without loss of generality, let us assume that

L = L_{1} L_{2}

, with

L_{1} \geq L_{2}

. A shorter version of the input signal vector,

x (k)

, may be written as

\begin{matrix} x^{'} (k) & = {[\begin{matrix} x (k) & x (k - 1) & \dots & x (k - L_{1} + 1) \end{matrix}]}^{T} . \end{matrix}

(3)

As a result, we can express (2) as

\begin{matrix} x (k) & = {[\begin{matrix} x^{' T} (k) & x^{' T} (k - L_{1}) & \dots & x^{' T} [k - (L_{2} - 1) L_{1}] \end{matrix}]}^{T}, \end{matrix}

(4)

from which we deduce the matrix of size

L_{1} \times L_{2}

:

\begin{matrix} X (k) & = [\begin{matrix} x^{'} (k) & x^{'} (k - L_{1}) & \dots & x^{'} [k - (L_{2} - 1) L_{1}] \end{matrix}] . \end{matrix}

(5)

In other terms, we have

\begin{matrix} x (k) & = vec [X (k)], \end{matrix}

(6)

where

vec [\cdot]

denotes vectorization, i.e., the operation of converting a matrix into a vector. It may also be convenient to use the inverse of the vectorization operator [40], i.e.,

X (k) = ivec [x (k)]

, which is equivalent to (6). Therefore, the most straightforward bilinear system that follows from the previous development results as

\begin{matrix} d (k) & = h_{t 1}^{T} X (k) h_{t 2} + w (k) \\ = y (k) + w (k), \end{matrix}

(7)

where

h_{t 1}

and

h_{t 2}

are the first and second temporal impulse responses, of lengths

L_{1}

and

L_{2}

, respectively; and

y (k) = h_{t 1}^{T} X (k) h_{t 2}

is now bilinear in

h_{t 1}

and

h_{t 2}

. We call (7) the bilinear SISO (BSISO1) system. The equivalency between the LSISO and BSISO1 systems is explained and detailed in Section 3, together with the connections among different models that are discussed in the current section.

An obvious generalization of (7) is

\begin{matrix} d (k) & = \sum_{l = 1}^{L_{2}} h_{t 1, l}^{T} X (k) h_{t 2, l} + w (k), \end{matrix}

(8)

where

h_{t 1, l}, l = 1, 2, \dots, L_{2}

and

h_{t 2, l}, l = 1, 2, \dots, L_{2}

are the first and second sets of the system temporal impulse responses of lengths

L_{1}

and

L_{2}

, respectively. We refer to (8) as the BSISO2 system. Expression (8) can be rewritten as

\begin{matrix} d (k) & = h_{t 1}^{T} X (k) h_{t 2} + w (k) \\ = y (k) + w (k), \end{matrix}

(9)

where

\begin{matrix} h_{t 1} & = {[\begin{matrix} h_{t 1, 1}^{T} & h_{t 1, 2}^{T} & \dots & h_{t 1, L_{2}}^{T} \end{matrix}]}^{T}, \end{matrix}

(10)

\begin{matrix} h_{t 2} & = {[\begin{matrix} h_{t 2, 1}^{T} & h_{t 2, 2}^{T} & \dots & h_{t 2, L_{2}}^{T} \end{matrix}]}^{T}, \end{matrix}

(11)

and

\begin{matrix} X (k) & = bdiag [X (k), X (k), \dots, X (k)] \end{matrix}

(12)

is a block-diagonal matrix with

L_{2}

diagonal blocks. We can see that

y (k) = h_{t 1}^{T} X (k) h_{t 2}

is bilinear in

h_{t 1}

and

h_{t 2}

.

An important extension of the LSISO system in (1) is the so-called linear MISO (LMISO) system:

\begin{matrix} d (k) & = \sum_{m = 1}^{M} h_{m}^{T} x_{m} (k) + w (k), \end{matrix}

(13)

where M denotes the number of system inputs (or channels),

h_{m}, m = 1, 2, \dots, M

are the M channel impulse responses of length L, and the vector

\begin{matrix} x_{m} (k) & = {[\begin{matrix} x_{m} (k) & x_{m} (k - 1) & \dots & x_{m} (k - L + 1) \end{matrix}]}^{T} \end{matrix}

(14)

contains the most recent L samples of the mth (

m = 1, 2, \dots, M

) input signal,

x_{m} (k)

. The general block diagram of the LMISO system is provided in Figure 2. Equation (13) can be rewritten as

\begin{matrix} d (k) & = {\bar{h}}^{T} \bar{x} (k) + w (k) \\ = y (k) + w (k), \end{matrix}

(15)

where

\begin{matrix} \bar{h} & = {[\begin{matrix} h_{1}^{T} & h_{2}^{T} & \dots & h_{M}^{T} \end{matrix}]}^{T}, \end{matrix}

(16)

\begin{matrix} \bar{x} (k) & = {[\begin{matrix} x_{1}^{T} (k) & x_{2}^{T} (k) & \dots & x_{M}^{T} (k) \end{matrix}]}^{T} . \end{matrix}

(17)

Clearly,

y (k) = {\bar{h}}^{T} \bar{x} (k)

is linear in

\bar{h}

. Of course, the particular case of

M = 1

corresponds to the LSISO system.

As in the single-channel case, let

L = L_{1} L_{2}

but with

M L_{1} \geq L_{2}

. We can decompose

x_{m} (k)

, similarly to (4), as

\begin{matrix} x_{m} (k) & = {[\begin{matrix} x_{m}^{' T} (k) & x_{m}^{' T} (k - L_{1}) & \dots & x_{m}^{' T} [k - (L_{2} - 1) L_{1}] \end{matrix}]}^{T}, \end{matrix}

(18)

where

\begin{matrix} x_{m}^{'} (k) & = {[\begin{matrix} x_{m} (k) & x_{m} (k - 1) & \dots & x_{m} (k - L_{1} + 1) \end{matrix}]}^{T} . \end{matrix}

(19)

Then, we concatenate the M input signals as

\begin{matrix} \underset{̲}{x} (k) & = {[\begin{matrix} {\underset{̲}{x}}^{' T} (k) & {\underset{̲}{x}}^{' T} (k - L_{1}) & \dots & {\underset{̲}{x}}^{' T} [k - (L_{2} - 1) L_{1}] \end{matrix}]}^{T}, \end{matrix}

(20)

where

\begin{matrix} {\underset{̲}{x}}^{'} (k) & = {[\begin{matrix} x_{1}^{' T} (k) & x_{2}^{' T} (k) & \dots & x_{M}^{' T} (k) \end{matrix}]}^{T} \end{matrix}

(21)

is a vector of length

M L_{1}

. Consequently, the LMISO system in (13) or (15) can be expressed in an equivalent manner as

\begin{matrix} d (k) & = {\underset{̲}{h}}^{T} \underset{̲}{x} (k) + w (k), \end{matrix}

(22)

where

\underset{̲}{h}

(of length

M L

) represents the spatiotemporal impulse response of the system, with the same coefficients as

\bar{h}

, resulting through simple permutations, according to the inputs.

The first bilinear MISO (BMISO1) system can be derived from the LSISO system in (1), according to (15) [24]:

\begin{matrix} d (k) & = h_{t}^{T} \bar{X} (k) h_{s} + w (k) \\ = y (k) + w (k), \end{matrix}

(23)

where

h_{t}

(of length L) represents the temporal impulse response of the system,

\bar{X} (k) = ivec [\bar{x} (k)]

,

h_{s}

(of length M) represents the spatial impulse response of the system, and

y (k) = h_{t}^{T} \bar{X} (k) h_{s}

is the bilinear form in

h_{t}

and

h_{s}

. For

M = 1

, (23) is equivalent to the LSISO system in (1); this also means that the bilinear structure is lost in the single-channel particular case.

Now, from (20), we can build the matrix of size

M L_{1} \times L_{2}

:

\begin{matrix} \underset{̲}{X} (k) & = ivec [\underset{̲}{x} (k)] . \end{matrix}

(24)

Then, our second bilinear MISO (BMISO2) system is derived according to (22). We get

\begin{matrix} d (k) & = h_{st 1}^{T} \underset{̲}{X} (k) h_{t 2} + w (k) \\ = y (k) + w (k), \end{matrix}

(25)

where

h_{st 1}

(of length

M L_{1}

) is the spatiotemporal impulse response of the system,

h_{t 2}

(of length

L_{2}

) is the system temporal impulse response, and

y (k) = h_{st 1}^{T} \underset{̲}{X} (k) h_{t 2}

is the bilinear form in

h_{st 1}

and

h_{t 2}

. For

M = 1

, we obtain exactly the BSISO1 system in (7).

Our third and last bilinear MISO (BMISO3) system is just an obvious generalization of (25), i.e.,

\begin{matrix} d (k) & = \sum_{l = 1}^{L_{2}} h_{st 1, l}^{T} \underset{̲}{X} (k) h_{t 2, l} + w (k), \end{matrix}

(26)

where

h_{st 1, l}, l = 1, 2, \dots, L_{2}

(of length

M L_{1}

) is the set of spatiotemporal impulse responses of the system, and

h_{t 2, l}, l = 1, 2, \dots, L_{2}

(of length

L_{2}

) is the set of temporal impulse responses of the system. For

M = 1

, we get the BSISO2 in (8). Relation (26) can be rewritten as

\begin{matrix} d (k) & = h_{st 1}^{T} \underset{̲}{X} (k) h_{t 2} + w (k) \\ = y (k) + w (k), \end{matrix}

(27)

where

\begin{matrix} h_{st 1} & = {[\begin{matrix} h_{st 1, 1}^{T} & h_{st 1, 2}^{T} & \dots & h_{st 1, L_{2}}^{T} \end{matrix}]}^{T}, \end{matrix}

(28)

\begin{matrix} h_{t 2} & = {[\begin{matrix} h_{t 2, 1}^{T} & h_{t 2, 2}^{T} & \dots & h_{t 2, L_{2}}^{T} \end{matrix}]}^{T}, \end{matrix}

(29)

and

\begin{matrix} \underset{̲}{X} (k) & = bdiag [\underset{̲}{X} (k), \underset{̲}{X} (k), \dots, \underset{̲}{X} (k)] \end{matrix}

(30)

is a block-diagonal matrix with

L_{2}

diagonal blocks, while

y (k) = h_{st 1}^{T} \underset{̲}{X} (k) h_{t 2}

is bilinear in

h_{st 1}

and

h_{t 2}

.

3. Equivalence among Systems

In this section, we show how the different linear and bilinear systems are related. Let us start with the BSISO1 system in (7). Its bilinear term can be rewritten as

\begin{matrix} y (k) & = h_{t 1}^{T} X (k) h_{t 2} \\ = tr [{(h_{t 1} h_{t 2}^{T})}^{T} X (k)] \\ = {vec}^{T} (h_{t 1} h_{t 2}^{T}) vec [X (k)] \\ = {(h_{t 2} \otimes h_{t 1})}^{T} x (k), \end{matrix}

(31)

where

tr [\cdot]

denotes the trace of a square matrix and ⊗ is the Kronecker product [56]. With (31) in mind, comparing the BSISO1 system with the LSISO system in (1), we can clearly observe that the two systems are identical if

h_{t} = h_{t 2} \otimes h_{t 1}

. Therefore, in general, we can say that BSISO1 is a particular case of LSISO. In other words, BSISO1 is also an LSISO with some structure of its temporal impulse response.

Now, let us focus on the BSISO2 system in (8). Another way to express its bilinear term is

\begin{matrix} y (k) & = \sum_{l = 1}^{L_{2}} h_{t 1, l}^{T} X (k) h_{t 2, l} \\ = \sum_{l = 1}^{L_{2}} {vec}^{T} (h_{t 1, l} h_{t 2, l}^{T}) vec [X (k)] \\ = {vec}^{T} (\sum_{l = 1}^{L_{2}} h_{t 1, l} h_{t 2, l}^{T}) x (k) \\ = {vec}^{T} (H_{t}) x (k), \end{matrix}

(32)

where

\begin{matrix} H_{t} & = \sum_{l = 1}^{L_{2}} h_{t 1, l} h_{t 2, l}^{T} \end{matrix}

(33)

is a matrix of size

L_{1} \times L_{2}

of rank equal to

L_{2}

in general. At the same time, the temporal impulse response of the LSISO system can be decomposed as

\begin{matrix} h_{t} & = {[\begin{matrix} h_{t, 1}^{T} & h_{t, 2}^{T} & \dots & h_{t, L_{2}}^{T} \end{matrix}]}^{T}, \end{matrix}

(34)

where

h_{t, l}, l = 1, 2, \dots, L_{2}

are impulse responses of length

L_{1}

each. Next, we can rewrite the linear term of the LSISO system as

\begin{matrix} y (k) & = h_{t}^{T} x (k) \\ = {vec}^{T} (H_{t}) x (k), \end{matrix}

(35)

where

H_{t} = ivec (h_{t})

. It can be easily seen by comparing (32) and (35) that the LSISO and BSISO2 systems are equivalent.

In the same way, we can write the bilinear term of the BMISO1 system in (23) as

\begin{matrix} y (k) & = h_{t}^{T} \bar{X} (k) h_{s} \\ = {(h_{s} \otimes h_{t})}^{T} \bar{x} (k) . \end{matrix}

(36)

Then, by comparing the previous expression with the bilinear form of the LMISO system in (15), we can see that the two are the same if

\bar{h} = h_{s} \otimes h_{t}

. In general, BMISO1 is a particular case of LMISO.

The bilinear term of the second bilinear MISO system, i.e., BMISO2 in (25), can also be expressed as

\begin{matrix} y (k) & = h_{st 1}^{T} \underset{̲}{X} (k) h_{t 2} \\ = {(h_{t 2} \otimes h_{st 1})}^{T} \underset{̲}{x} (k) . \end{matrix}

(37)

Again, we can conclude that the BMISO2 system is a particular case of the LMISO system in (22), where

\underset{̲}{h} = h_{t 2} \otimes h_{st 1}

.

Finally, the bilinear form of the BMISO3 system in (26) may be written as

\begin{matrix} y (k) & = \sum_{l = 1}^{L_{2}} h_{st 1, l}^{T} \underset{̲}{X} (k) h_{t 2, l} \\ = {vec}^{T} (H_{st}) \underset{̲}{x} (k), \end{matrix}

(38)

where

\begin{matrix} H_{st} & = \sum_{l = 1}^{L_{2}} h_{st 1, l} h_{t 2, l}^{T} \end{matrix}

(39)

is a matrix of size

M L_{1} \times L_{2}

of rank equal to

L_{2}

in general. At the same time, the spatiotemporal impulse response of the LMISO system in (22) can be decomposed as

\begin{matrix} \underset{̲}{h} & = {[\begin{matrix} {\underset{̲}{h}}_{1}^{T} & {\underset{̲}{h}}_{2}^{T} & \dots & {\underset{̲}{h}}_{L_{2}}^{T} \end{matrix}]}^{T}, \end{matrix}

(40)

where

{\underset{̲}{h}}_{l}, l = 1, 2, \dots, L_{2}

are impulse responses of length

M L_{1}

each. Next, we can rewrite the linear term of the LMISO system as

\begin{matrix} y (k) & = {\underset{̲}{h}}^{T} \underset{̲}{x} (k) \\ = {vec}^{T} (H_{st}) \underset{̲}{x} (k), \end{matrix}

(41)

where

H_{st} = ivec (\underset{̲}{h})

. It can be easily seen by comparing (38) and (41) that the LMISO and BMISO3 systems are equivalent.

4. Best Approximation

The main objective in this study is to identify the LMISO system in (13) (or, equivalently, in (15) or (22)). The LSISO system is just a particular case and has been studied before. We can achieve this goal based on what is already known about bilinear forms and how they are best approximated.

Let

a

be a real-valued vector of length L. The 2-norm or Euclidean norm of this vector is defined as

\begin{matrix} {∥a∥}_{2} & = \sqrt{a^{T} a} . \end{matrix}

(42)

Let

A

be a real-valued rectangular matrix of size

L \times C

. The Frobenius norm and the 2-norm of this matrix are, respectively,

\begin{matrix} {∥A∥}_{F} & = \sqrt{tr (A^{T} A)} \end{matrix}

(43)

and

\begin{matrix} {∥A∥}_{2} & = max_{{∥x∥}_{2} = 1} {∥A x∥}_{2} . \end{matrix}

(44)

Now, we can consider the impulse response of the BMISO3 system in (26); i.e., the matrix

H_{st} = \sum_{l = 1}^{L_{2}} h_{st 1, l} h_{t 2, l}^{T}

of size

M L_{1} \times L_{2}

with

M L_{1} \geq L_{2}

(see Equation (38)). As mentioned before, this system is equivalent to the LMISO system defined by relation (22). The matrix

H_{st}

can be factorized through the singular value decomposition (SVD):

\begin{matrix} H_{st} & = U Σ V^{T} \\ = \sum_{l = 1}^{L_{2}} σ_{l} u_{l} v_{l}^{T}, \end{matrix}

(45)

where

U

, of size

M L_{1} \times M L_{1}

, and

V

, of size

L_{2} \times L_{2}

, are orthogonal matrices and

Σ

is an

M L_{1} \times L_{2}

rectangular diagonal matrix having on the main diagonal nonnegative real numbers. The columns of

U

and

V

are known as the left-singular and right-singular vectors, respectively, of

H_{st}

, whereas the elements

σ_{l}, l = 1, 2, \dots, L_{2}

on the diagonal of

Σ

are called singular values of

H_{st}

with

σ_{1} \geq σ_{2} \geq \dots \geq σ_{L_{2}} \geq 0

.

Based on (39) and (45), we deduce that

\begin{matrix} h_{st 1, l} & = α (σ_{l}) u_{l}, \end{matrix}

(46)

\begin{matrix} h_{t 2, l} & = β (σ_{l}) v_{l}, \end{matrix}

(47)

with

l = 1, 2, \dots, L_{2}

, where

α (σ_{l}) β (σ_{l}) = σ_{l}

,

u_{l}, l = 1, 2, \dots, L_{2}

are the first

L_{2}

columns of

U

, and

v_{l}, l = 1, 2, \dots, L_{2}

are the columns of

V

. It may be easily checked that

{∥H_{st}∥}_{2} = σ_{1}

and

{∥H_{st}∥}_{F} = \sqrt{\sum_{l = 1}^{L_{2}} σ_{l}^{2}}

. In addition, since

\underset{̲}{h} = vec (H_{st})

(see (41)), the global impulse response can be decomposed as

\begin{matrix} \underset{̲}{h} & = \sum_{l = 1}^{L_{2}} σ_{l} (v_{l} \otimes u_{l}) \\ = \sum_{l = 1}^{L_{2}} h_{t 2, l} \otimes h_{st 1, l} . \end{matrix}

(48)

However, in practical scenarios, the matrix

H_{st}

is never really of full rank, because of the reflections and/or sparseness in the system [57,58,59,60]. Let

P ≪ L_{2}

and let us define the following matrix:

\begin{matrix} H_{st} (P) & = \sum_{p = 1}^{P} σ_{p} u_{p} v_{p}^{T} . \end{matrix}

(49)

Now, the objective is to verify whether

H_{st}

can be well approximated by

H_{st} (P)

. In the positive scenario, the LMISO system can be written as

\begin{matrix} d (k) & = {vec}^{T} (H_{st}) \underset{̲}{x} (k) + w (k) \\ = {vec}^{T} [H_{st} (P)] \underset{̲}{x} (k) + b (k) + w (k), \end{matrix}

(50)

where

\begin{matrix} b (k) & = {vec}^{T} (Υ) \underset{̲}{x} (k) \end{matrix}

(51)

denotes the correlated noise (considered negligible), with

Υ = \sum_{i = P + 1}^{L_{2}} σ_{i} u_{i} v_{i}^{T}

. Consequently, the goal becomes to identify the new matrix

H_{st} (P)

instead of

H_{st}

. This new idea may have a few advantages, as is explained in the following.

Next, we state a theorem given in [61,62], which helps to prove that

H_{st}

can be well approximated by

H_{st} (P)

. Let

rank (H_{st}) = R \leq L_{2}

and let

S

be the set of

M L_{1} \times L_{2}

matrices of rank equal to

P < R

. Then, the solution to the minimization problem

\begin{matrix} min_{H \in S} {∥H_{st} - H∥}_{2} or min_{H \in S} {∥H_{st} - H∥}_{F} \end{matrix}

(52)

is given by (49). Furthermore, we have

\begin{matrix} min_{H \in S} {∥H_{st} - H∥}_{2} = {∥H_{st} - H_{st} (P)∥}_{2} = σ_{P + 1} \end{matrix}

(53)

and

\begin{matrix} min_{H \in S} {∥H_{st} - H∥}_{F} = {∥H_{st} - H_{st} (P)∥}_{F} = \sqrt{\sum_{i = P + 1}^{L_{2}} σ_{i}^{2}} . \end{matrix}

(54)

Consequently, as long as the normalized misalignment,

\begin{matrix} M (P) & = \frac{{∥H_{st} - H_{st} (P)∥}_{F}}{{∥H_{st}∥}_{F}}, \end{matrix}

(55)

remains very small, it is sufficient in practice to estimate the impulse responses

h_{st 1, p}

and

h_{t 2, p}

for

p = 1, 2, \dots, P

.

In order to show the validity of this approach, let us consider two scenarios that will also be detailed in the simulations provided in Section 6. In the first scenario, we consider

M = 4

impulse responses from the G168 Recommendation [63], which are network echo paths of length

L = 500

, as depicted in Figure 3. In this case, the decomposition was performed using

L_{1} = 25

and

L_{2} = 20

. In the second scenario, we used two acoustic impulse responses (i.e.,

M = 2

), each one having

L = 1024

coefficients, as depicted in Figure 4. Here, we set

L_{1} = L_{2} = 32

for the decomposition. In both cases, we evaluate the normalized misalignment from (55) and the evolution of the singular values

σ_{l}

(

l = 1, 2, \dots, L_{2}

) of the matrix

H_{st}

. As we can see in Figure 5, the normalized misalignment decreased with the value of P. This was much more apparent in the first scenario (corresponding to Figure 3), where the rank of

H_{st}

resulted in

R = 5

, as shown in Figure 5a. Consequently, a good approximation was obtained for

P ≪ L_{2}

. In case of the acoustic impulse responses (i.e., the scenario from Figure 4), the resulting matrix

H_{st}

was closer to being full rank, so that a larger value of P was required to obtain a good approximation. Nevertheless, as we can notice in Figure 5b, a value of P significantly lower as compared to

L_{2}

led to reasonable attenuation of the misalignment (e.g., around

- 20

dB). This behavior is also supported in Figure 6, where we can notice the decreasing trend of the singular values (which are normalized to the maximum value).

5. Identification with the Wiener Filter

The identification of the LMISO system in (22) involves finding a real-valued filter,

\hat{\underset{̲}{h}}

, of length

M L

, which estimates the system

\underset{̲}{h}

. The error signal can be defined as

\begin{matrix} e (k) & = d (k) - \hat{y} (k), \end{matrix}

(56)

where

\hat{y} (k) = {\hat{\underset{̲}{h}}}^{T} \underset{̲}{x} (k)

. The optimization criterion used to find the optimal filter is the mean-squared error (MSE):

\begin{matrix} J (\hat{\underset{̲}{h}}) & = E [e^{2} (k)] \\ = σ_{d}^{2} - 2 {\hat{\underset{̲}{h}}}^{T} p + {\hat{\underset{̲}{h}}}^{T} R \hat{\underset{̲}{h}}, \end{matrix}

(57)

where

E [\cdot]

is the mathematical expectation,

p = E [\underset{̲}{x} (k) d (k)]

represents the cross-correlation vector between

\underset{̲}{x} (k)

and

d (k)

, and

R = E [\underset{̲}{x} (k) {\underset{̲}{x}}^{T} (k)]

denotes the covariance matrix of

\underset{̲}{x} (k)

. After minimizing

J (\hat{\underset{̲}{h}})

, the celebrated (multichannel) Wiener filter is obtained:

\begin{matrix} {\hat{\underset{̲}{h}}}_{W} & = R^{- 1} p . \end{matrix}

(58)

Since the covariance matrix in the expression above is of size

M L \times M L

, a large number of data samples (more than

M L

) is needed in order to obtain a reliable solution.

An alternative approach to identifying the LMISO system in (22) and estimating

\underset{̲}{h}

as in the conventional case is to identify the LMISO system in (50) and estimate

H_{st}

. In the rest of this paper, the subscripts

_{st}

and

_{t}

are dropped in order to simplify the notation, and in this way

H_{st} = \sum_{l = 1}^{L_{2}} h_{st 1, l} h_{t 2, l}^{T}

becomes

H = \sum_{l = 1}^{L_{2}} h_{1, l} h_{2, l}^{T}

.

Next, we assume that

rank (H) = P ≪ L_{2}

. Consequently,

\underset{̲}{h}

can be decomposed as

\begin{matrix} \underset{̲}{h} & = \sum_{p = 1}^{P} h_{2, p} \otimes h_{1, p}, \end{matrix}

(59)

where the impulse responses

h_{1, p}

and

h_{2, p}

have lengths

M L_{1}

and

L_{2}

, respectively. Therefore, the filter

\hat{\underset{̲}{h}}

may also be decomposed as

\begin{matrix} \hat{\underset{̲}{h}} & = \sum_{p = 1}^{P} {\hat{h}}_{2, p} \otimes {\hat{h}}_{1, p}, \end{matrix}

(60)

where the filters

{\hat{h}}_{1, p}

and

{\hat{h}}_{2, p}

have lengths

M L_{1}

and

L_{2}

, respectively. With the relations

\begin{matrix} {\hat{h}}_{2, p} \otimes {\hat{h}}_{1, p} & = ({\hat{h}}_{2, p} \times 1) \otimes (I_{M L_{1}} \times {\hat{h}}_{1, p}) \\ = ({\hat{h}}_{2, p} \otimes I_{M L_{1}}) {\hat{h}}_{1, p} \end{matrix}

(61)

and

\begin{matrix} {\hat{h}}_{2, p} \otimes {\hat{h}}_{1, p} & = (I_{L_{2}} \times {\hat{h}}_{2, p}) \otimes ({\hat{h}}_{1, p} \times 1) \\ = (I_{L_{2}} \otimes {\hat{h}}_{1, p}) {\hat{h}}_{2, p}, \end{matrix}

(62)

where

I_{M L_{1}}

and

I_{L_{2}}

are the identity matrices of sizes

M L_{1} \times M L_{1}

and

L_{2} \times L_{2}

, respectively, (60) may be rewritten as

\begin{matrix} \hat{\underset{̲}{h}} & = \sum_{p = 1}^{P} {\hat{H}}_{2, p} {\hat{h}}_{1, p} \end{matrix}

(63)

\begin{matrix} = \sum_{p = 1}^{P} {\hat{H}}_{1, p} {\hat{h}}_{2, p}, \end{matrix}

(64)

where

\begin{matrix} {\hat{H}}_{2, p} & = {\hat{h}}_{2, p} \otimes I_{M L_{1}}, \\ {\hat{H}}_{1, p} & = I_{L_{2}} \otimes {\hat{h}}_{1, p} \end{matrix}

are matrices of sizes

M L \times M L_{1}

and

M L \times L_{2}

, respectively. As a result, we may express the error signal defined in (56) in two distinct ways:

\begin{matrix} e (k) & = d (k) - \sum_{p = 1}^{P} {\hat{h}}_{1, p}^{T} {\hat{H}}_{2, p}^{T} \underset{̲}{x} (k) \\ = d (k) - \sum_{p = 1}^{P} {\hat{h}}_{1, p}^{T} x_{2, p} (k) \\ = d (k) - {\underset{̲}{\hat{h}}}_{1}^{T} {\underset{̲}{x}}_{2} (k) \end{matrix}

(65)

and

\begin{matrix} e (k) & = d (k) - \sum_{p = 1}^{P} {\hat{h}}_{2, p}^{T} {\hat{H}}_{1, p}^{T} \underset{̲}{x} (k) \\ = d (k) - \sum_{p = 1}^{P} {\hat{h}}_{2, p}^{T} x_{1, p} (k) \\ = d (k) - {\underset{̲}{\hat{h}}}_{2}^{T} {\underset{̲}{x}}_{1} (k), \end{matrix}

(66)

where

\begin{matrix} x_{2, p} (k) & = {\hat{H}}_{2, p}^{T} \underset{̲}{x} (k), \\ {\underset{̲}{\hat{h}}}_{1} & = {[\begin{matrix} {\hat{h}}_{1, 1}^{T} & {\hat{h}}_{1, 2}^{T} & \dots & {\hat{h}}_{1, P}^{T} \end{matrix}]}^{T}, \\ {\underset{̲}{x}}_{2} (k) & = {[\begin{matrix} x_{2, 1}^{T} (k) & x_{2, 2}^{T} (k) & \dots & x_{2, P}^{T} (k) \end{matrix}]}^{T}, \\ x_{1, p} (k) & = {\hat{H}}_{1, p}^{T} \underset{̲}{x} (k), \\ {\underset{̲}{\hat{h}}}_{2} & = {[\begin{matrix} {\hat{h}}_{2, 1}^{T} & {\hat{h}}_{2, 2}^{T} & \dots & {\hat{h}}_{2, P}^{T} \end{matrix}]}^{T}, \\ {\underset{̲}{x}}_{1} (k) & = {[\begin{matrix} x_{1, 1}^{T} (k) & x_{1, 2}^{T} (k) & \dots & x_{1, P}^{T} (k) \end{matrix}]}^{T} . \end{matrix}

Continuing with this formulation, we can write the MSE criterion as

\begin{matrix} J ({\underset{̲}{\hat{h}}}_{1}, {\underset{̲}{\hat{h}}}_{2}) & = σ_{d}^{2} - 2 {\underset{̲}{\hat{h}}}_{1}^{T} {\underset{̲}{p}}_{2} + {\underset{̲}{\hat{h}}}_{1}^{T} {\underset{̲}{R}}_{2} {\underset{̲}{\hat{h}}}_{1} \end{matrix}

(67)

\begin{matrix} = σ_{d}^{2} - 2 {\underset{̲}{\hat{h}}}_{2}^{T} {\underset{̲}{p}}_{1} + {\underset{̲}{\hat{h}}}_{2}^{T} {\underset{̲}{R}}_{1} {\underset{̲}{\hat{h}}}_{2}, \end{matrix}

(68)

where

\begin{matrix} {\underset{̲}{p}}_{2} & = {[\begin{matrix} p^{T} {\hat{H}}_{2, 1} & p^{T} {\hat{H}}_{2, 2} & \dots & p^{T} {\hat{H}}_{2, P} \end{matrix}]}^{T}, \\ {\underset{̲}{R}}_{2} & = [\begin{matrix} {\hat{H}}_{2, 1}^{T} R {\hat{H}}_{2, 1} & {\hat{H}}_{2, 1}^{T} R {\hat{H}}_{2, 2} & \dots & {\hat{H}}_{2, 1}^{T} R {\hat{H}}_{2, P} \\ {\hat{H}}_{2, 2}^{T} R {\hat{H}}_{2, 1} & {\hat{H}}_{2, 2}^{T} R {\hat{H}}_{2, 2} & \dots & {\hat{H}}_{2, 2}^{T} R {\hat{H}}_{2, P} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\hat{H}}_{2, P}^{T} R {\hat{H}}_{2, 1} & {\hat{H}}_{2, P}^{T} R {\hat{H}}_{2, 2} & \dots & {\hat{H}}_{2, P}^{T} R {\hat{H}}_{2, P} \end{matrix}], \\ {\underset{̲}{p}}_{1} & = {[\begin{matrix} p^{T} {\hat{H}}_{1, 1} & p^{T} {\hat{H}}_{1, 2} & \dots & p^{T} {\hat{H}}_{1, P} \end{matrix}]}^{T}, \\ {\underset{̲}{R}}_{1} & = [\begin{matrix} {\hat{H}}_{1, 1}^{T} R {\hat{H}}_{1, 1} & {\hat{H}}_{1, 1}^{T} R {\hat{H}}_{1, 2} & \dots & {\hat{H}}_{1, 1}^{T} R {\hat{H}}_{1, P} \\ {\hat{H}}_{1, 2}^{T} R {\hat{H}}_{1, 1} & {\hat{H}}_{1, 2}^{T} R {\hat{H}}_{1, 2} & \dots & {\hat{H}}_{1, 2}^{T} R {\hat{H}}_{1, P} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\hat{H}}_{1, P}^{T} R {\hat{H}}_{1, 1} & {\hat{H}}_{1, P}^{T} R {\hat{H}}_{1, 2} & \dots & {\hat{H}}_{1, P}^{T} R {\hat{H}}_{1, P} \end{matrix}] . \end{matrix}

It can be noticed that the matrices

{\underset{̲}{R}}_{1}

and

{\underset{̲}{R}}_{2}

have sizes

P L_{2} \times P L_{2}

and

P M L_{1} \times P M L_{1}

, respectively, which can be much smaller than the size of

R

, which is

M L \times M L

. Additionally, at least

P M L_{1}

data samples are required for the estimation of the statistics in the MSE from (67) or (68), whereas in order to estimate the statistics in the conventional MSE from (57), we need at least

M L

data samples. When

{\underset{̲}{\hat{h}}}_{2}

is fixed, we can express (67) as

\begin{matrix} J_{{\underset{̲}{\hat{h}}}_{2}} ({\underset{̲}{\hat{h}}}_{1}) & = σ_{d}^{2} - 2 {\underset{̲}{\hat{h}}}_{1}^{T} {\underset{̲}{p}}_{2} + {\underset{̲}{\hat{h}}}_{1}^{T} {\underset{̲}{R}}_{2} {\underset{̲}{\hat{h}}}_{1} \end{matrix}

(69)

and when

{\underset{̲}{\hat{h}}}_{1}

is fixed, we can write (68) as

\begin{matrix} J_{{\underset{̲}{\hat{h}}}_{1}} ({\underset{̲}{\hat{h}}}_{2}) & = σ_{d}^{2} - 2 {\underset{̲}{\hat{h}}}_{2}^{T} {\underset{̲}{p}}_{1} + {\underset{̲}{\hat{h}}}_{2}^{T} {\underset{̲}{R}}_{1} {\underset{̲}{\hat{h}}}_{2} . \end{matrix}

(70)

This represents a bilinear optimization strategy [64].

In order to obtain the optimal filters, an iterative algorithm similar to the those presented in [24,43] can be derived. At iteration 0, we may take

\begin{matrix} {\hat{h}}_{2, p}^{(0)} & = {[\begin{matrix} ϵ & 0 & \dots & 0 \end{matrix}]}^{T}, p = 1, 2, \dots, P, \end{matrix}

where

0 < ϵ \leq 1

. Then, we may form

{\hat{H}}_{2, p}^{(0)} = {\hat{h}}_{2, p}^{(0)} \otimes I_{M L_{1}}

and

\begin{matrix} {\underset{̲}{p}}_{2}^{(0)} & = {[\begin{matrix} p^{T} {\hat{H}}_{2, 1}^{(0)} & p^{T} {\hat{H}}_{2, 2}^{(0)} & \dots & p^{T} {\hat{H}}_{2, P}^{(0)} \end{matrix}]}^{T}, \\ {\underset{̲}{R}}_{2}^{(0)} & = [\begin{matrix} {({\hat{H}}_{2, 1}^{(0)})}^{T} R {\hat{H}}_{2, 1}^{(0)} & {({\hat{H}}_{2, 1}^{(0)})}^{T} R {\hat{H}}_{2, 2}^{(0)} & \dots & {({\hat{H}}_{2, 1}^{(0)})}^{T} R {\hat{H}}_{2, P}^{(0)} \\ {({\hat{H}}_{2, 2}^{(0)})}^{T} R {\hat{H}}_{2, 1}^{(0)} & {({\hat{H}}_{2, 2}^{(0)})}^{T} R {\hat{H}}_{2, 2}^{(0)} & \dots & {({\hat{H}}_{2, 2}^{(0)})}^{T} R {\hat{H}}_{2, P}^{(0)} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {({\hat{H}}_{2, P}^{(0)})}^{T} R {\hat{H}}_{2, 1}^{(0)} & {({\hat{H}}_{2, P}^{(0)})}^{T} R {\hat{H}}_{2, 2}^{(0)} & \dots & {({\hat{H}}_{2, P}^{(0)})}^{T} R {\hat{H}}_{2, P}^{(0)} \end{matrix}] . \end{matrix}

By substituting the quantities above into the MSE from (69), we get at iteration 1:

\begin{matrix} J_{{\underset{̲}{\hat{h}}}_{2}} ({\underset{̲}{\hat{h}}}_{1}^{(1)}) & = σ_{d}^{2} - 2 {({\underset{̲}{\hat{h}}}_{1}^{(1)})}^{T} {\underset{̲}{p}}_{2}^{(0)} + {({\underset{̲}{\hat{h}}}_{1}^{(1)})}^{T} {\underset{̲}{R}}_{2}^{(0)} {\underset{̲}{\hat{h}}}_{1}^{(1)}, \end{matrix}

(71)

which can be minimized with respect to

{\underset{̲}{\hat{h}}}_{1}^{(1)}

, thereby yielding

\begin{matrix} {\underset{̲}{\hat{h}}}_{1}^{(1)} & = {({\underset{̲}{R}}_{2}^{(0)})}^{- 1} {\underset{̲}{p}}_{2}^{(0)} . \end{matrix}

(72)

Next, using

{\underset{̲}{\hat{h}}}_{1}^{(1)}

, we can construct

{\hat{H}}_{1, p}^{(1)} = I_{L_{2}} \otimes {\hat{h}}_{1, p}^{(1)}

and

\begin{matrix} {\underset{̲}{p}}_{1}^{(1)} & = {[\begin{matrix} p^{T} {\hat{H}}_{1, 1}^{(1)} & p^{T} {\hat{H}}_{1, 2}^{(1)} & \dots & p^{T} {\hat{H}}_{1, P}^{(1)} \end{matrix}]}^{T}, \\ {\underset{̲}{R}}_{1}^{(1)} & = [\begin{matrix} {({\hat{H}}_{1, 1}^{(1)})}^{T} R {\hat{H}}_{1, 1}^{(1)} & {({\hat{H}}_{1, 1}^{(1)})}^{T} R {\hat{H}}_{1, 2}^{(1)} & \dots & {({\hat{H}}_{1, 1}^{(1)})}^{T} R {\hat{H}}_{1, P}^{(1)} \\ {({\hat{H}}_{1, 2}^{(1)})}^{T} R {\hat{H}}_{1, 1}^{(1)} & {({\hat{H}}_{1, 2}^{(1)})}^{T} R {\hat{H}}_{1, 2}^{(1)} & \dots & {({\hat{H}}_{1, 2}^{(1)})}^{T} R {\hat{H}}_{1, P}^{(1)} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {({\hat{H}}_{1, P}^{(1)})}^{T} R {\hat{H}}_{1, 1}^{(1)} & {({\hat{H}}_{1, P}^{(1)})}^{T} R {\hat{H}}_{1, 2}^{(1)} & \dots & {({\hat{H}}_{1, P}^{(1)})}^{T} R {\hat{H}}_{1, P}^{(1)} \end{matrix}] . \end{matrix}

Consequently, the MSE from (70) is

\begin{matrix} J_{{\underset{̲}{\hat{h}}}_{1}} ({\underset{̲}{\hat{h}}}_{2}^{(1)}) & = σ_{d}^{2} - 2 {({\underset{̲}{\hat{h}}}_{2}^{(1)})}^{T} {\underset{̲}{p}}_{1}^{(1)} + {({\underset{̲}{\hat{h}}}_{2}^{(1)})}^{T} {\underset{̲}{R}}_{1}^{(1)} {\underset{̲}{\hat{h}}}_{2}^{(1)} . \end{matrix}

(73)

The minimization of the previous expression with respect to

{\underset{̲}{\hat{h}}}_{2}^{(1)}

gives

\begin{matrix} {\underset{̲}{\hat{h}}}_{2}^{(1)} & = {({\underset{̲}{R}}_{1}^{(1)})}^{- 1} {\underset{̲}{p}}_{1}^{(1)} . \end{matrix}

(74)

By iterating further, we obtain at iteration n

\begin{matrix} {\underset{̲}{\hat{h}}}_{1}^{(n)} & = {({\underset{̲}{R}}_{2}^{(n - 1)})}^{- 1} {\underset{̲}{p}}_{2}^{(n - 1)}, \end{matrix}

(75)

\begin{matrix} {\underset{̲}{\hat{h}}}_{2}^{(n)} & = {({\underset{̲}{R}}_{1}^{(n)})}^{- 1} {\underset{̲}{p}}_{1}^{(n)}, \end{matrix}

(76)

where

{\underset{̲}{R}}_{2}^{(n - 1)}

,

{\underset{̲}{p}}_{2}^{(n - 1)}

,

{\underset{̲}{R}}_{1}^{(n)}

, and

{\underset{̲}{p}}_{1}^{(n)}

are constructed similarly to

{\underset{̲}{R}}_{2}^{(0)}

,

{\underset{̲}{p}}_{2}^{(0)}

,

{\underset{̲}{R}}_{1}^{(1)}

, and

{\underset{̲}{p}}_{1}^{(1)}

, respectively. In the end, the Wiener filter at iteration n results as

\begin{matrix} {\hat{\underset{̲}{h}}}_{W}^{(n)} & = \sum_{p = 1}^{P} {\hat{h}}_{2, p}^{(n)} \otimes {\hat{h}}_{1, p}^{(n)} . \end{matrix}

(77)

The multichannel iterative Wiener filter is summarized in Table 1. For

M = 1

(i.e., single-channel case), the problem is reduced to a regular SISO scenario, and the algorithm becomes equivalent to the version developed in [43]. Additionally, if the system is perfectly separable/decomposable, we can obtain the optimal solution for

P = 1

. In this case, the iterative Wiener filter for bilinear forms (proposed in [24]) is obtained.

6. Simulation Results

In this section, we evaluate the performance of the conventional and iterative Wiener filters in two different scenarios. The first one is dedicated to the case of independent input signals,

x_{m} (k), m = 1, 2, \dots, M

. The second scenario is more challenging, since it considers the case when the input signals are coming from the same source and they are linearly related. In both cases, the performance measure used to evaluate the overall behavior is the normalized misalignment (in dB), which is related to the spatiotemporal impulse response of the system,

\underset{̲}{h}

. In this framework, the solution provided by the conventional Wiener filter is given in (58), so that the performance measure is evaluated as

\begin{matrix} M (\underset{̲}{h}, {\hat{\underset{̲}{h}}}_{W}) & = 20 \log_{10} \frac{{∥\underset{̲}{h} - {\hat{\underset{̲}{h}}}_{W}∥}_{2}}{{∥\underset{̲}{h}∥}_{2}} . \end{matrix}

(78)

Similarly, for the iterative Wiener filter from (77), the performance measure results in

\begin{matrix} M (\underset{̲}{h}, {\hat{\underset{̲}{h}}}_{W}^{(n)}) & = 20 \log_{10} \frac{{∥\underset{̲}{h} - {\hat{\underset{̲}{h}}}_{W}^{(n)}∥}_{2}}{{∥\underset{̲}{h}∥}_{2}} . \end{matrix}

(79)

Both the conventional and iterative Wiener filters rely on the estimation of the statistics, i.e., the covariance matrix

R

and the cross-correlation vector

p

. Considering that N data samples are available, these estimates result in

\begin{matrix} \hat{R} & = \frac{1}{N} \sum_{k = 1}^{N} \underset{̲}{x} (k) {\underset{̲}{x}}^{T} (k), \end{matrix}

(80)

\begin{matrix} \hat{p} & = \frac{1}{N} \sum_{k = 1}^{N} \underset{̲}{x} (k) d (k) . \end{matrix}

(81)

Clearly, the value of N influences the quality of these estimates. Nevertheless, in practice, only a small amount of data could be available, which makes the identification process more challenging. In this case, the advantages of the iterative Wiener filter (which operates with smaller data structures) become more apparent, as will be supported in the following analysis.

The additive noise may also affect the accuracy of the Wiener solution. In relation to (13) or (22), the signal-to-noise ratio (SNR) can be defined as

\begin{matrix} SNR & = \frac{σ_{y}^{2}}{σ_{w}^{2}}, \end{matrix}

(82)

where

σ_{y}^{2}

and

σ_{w}^{2}

are the variances of the output signal and noise, respectively. In practice, the Wiener solution is satisfactory with reasonable levels of the SNR, but it is not with small values of the SNR. In our experiments, different values of the SNR were used, in order to illustrate this behavior.

All the experiments were performed using MATLAB R2018b on an Asus GL552VX device (Windows 10 OS), having an Intel Core i7-6700HQ CPU @ 2.60 GHz, with four cores, eight logical processors, and 16 GB of RAM. In the first set of experiments, we considered the case of M independent input signals, which were generated as AR(1) processes. These were obtained by filtering white Gaussian noise through an AR(1) model with a pole at 0.9. Of course, different other inputs can be used instead of the AR(1) model. The most common considerations are: (i) the input signal

x (k)

is wide-sense stationary, (ii) all of the signals (i.e.,

x (k)

,

d (k)

, and

w (k)

) have zero means, and (iii) usually, the noise

w (k)

is not correlated with the input signal

x (k)

.

In our scenario, the number of channels was

M = 4

and their impulse responses were chosen from the G168 Recommendation [63]. They were network echo paths of length

L = 500

, as depicted in Figure 3.

As mentioned before, the performance of the Wiener solution is influenced by the value of N and the level of the SNR. This is supported in Figure 7, where the performance of the conventional Wiener filter from (58) is illustrated for different values of N (from 2000 to 10,000 available data samples) and SNR levels (from 0 dB to 30 dB). As we can notice, a larger value of N (i.e.,

N ≫ M L

) is required to obtain reasonable attenuation of the misalignment. Additionally, as expected, a more accurate solution was obtained with higher SNRs.

In this context, let us first compare the performance of the conventional and iterative Wiener filters in “favorable” conditions, using a large amount of data to estimate the statistics (i.e.,

N =

10,000), in a high SNR environment (i.e.,

SNR = 30

dB). The iterative Wiener filter from (77) uses

L_{1} = 25

,

L_{2} = 20

, and different values of the decomposition parameter P (from 3 to 6). These values are much lower than

L_{2}

, which represents an important advantage, as discussed in Section 5. We should also note that for the scenario considered in this first set of experiments (using the setup from Figure 3), the rank of the matrix

H

(or

H_{st}

) was equal to

R = 5

. As we can notice in Figure 8, the iterative Wiener filter was able to outperform the conventional Wiener filter for most of the values of P. Even the case of

P = 3 < R ≪ L_{2}

led to reasonable attenuation of the misalignment. Moreover, all these iterative solutions were obtained in only a few iterations.

The advantages of the iterative Wiener filter became more apparent when less data were available to estimate the statistics. The previous simulation was repeated, but using

N = 2500

(Figure 9). According to the results shown in Figure 7, the performance of the conventional Wiener filter was affected in this case (even for

SNR = 30

dB); its misalignment level was close to

- 15

dB. This result is also confirmed in Figure 9. Most importantly, the iterative Wiener filter outperformed the conventional solution for all the values of P, thereby being more robust in this case due to the low-dimensional data structures used in its development.

For the scenario considered in this first set of experiments, the length of the spatiotemporal impulse response was

M L = 4 \times 500 = 2000

. Therefore, the case

N = 2000

represents a limit in terms of the available amount of data. As we can notice in Figure 7, the conventional Wiener filter could not cope with this limit. On the other hand, the iterative Wiener filter was still able to obtain good performances, for

P ≪ L_{2}

, as supported in Figure 10.

Furthermore, using

N < M L

is a significant challenge in terms of system identification, since apparently we deal with an "incomplete" scenario, when trying to estimate

M L

coefficients using less data. Clearly, the conventional Wiener filter cannot be used in this case. However, the iterative Wiener filter reformulates the original system identification problem (of size

M L

) as a combination of low-dimension solutions of size

P M L_{1}

and

P L_{2}

, with

P ≪ L_{2}

. Hence, it could overcome this limit of

N < M L

. This is supported in Figure 11: only

N = 1000

data samples were available for the estimation of the statistics. Even in this challenging case, the iterative Wiener filter was able to provide a good attenuation of the misalignment, in a relatively small number of iterations. This represents an important feature and a significant advantage when dealing with small amounts of data. In other words, the iterative Wiener filter exploiting the decomposition-based approach can be used to solve system identification problems with highly incomplete information, which is a condition imposed in many important applications.

The SNR is also a critical factor in system identification problems. As shown in Figure 7, the performance of the conventional Wiener filter is highly influenced by the SNR level. In the following simulations from this first set of experiments, we considered a more challenging environment, by setting

SNR = 10

dB. In this case, even for a large amount of data (i.e.,

N =

10,000), the conventional Wiener filter was outperformed by the iterative version, as supported in Figure 12.

The gain becomes more apparent when fewer data are available to estimate the statistics. Such a case is considered in Figure 13, where

SNR = 10

dB and

N = 2500

. As we can see, the conventional Wiener filter could not provide an accurate solution (as also indicated in Figure 7), whereas its iterative counterpart still attenuated the misalignment to an acceptable degree.

As previously explained (related to Figure 10 and Figure 11), using

N \leq M L

represents a critical scenario for the conventional Wiener filter. In this context, a lower SNR level made the situation even more challenging. Nevertheless, the iterative Wiener filter is reasonably robust even in these adverse conditions. This behavior is supported in Figure 14 and Figure 15, where

N = 2000

and 1500, respectively. The same noisy conditions were considered, with

SNR = 10

dB. As we can see in these figures, the best behavior was obtained for

P = 3

, which is significantly lower than

L_{2} = 20

.

Finally, the last simulation of the first set of experiments was performed in "extreme" SNR conditions, using

SNR = 0

dB. As we already know from Figure 7, the conventional Wiener filter cannot provide an accurate solution in this case, despite the value of N. In Figure 16,

N =

10,000, and

SNR = 0

dB. As expected, the conventional Wiener filter failed to provide an accurate estimate. On the other hand, the iterative Wiener filter was able to reach a much lower misalignment level, using

P ≪ L_{2}

. Therefore, it is much more robust in noisy conditions, which are frequent in practice.

The second set of experiments was performed in a more challenging situation that appeared in the context of stereophonic acoustic echo cancellation (SAEC) [65,66,67,68]. There were two acoustic echo paths to identify (for each microphone); i.e.,

M = 2

. Consequently, the reference (or microphone) signal resulted in

\begin{matrix} d (k) & = h_{R}^{T} x_{R} (k) + h_{L}^{T} x_{L} (k) + w (k) \\ = [\begin{matrix} h_{R}^{T} & h_{L}^{T} \end{matrix}] [\begin{matrix} x_{R} (t) \\ x_{L} (k) \end{matrix}] + w (k), \end{matrix}

(83)

where

h_{R}

and

h_{L}

correspond to the loudspeaker-to-microphone acoustic impulse responses (right and left, respectively), and

x_{R} (k)

and

x_{L} (k)

comprise the loudspeaker signal samples (right and left, respectively).

At first glance, from a system identification perspective, we need to identify the global impulse response

{[\begin{matrix} h_{R}^{T} & h_{L}^{T} \end{matrix}]}^{T}

. Nevertheless, in an SAEC scenario, the difficulty is many-fold. One of the main challenges is the so-called nonuniqueness problem [66,67], which comes from the fact that the loudspeaker (input) signals are linearly related. This issue can be addressed by manipulating the signals transmitted to the receiving room, e.g., using a preprocessor on the loudspeaker signals to make them less coherent, without affecting the stereo perception and the signal quality much. A simple but efficient nonlinear method uses positive and negative half-wave rectifiers on each channel, respectively [67]. In this case, the nonlinearly transformed signals become

\begin{matrix} x_{L}^{'} (k) & = x_{L} (k) + α \frac{x_{L} (k) + |x_{L} (k)|}{2}, \end{matrix}

(84)

\begin{matrix} x_{R}^{'} (k) & = x_{R} (k) + α \frac{x_{R} (k) - |x_{R} (k)|}{2}, \end{matrix}

(85)

where

α

is a parameter used to control the amount of nonlinearity; the recommended interval for this parameter is

0 < α \leq 0.5

[67]. The distortion parameter

α

(which controls the amount of nonlinearity) is provided a priori. Clearly, this distortion must be performed in such a way that the quality of the signals and the stereo effect are not degraded. Experiments reported in [67] (and also in many subsequent works) show that stereo perception is not affected even with an

α

as large as 0.5. Additionally, the audible distortion is small because of the psychoacoustic masking effects [69].

However, we should note that other methods can be used to address the nonuniqueness problem; e.g., see [70,71] and the references therein. An analysis of their influences on the overall performance of the decomposition-based approach is beyond the scope of this paper.

In our simulations, the source signal (in the transmission room) was white Gaussian noise. The acoustic impulse responses in the transmission room had 2048 coefficients. The acoustic impulse responses in the receiving room had

L = 1024

coefficients, as depicted in Figure 4. The background noise

w (k)

(in the receiving room) was white and Gaussian, with

SNR = 30

dB.

The influence of the preprocessing technique from (84) and (85) on the loudspeaker signals can be seen in Figure 17, where the performance of the conventional Wiener filter is evaluated for different values of N (i.e., the available amount of data used to estimate the statistics) and using different values of

α

, which acted as a distortion parameter. First, we can notice that the performance was clearly improved when preprocessing the input signals using positive and negative half-wave rectifiers with larger values of

α

. Additionally, even with preprocessing, a large amount of data (i.e.,

N ≫ M L

) is required for the conventional Wiener filter, in order to obtain reasonable misalignment attenuation.

In the following simulations, the identification problem was addressed using the conventional and iterative Wiener filters. The length of the global impulse response was

L = L_{1} L_{2} = 1024

, so that we set

L_{1} = L_{2} = 32

. Due to the nature of acoustic impulse responses, the matrix

H

(or

H_{st}

) was closer to full rank, so a higher value of P was required. In this context, the iterative Wiener filter involved in the experiments used

P = 6, 8, 10

, and 12. Nevertheless, these values are lower than

L_{2} = 32

.

For the results shown in Figure 18,

N =

10,000 data samples were used to estimate the statistics, and the input signals were preprocessed using positive and negative half-wave rectifiers, with

α = 0.5

. These represent favorable conditions for the identification, so the conventional Wiener filter had good accuracy (as also indicated in Figure 17). The performance of the iterative Wiener filter was improved for a larger value of P and even outperformed the conventional filter in the case of

P = 12

.

Reducing the value of the distortion parameter influenced the performance for the conventional and the iterative Wiener filters, as shown in Figure 19, where

N =

10,000 and

α = 0.3

. However, the iterative Wiener filter is more robust to this modification, since for

P = 12

it outperformed the conventional benchmark, and for

P = 8

it reached a misalignment level close to the conventional Wiener solution.

Next, for the simulations shown in Figure 20 and Figure 21, less data were used to estimate the statistics—i.e.,

N = 2500

. The other conditions were the same as in Figure 18 and Figure 19, respectively. As we can notice, the conventional Wiener filter could not obtain an accurate solution in these cases, despite the value of the distortion parameter. On the other hand, the iterative Wiener filter was still able to achieve reasonable results (for

P < L_{2}

), thereby far outperforming the conventional solution. The difference was even more apparent for a lower value of

α

, as supported in Figure 21.

A challenging case

N < M L

was considered in the following two simulations, where we set

N = 2000

(while

M L = 2048

). Two values of the distortion parameter were considered, i.e.,

α = 0.5

and

0.3

, and the results are depicted in Figure 22 and Figure 23, respectively. The conventional Wiener filter was not included in these experiments, since it could not provide an accurate solution (as indicated in Figure 17). As we can notice in both cases, despite the adverse conditions, the iterative Wiener filter was still able to attenuate the misalignment to a reasonable extent and provide a robust solution, for a value of P smaller than

L_{2}

.

As shown in Figure 17, the conventional Wiener filter could not obtain an accurate solution for small values of the distortion parameter

α

(i.e., closer to 0). For example, when using

α = 0.1

, its misalignment could not reach

- 5

dB even with a large amount of data (i.e.,

N =

10,000); and for less data (e.g.,

N = 2500

), it provided a far from accurate solution. These cases are considered in Figure 24 and Figure 25, using

α = 0.1

;

N =

10,000 and 2500, respectively. For N = 10,000 (Figure 24), the iterative Wiener filter outperformed the conventional solution for

P = 10

and 12. The decomposition using

P = 8 ≪ L_{2}

reached the misalignment level provided by the conventional Wiener filter. However, the differences (in favor of the iterative version, for all the values of P) were significant when

N = 2500

, as supported in Figure 25.

Finally, as shown in Figure 26,

N =

10,000 data samples are used to estimate the statistics, but the input signals were not preprocessed (i.e.,

α = 0

). Even if

N ≫ M L

, the conventional Wiener filter could not obtain an accurate solution in this case. On the other hand, the iterative Wiener filter with

P ≪ L_{2}

was able to outperform the conventional Wiener solution, even in this extremely difficult scenario. In other words, the influence of the nonuniqueness problem is less significant for the iterative Wiener filter. This is probably due to the fact that the matrices within the iterative Wiener filter are smaller as compared to the full matrix of size

M L \times M L

within the conventional Wiener filter.

Summarizing, the main feature of the multichannel iterative Wiener filter is that it operates with smaller data structures, i.e., of size

P M L_{1}

and

P L_{2}

, with P values far smaller than

L_{2}

. On the other hand, the conventional Wiener filter addresses a system identification problem of size

L = L_{1} L_{2}

. In our experiments, we tried to cover a wide range of scenarios, in order to properly assess the performance of the multichannel iterative Wiener filter, as compared to its conventional counterpart. Consequently, we used different values of N (the available data to estimate the statistics), different noise levels (SNR), and different values of the decomposition parameter P. Moreover, in the SAEC scenario, we also used different values of the distortion parameter

α

. As an overall conclusion, the iterative version significantly outperformed the conventional Wiener filter, especially in difficult conditions and environments, e.g., when using small amounts of data or low SNRs. These scenarios are of great importance in practice, as in real-world applications, only small amounts of data may be available (to estimate the statistics) or the filters may need to operate in noisy environments.

7. Conclusions and Perspectives

In this review paper, we have addressed the system identification problem from an efficient decomposition-based perspective. The contributions are threefold. First, we have shown how the main categories of linear SISO and MISO systems can be interpreted and related in a unified framework, taking advantage of the bilinear form representation. Second, we have demonstrated that the resulting spatiotemporal impulse response of the MISO system can be efficiently identified using the nearest Kronecker product decomposition, followed by low-rank approximations. Third, we have developed an iterative Wiener filter based on these techniques which outperforms the conventional Wiener filter in terms of accuracy and robustness of the solution. The main feature of the overall approach consists of an efficient (re)formulation of a high-dimension system identification problem (e.g., identification of a long spatiotemporal impulse response) as a combination of low-dimension solutions, which result from the optimization of shorter component filters.

In this study, we have illustrated the performance of the decomposition-based approach only in terms of the Wiener filter, which represents a benchmark tool for system identification problems. Simulation results have indicated that the iterative version is able to outperform the conventional Wiener filter, especially when a small amount of data is available for the estimation of the statistics. This represents an important performance feature, taking into account that in many real-world applications we deal with incomplete information related to the inputs/outputs of the system.

In perspective, improved solutions based on adaptive filtering should also be developed, which could further extend the applicability of the decomposition-based approach. For example, several preliminaries toward this goal can be found in [44,45]. Furthermore, a rigorous convergence analysis of these algorithms could reveal the influence of the decomposition parameters, which could be further exploited in order to improve the overall behavior. In addition, finding a practical method to determine the optimal value of the decomposition parameter P represents one of our main tasks for future works. Nevertheless, this is not a straightforward task, since the decomposition parameter depends on the nature of the system to be identified (which is unknown in practice). However, we can take advantage on some a priori knowledge of the system. For example, in the case of network echo paths (which are usually very sparse), the value of P is much smaller than

L_{2}

. For acoustic impulse responses, the value of P should be increased, but it must still be considerably lower than

L_{2}

. A preliminary study on the influence of the decomposition parameter can be found in [43].

Author Contributions

Conceptualization, J.B.; methodology, C.P.; validation, L.-M.D.; formal analysis, S.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by a grant of the Romanian Ministry of Research and Innovation, CNCS—UEFISCDI, project number PN-III-P1-1.1-PD-2019-0340, within PNCDI III.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ljung, L. System Identification: Theory for the User, 2nd ed.; Prentice-Hall: Upper Saddle River, NJ, USA, 1999. [Google Scholar]
Benesty, J.; Huang, Y. (Eds.) Adaptive Signal Processing–Applications to Real-World Problems; Springer: Berlin, Germany, 2003. [Google Scholar]
Mohler, R.R.; Kolodziej, W.J. An overview of bilinear system theory and applications. IEEE Trans. Syst. Man Cybern. 1980, 10, 683–688. [Google Scholar]
Halawani, T.U.; Mohler, R.R.; Kolodziej, W.J. A two-step bilinear filtering approximation. IEEE Trans. Acoust. Speech Signal Process. 1984, 32, 344–352. [Google Scholar] [CrossRef]
Inagaki, M.; Mochizuki, H. Bilinear system identification by Volterra kernels estimation. IEEE Trans. Autom. Control 1984, 29, 746–749. [Google Scholar] [CrossRef]
Baik, H.K.; Mathews, V.J. Adaptive lattice bilinear filters. IEEE Trans. Signal Process. 1993, 41, 2033–2046. [Google Scholar] [CrossRef]
Forssén, U. Adaptive bilinear digital filters. IEEE Trans. Circuits Syst. II Analog. Digit. Signal Process. 1993, 40, 729–735. [Google Scholar] [CrossRef]
Ma, G.-K.; Lee, J.; Mathews, V.J. A RLS bilinear filter for channel equalization. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Adelaide, SA, Australia, 19–22 April 1994; pp. III-257–III-260. [Google Scholar]
Lee, J.; Mathews, V.J. Adaptive bilinear predictors. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Adelaide, SA, Australia, 19–22 April 1994; pp. III-489–III-492. [Google Scholar]
Hu, R.; Hassan, H.M. Echo cancellation in high speed data transmission systems using adaptive layered bilinear filters. IEEE Trans. Commun. 1994, 42, 655–663. [Google Scholar]
Bose, T.; Chen, M.-Q. Conjugate gradient method in adaptive bilinear filtering. IEEE Trans. Signal Process. 1995, 43, 1503–1508. [Google Scholar] [CrossRef]
Lee, J.; Mathews, V.J. Output-error LMS bilinear filters with stability monitoring. In Proceedings of the 1995 International Conference on Acoustics, Speech, and Signal Processing, Detroit, MI, USA, 9–12 May 1995; pp. 965–968. [Google Scholar]
Gesbert, D.; Duhamel, P. Robust blind joint data/channel estimation based on bilinear optimization. In Proceedings of the 8th Workshop on Statistical Signal and Array Processing, Corfu, Greece, 24–26 June 1996; pp. 168–171. [Google Scholar]
Stenger, A.; Kellermann, W.; Rabenstein, R. Adaptation of acoustic echo cancellers incorporating a memoryless nonlinearity. In Proceedings of the Proceedings of 8th Workshop on Statistical Signal and Array Processing, Corfu, Greece, 24–26 June 1996. [Google Scholar]
Stenger, A.; Kellermann, W. Adaptation of a memoryless preprocessor for nonlinear acoustic echo cancelling. Signal Process. 2000, 80, 1747–1760. [Google Scholar] [CrossRef]
Zhu, Z.; Leung, H. Adaptive identification of nonlinear systems with application to chaotic communications. IEEE Trans. Circuits Syst. I Fundam. Theory Appl. 2000, 47, 1072–1080. [Google Scholar] [CrossRef]
Kuo, S.M.; Wu, H.-T. Nonlinear adaptive bilinear filters for active noise control systems. IEEE Trans. Circuits Syst. I Regul. Pap. 2005, 52, 617–624. [Google Scholar] [CrossRef]
Abrahamsson, R.; Kay, S.M.; Stoica, P. Estimation of the parameters of a bilinear model with applications to submarine detection and system identification. Digit. Signal Process. 2007, 17, 756–773. [Google Scholar] [CrossRef]
Lopes dos Santos, P.; Ramos, J.A.; Martins de Carvalho, J.L. Identification of bilinear systems with white noise inputs: An iterative deterministic-stochastic subspace approach. IEEE Trans. Control. Syst. Technol. 2009, 17, 1145–1153. [Google Scholar] [CrossRef]
Zhao, H.; Zeng, X.; He, Z. Low-complexity nonlinear adaptive filter based on a pipelined bilinear recurrent neural network. IEEE Trans. Neural Netw. 2011, 22, 1494–1507. [Google Scholar] [CrossRef]
Tan, L.; Jiang, J. Nonlinear active noise control using diagonal-channel LMS and RLS bilinear filters. In Proceedings of the 2014 IEEE 57th International Midwest Symposium on Circuits and Systems (MWSCAS), College Station, TX, USA, 3–6 August 2014; pp. 789–792. [Google Scholar]
Huang, Y.; Skoglund, J.; Luebs, A. Practically efficient nonlinear acoustic echo cancellers using cascaded block RLS and FLMS adaptive filters. In Proceedings of the 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA, 5–9 March 2017; pp. 596–600. [Google Scholar]
Bai, E.-W.; Li, D. Convergence of the iterative Hammerstein system identification algorithm. IEEE Trans. Autom. Control 2004, 49, 1929–1940. [Google Scholar] [CrossRef]
Benesty, J.; Paleologu, C.; Ciochina, S. On the identification of bilinear forms with the Wiener filter. IEEE Signal Process. Lett. 2017, 24, 653–657. [Google Scholar] [CrossRef]
Haykin, S. Adaptive Filter Theory, 4th ed.; Prentice-Hall: Upper Saddle River, NJ, USA, 2002. [Google Scholar]
Diniz, P.S.R. Adaptive Filtering: Algorithms and Practical Implementation, 4th ed.; Springer: New York, NY, USA, 2013. [Google Scholar]
Rupp, M.; Schwarz, S. A tensor LMS algorithm. In Proceedings of the 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), South Brisbane, QLD, Australia, 19–24 April 2015; pp. 3347–3351. [Google Scholar]
Paleologu, C.; Benesty, J.; Ciochină, S. Adaptive filtering for the identification of bilinear forms. Digit. Signal Process. 2018, 75, 153–167. [Google Scholar] [CrossRef]
Elisei-Iliescu, C.; Stanciu, C.; Paleologu, C.; Benesty, J.; Anghel, C.; Ciochină, S. Efficient recursive least-squares algorithms for the identification of bilinear forms. Digit. Signal Process. 2018, 83, 280–296. [Google Scholar] [CrossRef]
Dogariu, L.; Paleologu, C.; Ciochină, S.; Benesty, J.; Piantanida, P. Identification of bilinear forms with the Kalman filter. In Proceedings of the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, AB, Canada, 15–20 April 2018; pp. 4134–4138. [Google Scholar]
Cichocki, A.; Zdunek, R.; Pan, A.H.; Amari, S. Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multiway Data Analysis and Blind Source Separation; Wiley: Chichester, UK, 2009. [Google Scholar]
Boussé, M.; Debals, O.; de Lathauwer, L. A tensor-based method for large-scale blind source separation using segmentation. IEEE Trans. Signal Process. 2017, 65, 346–358. [Google Scholar] [CrossRef]
Benesty, J.; Cohen, I.; Chen, J. Array Processing–Kronecker Product Beamforming; Springer: Cham, Switzerland, 2019. [Google Scholar]
Ribeiro, L.N.; de Almeida, A.L.F.; Mota, J.C.M. Separable linearly constrained minimum variance beamformers. Signal Process. 2019, 158, 15–25. [Google Scholar] [CrossRef]
Vasilescu, M.A.O.; Kim, E. Compositional hierarchical tensor factorization: Representing hierarchical intrinsic and extrinsic causal factors. In Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), Anchorage, AK, USA, 4–8 August 2019. [Google Scholar]
Vasilescu, M.A.O.; Kim, E.; Zeng, X.S. CausalX: Causal eXplanations and block multilinear factor analysis. In Proceedings of the 2020 25th International Conference on Pattern Recognition (ICPR), Milan, Italy, 10–15 January 2021. [Google Scholar]
Vervliet, N.; Debals, O.; Sorber, L.; de Lathauwer, L. Breaking the curse of dimensionality using decompositions of incomplete tensors: Tensor-based scientific computing in big data analysis. IEEE Signal Process. Mag. 2014, 31, 71–79. [Google Scholar] [CrossRef]
Cichocki, A.; Mandic, D.; de Lathauwer, L.; Zhou, G.; Zhao, Q.; Caiafa, C.; Phan, A.H. Tensor decompositions for signal processing applications: From two-way to multiway component analysis. IEEE Signal Process. Mag. 2015, 32, 145–163. [Google Scholar] [CrossRef] [Green Version]
Sidiropoulos, N.; de Lathauwer, L.; Fu, X.; Huang, K.; Papalexakis, E.; Faloutsos, C. Tensor decomposition for signal processing and machine learning. IEEE Trans. Signal Process. 2017, 65, 3551–3582. [Google Scholar] [CrossRef]
da Costa, M.N.; Favier, G.; Romano, J.M.T. Tensor modelling of MIMO communication systems with performance analysis and Kronecker receivers. Signal Process. 2018, 145, 304–316. [Google Scholar] [CrossRef] [Green Version]
Dogariu, L.-M.; Stanciu, C.L.; Elisei-Iliescu, C.; Paleologu, C.; Benesty, J.; Ciochină, S. Tensor-based adaptive filtering algorithms. Symmetry 2021, 13, 481. [Google Scholar] [CrossRef]
Dogariu, L.-M.; Paleologu, C.; Benesty, J.; Stanciu, C.L.; Oprea, C.C.; Ciochină, S. A Kalman filter for multilinear forms and its connection with tensorial adaptive filters. Sensors 2021, 21, 3555. [Google Scholar] [CrossRef] [PubMed]
Paleologu, C.; Benesty, J.; Ciochină, S. Linear system identification based on a Kronecker product decomposition. IEEE/ACM Trans. Audio Speech Lang. Process. 2018, 26, 1793–1808. [Google Scholar] [CrossRef]
Elisei-Iliescu, C.; Paleologu, C.; Benesty, J.; Stanciu, C.; Anghel, C.; Ciochină, S. Recursive least-squares algorithms for the identification of low-rank systems. IEEE/ACM Trans. Audio Speech Lang. Process. 2019, 27, 903–918. [Google Scholar] [CrossRef]
Dogariu, L.-M.; Paleologu, C.; Benesty, J.; Ciochină, S. An efficient Kalman filter for the identification of low-rank systems. Signal Process. 2020, 166, 107239. [Google Scholar] [CrossRef]
Benesty, J.; Paleologu, C.; Oprea, C.C.; Ciochină, S. An iterative multichannel Wiener filter based on a Kronecker product decomposition. In Proceedings of the 2020 28th European Signal Processing Conference (EUSIPCO), Amsterdam, The Netherlands, 18–21 January 2021; pp. 211–215. [Google Scholar]
Elisei-Iliescu, C.; Paleologu, C.; Benesty, J.; Stanciu, C.; Anghel, C.; Ciochină, S. A multichannel recursive least-squares algorithm based on a Kronecker product decomposition. In Proceedings of the 2020 43rd International Conference on Telecommunications and Signal Processing (TSP), Milan, Italy, 7–9 July 2020; pp. 14–18. [Google Scholar]
Cohen, I.; Benesty, J.; Chen, J. Differential Kronecker product beamforming. IEEE/ACM Trans. Audio Speech Lang. Process. 2019, 27, 892–902. [Google Scholar] [CrossRef]
Bhattacharjee, S.S.; Kumar, K.; George, N.V. Nearest Kronecker product decomposition based generalized maximum correntropy and generalized hyperbolic secant robust adaptive filters. IEEE Signal Process. Lett. 2020, 27, 1525–1529. [Google Scholar] [CrossRef]
Bhattacharjee, S.S.; George, N.V. Nonlinear system identification using exact and approximate improved adaptive exponential functional link networks. IEEE Trans. Circuits Syst. II Express Briefs 2020, 67, 3542–3546. [Google Scholar] [CrossRef]
Bhattacharjee, S.S.; George, N.V. Fast and efficient acoustic feedback cancellation based on low rank approximation. Signal Process. 2021, 182, 107984. [Google Scholar] [CrossRef]
Bhattacharjee, S.S.; George, N.V. Nearest Kronecker product decomposition based linear-in-the-parameters nonlinear filters. IEEE/ACM Trans. Audio Speech Lang. Process. 2021. accepted for publication. [Google Scholar]
Yang, W.; Huang, G.; Chen, J.; Benesty, J.; Cohen, I.; Kellermann, W. Robust dereverberation with Kronecker product based multichannel linear prediction. IEEE Signal Process. Lett. 2021, 28, 101–105. [Google Scholar] [CrossRef]
Kuhn, E.V.; Pitz, C.A.; Matsuo, M.V.; Bakri, K.J.; Seara, R.; Benesty, J. A Kronecker product CLMS algorithm for adaptive beamforming. Digit. Signal Process. 2021, 111, 102968. [Google Scholar] [CrossRef]
He, H.; Chen, J.; Benesty, J.; Yu, Y. Robust recursive least M-estimate adaptive filter for the identification of low-rank acoustic systems. In Proceedings of the ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 6–11 June 2021; pp. 940–944. [Google Scholar]
Van Loan, C.F. The ubiquitous Kronecker product. J. Comput. Appl. Math. 2000, 123, 85–100. [Google Scholar] [CrossRef] [Green Version]
Gay, S.L.; Benesty, J. (Eds.) Acoustic Signal Processing for Telecommunication; Kluwer Academic Publisher: Boston, MA, USA, 2000. [Google Scholar]
Benesty, J.; Gänsler, T.; Morgan, D.R.; Sondhi, M.M.; Gay, S.L. Advances in Network and Acoustic Echo Cancellation; Springer: Berlin, Germany, 2001. [Google Scholar]
Paleologu, C.; Benesty, J.; Ciochină, S. Sparse Adaptive Filters for Echo Cancellation; Morgan & Claypool Publishers: Williston, VT, USA, 2010. [Google Scholar]
Liu, J.; Grant, S.L. Proportionate adaptive filtering for block-sparse system identification. IEEE/ACM Trans. Audio Speech Lang. Process. 2016, 24, 623–630. [Google Scholar] [CrossRef] [Green Version]
Golub, G.H.; van Loan, C.F. Matrix Computations, 3rd ed.; The John Hopkins University Press: Baltimore, MD, USA, 1996. [Google Scholar]
Gander, W.; Gander, M.J.; Kwok, F. Scientific Computing–An Introduction Using Maple and MATLAB; Springer: Berlin, Germany, 2014. [Google Scholar]
Digital Network Echo Cancellers; ITU-T Recommendations G.168; ITU: Geneva, Switzerland, 2002.
Bertsekas, D.P. Nonlinear Programming, 2nd ed.; Athena Scientific: Belmont, MA, USA, 1999. [Google Scholar]
Sondhi, M.M.; Morgan, D.R. Acoustic echo cancellation for stereophonic teleconferencing. In Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA, 20–23 October 1991. [Google Scholar]
Sondhi, M.M.; Morgan, D.R.; Hall, J.L. Stereophonic acoustic echo cancellation–An overview of the fundamental problem. IEEE Signal Process. Lett. 1995, 2, 148–151. [Google Scholar] [CrossRef]
Benesty, J.; Morgan, D.R.; Sondhi, M.M. A better understanding and an improved solution to the specific problems of stereophonic acoustic echo cancellation. IEEE Trans. Speech Audio Process. 1998, 6, 156–165. [Google Scholar] [CrossRef] [Green Version]
Benesty, J.; Paleologu, C.; Gänsler, T.; Ciochină, S. A Perspective on Stereophonic Acoustic Echo Cancellation; Springer: Berlin, Germany, 2011. [Google Scholar]
Moore, B.C.J. An Introduction to the Psychology of Hearing; Academic Press: London, UK, 1989. [Google Scholar]
Romoli, L.; Cecchi, S.; Peretti, P.; Piazza, F. A mixed decorrelation approach for stereo acoustic echo cancellation based on the estimation of the fundamental frequency. IEEE Trans. Audio Speech Lang. Process. 2012, 20, 690–698. [Google Scholar] [CrossRef]
Schneider, M.; Kellermann, W. Multichannel acoustic echo cancellation in the wave domain with increased robustness to nonuniqueness. IEEE/ACM Trans. Audio Speech Lang. Process. 2016, 24, 518–529. [Google Scholar] [CrossRef]

Figure 1. A general block diagram of the LSISO system from (1).

Figure 2. A general block diagram of the LMISO system from (13).

Figure 3. Impulse responses used in the first set of simulations from Section 6 (according to the G168 Recommendation [63]), with

L = 500

and

M = 4

: (a) the first network echo path from [63], (b) the second network echo path from [63], (c) the fifth network echo path from [63], and (d) the sixth network echo path from from [63].

Figure 3. Impulse responses used in the first set of simulations from Section 6 (according to the G168 Recommendation [63]), with

L = 500

and

M = 4

: (a) the first network echo path from [63], (b) the second network echo path from [63], (c) the fifth network echo path from [63], and (d) the sixth network echo path from from [63].

Figure 4. Impulse responses used in the second set of simulations from Section 6, with

L = 1024

and

M = 2

: (a) right acoustic echo path and (b) left acoustic echo path.

Figure 4. Impulse responses used in the second set of simulations from Section 6, with

L = 1024

and

M = 2

: (a) right acoustic echo path and (b) left acoustic echo path.

Figure 5. Normalized misalignment evaluated based on (55) for different values of P, corresponding to (a) the impulse responses from Figure 3, with

M = 4

,

L = 500

,

L_{1} = 25

, and

L_{2} = 20

; and (b) the impulse responses from Figure 4, with

M = 2

,

L = 1024

, and

L_{1} = L_{2} = 32

. For better visualization, the representation is limited to

- 100

dB.

Figure 5. Normalized misalignment evaluated based on (55) for different values of P, corresponding to (a) the impulse responses from Figure 3, with

M = 4

,

L = 500

,

L_{1} = 25

, and

L_{2} = 20

; and (b) the impulse responses from Figure 4, with

M = 2

,

L = 1024

, and

L_{1} = L_{2} = 32

. For better visualization, the representation is limited to

- 100

dB.

Figure 6. Singular values (normalized with respect to the maximum one) of the matrix

H_{st}

, corresponding to (a) the impulse responses from Figure 3, with

M = 4

,

L = 500

,

L_{1} = 25

, and

L_{2} = 20

; and (b) the impulse responses from Figure 4, with

M = 2

,

L = 1024

, and

L_{1} = L_{2} = 32

.

Figure 6. Singular values (normalized with respect to the maximum one) of the matrix

H_{st}

, corresponding to (a) the impulse responses from Figure 3, with

M = 4

,

L = 500

,

L_{1} = 25

, and

L_{2} = 20

; and (b) the impulse responses from Figure 4, with

M = 2

,

L = 1024

, and

L_{1} = L_{2} = 32

.

Figure 7. Performance of the conventional Wiener filter for different values of N (number of available data samples to estimate the statistics) and SNRs. The input signals were independent AR(1) processes,

M = 4

, and

L = 500

.

Figure 7. Performance of the conventional Wiener filter for different values of N (number of available data samples to estimate the statistics) and SNRs. The input signals were independent AR(1) processes,

M = 4

, and

L = 500

.

Figure 8. Performances of the conventional and iterative Wiener filters, using

N =

10,000. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 30

dB.

Figure 8. Performances of the conventional and iterative Wiener filters, using

N =

10,000. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 30

dB.

Figure 9. Performances of the conventional and iterative Wiener filters, using

N = 2500

. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 30

dB.

Figure 9. Performances of the conventional and iterative Wiener filters, using

N = 2500

. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 30

dB.

Figure 10. Performance of the iterative Wiener filter with different values of P, using

N = 2000

. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 30

dB.

Figure 10. Performance of the iterative Wiener filter with different values of P, using

N = 2000

. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 30

dB.

Figure 11. Performance of the iterative Wiener filter with different values of P, using

N = 1000

. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 30

dB.

Figure 11. Performance of the iterative Wiener filter with different values of P, using

N = 1000

. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 30

dB.

Figure 12. Performances of the conventional and iterative Wiener filters, using

N =

10,000. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 10

dB.

Figure 12. Performances of the conventional and iterative Wiener filters, using

N =

10,000. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 10

dB.

Figure 13. Performances of the conventional and iterative Wiener filters, using

N = 2500

. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 10

dB.

Figure 13. Performances of the conventional and iterative Wiener filters, using

N = 2500

. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 10

dB.

Figure 14. Performance of the iterative Wiener filter with different values of P, using

N = 2000

. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 10

dB.

Figure 14. Performance of the iterative Wiener filter with different values of P, using

N = 2000

. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 10

dB.

Figure 15. Performance of the iterative Wiener filter with different values of P, using

N = 1500

. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 10

dB.

Figure 15. Performance of the iterative Wiener filter with different values of P, using

N = 1500

. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 10

dB.

Figure 16. Performances of the conventional and iterative Wiener filters, using

N =

10,000. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 0

dB.

Figure 16. Performances of the conventional and iterative Wiener filters, using

N =

10,000. The input signals were independent AR(1) processes,

M = 4

,

L = 500

, and

SNR = 0

dB.

Figure 17. Performance of the conventional Wiener filter for different values of N (number of available data samples to estimate the statistics) and different values of

α

(the distortion parameter). The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 17. Performance of the conventional Wiener filter for different values of N (number of available data samples to estimate the statistics) and different values of

α

(the distortion parameter). The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 18. Performances of the conventional and iterative Wiener filters, using

N =

10,000. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.5

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 18. Performances of the conventional and iterative Wiener filters, using

N =

10,000. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.5

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 19. Performances of the conventional and iterative Wiener filters, using

N =

10,000. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.3

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 19. Performances of the conventional and iterative Wiener filters, using

N =

10,000. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.3

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 20. Performances of the conventional and iterative Wiener filters, using

N = 2500

. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.5

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 20. Performances of the conventional and iterative Wiener filters, using

N = 2500

. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.5

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 21. Performances of the conventional and iterative Wiener filters, using

N = 2500

. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.3

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 21. Performances of the conventional and iterative Wiener filters, using

N = 2500

. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.3

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 22. Performance of the iterative Wiener filter with different values of P, using

N = 2000

. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.5

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 22. Performance of the iterative Wiener filter with different values of P, using

N = 2000

. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.5

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 23. Performance of the iterative Wiener filter with different values of P, using

N = 2000

. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.3

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 23. Performance of the iterative Wiener filter with different values of P, using

N = 2000

. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.3

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 24. Performances of the conventional and iterative Wiener filters, using

N =

10,000. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.1

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 24. Performances of the conventional and iterative Wiener filters, using

N =

10,000. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.1

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 25. Performances of the conventional and iterative Wiener filters, using

N = 2500

. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.1

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 25. Performances of the conventional and iterative Wiener filters, using

N = 2500

. The source signal (white Gaussian noise) was preprocessed with positive and negative half-wave rectifiers, using

α = 0.1

. The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 26. Performances of the conventional and iterative Wiener filters, using

N =

10,000. The source signal (white Gaussian noise) was not preprocessed (no distortion, i.e.,

α = 0

). The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Figure 26. Performances of the conventional and iterative Wiener filters, using

N =

10,000. The source signal (white Gaussian noise) was not preprocessed (no distortion, i.e.,

α = 0

). The numbers of channels were

M = 2

(stereophonic scenario),

L = 1024

, and

SNR = 30

dB.

Table 1. The multichannel iterative Wiener filter.

\begin{matrix} \underset{̲}{Data :} R, p (estimated statistics based on N data samples), \\ P (decomposition parameter), δ > 0 (regularization constant) \\ \underset{̲}{Initialization :} \\ Provide the initial coefficients of the filter {\hat{h}}_{2, p}^{(0)} (for p = 1, 2, \dots, P) : \\ {\hat{h}}_{2, p}^{(0)} = {[\begin{matrix} ϵ & 0 & \dots & 0 \end{matrix}]}^{T}, p = 1, 2, \dots, P (0 < ϵ \leq 1) \\ Compute the initial statistics {\underset{̲}{p}}_{2}^{(0)} and {\underset{̲}{R}}_{2}^{(0)} : \\ {\hat{H}}_{2, p}^{(0)} = {\hat{h}}_{2, p}^{(0)} \otimes I_{M L_{1}}, p = 1, 2, \dots, P \\ {\underset{̲}{p}}_{2}^{(0)} = {[\begin{matrix} p^{T} {\hat{H}}_{2, 1}^{(0)} & \dots & p^{T} {\hat{H}}_{2, P}^{(0)} \end{matrix}]}^{T} \\ {\underset{̲}{R}}_{2}^{(0)} = [\begin{matrix} {({\hat{H}}_{2, 1}^{(0)})}^{T} R {\hat{H}}_{2, 1}^{(0)} & \dots & {({\hat{H}}_{2, 1}^{(0)})}^{T} R {\hat{H}}_{2, P}^{(0)} \\ ⋮ & ⋱ & ⋮ \\ {({\hat{H}}_{2, P}^{(0)})}^{T} R {\hat{H}}_{2, 1}^{(0)} & \dots & {({\hat{H}}_{2, P}^{(0)})}^{T} R {\hat{H}}_{2, P}^{(0)} \end{matrix}] \\ \underset{̲}{For} n = 1, 2, \dots : \\ Compute the coefficients of the filter {\underset{̲}{\hat{h}}}_{1}^{(n)} based on (75) : \\ {\underset{̲}{\hat{h}}}_{1}^{(n)} = {({\underset{̲}{R}}_{2}^{(n - 1)} + δ I_{P M L_{1}})}^{- 1} {\underset{̲}{p}}_{2}^{(n - 1)} = {[\begin{matrix} {({\hat{h}}_{1, 1}^{(n)})}^{T} & \dots & {({\hat{h}}_{1, P}^{(n)})}^{T} \end{matrix}]}^{T} \\ Compute the statistics {\underset{̲}{p}}_{1}^{(n)} and {\underset{̲}{R}}_{1}^{(n)} : \\ {\hat{H}}_{1, p}^{(n)} = I_{L_{2}} \otimes {\hat{h}}_{1, p}^{(n)}, p = 1, 2, \dots, P \\ {\underset{̲}{p}}_{1}^{(n)} = {[\begin{matrix} p^{T} {\hat{H}}_{1, 1}^{(n)} & \dots & p^{T} {\hat{H}}_{1, P}^{(n)} \end{matrix}]}^{T} \\ {\underset{̲}{R}}_{1}^{(n)} = [\begin{matrix} {({\hat{H}}_{1, 1}^{(n)})}^{T} R {\hat{H}}_{1, 1}^{(n)} & \dots & {({\hat{H}}_{1, 1}^{(1)})}^{T} R {\hat{H}}_{1, P}^{(1)} \\ ⋮ & ⋱ & ⋮ \\ {({\hat{H}}_{1, P}^{(n)})}^{T} R {\hat{H}}_{1, 1}^{(n)} & \dots & {({\hat{H}}_{1, P}^{(n)})}^{T} R {\hat{H}}_{1, P}^{(n)} \end{matrix}] \\ Compute the coefficients of the filter {\underset{̲}{\hat{h}}}_{2}^{(n)} based on (76) : \\ {\underset{̲}{\hat{h}}}_{2}^{(n)} = {({\underset{̲}{R}}_{1}^{(n)} + δ I_{P L_{2}})}^{- 1} {\underset{̲}{p}}_{1}^{(n)} = {[\begin{matrix} {({\hat{h}}_{2, 1}^{(n)})}^{T} & \dots & {({\hat{h}}_{2, P}^{(n)})}^{T} \end{matrix}]}^{T} \\ Compute the statistics {\underset{̲}{p}}_{2}^{(n)} and {\underset{̲}{R}}_{2}^{(n)} : \\ {\hat{H}}_{2, p}^{(n)} = {\hat{h}}_{2, p}^{(n)} \otimes I_{M L_{1}}, p = 1, 2, \dots, P \\ {\underset{̲}{p}}_{2}^{(n)} = {[\begin{matrix} p^{T} {\hat{H}}_{2, 1}^{(n)} & \dots & p^{T} {\hat{H}}_{2, P}^{(n)} \end{matrix}]}^{T} \\ {\underset{̲}{R}}_{2}^{(n)} = [\begin{matrix} {({\hat{H}}_{2, 1}^{(n)})}^{T} R {\hat{H}}_{2, 1}^{(n)} & \dots & {({\hat{H}}_{2, 1}^{(n)})}^{T} R {\hat{H}}_{2, P}^{(n)} \\ ⋮ & ⋱ & ⋮ \\ {({\hat{H}}_{2, P}^{(n)})}^{T} R {\hat{H}}_{2, 1}^{(n)} & \dots & {({\hat{H}}_{2, P}^{(n)})}^{T} R {\hat{H}}_{2, P}^{(n)} \end{matrix}] \\ Compute the coefficients of the Wiener filter {\underset{̲}{\hat{h}}}_{W}^{(n)} based on (77) : \\ {\underset{̲}{\hat{h}}}_{W}^{(n)} = \sum_{p = 1}^{P} {\hat{h}}_{2, p}^{(n)} \otimes {\hat{h}}_{1, p}^{(n)} \end{matrix}

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Benesty, J.; Paleologu, C.; Dogariu, L.-M.; Ciochină, S. Identification of Linear and Bilinear Systems: A Unified Study. Electronics 2021, 10, 1790. https://doi.org/10.3390/electronics10151790

AMA Style

Benesty J, Paleologu C, Dogariu L-M, Ciochină S. Identification of Linear and Bilinear Systems: A Unified Study. Electronics. 2021; 10(15):1790. https://doi.org/10.3390/electronics10151790

Chicago/Turabian Style

Benesty, Jacob, Constantin Paleologu, Laura-Maria Dogariu, and Silviu Ciochină. 2021. "Identification of Linear and Bilinear Systems: A Unified Study" Electronics 10, no. 15: 1790. https://doi.org/10.3390/electronics10151790

APA Style

Benesty, J., Paleologu, C., Dogariu, L.-M., & Ciochină, S. (2021). Identification of Linear and Bilinear Systems: A Unified Study. Electronics, 10(15), 1790. https://doi.org/10.3390/electronics10151790

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Identification of Linear and Bilinear Systems: A Unified Study

Abstract

1. Introduction

2. Different Input Output Linear/Bilinear System Models

3. Equivalence among Systems

4. Best Approximation

5. Identification with the Wiener Filter

6. Simulation Results

7. Conclusions and Perspectives

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI