1. Introduction
The adoption of adaptive filtering in wireless communication, speech signal processing, radar signal processing and adaptive control applications, for obtaining fast convergence and stability in estimation, has been studied over many years [
1,
2,
3]. In recent decades, adaptive filtering has been receiving ever increasing attention, thanks to new developments, such as sparse signal processing [
4,
5,
6] and compressive sensing [
7,
8,
9]. The improved adaptive filtering techniques pave the way to new added-value for dealing with practical engineering problems and enable a full exploitation of the sparsity-aware property of the practical signals in our life [
4,
5,
6,
10,
11,
12,
13], such as a broadband multi-path channel. However, traditional adaptive filter algorithms used in sparse channel estimation have unique challenges, both in terms of their convergence and estimation misalignment, thus requiring new algorithms.
The broadband multi-path wireless communication channel is commonly found to be sparse based on the measurement results [
14,
15,
16,
17,
18]. For example, in broad-band wireless communication systems, a “hilly terrain (HT)” delay profile generally takes a sparsely-distributed multipath form, which has been considered for channel estimation and equalization [
15,
16]. Furthermore, the adaptive filter technique has been widely considered to get good channel state information to enhance the performance of a wireless communication system, in which the adaptive filter has been used for channel estimation [
1,
2,
3,
19]. Among the various adaptive filter algorithms, the least mean square (LMS) algorithm has been extensively used for channel estimation. Although the LMS algorithm can obtain good channel estimation performance, it is inappropriate to deal with sparse channels [
10,
11,
12,
13]. As is well-known, the broadband signal transmission technique will be implemented over a wireless channel in the next-generation of wireless communication systems [
11]. As a broadband wireless communication channel, it often incurs a frequency-selective phenomenon, which relates to channel fading behavior. Thus, the broadband channel in such environments can be defined as a sparse channel structure, which is dominated by very few large taps.
To address this problem, sparse adaptive filtering algorithms have been presented and proposed for sparse broadband channel estimation and sparse system identification applications [
4,
5,
6,
10,
11,
12,
13,
20,
21,
22,
23,
24,
25,
26,
27,
28,
29]. In [
11], a broadband sparse multi-path channel is used for the sparse channel estimation application, which is implemented by using sparse normalized LMS (NLMS) algorithms. These sparse adaptive algorithms can be categorized into two groups, namely proportionate-type algorithms [
20,
21,
22] and zero-attracting algorithms [
4,
5,
6,
10,
11,
12,
13,
23,
24,
25,
26,
27,
28]. The proportionate-type algorithms aim to assign different weighting to each coefficient according to the magnitudes of the channel coefficients [
20], which will increase the computational load. Recently, a new kind of sparse adaptive filter algorithm has been proposed on the basis of compressive sensing concepts. One of the most popular sparse adaptive filter algorithms is the zero-attracting LMS (ZA-LMS) algorithm [
4], which is realized by incorporating an
-norm penalty into the cost function of the traditional LMS algorithm to give rise to a zero attractor term. As a result, the proposed ZA-LMS can accelerate the convergence rate because of the designed zero attractor term, which quickly attracts the zero coefficients and near zero coefficients to zero. Furthermore, an enhanced ZA-LMS algorithm named the reweighted ZA-LMS (RZA-LMS) has been reported based on a log-sum penalty [
4], which provides a reweighting factor in comparison with the ZA-LMS algorithm to selectively exert zero attraction on the channel coefficients. After that, several variants of LMS based on zero attracting theory have been reported and used for channel estimation [
5,
6,
23]. Although these sparse LMS algorithms can provide good channel estimation performance for ensuring the stability of the broadband wireless propagation, they are sensitive to the scaling of the inputs [
24,
25,
26].
Consequently, the zero attracting techniques have been utilized to exploit sparse least mean fourth (LMF) [
24], the sparse affine projection (AP) algorithm [
12,
13], sparse least mean square/fourth (LMS/F) [
25,
26,
27] and other sparse adaptive filtering algorithms [
28]. Although these sparse adaptive filter algorithms can effectively improve the performance of the LMS algorithms, some of them are computationally complex, and others should tradeoff the key parameter to give a balance in performance. To take advantage of the zero attraction and to improve the drawbacks of the LMS algorithm, a mixture sparse adaptive filter algorithm denoted as sparse least-mean mixture-norm (LMMN) [
30,
31] has been proposed and used for broadband sparse channel estimation applications, including the zero-attracting (ZA) LMMN (ZA-LMMN) and reweighted ZA-LMMN (RZA-LMMN) algorithms [
32]. As a result, these sparse LMMN algorithms can provide a better performance in comparison with the LMS, LMF, LMS/F and their relevant sparse forms.
In this paper, a robust correntropy-induced metric-constrained LMMN (CIM-LMMN) algorithm is proposed to fully exploit the sparse property for the sparse broadband wireless multi-path channel estimation application. In the proposed CIM-LMMN algorithm, a correntropy-induced metric criterion is implemented within the kernel framework and integrated into the cost function of the traditional LMMN algorithm to create a zero attractor in the update function. The derivation of the proposed CIM-LMMN algorithm is given in detail, and its channel estimation behavior is evaluated over a sparse multi-path wireless communication channel. The channel estimation results obtained from computer simulation are given to show that the proposed CIM-LMMN algorithm is superior to the traditional LMS, LMF, LMS/F and their sparse forms in terms of the convergence speed rate and the steady-state misalignment.
The rest of this paper is organized as follows. In
Section 2, the traditional LMMN algorithm and ZA techniques are addressed and stated within the adaptive filtering framework. In
Section 3, we propose our CIM-LMMN algorithm in the framework of the mixed error criterion, CIM and ZA theories. In
Section 4, the channel estimation behaviors of the proposed CIM-LMMN algorithm are investigated in the context of a sparse finite impulse response (FIR) multi-path channel. Finally, we summarize this work in
Section 5.
3. Proposed Sparse CIM-LMMN Algorithm
Since the LMMN algorithm combines the advantages of both the LMS and LMF algorithms, we propose a CIM-LMMN algorithm to fully exploit the sparsity property of the broadband multi-path wireless channel. Based on the ZA techniques [
4,
5,
6,
10,
11,
12,
13,
22,
23,
24,
25,
26,
27,
28,
32] and CIM theory [
33,
34,
35,
36,
37,
38,
39,
40], we propose a robust sparse LMMN algorithm by exerting a CIM penalty on the channel coefficient vector, and we utilize this constrained term to modify the cost function of the traditional LMMN algorithm. As we know, in the CIM theory, CIM can be used for measuring a similarity in kernel space between two random vectors
and
, which can be described as [
33,
34,
35,
36,
37,
38,
39,
40]:
where
, and:
where
σ represents the kernel width,
denotes a shift-invariant Mercer kernel [
38,
39,
40,
41] and
is a joint distribution. In practical engineering, the distribution of the data is unknown. Thus, the correntropy is rewritten as
. Here, we consider a typical kernel in correntropy, which is a Gaussian kernel defined as:
where
. The CIM can be used for accounting for the number of non-zero channel coefficients, and hence, we have:
It also can be seen that the CIM is a nonlinear metric in the input space.
Then, we present our proposed CIM-LMMN algorithm by using the ZA technique and CIM theory. Herein, we integrate the Gaussian kernel-based CIM into the cost function of the traditional LMMN algorithm. As a result, we can get the cost function of the proposed CIM-LMMN algorithm, which is given by:
where
is the instantaneous error at instant
n and
. Here,
, which is used for controlling the mixture of
and
. Furthermore, we employ the stochastic gradient approximation to form the final term in Equation (
11). The stochastic gradient of Equation (
11), which defines the search direction, is illustrated as:
By introducing a step size
η, the updating equation of the proposed CIM-LMMN algorithm for each coefficient can be expressed as:
where
is a regularization parameter used for balancing the estimation error and sparsity penalty. The matrix-vector form of Equation (13) converts to:
Compared with the traditional LMMN algorithm, there is a ZA term in the iterations of the proposed CIM-LMMN algorithm. The ZA ability is controlled by parameter . Moreover, the computational complexity is still low, which needs only additions, multiplications and N exponential calculations.
From the deduction of the proposed CIM-LMMN algorithm, we can see that the
is close to the
-norm when
[
41] if
,
. Thus, the CIM given in (
10) can provide an approximation for the
-norm, which has been widely used for sparsity exploitation. By using this CIM in the cost function of the traditional LMMN algorithm, a new cost function
has been created. Similarly, the updating equation of the proposed CIM-LMMN algorithm is obtained by using a stochastic gradient minimization method, which is shown in (
14). It is found that there is an additional term
, which is an extra zero attractor. Thus, we can conclude that the proposed CIM-LMMN algorithm can be denoted as:
Thus, the proposed CIM-LMMN is also a zero-attracting adaptive filtering algorithm, which is implemented by integrating a CIM-based zero attractor term into the update equation of the traditional LMMN algorithm. Moreover, the designated zero attractor can also be realized by using
-norm [
6,
22], combined
and
norms [
4,
5,
10,
27] and smooth approximated
-norm [
5,
11].
4. Channel Estimation Performance Investigation
On the basis of the previous research, we give the channel estimation behaviors of the newly-developed CIM-LMMN algorithm over a broadband sparse multi-path wireless communication channel, which is similar to the experiment setup in [
6,
11,
13,
22,
23,
24,
25,
26,
27,
28,
32]. To verify the effectiveness of the proposed CIM-LMMN algorithm, we also compare its performance with the traditional LMS, LMF, LMS/F and their relevant sparsity-aware algorithms. In the experiments, each point for all of the used adaptive filtering algorithms is set to 500 Monte Carlo runs. In this paper, we use a multi-path FIR channel with 16 taps to evaluate the channel estimation behaviors of the CIM-LMMN algorithm. Moreover, the number of dominant channel coefficients is defined as the sparsity level, and it is marked as
K. Herein, we investigate the sparsity level for
,
and
, which is similar to [
4,
5,
6,
10,
11,
12,
13,
22,
23,
24,
25,
26,
27,
28,
32]. In fact, as a sparse channel,
K taps are non-zeros, while the other
channel taps are set to zeros. In all of the simulations,
K dominant channel coefficients are created under a random distribution, and the positions of the
K taps are also randomly distributed within the length of the designated sparse channel, which is subjected to
. A white Gaussian random signal is used as the training signal, and the noise signal
is assumed to be independent of
. The received signal is normalized, and the noise power is set to be
in all of the experiments. We use the mean square error (MSE) to measure the channel estimation behavior, and the MSE is defined as
.
Firstly, we will investigate the convergence of the proposed CIM-LMMN algorithm. To the best of our knowledge, the previously-presented mixed LMS/F algorithm converges faster than the LMS and LMF algorithm. Thus, we compare the convergence of the CIM-LMMN algorithm with the LMS, LMS/F, LMMN, ZA-LMS/F, reweighted ZA-LMS/F (RZA-LMS/F), ZA-LMMN and reweighted ZA-LMMN (RZA-LMMN) algorithms. In this experiment, the simulation parameters are:
,
,
,
,
,
,
,
,
,
,
,
,
,
,
, where
,
,
,
and
are the step sizes of the LMS, LMS/F, ZA-LMS/F, RZA-LMS/F and RZA-LMMN algorithms, while
,
and
are the regularization parameters of the ZA-LMS/F, RZA-LMS/F and RZA-LMMN algorithms. The results of the convergence study of the proposed CIM-LMMN algorithm are reported in
Figure 1. In this simulation, step sizes are chosen to obtain the same MSE for each algorithm. From this figure, one can foresee that our proposed CIM-LMMN algorithm converges fastest with the same channel estimation error floor. Compared with the previously-proposed ZA-LMMN and RZA-LMMN algorithms, the CIM-LMMN algorithm achieves a stable MSE floor in 100 fewer iterations.
Secondly, we will investigate the channel estimation performance with respect to the MSE floor to show the superior behavior of the CIM-LMMN algorithm. Furthermore, we will compare its channel estimation behavior with the LMS, LMS/F, LMMN, ZA-LMS/F, ZA-LMMN, RZA-LMS/F and RZA-LMMN algorithms. Furthermore, we also investigate the effects of the sparsity levels for
,
and
. The CIM-LMMN adaptive filtering algorithm is affected by the step size, regularization parameter, kernel width,
χ and sparsity level. Thus, in this experiment, the simulation parameters related to the above-mentioned algorithms are set to give nearly same initial convergence to compare the MSE at different sparsity levels, and these parameters are:
,
,
,
,
,
,
,
,
,
. The channel estimation performance of the proposed CIM-LMMN algorithm for
,
and
are illustrated in
Figure 2,
Figure 3 and
Figure 4, respectively. We can see that the proposed CIM-LMMN algorithm has the lowest MSE for
. Our proposed CIM-LMMN algorithm can achieve a 3-dB gain in comparison with the RZA-LMMN algorithm. The sparsity level effects on the CIM-LMMN algorithm from
to
given in
Figure 3 and
Figure 4 show that our proposed CIM-LMMN algorithm is still superior to the mentioned adaptive filtering algorithm for sparse channel estimation. It is found that the CIM-LMMN algorithm has about 2.5-dB and 1-dB gains compared with the RZA-LMMN algorithm for
and
, respectively.
As we know, the LMMN algorithm can be used as the traditional LMS algorithm for
. Thus, we herein compare the channel estimation behavior with the LMS, ZA-LMS and RZA-LMS algorithms. Furthermore, the ZA-LMMN and RZA-LMMN are included to further verify the effectiveness and the superiority of the CIM-LMMN algorithm. In this experiment, the simulation parameters for the used algorithms are:
,
,
,
,
,
,
. The channel estimation behaviors for different sparsity levels, namely
,
and
, are reported in
Figure 5,
Figure 6 and
Figure 7, respectively. When
, the steady-state error floor of our proposed CIM-LMMN algorithm is much lower than those of previously-reported algorithms. Such a low MSE is more and more significant for achieving good channel state information, which can potentially enhance the quality of the wireless communication system in practical engineering applications. With an increment of
K ranging from
to
, the sparsity of the channel is reduced, and hence, the MSE floor is increased. However, our proposed CIM-LMMN algorithm still outperforms the previously-reported ZA-LMMN and RZA-LMMN algorithms for handling sparse channel estimation.
On the other hand, as for the LMMN algorithm, it can also be used as an LMF algorithm when
χ is set to zero. To quantify the robust performance of the CIM-LMMN algorithm, the channel estimation is also investigated and compared with the LMF, ZA-LMF and RZA-LMF algorithms. Herein, to obtain nearly the same initial convergence rate, the parameters used in this experiment are set to be:
,
,
,
,
,
,
,
. The channel estimation performance compared with the LMF algorithms is shown in
Figure 8,
Figure 9 and
Figure 10. As expected, the proposed CIM-LMMN algorithm is superior to the earlier reported LMF, LMS/F, LMMN, ZA-LMF, RZA-LMF, ZA-LMMN and RZA-LMMN algorithms. Furthermore, we can see that our proposed CIM-LMMN algorithm is better than the mentioned algorithms with respect to both the initial convergence and the steady-state performance. Compared to the RZA-LMMN algorithm, the convergence of the proposed CIM-LMMN algorithm is much faster for
, which is because the CIM-LMMN integrates a CIM penalty term to account for the non-zero channel coefficients. It is worth noting that the CIM-LMMN algorithm can achieve much gain in comparison with the RZA-LMMN for
without sacrificing the convergence.
From the above discussions and the previous studies [
26,
27], we know that the RZA-LMS/F algorithm is better than the RZA-LMS and RZA-LMF algorithms for sparse channel estimation. Moreover, the RZA-LMMN algorithm outperforms the ZA-LMMN and traditional LMMN algorithms for dealing with sparse signals, which has been investigated in [
32]. Thus, we will investigate the tracking behavior of the proposed CIM-LMMN algorithm in comparison with the RZA-LMS/F and RZA-LMMN algorithms. In this simulation, the simulation parameters are set to obtain the desired final MSE of each algorithm with the same convergence rate at the initial stage, and these parameters are:
,
,
,
,
,
,
,
. Herein, the signal-to-noise ratio is set to 30 dB. The simulation result is shown in
Figure 11. It is found that the proposed algorithm can track the varying of the channel well, and it still outperforms the RZA-LMS/F and RZA-LMMN algorithms with respect to the MSE floor at the same initial convergence rate.
We next draw conclusions in terms of the considered channel estimation performance. The proposed CIM-LMMN algorithm can provide the fastest convergence rate and lowest channel estimation misalignment compared with traditional LMS/F, LMS, LMF and their related ZA algorithms. However, as the goal is to get both fast convergence and low misalignment, a novel technique named as CIM has been employed to utilize the sparsity property of the broadband multi-path wireless channel. This is because the CIM measure can account for the dominant channel taps and is used to force the non-dominant channel coefficients to zero quickly. Moreover, the ZA ability can be controlled by to provide good channel estimation performance. Thus, we can say that the CIM-LMMN algorithm is robust and effective for sparse adaptive channel estimation.