Iterative Joint Estimation Procedure of Channel and PDP for OFDM Systems

The power-delay profile (PDP) estimation of wireless channels is an important step to generate a channel correlation matrix for channel linear minimum mean square error (LMMSE) estimation. Estimated channel frequency response can be used to obtain time dispersion characteristics that can be exploited by adaptive orthogonal frequency division multiplexing (OFDM) systems. In this paper, a joint estimator for PDP and LMMSE channel estimation is proposed. For LMMSE channel estimation, we apply a candidate set of frequency-domain channel correlation functions (CCF) and select the one that best matches the current channel to construct the channel correlation matrix. The initial candidate set is generated based on the traditional CCF calculation method for different scenarios. Then, the result of channel estimation is used as an input for the PDP estimation whereas the estimated PDP is further used to update the candidate channel correlation matrix. The enhancement of LMMSE channel estimation and PDP estimation can be achieved by the iterative joint estimation procedure. Analysis and simulation results show that in different communication scenarios, the PDP estimation error of the proposed method can approach the Cramér–Rao lower bound (CRLB) after a finite number of iterations. Moreover, the mean square error of channel estimation is close to the performance of accurate PDP-assisted LMMSE.


Introduction
The OFDM systems based on cyclic prefix (CP) can resist the frequency selective channel and allow for simple one-tap equalization to be performed [1]. However, the performance of the receiver mainly depends on the accuracy of channel estimation. Among the channel estimation methods, least square (LS) estimation is one of the most widely used methods. Although the implementation of LS estimation has low computation complexity, the performance is greatly affected by noise [2]. LMMSE estimation is the theoretically optimal solution when minimizing the MSE. In addition to the noise power, LMMSE also features a channel correlation matrix which depends entirely on channel statistical information. However, in a wide range of systems, the statistical characteristics are unknown at the receiver [3]. Therefore, it is very challenging to achieve high-performance LMMSE estimation.
The channel PDP is often used to describe the statistical characteristics of frequency selective channels. The time dispersion or delay spread of wireless channels is one of the most important parameters in adaptive OFDM systems [4]. For example, the adaptation of OFDM systems such as adaptive modulation and coding, adaptive CP, pilot pattern, and equalization technology [5][6][7] can adjust the transceiver parameters according to the changes of channel time dispersion. These methods enhance system throughput and transmission reliability [8]. When the channel characteristics of time dispersion are available, the inter-symbol interference can be reduced by adjusting the length of the CP [9].
The LMMSE channel estimation also requires the time dispersion information of the channel [10].
The statistics of channel time dispersion include the mean delay spread and the maximum delay spread, which can be estimated by applying cross-correlation of differentially correlated received signals or setting a threshold for the estimated channel impulse response (CIR) [11,12]. However, these condensed parameters may be oversimplified and cannot fully represent the time dispersion characteristics of the channel. To obtain PDP information, the estimation method based on the maximum likelihood criterion of CP is proposed [13]. Although this class of method can obtain better estimation accuracy, the computational complexity is prohibitively high. In addition, the data-assisted methods using a training sequence or pilot can achieve better estimation performance [14]. To solve the problem of generating channel correlation matrix, one approach is to utilize simple PDP models such as the exponential decay or uniform-model. The channel correlation matrix for these PDPs can be simply calculated by estimating the mean delay and the rootmean-squared (RMS) delay spread [15]. However, how to choose an appropriate PDP model is still an open question. The mismatch of a PDP-model and the estimation error of delay parameters will degrade the performance of LMMSE. In response to this problem, a large number of works have proposed to use a pilot to estimate channel PDP, or to achieve LMMSE estimation through approximate PDP [16,17]. Although these methods can effectively avoid the mismatch in frequency domain and the distortion caused by the zero subcarrier, they still cannot guarantee the optimal performance of LMMSE estimation. To solve the problem of PDP model mismatch, the LMMSE estimator with CCF selection function is considered in [18]. The selected CCF is used to construct the channel correlation matrix to avoid the performance loss.
Due to the fact that the PDP is the inverse Fourier transform of the channel frequency correlation, precise channel frequency response can be utilized to estimate exact PDPs. It can be seen that most recent research uses matrix approximation or an inaccurate PDP model to generate channel correlation matrices. Thus, they cannot avoid the performance deterioration of LMMSE. Furthermore, applying independent PDP and LMMSE estimation algorithms is inefficient. This paper proposes an iterative joint estimation procedure for channel LMMSE and PDP. Motivated by [18], the best-matched CCF vector is selected to construct the channel correlation matrix for channel LMMSE estimation. Then, the estimated channel frequency response (CFR) is used for PDP estimation. Iteratively, the estimated PDP can be exploited to update the CCF candidate set. In this way, the estimation performance of both channel and PDP can be improved.
The rest of this paper is organized as follows: Section 2 gives a brief overview of the related work in this field. Section 3 presents the system model and the conventional channel estimation. The proposed algorithm is described in Section 4. Simulations in Section 5 validate the performance of the method. Section 6 concludes this paper and envisages the future work.
Notation: In the following, lower case letter a is used for scalar variable, capital letter A is used for vector variable, the bold letter A is used for matrix, A[m, n] denotes the (m, n)th element of A. N represents the set of natural numbers. (·) T , (·) * and (·) −1 denote transpose, complex conjugate, and inverse operation, respectively. E{·} represents the expectation operator, |·| and ∥·∥ denotes the modulus of a variable and the Euclidean norm of a vector, respectively.

Related Work
The methods to obtain the statistical characteristics of time dispersion of channels can be roughly divided into CP-based methods and pilot-based methods. Authors of [19] suggest that the delay and power of multipath components can be determined by the kneepoint locations and the gradients of the curve of the CP correlation function. Cui [20] estimates PDP using the maximum likelihood (ML) estimation based on the CP of the OFDM signal. Furthermore, an approximate ML estimation algorithm of PDP and noise variance is proposed in [21] through a simplified joint parameter estimation model. Although the proposed algorithm reduces the complexity of ML estimation, only the suboptimal solution can be obtained. In [13], the LS solution of channel tap-power and its relationship with the correlation coefficient of CP are derived, along with the hypothesis test for identifying the correct path. The proposed method can approach the CRLB of tap-power estimation in quasi-stationary fading channels. However, the existing CP-based methods above may require a large number of OFDM symbols and lead to prohibitively high computational complexity. Instead of using CP, another class of PDP estimation is the data-aided method that exploits the training sequences or pilots. In [22], the level-crossing rate (LCR) in the frequency domain was used for studying the characterization of frequency selectivity channel. It indicates that LCR is proportional to RMS delay spread, and scaling factor depends on the threshold level of observed LCR. However, accurate estimation of LCR requires intensive frequency sampling of the channel response and is highly sensitive to noise, which leads to low spectrum efficiency and reliability. Manhattan distance is used as a symbol-bysymbol correlator to estimate the channel delay parameters in [23]. This method has low computation complexity but limits estimation accuracy. In [24], the Fourier transform of the channel frequency correlation is used to estimate the PDP, and the channel-magnitudebased algorithm is proposed to overcome the sensitivity to timing errors. However, the estimation performance is greatly limited by the accuracy of the channel estimation.
For LMMSE estimation, the channel correlation matrix can be generated by using the simplified PDP model. However, the mismatch of the correlation function will cause estimation performance loss. To reduce the mismatch, Kim and Im [25] suggest using the pilot symbols of all transmit antenna ports to estimate the PDP for multiple-input multipleoutput (MIMO) systems. The proposed method alleviates the spectral leakage and the residual noise caused by insufficient samples of the estimated CIR. In [10], the PDP is approximately represented by the curvature of the channel amplitude, and then the Wiener filter coefficients corresponding to the subcarriers are generated by the estimation results. The vector quantization method [26] avoids real-time matrix inversion calculation by calculating the LMMSE filter matrices offline. The filter matrices can be stored in a look-up table and be used for the mapping from the code vector to the quantized LMMSE filter coefficient. To reduce the computational complexity of the channel correlation matrix, discrete cosine transform (DCT)-based methods [27,28] approximate the channel correlation matrix as a diagonal matrix, and the singular value decomposition (SVD) method [29] divides the channel correlation matrix into small submatrix products. However, the approximation in these methods results in performance deterioration. Additionally, Mei [18] proposes to design a candidate set of CCF vectors in advance and then selects the optimal one to construct the channel correlation matrix. This scheme can still achieve good MSE performance when the channel knowledge is completely unknown.

System Model
The wide-sense stationary-uncorrelated scattering (WSSUS) model is often used to represent time-varying fading channels, in which the instantaneous channel impulse response for baseband can be written by where L denotes the number of channel paths, α l (t) and τ l represent the complex amplitude and time delay of the lth multipath component, respectively. For different multipath component l, α l (t) is a wide-sense stationary narrow-band complex Gaussian process that is mutually independent. The mean delay τ and RMS delay spread τ RMS of the channel are expressed as By the Fourier transform of the channel impulse response h(t, τ) relative to τ, the channel frequency response at instant t can be obtained as Assuming that the number of subcarriers in the OFDM system is N, the CFR of the mth OFDM block and the kth subcarrier in an OFDM data frame are expressed as where k = 0, 1, · · · , N − 1, T b and ∆ f represent symbol duration and subcarrier spacing, respectively. Using X[m, k] to represent the transmitted data, and removing the CP before the discrete Fourier transform (DFT), the received symbol can be written as where d m is timing error, and W[m, k] is additive white Gaussian noise (AWGN).
Assuming that the number of pilot subcarriers used for estimation is N P and omitting the variable m, the LS channel estimation criterion is given bŷ LMMSE is the best linear estimation method in terms of MSE performance. Essentially, it involves a weighting matrix Φ to correct the influence of noise on the LS estima- Omitting the derivation process, it can be derived as In order to reduce the number of matrix inversions, the simplified LMMSE channel estimation is given by [30] where β = E |x[n]| 2 E |1/x[n]| 2 , n = 0, 1, · · · , N − 1 is a constant and depends on the modulation type. SNR is the average signal-to-noise ratio. R HĤ LS is the cross-correlation matrix of the accurate channel vector and the LS estimation vector, and R HH is the autocorrelation matrix of the channel. The elements in R HĤ LS and R HH satisfy [31] is the zeroth-order Bessel function of the first kind. The R f H can be obtained by the Fourier transform of PDP. In addition, if the statistical information of the channel is available, and the PDP decays exponentially, the CCF can be obtained by R f [31]. However, the PDP information is usually unknown to the receiver. Therefore, it is challenging to achieve accurate LMMSE estimation.

Iterative Joint Estimation Algorithm for Channel and PDP
Each frame of the transmitted signal consists of a training sequence and a certain number of data symbols. The CFR of LS estimationĤ LS can be obtained according to (7) by using the training sequence. According to (8), LMMSE estimation needs the channel correlation matrix R HH and SNR. We also assume that the signal power is normalized. Noise power is generally easy to be estimated [32,33]. Therefore, the difficulty of LMMSE estimation mainly depends on the acquisition of the R HH . Enabling the estimator with the ability to select CCFs can effectively solve this problem, and we propose a looped algorithmic architecture that enables joint PDP and channel estimation. The execution block diagram of the algorithm at the receiver is shown in Figure 1. R k k f [31]. However, the PDP information is usually unknown to the receiver. Therefore, it is challenging to achieve accurate LMMSE estimation.

Iterative Joint Estimation Algorithm for Channel and PDP
Each frame of the transmitted signal consists of a training sequence and a certain number of data symbols. The CFR of LS estimation LŜ H can be obtained according to (7) by using the training sequence. According to (8), LMMSE estimation needs the channel correlation matrix HH R and SNR. We also assume that the signal power is normalized.
Noise power is generally easy to be estimated [32,33]. Therefore, the difficulty of LMMSE estimation mainly depends on the acquisition of the HH R . Enabling the estimator with the ability to select CCFs can effectively solve this problem, and we propose a looped algorithmic architecture that enables joint PDP and channel estimation. The execution block diagram of the algorithm at the receiver is shown in Figure 1. Obviously, the proposed algorithm architecture can be divided into two loops: LMMSE estimation loop and PDP estimation loop. In the following, we describe the specific implementation process of the two loops. Firstly, we pre-generate a CCF candidate set ℜ, and the CCF vectors are calculated according to different scenarios. Secondly, the CCF selection algorithm is executed to select the best-matched CCF vector R f q * . Then, the R f q * is used to calculate the channel correlation matrixR opt and complete the LMMSE estimation. Further, we use the CFRĤ LMMSE as the input to PDP estimation, and the time dispersion parameters can be estimated. In addition, Fourier transform is applied to the output of PDP estimation, and the result R f Est is used to update the CCF set to improve the LMMSE estimation performance. Obviously, the proposed algorithm architecture can be divided into two loops: LMMSE estimation loop and PDP estimation loop. In the following, we describe the specific implementation process of the two loops.

LMMSE Estimation Loop
where N o = ⌈∆ f τ max N⌉ = ⌈τ max N/T s ⌉, T s , and τ max are the system sampling interval (OFDM symbol duration) and the possible maximum delay spread, respectively, and τ max usually takes the length of CP. Moreover, when the timing error is considered, the effective channel frequency-domain correlation function (E-CCF) [31] can be calculated from the typical value of the timing error or the statistical distribution of the timing error. The specific expression can be given as where P(d m ) is the probability density function of the timing error d m . The R f [∆k] of (11) can be obtained by the Fourier transform of the PDP. When the channel statistics are completely unknown, the R f [∆k] can be substituted by the R f obtained by (10).
After generating ℜ, the key step is CCF selection. The training sequence is divided into two groups X 1 and X 2 at equal intervals in the order of subcarrier index, i.e., 1] ]. The CFR vectors corresponding to X 1 and X 2 are denoted as H 1 and H 2 , respectively. The corresponding LS estimation vectors are denoted asĤ LS_1 andĤ LS_2 , respectively. Then, we use X 1 to perform MMSE interpolation to estimate H 2 , and use X 2 to perform MMSE interpolation to estimate H 1 . The interpolation resultsĤ INT_1 andĤ INT_2 are obtained by The evaluation index of the CCF selection algorithm is given by In Equation (12),R is the corresponding channel correlation matrix, whose elements come from the candidate CCF vector R f q . By evaluating each candidate CCF vector according to Equation (13), the CCF that best matches the current channel can be selected as Finally, the CFR of the LMMSE estimator is obtained bŷ Different ℜ may produce different estimation results. Therefore, with updated ℜ, we perform Equations (12)- (15) to complete a new LMMSE estimation loop.

PDP Estimation Loop
The PDP estimation can rely on the principle that the channel frequency correlation and the channel PDP are Fourier transform pairs. The instantaneous channel frequency correlation can be calculated by using the estimated CFRĤ LMMSE as where u is the index of the frame. The E u,k {·} is the expectation over u and k. The expectation of multiple frames can effectively reduce the impact of noise. Then, the inverse discrete Fourier transform (IDFT) is performed on the obtained channel correlation value to calculate the PDP where P l = E |α l | 2 is the power of the lth channel tap. In addition, the taps whose power is less than 25 dB of the strongest tap power are omitted. Then, τ and τ RMS are calculated according to Equations (2) and (3), respectively. Moreover, it is worth mentioning that when using LS-estimated CFRĤ LS , the channel frequency correlation obtained according to (16) can be simplified into the sum of effective channel frequency correlation and noise correlation values where σ 2 Z is the variance of channel estimation error caused by noise, and δ(·) is Dirac delta function. To remove the influence of noise term, we use the correlation value of nonzero lag to fit the parabola by the LS method. Then, replace the correlation value R fĤ LS (0) of zero lag with the polynomial coefficients obtained by fitting. By this means, we can approximate the value of R f H (∆k) as R f H (∆k) and the PDP can be estimated by the inverse Fourier transform of R f H (∆k). It should be noted that the performance of the proposed PDP estimation algorithm is limited to the accuracy of channel estimation. Figure 2 shows the amplitude curve of normalized channel frequency correlation calculated according to (17) for accurate CFR, LS estimation, and the proposed LMMSE estimation under the ITU-VA channel when the SNR is 5 dB. The dotted lines represent the simulation curves when STO obeys the uniform distribution of [−10, 0]. As can be observed, in the case of perfect synchronization, the channel frequency correlation amplitude of the LMMSE is lower than the accurate CFR but higher than that of the LS channel estimate. In the presence of STO, the channel frequency correlation amplitude of LMMSE and LS shows an obvious reduction. However, the correlation amplitude of LMMSE is still higher than that of LS. That is to say, the proposed LMMSE estimator is able to alleviate the noise effects compared with LS estimation and do not need any prior channel knowledge. the channel frequency correlation amplitude of the LMMSE is lower than the accurate CFR but higher than that of the LS channel estimate. In the presence of STO, the channel frequency correlation amplitude of LMMSE and LS shows an obvious reduction. However, the correlation amplitude of LMMSE is still higher than that of LS. That is to say, the proposed LMMSE estimator is able to alleviate the noise effects compared with LS estimation and do not need any prior channel knowledge. Therefore, we consider using the CFR obtained by the proposed LMMSE estimator as the input of the PDP estimation algorithm instead of the LS estimation value. In addition, with the output of the new LMMSE estimation loop, we can perform a new PDP estimation loop to get a more accurate PDP.

Iterative Joint Estimation Procedure
It is evident that the LMMSE estimator can obtain a better CFR in terms of MSE performance than the LS estimator when the channel information is unavailable. Moreover, the performance of the proposed LMMSE estimator depends on the candidate CCF vectors. Thus, we can update the candidate CCF set according to the latest PDP estimation. In this case, the performance of the LMMSE estimation is improved by the updated CCF set. Since the results of each estimation process can be applied to the other one, the proposed algorithm is performed in an iterative manner, as shown in Figure 3.
We denote the number of iterations as ∈  , i i and max i is the maximum number of iterations with = 0 i representing the initial process. Firstly, the initial CCF set is generated according to (10) and (11) for different scenarios, Therefore, we consider using the CFR obtained by the proposed LMMSE estimator as the input of the PDP estimation algorithm instead of the LS estimation value. In addition, with the output of the new LMMSE estimation loop, we can perform a new PDP estimation loop to get a more accurate PDP.

Iterative Joint Estimation Procedure
It is evident that the LMMSE estimator can obtain a better CFR in terms of MSE performance than the LS estimator when the channel information is unavailable. Moreover, the performance of the proposed LMMSE estimator depends on the candidate CCF vectors. Thus, we can update the candidate CCF set according to the latest PDP estimation. In this case, the performance of the LMMSE estimation is improved by the updated CCF set. Since the results of each estimation process can be applied to the other one, the proposed algorithm is performed in an iterative manner, as shown in Figure 3.
We denote the number of iterations as i, i ∈ N and i max is the maximum number of iterations with i = 0 representing the initial process. Firstly, the initial CCF set ℜ 0 = R f Inl_1 , R f Inl_2 , · · · , R f Inl_Q is generated according to (10) and (11) for different scenarios, and the LS estimation valueĤ LS is calculated by Equation (7). The ℜ 0 andĤ LS are the inputs of the iterative algorithm. According to the principle of the proposed LMMSE estimator, the CCF selection algorithm is executed to select the CCF vector, R f q * , that best matches the current channel by Equations (12)- (14). Using R f q * to construct channel correlation matrixR opt according to (16), the initial valueĤ i=0 LMMSE can be obtained by using Equation (15). According to (17),Ĥ i=0 LMMSE is used to calculate the channel frequency correlation, and the results of multiple frames are counted to obtain the expectation. Then, the PDP estimation is completed by Equation (18). On this basis, we can calculateτ i=0 RMS according to (3), and a CCF vector R f Est can be obtained by performing DFT on the result of PDP estimation. We add i by 1 after each LMMSE estimation and PDP estimation, which means that a new estimation process will begin. Then, we add R f Est,i−1 to the CCF set to obtain a new CCF vector set Entropy 2022, 24, 1664 9 of 16 according to (3), and a CCF vector Est R can be obtained by performing DFT on the result of PDP estimation. We add i by 1 after each LMMSE estimation and PDP estimation, which means that a new estimation process will begin. Then, we add −   Based on the above procedure, the channel LMMSE and PDP estimation algorithms are executed iteratively to obtain the channel estimationĤ i LMMSE and the estimated RMS delayτ i RMS of the ith iteration. Because ℜ i has more accurate CCF vectors than ℜ i−1 , we can ensure that the performance of each iteration is improved. After each execution of the PDP estimation algorithm, the value of i is judged. The algorithm ends when i > i max . The proposed scheme is summarized as Algorithm 1.
UsingĤ i E−LMMSE to estimate PDP and get the result R f Est,i ,τ i RMS (17) and (18)

Analysis of Computational Complexity
Generally, the LMMSE-based algorithm has significantly higher computational complexity than the one based on the LS estimation proposed in [24]. The process of determining q * needs to calculate the evaluation index ξ q from all candidate CCFs. Therefore, the complexity of the proposed LMMSE estimation loop is proportional to the number of the candidate CCF vectors. Using Equations (12)- (14) to calculate ξ q requires two N/2order matrix inversion operations. For the candidate CCF set ℜ, we need to calculate Q times of ξ q according to (13), corresponding to 2Q times of N/2-order matrix inversions. When the SNR and q * are determined, the LMMSE estimator needs to calculatē , once in Equation (15) with N 3 multiplications and in the Norder matrix inversion. Meanwhile, the LS resultĤ LS also needs N times of multiplications. Finally, considering the number of iterations i max , the multiplication and matrix inversion times are counted independently. The results are shown in Table 1, where L r is the length of CP. It can be seen that, compared with the matrix approximation method [27] and the singular value decomposition (SVD) method [29], the LMMSE estimator with CCF selection [18] has a relatively large amount of computation. It is evident that the amount of calculation of algorithm suggested in [24] is minimal since it only performs PDP estimation and needs no matrix inversion operations. Compared with [18], the proposed algorithm only slightly increases the computational complexity when performing both channel LMMSE and PDP estimation.

Simulation Results and Analysis
In this section, the performance of the proposed iterative joint estimation algorithm is analyzed by simulation. We build an OFDM simulation platform in which the environment parameters are shown in Table 2, and the rest of the system parameters are set according to the IEEE 802.11ax standard. In this paper, we consider using an OFDM symbol-length Zadoff-Chu sequence as the training sequence for channel estimation. The transmitted frame contains 50 data symbols. The results are obtained through 1000 Monte Carlo simulations. In order to evaluate the performance of channel PDP estimation, we use the normalized mean square error (NMSE) of channel second-order statistics τ RMS as the performance metric which can accurately reflect the deviation of the estimator from the theoretical value.
The NMSE of τ RMS is defined as NMSE(τ RMS ) ≜ E |τ RMS −τ RMS | 2 /τ 2 RMS , in whichτ RMS represents the estimated values. In addition, the CRLB of the τ RMS estimator is used as the comparison [24] CRLB(τ RMS ) where M u is the number of frames used for estimation. The performance of channel estimation is evaluated by the mean square error (MSE) of CFR: MSE(H) = E ∥H −Ĥ∥ 2 and compared with three typical LMMSE estimation methods [18,27,29].
Since the proposed algorithm is based on the LMMSE estimator, the initial estimation valueĤ i=0 LMMSE directly affects the performance of the proposed algorithm. The key to the LMMSE estimator lies in the design of candidate set ℜ 0 . In order to comprehensively investigate the performance of the proposed algorithm, we divide the possible communication scenarios into three types. Scenario 1: It is assumed that all possible channel models' statistics are available. For example, ITU-R defines the four models shown in Table 3. In this scenario, ℜ 0 can be composed of all possible PDPs. In addition, in order to improve the robustness of the estimator, N o takes the value of {4, 16, 64}, and the R-CCFs are also added to ℜ 0 according to (10). The simulated channel of each transmission is randomly selected among four models. according to (10). In addition, in order to expand the number of CCF, the E-CCF with the value of d m = [−5, −1] is also added to ℜ 0 . In the simulation of this scenario, the channel is fixed as the ITU-VA model. Scenario 3: On the basis of scenario 2, this scenario further considers the non-ideal factor: STO. It is assumed that d m obeys the uniform distribution of [−10, 0]. In this scenario, ℜ 0 and other simulation parameters are the same as in scenario 2. Figure 4 shows the relationship between the NMSE of τ RMS and the number of frames used to estimate in Scenario 1 and Scenario 2. The label "Yuceks's Method" represents the algorithm based on the LS estimation proposed in [24]. The "i = 0" means the initial result when the algorithm does not start an iteration. That is to say, the candidate CCF set of the LMMSE estimator is ℜ 0 . In this case, the estimated result of PDP is calculated withĤ i=0 LMMSE as the input. Because the algorithm in [24] is performed based on LS channel estimation, the results are the same in two scenarios, and the MSE are significantly higher than those of the proposed algorithm. The initial result of the proposed algorithm in scenario 1 is better than that in scenario 2. In scenario 1, after one iteration, the NMSE of τ RMS is close to CRLB, whereas in scenario 2, it takes four iterations to get a similar result. It is because the ℜ 0 of scenario 1 contains all possible PDPs and it make the algorithm efficient.   M is the number of frames. It can be seen that when the SNR is lower than 4 dB, the performance of the algorithms is improved with the increase of SNR, and the performance of the proposed algorithm is better than the algorithm proposed in [24]. However, when SNR rises to around 5 dB, the NMSE of RMS is SNR-independent. This is due to the limitation of the number of statistical frames.  Figure 5 shows the NMSE of τ RMS as a function of the SNR in Scenario 1 and Scenario 2, where M u is the number of frames. It can be seen that when the SNR is lower than 4 dB, the performance of the algorithms is improved with the increase of SNR, and the performance of the proposed algorithm is better than the algorithm proposed in [24]. However, when SNR rises to around 5 dB, the NMSE of RMS is SNR-independent. This is due to the limitation of the number of statistical frames. Figure 6 simulates the performance of the LMMSE estimation in Scenario 1 and Scenario 2. It should be noted that the results of [24] are also the results of the proposed algorithm when i = 0. It can be seen that, compared with the DCT based LMMSE and the SVD based LMMSE in [27,29], the LMMSE with CCF selection is more robust and the proposed iterative algorithm can further improve the performance of LMMSE, especially in scenario 2. The proposed algorithm approaches the accurate LMMSE result after 1 and 4 iterations, respectively, in the two scenarios. Figures 7 and 8 show the simulation results in scenario 3. It can be seen that the performance of PDP estimation is greatly affected by STO, compared with scenarios 1 and 2. The NMSE of τ RMS is similar compared to the third iteration and the second iteration. It indicates that the performance of the algorithm cannot be significantly improved after the   Figure 5 shows the NMSE of τ RMS as a function of the SNR in Scenario 1 and Scenario 2, where u M is the number of frames. It can be seen that when the SNR is lower than 4 dB, the performance of the algorithms is improved with the increase of SNR, and the performance of the proposed algorithm is better than the algorithm proposed in [24]. However, when SNR rises to around 5 dB, the NMSE of RMS is SNR-independent. This is due to the limitation of the number of statistical frames.  Figure 6 simulates the performance of the LMMSE estimation in Scenario 1 and Scenario 2. It should be noted that the results of [24] are also the results of the proposed algorithm when = 0 i . It can be seen that, compared with the DCT based LMMSE and the SVD based LMMSE in [27,29], the LMMSE with CCF selection is more robust and the proposed iterative algorithm can further improve the performance of LMMSE, especially in scenario 2. The proposed algorithm approaches the accurate LMMSE result after 1 and 4 iterations, respectively, in the two scenarios. The NMSE of τ RMS is similar compared to the third iteration and the second iteration. It indicates that the performance of the algorithm cannot be significantly improved after the third iteration. In addition, the performance of the proposed LMMSE estimator still has great advantages over the other three algorithms in this scenario. The NMSE of τ RMS is similar compared to the third iteration and the second iteration. It indicates that the performance of the algorithm cannot be significantly improved after the third iteration. In addition, the performance of the proposed LMMSE estimator still has great advantages over the other three algorithms in this scenario.

Conclusions and Future Work
In this paper, we propose a joint estimation algorithm for channel and PDP, including the LMMSE estimate loop and PDP estimate loop. The proposed LMMSE estimator has the ability to select the best-matched CCF, which solves the problem of channel correlation matrix generation. The CFR obtained is used for PDP estimation, and the results can be applied to update the CCF set. Since both PDP and channel estimation requires the result from the other one, an iterative joint algorithm is proposed. The proposed algorithm is evaluated by the NMSE of the second-order statistic of PDP and the channel MSE. Simulation results show that the performance of PDP estimation is better than that of existing LS-based methods in different scenarios. The NMSE of RMS can approach CRLB after a limited number of iterations. The results of this paper can be used to obtain accurate PDP results and improve the performance of channel estimators in OFDM systems. As for future extensions of this work, we will consider further performance improvement by the joint estimation process of channel response and Doppler spread. By generating the candidate set of channel correlation function and choosing the best-matched correlation function vector, the channel frequency response estimated by LMSME can be used to estimate the Doppler spread. Therefore, the estimated results can update the channel correlation matrix. A joint estimation process of channel and the Doppler spectrum can also be obtained. This approach is appealing especially in adaptive OFDM systems.

Conclusions and Future Work
In this paper, we propose a joint estimation algorithm for channel and PDP, including the LMMSE estimate loop and PDP estimate loop. The proposed LMMSE estimator has the ability to select the best-matched CCF, which solves the problem of channel correlation matrix generation. The CFR obtained is used for PDP estimation, and the results can be applied to update the CCF set. Since both PDP and channel estimation requires the result from the other one, an iterative joint algorithm is proposed. The proposed algorithm is evaluated by the NMSE of the second-order statistic of PDP and the channel MSE. Simulation results show that the performance of PDP estimation is better than that of existing LS-based methods in different scenarios. The NMSE of RMS can approach CRLB after a limited number of iterations. The results of this paper can be used to obtain accurate PDP results and improve the performance of channel estimators in OFDM systems. As for future extensions of this work, we will consider further performance improvement by the joint estimation process of channel response and Doppler spread. By generating the candidate set of channel correlation function and choosing the best-matched correlation function vector, the channel frequency response estimated by LMSME can be used to estimate the Doppler spread. Therefore, the estimated results can update the channel correlation matrix. A joint estimation process of channel and the Doppler spectrum can also be obtained. This approach is appealing especially in adaptive OFDM systems.