Abstract
Asymmetric windows are of increasing interest to researchers because of the nonlinear and adjustable phase response, as well as alterable time delay. Short-time phase distortion can provide an essential improvement in speech coding, and also has better performance in speech recognition. The merits of asymmetric windows in the aspect of spectral behaviors have an important function in frequency component detection and parameter estimation. In this paper, the phase response of windows were further studied, and the phase characteristics of symmetric and asymmetric windows are described. The relationship between the barycenter of windows in the time domain, and the phase characteristic at the center of the main lobe in the frequency domain, was established. In light of the relationship, an improved version of the asymmetric window- based frequency estimation algorithm was proposed. The improved algorithm has advantages of straightforward implementation and computational efficiency. The numeric simulation results also indicate that the improved approach is more robust than the traditional method against additive random noise.
1. Introduction
Windows are a kind of weighting function that are introduced to reduce the spectral leakage in spectrum analysis []. Classic windows have constant time delay and linear phase response due to the symmetry and time shifting causality-imposed in the time domain []. In recent years, asymmetric windows have attracted increasing concerns due to the nonlinear and adjustable phase response, as well as alterable time delay []. Short-time phase distortion can lead to better recognition in speech processing and bring a lot of advantages in speech coding [,,,,]. These asymmetric windows with extended discrimination capacity are very good candidates for the detection of closely spaced frequency components []. The characteristics of the adjustable spectral phase in asymmetric windows also play an important role in parameter estimation []. However, it should be pointed out that most of the past literature, so far, has been mainly focused on the construction of asymmetric windows or the amplitude response [,,]. Although the alterable phase response of asymmetric windows is of great benefit to scientific fields or engineering applications, such as speech recognition and parameter estimation, there have been relatively few studies devoted to the theoretical analysis of phase response.
For example, when asymmetric windows are applied in phase difference-based frequency estimation implemented by discrete Fourier Transform (DFT), an accurate frequency estimate can be obtained through only one sample frame []. The critical point is to find the abscissa where the phase difference is zero, because the intersection of the phase lines of a symmetric window and its corresponding asymmetric window is always at the center of the main lobe []. Since it is difficult to find an analytical solution, it becomes a numeric root-finding problem. In [], the secant method was suggested to approximate the normalized theoretical frequency. However, it is required to compute additional Fourier coefficients in each iteration, where intensive computational effort is involved and, thus, poses a heavy burden on the overall cost of the algorithm. This problem extremely weakens the application of the estimation algorithm in the presence of noise, especially in the situation where real-time computation is demanded. If the slope of the secant lines can be computed steadily in advance, the amount of calculation can be sharply decreased because of fast convergence. For a symmetric window, the phase response can be easily worked out, while, for an asymmetric window, it is difficult due to the nonlinearity and alterability. In this paper, our aim was to study the phase characteristics of asymmetric windows and deduce the inner relationship between samples of window in the time domain and its slope of the phase response in the frequency domain.
This paper is organized as follows. In Section 2, the phase response of symmetric windows, in the form of continuous time, is explored, and the DFT-even window and true-even window in the form of discrete time are discussed. In Section 3, the phase characteristics of asymmetric windows are studied, and the relationship between the barycenter of windows and the phase characteristic is established. In Section 4, an improved version of the phase difference-based frequency estimation algorithm is proposed by taking advantage of the phase characteristic of asymmetric windows. A numerical comparison study between the improved approach and the traditional approach is also performed in this section. In Section 5, the main conclusions are summarized.
2. Phase Response of Symmetric Windows
2.1. Phase Response of Symmetric Windows
Considering a window function , the time interval of is T. The Fourier transform of the window can be expressed as []
Since is even and real, is a real even function. Obviously, the phase is 0 or , depending on the sign of . If a time shift on the window function is T/2, the Fourier transform of the time-shifted window function would be []
Accordingly, the phase is
It is indicated that a time shift on a window function would introduce a linear phase shift.
2.2. Phase Response of Sampled Symmetric Windows
As a symmetric window in the form of continuous time is an even function about the origin, the discrete window with an odd number of points N is also presented as symmetrical. The overall points include (N − 1)/2 points on each side, and a peak point in the middle of the window. For an even number of points, a discrete window is obtained after the right endpoint is discarded. This is DFT-even, referred to in []. In that paper, Harris also mentioned that the window with the absolute even symmetry, rather than that with the implied symmetry with the missing endpoint, was designed by much of the literature and incorporated in many software libraries []. However, whether all the DFT-even windows have a strictly linear phase is still unknown, and so far, the differences and relationships between the phase of DFT-even windows and that of absolute-even windows have not been analyzed. To answer these questions, a new and uniform perspective of the process from a continuous symmetric window to a discrete window will be presented in the latter part of this section. We used the triangle window as an example. The triangle window in the form of continuous time can be written as
the period of which is T. The interval can be separated into N slices by N + 1 points. A discrete window with an odd or even number of points can be obtained by keeping or removing the last point. This is the routine operation. Now, we consider a sampling train , which can be shifted by a coefficient . It is described as
where is sample interval; and is the shifting factor between zero and . By making equal to and equal to , a discrete triangle window sampled from the continuous one with points is obtained. Given the radix-2 operations were implemented in fast Fourier Transform (FFT), we only discuss the discrete window with an even number of points in detail. The discrete triangle window can be described as
where . The value of each point in the discrete triangle window is dependent upon the parameter . In this paper, special cases with and were considered. When , it is exactly the same as the DFT-even window. When , the window is absolutely symmetric. Two cases of a triangle window with 32 samples are shown in Figure 1.

Figure 1.
(a) DFT-even triangle window; (b) True-even triangle window.
In the two cases, the results of DFT were all real, because of the symmetry and real in the time domain. As a result, after the windows were shifted, both of them had a linear phase, with the difference being that the slope for the DFT-even window was (see Appendix A) and the slope for the true-even window was (see Appendix B). This is because the number of shifting points was N/2 for the DFT-even window and was (N − 1)/2 for the true-even window. Figure 2, Figure 3 and Figure 4 show the normalized log-magnitude response (NLMR), the normalized phase response (NPR), and the derivative of normalized phase response (DNPR). The results coincided well with the theoretical analysis.

Figure 2.
Normalized log-magnitude response of triangle window.

Figure 3.
Normalized phase response of triangle window.

Figure 4.
Derivative of normalized phase of triangle window.
When it comes to windows with non-zero endpoints, things get more complex. For example, the Hamming window can be written as
When = 0.5, the phase slope for the true-even window is still . However, when = 0, the DFT-even window has a nonlinear, but approximately linear phase, due to loss of symmetry in the time domain. For an enough large N, the slope of the phase gets close to . For the two cases of Hamming window with 32 samples, the normalized log-magnitude response, the normalized phase response, and the derivative of normalized phase are shown in Figure 5, Figure 6, Figure 7 and Figure 8, respectively. It is clear that all absolute-even ( = 0.5) windows have a strictly linear phase. The phase for some DFT-even ( = 0) windows are strictly linear, and some are not, which is decided by the value of the endpoint.

Figure 5.
(a) DFT-even Hamming window; (b) True-even Hamming window.

Figure 6.
Normalized log-magnitude response of Hamming window.

Figure 7.
Normalized phase response of Hamming window.

Figure 8.
Derivative of normalized phase of Hamming window.
3. Phase Response of Asymmetric Windows
3.1. Phase Response of Asymmetric Windows
For a symmetric window, the symmetric center is the peak point. The slope of phase is determined by the causality-imposed time shift. Since all symmetric windows have a constant time delay T/2, they have the same phase response []. Before further discussion, we introduced an indicator, , called the normalized time delay factor (NTDF), to evaluate the time delay of different lengths of windows. This is defined as the time delay of a window scaled by the interval that the window covers.
Obviously, we have and the NTDF of all symmetric windows is 1/2. For an asymmetric window, we would also like to find the NTDF. If we denote as the phase response of a time-shifting asymmetric window , according to the Taylor series expansion, one can have
The phase response in the main lobe, and especially the phase response near the main lobe center is of more significance, as it has a decisive influence on the phase distortion of the tapered signal [,,]. The problem is more notable for the FFT of a discrete time signal, due to the picket fence effect [,]. The phase distortion is determined by the frequency deviation and the variation of phase response near the main lobe center of the adopted window []. As a result, expanding around the center in the main lobe with , one can reach
Since the weighting function is real, . Equation (10) can be simplified as
Equation (11) indicates that the phase near the center of the main lobe can be approximated by two parts: the linear part and the higher order part. According to a similar concept used in symmetric windows, we introduced the general time delay (GTD) in asymmetric windows. Assuming denotes the general time delay, and considering the relationship between time delay and the linear part of phase, we have
The GTD of an asymmetric window can be computed by
Considering the substitution (See Appendix C), the GTD can be finally expressed as
Equation (14) indicates that the general time delay is solely determined by the barycenter of the weighting factors and that window function can be adjusted in the time domain to change the time delay. Specifically, the time delay of all symmetric windows is T/2, as the barycenter is located at the symmetric center. Equation (14) not only gives a mathematical analysis of the time delay, but also implies that the phase characteristics around the center of the main lobe can be quantitatively adjusted. According to Appendix C, the derivative of the phase in the main lobe center can be computed by
3.2. Phase Response of Sampled Asymmetric Windows
Similar to the asymmetric windows in the form of continuous time, the discrete time asymmetric windows have the same property. If denotes the general time delay, for the asymmetric windows , the derivative of the phase at the center of the main lobe (DPCM) can be calculated by
where denotes the normalized GTD. According to the conclusion in Appendix D, the normalized GTD can be finally expressed as
It should be pointed out that for an asymmetric window, the asymmetry due to the discrete sampling discussed in Section 2.2 is much less critical when compared with the asymmetry of itself, and Equation (16) is valid for all sampling cases.
4. Applications and Numerical Simulations
4.1. Application in Frequency Estimation
It has been demonstrated that if asymmetric windows are introduced in frequency estimation, some inherent defects in the traditional parameter estimation algorithms can be overcome []. For example, by replacing symmetric windows with proper asymmetric windows, frequency estimation can be achieved by only one segment of samples, rather than two segments in the traditional phase difference-based algorithms, thus yielding better performance in the estimation of closely spaced frequency components []. Furthermore, the resistance of traditional algorithms against additive noise can be improved with the proper selection of asymmetric windows. However, it should also be stressed that the improvement is at the expense of a remarkable increase in computational burden as the simple relationship between the normalized frequency deviation and the phase difference disappears, due to the nonlinear phase of asymmetric windows. The calculation of estimate becomes a root-finding problem, and the secant method was suggested in [] to avoid the computation of a derivative in Newton’s method. The secant method is defined by the recurrence relation
where denotes the normalized frequency estimate in each iteration; denotes the phase difference at frequency ; and is defined as
It should be pointed out that we have the relationship that
where , and are the DPCM of the two adopted windows, respectively. Moreover, and vary slightly around the center in the main lobe and the frequency estimate returned by coarse search is also near the center of the modulated main lobe. Accordingly, we can rewrite the recurrence relation by replacing with .
Equation (21) avoids the calculation of the finite difference . While, in Equation (18) is required to be computed in each iteration and more importantly, may become unstable once intensive random noise is involved. In contrast, is only dependent on the adopted windows and is, consequently, a constant value in each iteration. The frequency components and random noise have no influence on , and thus can be computed in advance once the window is determined. As a result, Equation (21) becomes more robust and more efficient than Equation (18) in the presence of interference or random noise. In addition, the requirement of memory for the implementation of Equation (21) is also less than that of Equation (18). All those features make it well suited for application in practice engineering. Finally, the improved estimation algorithm can be summarized as follows:
- (1)
- Obtain the first discrete-time signal , where is the raw sequence with N samples and is a symmetric window.
- (2)
- Obtain the second sequence , where is the corresponding asymmetric window.
- (3)
- Perform a coarse search on , where is FFT of , to determine the bin number l with the largest magnitude.
- (4)
- Calculate the window-related coefficient as well as the phase difference between the two sequences , where is the DFT coefficient of .
- (5)
- Return the coarse estimate .
- (6)
- Obtain the further refined estimate by the iterations .
4.2. Simulation and Results
In order to evaluate the effectiveness of Equation (21) as well as compare it with the traditional algorithm, some simulations were designed. For simplicity but without loss of generality, a single discrete complex exponential was considered, which can be generated according to Equation (22).
In Equation (22), the amplitude was set to 1. Both the sampling rate and the number of analyzed samples N was set to 256, resulting in a frequency resolution of . First, we considered the cases free of noise. The frequency scanned from 63.5 to 64.5 in steps of 0.05. For each frequency, the phase scanned from to in steps of , and the maximum absolute estimation error between the estimate and the theoretical value was selected as the result. All frequency estimates were scaled by frequency resolution and expressed in frequency bin. The adopted asymmetric windows were [t]2-windows defined in [], which was constructed by the truncation method. The remaining errors for the Kaiser–Bessel window, Gaussian window, Hanning window, Hamming window, and Blackman window were reported in Figure 9, Figure 10, Figure 11, Figure 12 and Figure 13 as a function of frequency deviation. As can be seen from these figures, the asymmetric windows-based phase difference method had good compatibility with different window functions. The maximum absolute estimation errors for the classic method with m = 0 were about 10−3~10−2. By performing an iteration, the level decreased to 10−5~10−4. Without iteration, the improved method had a similar performance to the classic method but attained a worst case at . However, after an iteration, the errors sharply decreased and were all below 10−5. The remaining errors could be completely removed after two iterative operations even for the case where . As a result, in the absence of noise, an iteration in the proposed algorithm was enough to achieve a high accurate result.

Figure 9.
Frequency errors of the classic and the improved secant method (Kaiser–Bessel window, ).

Figure 10.
Frequency errors of the classic and the improved secant method (Gaussian window, ).

Figure 11.
Frequency errors of the classic and the improved secant method (Hanning window).

Figure 12.
Frequency errors of the classic and the improved secant method (Hamming window).

Figure 13.
Frequency errors of the classic and the improved secant method (Blackman window).
Now, we considered that Equation (22) was contaminated with an additive Gaussian random noise z(n). The parameter settings were the same as Figure 9, Figure 10, Figure 11, Figure 12 and Figure 13, except that the phase was randomly selected. The variance of additive white Gaussian noise was set to , indicating signal-to-noise ratio (SNR) equal to 10 dB. SNR expressed in dB was defined as
An amount of 100,000 instances of such test signals were generated for each frequency, and the root-mean-square error (RMSE) was evaluated. The simulation results for the five considered windows are shown in Figure 14, Figure 15, Figure 16, Figure 17 and Figure 18. For m = 0, the classic secant method had good performance in the vicinity of , however, it showed a much worse performance in the rest of the cases. After an iteration, the RMSEs of the classic secant method declined notably, but the fluctuation could be still observed. If m = 2, the values can be maintained at the same level for the whole range of frequency deviation. In contrast, the improved secant method had a more rapid convergence and the errors stayed steady for different frequency deviations after only one iteration. The proposed approach did not seem to capture a substantial improvement by performing another iteration after the first iteration. The benefit from more iterations was limited, but each iteration required a calculation of two DFT coefficients, which would increase the overall computational effort. Hence, it can be concluded that if a moderate noise is involved, performing one iteration after the coarse compensation in step (5) is enough for the improved secant method, as the errors due to different frequency deviation can be eliminated.

Figure 14.
RMSEs of the classic and the improved secant method (SNR = 10 dB, Kaiser–Bessel window, ).

Figure 15.
RMSEs of the classic and the improved secant method (SNR = 10 dB, Gaussian window, ).

Figure 16.
RMSEs of the classic and the improved secant method (SNR = 10 dB, Hanning window).

Figure 17.
RMSEs of the classic and the improved secant method (SNR = 10 dB, Hamming window).

Figure 18.
RMSEs of the classic and the improved secant method (SNR = 10 dB, Blackman window).
We also investigated the RMSEs of the classic and the improved secant method for a wide range of SNRs from 0 dB to 180 dB with a step of 10 dB. The normalized frequency, as well as the phase, were randomly chosen. The other simulation parameters were kept invariant. As can be seen in Figure 19, Figure 20, Figure 21, Figure 22 and Figure 23, for some conditions, the RMSEs may become flat after certain SNR as the bias becomes the dominant source of error. In the case where enough iterations were performed, the remaining bias could be completely eliminated, and the RMSEs were linear for the entire range of SNR. The figures were very informative regarding the trade-off between the bias and variance for different iterations. Without iterations, the classic method was first saturated at SNR ≈ 10 dB, followed by the improved method. If an iteration was performed, the values for the improved method sharply decreased, and the RMSEs were much smaller and less than 10−6. However, a high level of RMSEs remained for the classic method. If two iterations were performed, the RMSEs of the improved version would be merely dependent on the random noise, while for the classic method, the bias may still be significant when a high SNR was encountered. In most practical situations, the influence of noise is usually more important than the bias. As a result, one iteration was sufficient for the proposed algorithm to produce satisfactory results even in the presence of noise.

Figure 19.
RMSEs of the classic and the improved secant method versus SNR (Kaiser–Bessel window, ).

Figure 20.
RMSEs of the classic and the improved secant method versus SNR (Gaussian window, ).

Figure 21.
RMSEs of the classic and the improved secant method versus SNR (Hanning window).

Figure 22.
RMSEs of the classic and the improved secant method versus SNR (Hamming window).

Figure 23.
RMSEs of the classic and the improved secant method versus SNR (Blackman window).
5. Conclusions
In this paper, the phase characteristics of symmetric and asymmetric windows were studied. It was shown that the phase characteristics were closely related to the time delay of the windows. The quantitative relationship between the barycenter of the windows in the time domain and the phase slope at the main lobe center in the frequency domain was derived. On the basis of the relationship, the traditional phase difference method was improved by a simplified secant method. This was illustrated through simulations where the improved algorithm not only had the merits of simple implementation and computational efficiency, but also had the advantage of robustness against additive random noise as the proposed algorithm with only one additional iteration was sufficient to guarantee an accurate estimate even in the presence of noise.
Author Contributions
J.L., X.L. and S.F. wrote the manuscript. H.X. and X.L. analyzed and interpreted the data. After the manuscript was finished, K.Z. and S.F. reviewed the manuscript.
Funding
This research was funded by the National Natural Science Foundation of China (Grant No. 51705059), funded by the Basic Research and Frontier Technology of Chongqing Science and Technology Commission (Grant Nos. cstc2017jcyjAX0033 and cstc2017jcyjAX0126) and also partly supported by the Project of Science and Technology Research Program of Chongqing Education Commission of China (Grant No. KJ1600428).
Acknowledgments
The authors would like to thank the anonymous reviewers for their valuable comments and suggestions that helped improve the quality of this manuscript.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A
According to the definition of discrete Fourier transform, we have
Using Euler’s formula , can be rewritten into two parts
where the real part is
and the imaginary part is
The derivatives are, respectively,
and
The phase can be computed by
and its derivative is
The numerator can be rewritten as
and the denominator can be rewritten as
Introducing
Equation (A8) can be simplified to
The second derivative of the phase
If
and
Equation (A13) is simply written as
After some algebraic operations, can be rewritten as
For the DFT-even symmetric windows, it is already known . Assuming
we can obtain
Since
and
Equation (A19) can be further simplified as
As a result, can be simplified to
For those windows with an endpoint equal to zero, it is easy to obtain
Accordingly, we have
and, thus, is a constant. Substituting into Equation (A12) yields
Because of and , is
Appendix B
For the true-even symmetric windows, we have . Similar to the derivations in Appendix A, because of and it is obtained that
With , can be simplified to
Accordingly, we have the same conclusions that and is a constant. Substituting into yields the same result as Equation (A26) and considering the property , we finally obtain
Appendix C
According to the definition of Fourier transform, we have
Using Euler’s formula , can be written into two parts
The real part is
and we have
The imaginary part is
and we have
Taking the derivative of yields
and one can reach
Taking the derivative of yields
and one can reach
The phase can be computed by
Taking the derivative of yields
After some manipulations, we obtain
Finally, we have by taking the limit of with
where stands for the barycenter of .
Appendix D
According to the derivations in Appendix A, we already know that
Taking the limit of with yields
where stands for the barycenter of .
References
- Harris, F.J. On the use of harmonic analysis with the discrete Fourier transform. Proc. IEEE 1978, 66, 51–83. [Google Scholar] [CrossRef]
- Luo, J.; Xie, Z.; Li, X. Asymmetric windows and their application in frequency estimation. J. Algorithms Comput. Technol. 2015, 9, 389–412. [Google Scholar] [CrossRef]
- Rozman, R.; Kodek, D.M. Using asymmetric windows in automatic speech recognition. Speech Commun. 2007, 49, 268–276. [Google Scholar] [CrossRef]
- Florencio, D.A.F. On the use of asymmetric windows for reducing the time delay in real-time spectral analysis. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Toronto, AN, Canada, 14–17 April 1991; Volume 3265, pp. 3261–3264. [Google Scholar]
- Florencio, D.A.F. Investigating the use of asymmetric windows in CELP vocoders. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Minneapolis, MN, USA, 27–30 April 1993; pp. 427–430. [Google Scholar]
- Rozman, R.; Kodek, D.M. Improving speech recognition robustness using non-standard windows. In Proceedings of the IEEE Region 8 Eurocon 2003: Computer as a Tool, Ljubljana, Slovenia, 22–24 September 2003; Volume 172, pp. 171–174. [Google Scholar]
- Shannon, B.J.; Paliwal, K.K. Feature extraction from higher-lag autocorrelation coefficients for robust speech recognition. Speech Commun. 2006, 48, 1458–1485. [Google Scholar] [CrossRef]
- Zivanovic, M.; Carlosena, A. On asymmetric analysis windows for detection of closely spaced signal components. Mech. Syst. Signal Process. 2006, 20, 702–717. [Google Scholar] [CrossRef]
- Luo, J.; Xie, M. Phase difference methods based on asymmetric windows. Mech. Syst. Signal Process. 2015, 54–55, 52–67. [Google Scholar] [CrossRef]
- Rabiner, L.R.; Gold, B.; Yuen, C. Theory and Application of Digital Signal Processing; Prentice-Hall: Englewood Cliffs, NJ, USA, 1975. [Google Scholar]
- Huang, D. Phase error in fast fourier transform analysis. Mech. Syst. Signal Process. 1995, 9, 113–118. [Google Scholar]
- Xie, M.; Ding, K. Corrections for frequency, amplitude and phase in a fast fourier transform of a harmonic signal. Mech. Syst. Signal Process. 1996, 10, 211–221. [Google Scholar]
- Luo, J.; Xie, Z.; Xie, M. Frequency estimation of the weighted real tones or resolved multiple tones by iterative interpolation DFT algorithm. Digit. Signal Process. 2015, 41, 118–129. [Google Scholar] [CrossRef]
- Proakis, J.G.; Manolakis, D.G. Digital Signal Processing Principles, Algorithms, and Applications, 3rd ed.; Prentice-Hall: Upper Saddle River, NJ, USA, 1996. [Google Scholar]
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).