Fast Computation of LSP Frequencies Using the Bairstow Method
Abstract
1. Introduction
- Avoiding evaluation of trigonometric functions,
- Avoiding using complex operations and a fine grid,
- Extracting the roots rapidly and accurately.
2. Line Spectral Pairs (LSPs) Frequencies
2.1. LPC Polynomial
2.2. General-Form Polynomial Transformation for the Computation of LSPs Frequencies
3. Polynomial Solution Based on Bairstow Method
4. Experiment and Results
4.1. Experimental Environment
4.2. Selection and Update of Initial Values
4.3. Solve the 10-Order LSP Using the Proposed Method
4.4. Performance of the Proposed Method
5. Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Rabiner, L.R.; Schafer, R.W. Theory and Applications of Digital Speech Processing; Pearson Education: London, UK, 2011; p. 473. [Google Scholar]
- Makhoul, J. Linear Prediction: A Tutorial Review. Proc. IEEE 1975, 63, 561–580. [Google Scholar] [CrossRef]
- Chowdhury, A.; Ross, A. Fusing MFCC and LPC Features Using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals. IEEE Trans. Inf. Forensics Secur. 2019, 15, 1616–1629. [Google Scholar] [CrossRef]
- Alku, P.; Saeidi, R. The Linear Predictive Modeling of Speech From Higher-Lag Autocorrelation Coefficients Applied to Noise-Robust Speaker Recognition. IEEE/ACM Trans. Audio Speech Lang. Process. 2017, 25, 1606–1617. [Google Scholar] [CrossRef]
- Ramalho, L.; Fonseca, M.N.; Klautau, A.; Lu, C.; Berg, M.; Trojer, E.; Höst, S. An LPC-Based Fronthaul Compression Scheme. IEEE Commun. Lett. 2017, 21, 318–321. [Google Scholar] [CrossRef]
- Itakura, F. Line Spectrum Representation of Linear Predictive Coefficients of Speech Signals. J. Acoust. Soc. Am. 1975, 57, 535. [Google Scholar] [CrossRef]
- Soong, F.K.; Juang, B.H. Optimal quantization of LSP parameters. IEEE Trans. Speech Audio Process. 1993, 1, 15–24. [Google Scholar] [CrossRef]
- Soong, F.K.; Juang, B.H. Line Spectrum Pair (LSP) and Speech Data Compression. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), San Diego, CA, USA, 19–21 March 1984. [Google Scholar]
- Kang, G.; Fransen, L. Application of line-spectrum pairs to low-bit-rate speech encoders. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Tampa, FL, USA, 26–29 April 1985; p. 8857. [Google Scholar]
- Kabal, P.; Ramachandran, R.P. The Computation of Line Spectral Frequencies Using Chebyshev Polynomials. IEEE Trans. Acoust. Speech Signal Process. 1986, 34, 1419–1426. [Google Scholar] [CrossRef]
- Wu, C.H.; Chen, J.H. A Novel Two-Level Method for the Computation of the LSP Frequencies Using a Decimation-in-Degree Algorithm. IEEE Trans. Speech Audio Process. 1997, 5, 106–115. [Google Scholar]
- Chen, S.H.; Chang, Y.; Ruan, J.C. An Efficient Computation of LSP Frequencies Using Modified Complex-Free Ferrari Formula. Signal Process. Syst. 2008, 52, 153–163. [Google Scholar] [CrossRef]
- Chen, S.H.; Chang, Y.; Syuan, C.J.Y. The Computation of Line Spectrum Pair Frequencies Using Tschirnhaus Transform. In Proceedings of the International Symposium on Circuits and Systems, Taipei, Taiwan, 24–27 May 2009; pp. 288–291. [Google Scholar]
- Chang, C.H.; Chen, B.W.; Chen, S.H.; Wang, J.F.; Chiu, Y.H. Low-Complexity Hardware Design for Fast Solving LSPs With Coordinated Polynomial Solution. IEEE Trans. Large Scale Integr. Syst. 2015, 23, 230–243. [Google Scholar] [CrossRef]
- Luk, W.S. Finding roots of real polynomial simultaneously by means of Bairstow’s method. Bit Numer. Math. 1996, 36, 302–308. [Google Scholar] [CrossRef]
- Hsiao, C.C.; Brodersen, R. A multi-rate root LPC speech synthesizer. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), San Diego, CA, USA, 19–21 March 1984; pp. 41–44. [Google Scholar]
- Coding of Speech at 8 kbit/s Using Conjugate-Structure Algebraic-Code-Excited Linear Prediction (CS-ACELP); International Telecommunication Union: Geneva, Switzerland, 1996.
- Stoer, J.; Bulirsch, R. Introduction to Numerical Analysis, 2nd ed.; Springer Science & Business Media: Berlin, Germany, 2013; pp. 333–335. [Google Scholar]
- O’Donnell, J. A System for very low data rate speech communication. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Atlanta, GA, USA, 30 March–1 April 1981; pp. 8–11. [Google Scholar]
- Rothweiler, J. On polynomial reduction in the computation of LSP frequencies. IEEE Trans. Speech Audio Process. 1999, 7, 592–594. [Google Scholar] [CrossRef]
- Hildebrand, F.B. Introduction to Numerical Analysis, 2nd ed.; McGraw-Hill: New York, NY, USA, 1974; pp. 613–618. [Google Scholar]
- Wikipedia. Available online: https://en.wikipedia.org/wiki/Insertion_sort (accessed on 15 January 2020).
- Aurora Corpus. Available online: http://portal.elda.org/en/catalogues/free-resources/free-lrs-set-1/ (accessed on 15 January 2020).
- International Telecommunication Union. Perceptual Evaluation of Speech Quality (PESQ): An Objective Method for End-to-End Speech Quality Assessment of Narrow-Band Telephone Networks and Speech Codecs; International Telecommunication Union: Geneva, Switzerland, 2001. [Google Scholar]


| 1 set  and  as the initial value of  and 2 initialize , , , 3 while do 4 5 6 7 8 for do 9 10 11 end 12 13 | 
| 14 15 16 end 17 18 19 return , | 
| Frame | ||||||
|---|---|---|---|---|---|---|
| 1 | −0.244103 | −0.255179 | 0.011077 | −0.311074 | −0.301109 | 0.009965 | 
| 2 | −0.255179 | −0.241979 | 0.013201 | −0.301109 | −0.279433 | 0.021676 | 
| 3 | −0.241979 | −0.238239 | 0.003739 | −0.279433 | −0.276074 | 0.003359 | 
| 4 | −0.238239 | −0.258420 | 0.020180 | −0.276074 | −0.299796 | 0.023722 | 
| 5 | −0.258420 | −0.320360 | 0.061940 | −0.299796 | −0.267679 | 0.032117 | 
| 6 | −0.320360 | −0.350183 | 0.029824 | −0.267679 | −0.246816 | 0.020863 | 
| Mathematic Analysis | ||
|---|---|---|
| Mean | 0.0419 | 0.0305 | 
| Standard Deviation | 0.0078 | 0.0038 | 
| Initial Method | Average Iteration Numbers | |||||
|---|---|---|---|---|---|---|
| n1 | n2 | n3 | n4 | Total | ||
| Fixed | 0 | 4.9 | 5.6 | 10.2 | 6.9 | 27.6 | 
| 4 | 11.5 | 10.6 | 11.3 | 9.3 | 42.7 | |
| 8 | 14.1 | 11.5 | 14.0 | 10.3 | 49.9 | |
| Statistic | 4.7 | 5.4 | 5.2 | 6.4 | 21.7 | |
| Adaptive | 3.7 | 3.4 | 5.2 | 3.7 | 16 | |
| First stage | |||||||
| 1 | −0.702538 | −0.666299 | 0.088680 | 0.071163 | 0.036240 | 0.017517 | |
| 2 | −0.666299 | −0.664132 | 0.071163 | 0.070034 | 0.002167 | 0.001129 | |
| 3 | −0.664132 | −0.664117 | 0.070034 | 0.070034 | 0.000015 | 0.000000 | |
| 4 | −0.664117 | −0.664117 | 0.070034 | 0.070034 | 0.000000 | 0.000000 | |
| Second stage | |||||||
| 1 | −0.652314 | −0.623734 | −0.256786 | −0.282222 | 0.028580 | 0.025436 | |
| 2 | −0.623734 | −0.621934 | −0.282222 | −0.283870 | 0.001801 | 0.001648 | |
| 3 | −0.621934 | −0.621919 | −0.283870 | −0.283885 | 0.000015 | 0.000015 | |
| 4 | −0.621919 | −0.621919 | −0.283885 | −0.283885 | 0.000000 | 0.000000 | |
| Accurate | Approximate | ||
| 1 | −0.792042 | −0.792041 | 0.000001 | 
| 2 | 0.928270 | 0.928265 | 0.000005 | 
| 3 | 0.641682 | 0.641686 | 0.000004 | 
| 4 | −0.395398 | −0.395388 | 0.000001 | 
| 5 | 0.175723 | 0.175723 | 0.000000 | 
| i | 1 | 2 | 3 | 4 | 5 | 
| −0.702538 | |||||
| 0.088680 | |||||
| −0.652314 | |||||
| −0.256786 | |||||
| 0.928265 | 0.641686 | 0.175723 | −0.395388 | −0.792041 | |
| 0.381076 | 0.874102 | 1.394156 | 1.977287 | 2.484941 | |
| Methods | Test Speech | PESQ | 
|---|---|---|
| The proposed method | testa_clean1 | 3.35 | 
| testa_n1_snr15 | 3.46 | |
| Birge-Vieta method | testa_clean1 | 2.74 | 
| testa_n1_snr15 | 3.36 | 
| Methods | Time (ms) | Normalized Clock Numbers | Environment | 
|---|---|---|---|
| The proposed method | 0.0016 | 5920 | 3.7 GHz | 
| Birge-Vieta method | 0.0018 | 6660 | 3.7 GHz | 
| Tschirnhus Transform | 0.0614 | 104380 | 1.7 GHz | 
| Modified Ferrari’s | 0.0802 | 136340 | 1.7 GHz | 
| Original Ferrari’s | 0.1640 | 278800 | 1.7 GHz | 
| Chen and Wu | 0.1651 | 280670 | 1.7 GHz | 
| Soong and Juang | 0.5316 | 903720 | 1.7 GHz | 
| Full Search | 700.32 | 1190540 | 1.7 GHz | 
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Xue, Y.; Zhu, Z.; Jiang, J.; Zhan, Y.; Yu, Z.; Fan, X.; Qiao, S. Fast Computation of LSP Frequencies Using the Bairstow Method. Electronics 2020, 9, 387. https://doi.org/10.3390/electronics9030387
Xue Y, Zhu Z, Jiang J, Zhan Y, Yu Z, Fan X, Qiao S. Fast Computation of LSP Frequencies Using the Bairstow Method. Electronics. 2020; 9(3):387. https://doi.org/10.3390/electronics9030387
Chicago/Turabian StyleXue, Yuqun, Zhijiu Zhu, Jianhua Jiang, Yi Zhan, Zenghui Yu, Xiaohua Fan, and Shushan Qiao. 2020. "Fast Computation of LSP Frequencies Using the Bairstow Method" Electronics 9, no. 3: 387. https://doi.org/10.3390/electronics9030387
APA StyleXue, Y., Zhu, Z., Jiang, J., Zhan, Y., Yu, Z., Fan, X., & Qiao, S. (2020). Fast Computation of LSP Frequencies Using the Bairstow Method. Electronics, 9(3), 387. https://doi.org/10.3390/electronics9030387
 
        
 
       