Sound Source Localization for Unmanned Aerial Vehicles in Low Signal-to-Noise Ratio Environments
Abstract
1. Introduction
- Aiming at the poor performance of the traditional wideband MUSIC algorithm in estimating the DOA of UAV sound sources at low signal-to-noise ratios, we propose the MUSIC pseudo-spectral normalized mean processing technique to improve the DOA performance of the wideband MUSIC algorithm.
- Aiming at the problem that the DOA resolution of traditional time delay estimation algorithms is limited with poor performance under low signal-to-noise ratio conditions, we design a DOA estimation algorithm for UAV sound sources based on a time delay estimation neural network.
- In the experimental part, we compare the DOA algorithm performance with the localization simulation results through a large number of simulation experiments and actual experiments, respectively, as a way to verify the feasibility and robustness of the proposed method in this paper.
2. Related Works
3. Microphone Array Signal Processing Technology
3.1. Microphone Array Signal Model
3.2. Conventional DOA Algorithm for Microphone Arrays
3.2.1. Intercorrelation-Based Delay Estimation
3.2.2. Beam Fouling
3.2.3. Super-Resolution Estimation
4. Our Methods
4.1. Problems Description and Analysis
4.1.1. Sound Field Modeling
4.1.2. Microphone Array Construction
4.2. Doa Algorithm Based on Normalized Mean Pseudo-Spectrum MUSIC
- Input an audio matrix, where M is the number of microphones and T is the length of a single channel of audio.
- For each column vector of the audio matrix in the first step is calculated;
- For each column corresponding to , the eigenvalues are used to decompose the noise subspace ;
- Using the noise subspace from step 3 and Equation (21), compute ;
- For , the omnidirectional power vector is computed first, then ;
- Repeat step 5 for ;
- The results obtained in steps 5 and 6 are concatenated into vectors to obtain the total pseudo-spectrum;
- The frequency axis of the total pseudo-spectrum is converted to an angular axis using for spectral peak search.
4.3. DOA Algorithm for UAV Sound Source Based on Time Delay Estimation Neural Network
4.3.1. Modeling the Delay Estimation Problem
4.3.2. Delay Estimation Network Construction
4.3.3. Network Training
- The UAV noise has a total of 1332 audios, which are divided into training and test sets at a ratio of about 0.7, 0.3;
- The were set to 0:0.1:1 for a total of 11, traversing angles 0°–180°, and 932 training set audios were used at each angle to generate the corresponding (of size 11 × 49) and used as labels. There are a total of 168,692 and ;
- Repeating step 2 on the test set yields a total of 72,400 and ;
- Completed.
5. Analysis of Experimental Results
5.1. Simulation Experiment Analysis
5.1.1. DOA Algorithm Performance Comparison
- Simulation of each DOA estimation algorithm for microphone arrays shooting in different directions
- Simulation of each DOA estimation algorithm with different signal-to-noise ratios
- Effect of audio length on noise immunity of DOA estimation algorithm
5.1.2. Localization Simulation
5.2. Practical Experimental Analysis
5.2.1. Experimental Environment and System Construction
5.2.2. Localization Experiments and Analysis
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Hilal, A.A.; Mismar, T. Drone Positioning System Based on Sound Signals Detection for Tracking and Photography. In Proceedings of the 2020 11th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), Vancouver, BC, Canada, 4–7 November 2020; pp. 8–11. [Google Scholar]
- Yamada, T.; Itoyama, K.; Nishida, K.; Nakadai, K. Outdoor evaluation of sound source localization for drone groups using microphone arrays. In Proceedings of the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 23–27 October 2022; pp. 9296–9301. [Google Scholar]
- Zhang, H.; Li, T.; Li, Y.; Li, J.; Dobre, O.A.; Wen, Z. RF-Based Drone Classification Under Complex Electromagnetic Environments Using Deep Learning. IEEE Sens. J. 2023, 23, 6099–6108. [Google Scholar] [CrossRef]
- Coluccia, A.; Fascista, A.; Schumann, A.; Sommer, L.; Dimou, A.; Zarpalas, D.; Akyon, F.C.; Eryuksel, O.; Ozfuttu, K.A.; Altinuc, S.O.; et al. Drone-vs-Bird Detection Challenge at IEEE AVSS2021. In Proceedings of the 2021 17th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Washington, DC, USA, 16–19 November 2021; pp. 1–8. [Google Scholar]
- Chiper, F.-L.; Martian, A.; Vladeanu, C.; Marghescu, I.; Craciunescu, R.; Fratu, O. Drone Detection and Defense Systems: Survey and a Software-Defined Radio-Based Solution. Sensors 2022, 22, 1453. [Google Scholar] [CrossRef] [PubMed]
- Ezuma, M.; Erden, F.; Anjinappa, C.K.; Ozdemir, O.; Guvenc, I. Micro-UAV Detection and Classification from RF Fingerprints Using Machine Learning Techniques. In Proceedings of the 2019 IEEE Aerospace Conference, Big Sky, MT, USA, 2–9 March 2019; pp. 1–13. [Google Scholar]
- Medaiyese, O.O.; Syed, A.; Lauf, A.P. Machine Learning Framework for RF-Based Drone Detection and Identification System. In Proceedings of the 2021 2nd International Conference On Smart Cities, Automation & Intelligent Computing Systems (ICON-SONICS), Tangerang, Indonesia, 12–13 October 2021; pp. 58–64. [Google Scholar]
- Semkin, V.; Yin, M.; Hu, Y.; Mezzavilla, M.; Rangan, S. Drone Detection and Classification Based on Radar Cross Section Signatures. In Proceedings of the 2020 International Symposium on Antennas and Propagation (ISAP), Osaka, Japan, 25–28 January 2021; pp. 223–224. [Google Scholar]
- Ezuma, M.; Anjinappa, C.K.; Funderburk, M.; Guvenc, I. Radar Cross Section Based Statistical Recognition of UAVs at Microwave Frequencies. IEEE Trans. Aerosp. Electron. Syst. 2022, 58, 27–46. [Google Scholar] [CrossRef]
- Nalamati, M.; Kapoor, A.; Saqib, M.; Sharma, N.; Blumenstein, M. Drone Detection in Long-Range Surveillance Videos. In Proceedings of the 2019 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Taipei, Taiwan, 18–21 September 2019; pp. 1–6. [Google Scholar]
- Shi, X.; Yang, C.; Xie, W.; Liang, C.; Shi, Z.; Chen, J. Anti-Drone System with Multiple Surveillance Technologies: Architecture, Implementation, and Challenges. IEEE Commun. Mag. 2018, 56, 68–74. [Google Scholar] [CrossRef]
- Svanström, F.; Englund, C.; Alonso-Fernandez, F. Real-Time Drone Detection and Tracking With Visible, Thermal and Acoustic Sensors. In Proceedings of the 2020 25th International Conference on Pattern Recognition (ICPR), Milan, Italy, 10–15 January 2021; pp. 7265–7272. [Google Scholar]
- Ganti, S.R.; Kim, Y. Implementation of detection and tracking mechanism for small UAS. In Proceedings of the 2016 International Conference on Unmanned Aircraft Systems (ICUAS), Arlington, VA, USA, 7–10 June 2016; pp. 1254–1260. [Google Scholar]
- Behera, D.K.; Raj, A.B. Drone Detection and Classification using Deep Learning. In Proceedings of the 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, India, 13–15 May 2020; pp. 1012–1016. [Google Scholar]
- Guvenc, I.; Koohifar, F.; Singh, S.; Sichitiu, M.L.; Matolak, D. Detection, Tracking, and Interdiction for Amateur Drones. IEEE Commun. Mag. 2018, 56, 75–81. [Google Scholar] [CrossRef]
- Aydın, İ.; Kızılay, E. Development of a new light-weight convolutional neural network for acoustic-based amateur drone detection. Appl. Acoust. 2022, 193, 108773. [Google Scholar] [CrossRef]
- Chung, M.A.; Chou, H.C.; Lin, C.W. Sound localization based on acoustic source using multiple microphone array in an indoor environment. Electronics 2022, 11, 890. [Google Scholar] [CrossRef]
- Al-Emadi, S.; Al-Ali, A.; Mohammad, A.; Al-Ali, A. Audio Based Drone Detection and Identification using Deep Learning. In Proceedings of the 2019 15th International Wireless Communications & Mobile Computing Conference (IWCMC), Tangier, Morocco, 24–28 June 2019; pp. 459–464. [Google Scholar]
- Knapp, C.; Carter, G. The generalized correlation method for estimation of time delay. IEEE Trans. Acoust. Speech Signal Process. 1976, 24, 320–327. [Google Scholar] [CrossRef]
- DiBiase, J.H.; Silverman, H.F.; Brandstein, M. Robust Localization in Reverberant Rooms; Springer: Berlin/Heidelberg, Germany, 2001. [Google Scholar]
- Dmochowski, J.P.; Benesty, J.; Affes, S. Broadband Music: Opportunities and Challenges for Multiple Source Localization. In Proceedings of the 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, USA, 21–24 October 2007; pp. 18–21. [Google Scholar]
- Chakrabarty, S.; Habets, E.A.P. Broadband doa estimation using convolutional neural networks trained with noise signals. In Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA, 15–18 October 2017; pp. 136–140. [Google Scholar]
- Perotin, L.; Serizel, R.; Vincent, E.; Guérin, A. CRNN-based Joint Azimuth and Elevation Localization with the Ambisonics Intensity Vector. In Proceedings of the 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC), Tokyo, Japan, 17–20 September 2018; pp. 241–245. [Google Scholar]
- Adavanne, S.; Politis, A.; Virtanen, T. Direction of Arrival Estimation for Multiple Sound Sources Using Convolutional Recurrent Neural Network. In Proceedings of the 2018 26th European Signal Processing Conference (EUSIPCO), Rome, Italy, 3–7 September 2018; pp. 1462–1466. [Google Scholar]
- Vera-Diaz, J.M.; Pizarro, D.; Macias-Guarasa, J. Towards end-to-end acoustic localization using deep learning: From audio signals to source position coordinates. Sensors 2018, 18, 3418. [Google Scholar] [CrossRef] [PubMed]
- Salvati, D.; Drioli, C.; Foresti, G.L. Time delay estimation for speaker localization using cnn-based parametrized gcc-phat features. In Proceedings of the Conference of the International Speech Communication Association, Brno, Czechia, 30 August–3 September 2021. [Google Scholar]
- Diaz-Guerra, D.; Miguel, A.; Beltran, J.R. Robust sound source tracking using srp-phat and 3d convolutional neural networks. IEEE/ACM Trans. Audio Speech Lang. Process. 2020, 29, 300–311. [Google Scholar] [CrossRef]
- Case, E.E.; Zelnio, A.M.; Rigling, B.D. Low-cost acoustic array for small uav detection and tracking. In Proceedings of the 2008 IEEE National Aerospace and Electronics Conference, Dayton, OH, USA, 16–18 July 2008; pp. 110–113. [Google Scholar]
- Yang, B.; Matson, E.T.; Smith, A.H.; Dietz, J.E.; Gallagher, J.C. Uav detection system with multiple acoustic nodes using machine learning models. In Proceedings of the 2019 Third IEEE International Conference on Robotic Computing (IRC), Naples, Italy, 25–27 February 2019. [Google Scholar]
- Shi, Z.; Chang, X.; Yang, C.; Wu, Z.; Wu, J. An acoustic-based surveillance system for amateur drones detection and localization. IEEE Trans. Veh. Technol. 2020, 69, 2731–2739. [Google Scholar] [CrossRef]
- Carter, G.C. Coherence and time delay estimation. Proc. IEEE 1987, 75, 236–255. [Google Scholar] [CrossRef]
- Schmidt, R. Multiple emitter location and signal parameter estimation. IEEE Trans. Antennas Propag. 1986, 34, 276–280. [Google Scholar] [CrossRef]
- Tao, T.; Zheng, H.; Yang, J.; Guo, Z.; Zhang, Y.; Ao, J.; Chen, Y.; Lin, W.; Tan, X. Sound Localization and Speech Enhancement Algorithm Based on Dual-Microphone. Sensors 2022, 22, 715. [Google Scholar] [CrossRef] [PubMed]
- Yousaf, J.; Zia, H.; Alhalabi, M.; Yaghi, M.; Basmaji, T.; Shehhi, E.A.; Gad, A.; Alkhedher, M.; Ghazal, M. Drone and controller detection and localization: Trends and challenges. Appl. Sci. 2022, 12, 12612. [Google Scholar] [CrossRef]
- Salvati, D.; Drioli, C.; Foresti, G.L. Incoherent Frequency Fusion for Broadband Steered Response Power Algorithms in Noisy Environments. IEEE Signal Process. Lett. 2014, 21, 581–585. [Google Scholar] [CrossRef]
- Scheibler, R.; Bezzam, E.; Dokmanić, I. Pyroomacoustics: A Python Package for Audio Room Simulation and Array Processing Algorithms. In Proceedings of the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, AB, Canada, 15–20 April 2018; pp. 351–355. [Google Scholar]
























| Input Size | Initial Learning Rate | Optimizer | Gradient Decay Factor | Squared GDF | 
|---|---|---|---|---|
| [129,8] | 0.00001 | Adam | 0.9 | 0.999 | 
| Operating System | Central Processing Unit (CPU) | Graphics Processing Unit (GPU) | CUDA Version | 
|---|---|---|---|
| Windows Server 2019 | Intel Xeon 8255C | Tesla T4 | 11.6 | 
| Total Params | Total Memory | Total MAdd | Total Flops | Total MemR + W | 
|---|---|---|---|---|
| 1,853,633 | 7.29 MB | 210.78 MMAdd | 105.82 MFlops | 17.65 MB | 
| Azimuthal Axis | X-Axis | Y-Axis | Z-Axis | ||
|---|---|---|---|---|---|
| SNR = 0 dB | Standard deviation | GCC-PHAT | 0.148 | 0.291 | 0.258 | 
| PCNN | 0.056 | 0.102 | 0.092 | ||
| Average value | GCC-PHAT | 0.443 | 0.482 | 0.424 | |
| PCNN | 0.351 | 0.131 | 0.387 | ||
| t-test | t | −5.493 | −10.707 | −1.3 | |
| p-val | |||||
| SNR = −10 dB | Standard deviation | GCC-PHAT | 0.131 | 0.213 | 0.183 | 
| PCNN | 0.052 | 0.144 | 0.073 | ||
| Average value | GCC-PHAT | 0.539 | 0.381 | 0.369 | |
| PCNN | 0.376 | 0.256 | 0.383 | ||
| t-test | t | −8.09 | −3.399 | 0.499 | |
| p-val | |||||
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. | 
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wu, S.; Zheng, Y.; Ye, K.; Cao, H.; Zhang, X.; Sun, H. Sound Source Localization for Unmanned Aerial Vehicles in Low Signal-to-Noise Ratio Environments. Remote Sens. 2024, 16, 1847. https://doi.org/10.3390/rs16111847
Wu S, Zheng Y, Ye K, Cao H, Zhang X, Sun H. Sound Source Localization for Unmanned Aerial Vehicles in Low Signal-to-Noise Ratio Environments. Remote Sensing. 2024; 16(11):1847. https://doi.org/10.3390/rs16111847
Chicago/Turabian StyleWu, Sheng, Yijing Zheng, Kun Ye, Hanlin Cao, Xuebo Zhang, and Haixin Sun. 2024. "Sound Source Localization for Unmanned Aerial Vehicles in Low Signal-to-Noise Ratio Environments" Remote Sensing 16, no. 11: 1847. https://doi.org/10.3390/rs16111847
APA StyleWu, S., Zheng, Y., Ye, K., Cao, H., Zhang, X., & Sun, H. (2024). Sound Source Localization for Unmanned Aerial Vehicles in Low Signal-to-Noise Ratio Environments. Remote Sensing, 16(11), 1847. https://doi.org/10.3390/rs16111847
 
        




 
       