Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (2)

Search Parameters:
Keywords = two-channel immersive audio

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
18 pages, 1461 KiB  
Article
Two-Stage Unet with Gated-Conv Fusion for Binaural Audio Synthesis
by Wenjie Zhang, Changjun He, Yinghan Cao, Shiyun Xu and Mingjiang Wang
Sensors 2025, 25(6), 1790; https://doi.org/10.3390/s25061790 - 13 Mar 2025
Viewed by 636
Abstract
Binaural audio is crucial for creating immersive auditory experiences. However, due to the high cost and technical complexity of capturing binaural audio in real-world environments, there has been increasing interest in synthesizing binaural audio from monaural sources. In this paper, we propose a [...] Read more.
Binaural audio is crucial for creating immersive auditory experiences. However, due to the high cost and technical complexity of capturing binaural audio in real-world environments, there has been increasing interest in synthesizing binaural audio from monaural sources. In this paper, we propose a two-stage framework for binaural audio synthesis. Specifically, monaural audio is initially transformed into a preliminary binaural signal, and the shared common portion across the left and right channels, as well as the distinct differential portion in each channel, are extracted. Subsequently, the POS-ORI self-attention module (POSA) is introduced to integrate spatial information of the sound sources and capture their motion. Based on this representation, the common and differential components are separately reconstructed. The gated-convolutional fusion module (GCFM) is then employed to combine the reconstructed components and generate the final binaural audio. Experimental results demonstrate that the proposed method can accurately synthesize binaural audio and achieves state-of-the-art performance in phase estimation (Phase-l2: 0.789, Wave-l2: 0.147, Amplitude-l2: 0.036). Full article
(This article belongs to the Section Intelligent Sensors)
Show Figures

Figure 1

17 pages, 1584 KiB  
Article
Immersive Two-Channel Recordings Based on Personalized BRIRs and Their Applications in Industry
by Kaja Kosmenda and Witold Mickiewicz
Appl. Sci. 2024, 14(24), 11724; https://doi.org/10.3390/app142411724 - 16 Dec 2024
Viewed by 1132
Abstract
The realm of immersive sound technologies in modern industries is evolving every day. Two-channel recordings using personalized HRIRs or BRIRs, which are tailored to the unique anatomical features of individual listeners, significantly enhance the spatial accuracy and naturalness of sound, providing a highly [...] Read more.
The realm of immersive sound technologies in modern industries is evolving every day. Two-channel recordings using personalized HRIRs or BRIRs, which are tailored to the unique anatomical features of individual listeners, significantly enhance the spatial accuracy and naturalness of sound, providing a highly immersive auditory experience. This paper discusses the importance of immersive sound and the externalization effect in recreating the acoustic environment. The paper also presents techniques for obtaining two-channel immersive renderings in a few different ways. The main focus is the integration of immersive audio in new technologies in the wide-ranging audio industry, from telecommunication, through applications for musicians, virtual reality scenarios, and hearing devices. In summary, this paper highlights the huge potential of personalized BRIRs in creating immersive two-channel recordings, offering substantial benefits across various industries by improving the realism and effectiveness of each auditory experience on its own. Full article
(This article belongs to the Special Issue Digital Technologies Enabling Modern Industries)
Show Figures

Figure 1

Back to TopTop