This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Deep Learning-Based Multi-Label Classification for Forest Soundscape Analysis: A Case Study in Shennongjia National Park
by
Caiyun Yang
Caiyun Yang 1
,
Xuanxin Liu
Xuanxin Liu 1,
Yiyang Li
Yiyang Li 1 and
Xinwen Yu
Xinwen Yu 1,2,*
1
Institute of Forest Resource Information Techniques, Chinese Academy of Forestry, Beijing 100091, China
2
Key Laboratory of Forestry Remote Sensing and Information System, National Forestry and Grassland Administration (NFGA), Beijing 100091, China
*
Author to whom correspondence should be addressed.
Forests 2025, 16(6), 899; https://doi.org/10.3390/f16060899 (registering DOI)
Submission received: 24 April 2025
/
Revised: 18 May 2025
/
Accepted: 23 May 2025
/
Published: 27 May 2025
Abstract
Forest soundscapes contain rich ecological information that reflects the composition, structure, and dynamics of biodiversity within forest ecosystems. The effective monitoring of these soundscapes is essential for forest conservation and wildlife management. However, traditional manual annotation methods are time-consuming and limited in scalability, while commonly used acoustic indices such as the Normalized Difference Soundscape Index (NDSI) lack the capacity to resolve overlapping or complex sound sources often encountered in dense forest environments. To overcome these limitations, this study applied a deep learning-based multi-label classification approach to long-term field recordings collected from Shennongjia National Park, a typical subtropical forest ecosystem in China. The model automatically classifies sound sources into biophony, geophony, and anthrophony. Compared to the NDSI, the model demonstrated higher precision and robustness, especially under low-signal-to-noise-ratio conditions. While the NDSI provides an efficient overview of soundscape disturbances, it demonstrates limitations in differentiating geophonic components and detecting subtle variations. This study supports a complementary “macro–micro” analytical framework that enables capturing broad, time-averaged soundscape trends through the NDSI, while achieving fine-grained, label-specific detection of biophony, geophony, and anthrophony through the multi-label classification model. This integration enhances analytical resolution, enabling the scalable, automated monitoring of complex forest soundscapes. This study contributes a novel and adaptable approach for real-time biodiversity assessment and long-term forest conservation.
Share and Cite
MDPI and ACS Style
Yang, C.; Liu, X.; Li, Y.; Yu, X.
Deep Learning-Based Multi-Label Classification for Forest Soundscape Analysis: A Case Study in Shennongjia National Park. Forests 2025, 16, 899.
https://doi.org/10.3390/f16060899
AMA Style
Yang C, Liu X, Li Y, Yu X.
Deep Learning-Based Multi-Label Classification for Forest Soundscape Analysis: A Case Study in Shennongjia National Park. Forests. 2025; 16(6):899.
https://doi.org/10.3390/f16060899
Chicago/Turabian Style
Yang, Caiyun, Xuanxin Liu, Yiyang Li, and Xinwen Yu.
2025. "Deep Learning-Based Multi-Label Classification for Forest Soundscape Analysis: A Case Study in Shennongjia National Park" Forests 16, no. 6: 899.
https://doi.org/10.3390/f16060899
APA Style
Yang, C., Liu, X., Li, Y., & Yu, X.
(2025). Deep Learning-Based Multi-Label Classification for Forest Soundscape Analysis: A Case Study in Shennongjia National Park. Forests, 16(6), 899.
https://doi.org/10.3390/f16060899
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.