Next Article in Journal
Controllable Synthesis of Special Reed-Leaf-Like Carbon Nanostructures Using Copper Containing Catalytic Pyrolysis for High-Performance Field Emission
Previous Article in Journal
An Internet of Things-Based Environmental Quality Management System to Supervise the Indoor Laboratory Conditions

Automatic Segmentation of Ethnomusicological Field Recordings

Faculty of Computer and Information Science, University of Ljubljana, Večna pot 113, 1000 Ljubljana, Slovenia
Author to whom correspondence should be addressed.
Appl. Sci. 2019, 9(3), 439;
Received: 30 December 2018 / Revised: 18 January 2019 / Accepted: 21 January 2019 / Published: 28 January 2019
The article presents a method for segmentation of ethnomusicological field recordings. Field recordings are integral documents of folk music performances captured in the field, and typically contain performances, intertwined with interviews and commentaries. As these are live recordings, captured in non-ideal conditions, they usually contain significant background noise. We present a segmentation method that segments field recordings into individual units labelled as speech, solo singing, choir singing, and instrumentals. Classification is based on convolutional deep networks, and is augmented with a probabilistic approach for segmentation. We describe the dataset gathered for the task and the tools developed for gathering the reference annotations. We outline a deep network architecture based on residual modules for labelling short audio segments and compare it to the more standard feature based approaches, where an improvement in classification accuracy of over 10% was obtained. We also present the SeFiRe segmentation tool that incorporates the presented segmentation method. View Full-Text
Keywords: audio segmentation; field recordings; deep learning; music information retrieval audio segmentation; field recordings; deep learning; music information retrieval
Show Figures

Graphical abstract

MDPI and ACS Style

Marolt, M.; Bohak, C.; Kavčič, A.; Pesek, M. Automatic Segmentation of Ethnomusicological Field Recordings. Appl. Sci. 2019, 9, 439.

AMA Style

Marolt M, Bohak C, Kavčič A, Pesek M. Automatic Segmentation of Ethnomusicological Field Recordings. Applied Sciences. 2019; 9(3):439.

Chicago/Turabian Style

Marolt, Matija, Ciril Bohak, Alenka Kavčič, and Matevž Pesek. 2019. "Automatic Segmentation of Ethnomusicological Field Recordings" Applied Sciences 9, no. 3: 439.

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

Back to TopTop