- Article
Target Speaker Extraction Using Attention-Enhanced Temporal Convolutional Network
- Jian-Hong Wang,
- Yen-Ting Lai,
- Tzu-Chiang Tai,
- Phuong Thi Le,
- Tuan Pham,
- Ze-Yu Wang,
- Yung-Hui Li,
- Jia-Ching Wang and
- Pao-Chi Chang
When recording conversations, there may be multiple people talking at once. While our human ears can filter out unwanted sounds, this can be challenging for automatic speech recognition (ASR) systems, leading to reduced accuracy. To address this issu...