- Review
A Survey of Deep Learning-Based Multimodal Emotion Recognition: Speech, Text, and Face
- Hailun Lian,
- Cheng Lu,
- Sunan Li,
- Yan Zhao,
- Chuangao Tang and
- Yuan Zong
Multimodal emotion recognition (MER) refers to the identification and understanding of human emotional states by combining different signals, including—but not limited to—text, speech, and face cues. MER plays a crucial role in the human&...

