Wed-3-8-8 Meta Multi-task Learning for Speech Emotion Recognition

Ruichu Cai(GDUT), Kaibin Guo(Guangdong University of Technology), Boyan Xu(Faculty of Computer, Guangdong University of Technology), Xiaoyan Yang(YITU) and Zhenjie Zhang(Yitu Technology)

Abstract: Most existing Speech Emotion Recognition (SER) approaches ignore the relationship between the categorical emotional labels and the dimensional labels in valence, activation or dominance space. Although multi-task learning has recently been introduced to explore such auxiliary tasks of SER, existing approaches only share the feature extractor under the traditional multi-task learning framework and can not efficiently transfer the knowledge from the auxiliary tasks to the target task. In order to address these issues, we propose a Meta Multi-task Learning method for SER by combining the multi-task learning with meta learning. Our contributions include: 1) to model the relationship among auxiliary tasks, we extend the task generation of meta learning to the form of multiple tasks, and 2) to transfer the knowledge from the auxiliary tasks to the target task, we propose a tuning-based transfer training mechanism in the meta learning framework. The experiments on IEMOCAP show that our approach outperforms the state-of-the-art solution (UA: 70.32%, WA: 76.64%).

Paper

prev Wed-3-8-7 A Lightweight Model Based on Separable Convolution for Speech Emotion Recognition

next Wed-3-8-9 GEV Beamforming Supported by DOA-based Masks Generated on Pairs of Microphones

About

About the Conference

Welcome from the Chair

Conference Committees

Calls