Mon-1-9-9 Context-Dependent Domain Adversarial Neural Network for Multimodal Emotion Recognition

Zheng Lian(National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing), Jianhua Tao(National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing), Bin Liu(National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing), Jian Huang(National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing), Zhanlei Yang(National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing) and Rongjun Li(National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing)
Abstract: Emotion recognition remains a complex task due to speaker variations and low-resource training samples. To address these difficulties, we focus on the domain adversarial neural networks (DANN) for emotion recognition. The primary task is to predict emotion labels. The secondary task is to learn a common representation where speaker identities can not be distinguished. By using this approach, we bring the representations of different speakers closer. Meanwhile, through using the unlabeled data in the training process, we alleviate the impact of low-resource training samples. In the meantime, prior work found that contextual information and multimodal features are important for emotion recognition. However, previous DANN based approaches ignore these information, thus limiting their performance. In this paper, we propose the context-dependent domain adversarial neural network for multimodal emotion recognition. To verify the effectiveness of our proposed method, we conduct experiments on the benchmark dataset IEMOCAP. Experimental results demonstrate that the proposed method shows an absolute improvement of 3.48% over state-of-the-art strategies.
Student Information

Student Events

Travel Grants