Wed-2-12-9 Adversarial Domain Adaptation for Speaker Verification using Partially Shared Network

Zhengyang Chen(MoE Key Lab of Artificial Intelligence SpeechLab, Department of Computer Science and EngineeringShanghai Jiao Tong University, Shanghai), Shuai Wang(Shanghai Jiao Tong University) and Yanmin Qian(Shanghai Jiao Tong University)

Abstract: Speaker verification systems usually suffer from large performance degradation when applied to a new dataset from another different domain. In this work, we will study the domain adaption strategy between datasets with different languages using domain adversarial training. We introduce a partially shared network based domain adversarial training architecture to learn an asymmetric mapping for source and target domain embedding extractor. This architecture can help the embedding extractor learn domain invariant feature without sacrificing the ability on speaker discrimination. When doing the evaluation on cross-lingual domain adaption, the source domain data is in English from NIST SRE04-10 and Switchboard, and the target domain data is in Cantonese and Tagalog from NIST SRE16. Our results show that the usual adversarial training mode will indeed harm the speaker discrimination when the source and target domain embedding extractors are fully shared, and in contrast the newly proposed architecture solves this problem and achieves ∼25.0% relative average Equal Error Rate (EER) improvement on SRE16 Cantonese and Tagalog evaluation.

Paper

prev Wed-2-12-8 Speaker-Aware Linear Discriminant Analysis in Speaker Verification

next No More

About

About the Conference

Welcome from the Chair

Conference Committees

Calls