Wed-2-4-6 Conv-TasSAN: Separative Adversarial Network based on Conv-TasNet

Chengyun Deng(Didi Chuxing), Yi Zhang(Didi Chuxing), Shiqian Ma(Didi Chuxing), Yongtao Sha(Didi Chuxing), Hui Song(Didi Chuxing) and Xiangang Li(Didi Chuxing)
Abstract: Conv-TasNet has showed competitive performance on single-channel speech source separation. In this paper, we investigate to further improve separation performance by optimizing the training mechanism with the same network structure. Motivated by the successful applications of generative adversarial networks (GANs) on speech enhancement tasks, we propose a novel Separative Adversarial Network called Conv-TasSAN, in which the separator is realized by using Conv-TasNet architecture. The discriminator is involved to optimize the separator with respect to specific speech objective metric. It makes the separator network capture the distribution information of speech sources more accurately, and also prevents over-smoothing problems. Experiments on WSJ0-2mix dataset confirm the superior performance of the proposed method over Conv-TasNet in terms of SI-SNR and PESQ improvement.
Student Information

Student Events

Travel Grants