Mon-SS-2-6-5 The TalTech Systems for the Short-duration Speaker Verification Challenge 2020

Tanel Alumäe(Tallinn University of Technology) and Jörgen Valk(Tallinn University of Technology)

Abstract: This paper presents the Tallinn University of Technology systems submitted to the Short-duration Speaker Verification Challenge 2020. The challenge consists of two tasks, focusing on text-dependent and text-independent speaker verification with some cross-lingual aspects. We used speaker embedding models that consist of squeeze-and-attention based residual layers, multi-head attention and either cross-entropy-based or additive angular margin based objective function. In order to encourage the model to produce language-independent embeddings, we trained the models in a multi-task manner, using dataset specific output layers. In the text-dependent task we employed a phrase classifier to reject trials with non-matching phrases. In the text-independent task we used a language classifier to boost the scores of trials where the language of the test and enrollment utterances does not match. Our final primary metric score was 0.075 in Task 1 (ranked as 6th) and 0.118 in Task 2 (rank 8).

Paper

prev Mon-SS-2-6-4 Robust Text-Dependent Speaker Verification via Character-Level Information Preservation for the SdSV Challenge 2020

next Mon-SS-2-6-6 Investigation of NICT submission for short-duration speaker verification challenge 2020

About

About the Conference

Welcome from the Chair

Conference Committees

Calls