Wed-SS-3-11-3 STC-innovation Speaker Recognition Systems for Far-Field Speaker Verification Challenge 2020

Aleksei Gusev(STC-innovations/ITMO), Vladimir Volokhov(STC-innovations), Alisa Vinogradova(STC-innovations/ITMO), Tseren Andzhukaev(STC-innovations), Andrey Shulipa(ITMO), Sergey Novoselov(STC-innovations/ITMO), Timur Pekhovsky(STC-innovations) and Alexander Kozlov(STC-innovations)

Abstract: This paper presents speaker recognition (SR) systems submitted by the Speech Technology Center (STC) team to the Far-Field Speaker Verification Challenge 2020. SR tasks of the challenge are focused on the problem of far-field text-dependent speaker verification from single microphone array (Track 1), far-field text-independent speaker verification from single microphone array (Track 2) and far-field text-dependent speaker verification from distributed microphone arrays (Track 3). In this paper, we present techniques and ideas underlying our best performing models. A number of experiments on x-vector based and ResNet based architectures show that ResNet topology based networks outperform x-vector based systems. Submitted systems are the fusions of ResNet34 based extractors, trained on 80 Log Mel-filter Bank Energies (MFBs) postprocessed with U-net based voice activity detector (VAD). The best systems for the Track 1, Track 2 and Track 3 achieved 5.08% EER and 0.500 minDCF, 5.39% EER and 0.541 minDCF and 5.53% EER and 0.458 minDCF on the challenge evaluation sets respectively.

Paper

prev Wed-SS-3-11-2 Deep Embedding Learning for Text-Dependent Speaker Verification

next Wed-SS-3-11-4 NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge

About

About the Conference

Welcome from the Chair

Conference Committees

Calls