Speech Translation and Multilingual/Multimodal Learning

Tue-1-1-2 Efficient Wait-k Models for Simultaneous Machine Translation

Maha Elbayad(INRIA / LIG), Laurent Besacier(LIG) and Jakob Verbeek(INRIA)
Abstract: Simultaneous machine translation consists in starting output generation before the entire input sequence is available. Wait-k decoders offer a simple but efficient approach for this problem.They first read k source tokens, after which they alternate be-tween producing a target token and reading another source token.We investigate the behavior of wait-k decoding in low resource settings for spoken corpora using IWSLT datasets. We improve training of these models using unidirectional encoders, and train-ing across multiple values of k. Experiments with Transformer and 2D-convolutional architectures show that our wait-k models generalize well across a wide range of latency levels. We also show that the 2D-convolution architecture is competitive with Transformers for simultaneous translation of spoken language.
Student Information

Student Events

Travel Grants