Mon-1-5-7 Fast and lightweight on-device TTS with Tacotron2 and LPCNet

Vadim Popov(Huawei Technologies Co. Ltd.), Stanislav Kamenev(Huawei Technologies Co. Ltd.), Mikhail Kudinov(Huawei Technologies Co. Ltd.), Sergey Repyevsky(Huawei Technologies Co. Ltd.), Tasnima Sadekova(Huawei Technologies Co. Ltd.), Vitalii Bushaev(Huawei Technologies Co. Ltd.), Vladimir Kryzhanovskiy(Huawei Technologies Co. Ltd.) and Denis Parkhomenko(Huawei Technologies Co. Ltd.)
Abstract: We present a fast and lightweight on-device text-to-speech system based on state-of-art methods of feature and speech generation i.e. Tacotron2 and LPCNet. We show that modification of the basic pipeline combined with hardware-specific optimizations and extensive usage of parallelization enables running TTS service even on low-end devices with faster than realtime waveform generation. Moreover, the system preserves high quality of speech without noticeable degradation of Mean Opinion Score compared to the non-optimized baseline. While the system is mostly oriented on low-to-mid range hardware we believe that it can also be used in any CPU-based environment.
Student Information

Student Events

Travel Grants