Wed-1-2-2 A low latency ASR-free end to end spoken language understanding system

Mohamed Mhiri(fluent.ai), Samuel Myer(fluent.ai) and Vikrant Singh Tomar(fluent.ai)

Abstract: In recent years, developing a speech understanding system that classifies a waveform to structured data, such as intents and slots, without first transcribing the speech to text has emerged as an interesting research problem. This work proposes such as system with an additional constraint of designing a system that has a small enough footprint to run on small micro-controllers and embedded systems with minimal latency. Given a streaming input speech signal, the proposed system can process it segment-by-segment without the need to have the entire stream at the moment of processing. The proposed system is evaluated on the publicly available Fluent Speech Commands dataset. Experiments show that the proposed system yields state-of-the-art performance with the advantage of low latency and a much smaller model when compared to other published works on the same task.

Paper

prev Wed-1-2-1 Do face masks introduce bias in speech technologies? The case of automated scoring of speaking proficiency.

next Wed-1-2-3 An Audio-Based Wakeword-Independent Verification System

About

About the Conference

Welcome from the Chair

Conference Committees

Calls