Wed-2-5-9 Bidirectional LSTM Network with Ordered Neurons for Speech Enhancement

Xiaoqi Li(Wuhan University of Technology), Yaxing Li(School of Computer Science and Technology, Wuhan University of Technology), Yuanjie Dong(Wuhan University of Technology), Shan Xu(Wuhan University of Technology), Zhihui Zhang(Wuhan University of Technology), Dan Wang(Wuhan University of Technology) and Shengwu Xiong(Wuhan University of Technology)
Abstract: Speech enhancement aims to reduce the noise and improve the quality and intelligibility of noisy speech. Long short-term memory (LSTM) network frameworks have achieved great success on many speech enhancement applications. In this paper, the ordered neurons long short-term memory (ON-LSTM) network with a new inductive bias to differential the long/short-term information in each neuron is proposed for speech enhancement. Comparing the low-ranking neurons with short-term or local information, the high-ranking neurons which contain the long-term or global information always update less frequently for a wide range of influence. Thus, the ON-LSTM can automatically learn the clean speech information from noisy input and show better expressive ability. We also propose a rearrangement concatenation rule to connect the ON-LSTM outputs of forward and backward layers to construct the bidirectional ON-LSTM (Bi-ONLSTM) for further performance improvement. The experimental results reveal that the proposed ON-LSTM schemes produce better enhancement performance than the vanilla LSTM baseline. And visualization result shows that our proposed model can effectively capture clean speech components from noisy inputs.
Student Information

Student Events

Travel Grants