Thu-3-1-9 Raw speech waveform based classification of patients with ALS, Parkinson’s Disease and healthy controls using CNN-BLSTM

Jhansi Mallela(Indian Institute of Science), Aravind Illa(PhD Student, Indian Institute of Science, Bangalore), Yamini Belur(National Institute of Mental Health and Neurosciences), Nalini Atchayaram(National Institute of Mental Health and Neurosciences), Pradeep Reddy(National Institute of Mental Health and Neurosciences), Dipanjan Gope(National Institute of Mental Health and Neurosciences) and Prasanta Kumar Ghosh(Indian Institute of Science)
Abstract: Analysis of speech waveform through automated methods in patients with Amyotrophic Lateral Sclerosis (ALS), and Parkinson's disease (PD) can be used for early diagnosis and monitoring disease progression. Many works in the past have used different acoustic features for the classification of patients with ALS and PD with healthy controls (HC). In this work, we propose a data-driven approach to learn representations from raw speech waveforms. Our model comprises of 1-D CNN layer to extract representations from raw speech followed by BLSTMlayers for the classification tasks. We consider 3 different classification tasks (ALS vs HC), (PD vs HC), and (ALS vs PD). We perform each classification task using four different speech stimuli in two scenarios: i) trained and tested in a stimulus-specific manner, ii) trained on data pooled from all stimuli, and test on each stimulus separately. Experiments with 60 ALS,60 PD, and 60 HC show that the frequency responses of the learned 1-D CNN filters are low pass in nature, and the center frequencies lie below 1kHz. The learned representations from raw speech perform better than MFCC which is considered as baseline. Experiments with pooled models yield a better result compared to the task-specific models.
Student Information

Student Events

Travel Grants