Mon-3-4-8 Attention-Driven Projections for Soundscape Classification

Dhanunjaya Varma Devalraju(Indian Institute of Technology, Mandi), Muralikrishna H(Indian Institute of Technology Mandi), Padmanabhan Rajan(Indian Institute of Technology Mandi) and Dileep Aroor Dinesh(Indian Institute of Technology Mandi)
Abstract: Acoustic soundscapes can be made up of background sound events and foreground sound events. Many times, either the background (or the foreground) may provide useful cues in discriminating one soundscape from another. A part of the background or a part of the foreground can be suppressed by using subspace projections. These projections can be learnt by utilising the framework of robust principal component analysis. In this work, audio signals are represented as embeddings from a convolutional neural network, and meta-embeddings are derived using an attention mechanism. This representation enables the use of class-specific projections for effective suppression, leading to good discrimination. Our experimental evaluation demonstrates the effectiveness of the method on standard datasets for acoustic scene classification.
Student Information

Student Events

Travel Grants