Back to Search
Journal ArticleUnknown

Bangla Short Speech Commands Recognition Using Convolutional Neural Networks

Author Affiliations
North South University
Year2018
Citations32

Abstract

Despite being one of the most widely spoken languages of the world, no significant efforts have been made in Bangla speech recognition. Speech recognition is a difficult task, particularly if the demand is to do so in noisy real-life conditions. In this study, Bangla short speech commands data set has been reported, where all the samples are taken in the real-life setting. Three different convolutional neural network (CNN) architectures have been designed to recognize those short speech commands. Mel-frequency cepstral coefficients (MFCC) features have been extracted from the audio files in one approach whereas only the raw audio files have been used in another CNN architecture. Lastly, a pre-trained model which is trained on a large English short speech commands…
View at Publisher

BORR does not host full-text PDFs. The button above takes you to the original publisher.