Journal ArticleUnknown

Speech Emotion Recognition using Machine Learning

Authors

Ajmeera Kiran, Mudassir Khan, J. Chinna Babu, B. P. Santosh Kumar, …

Author Affiliations

King Khalid University, Yogi Vemana University, Presidency University

Year2023

Citations3

DOI10.1109/iccams60113.2023.10526124

Abstract

Humans communicate most naturally and quickly through speech. Speech Emotion Recognition (SER) attempts to identify human emotions and affective states from speech. This capitalizes on voice tone and pitch, reflecting underlying emotion. The voice signal is a rich source of information and a powerful medium for communicating approach and eliciting emotions. Audio emotion recognition needs feature extraction and classifier training. The feature vector must contain audio signal elements that characterize speaker-specific properties, including tone, pitch, and energy, to train the classifier model to classify an emotion effectively. RAVDESS, an English language open source dataset of male and female acted speech corpus, is carefully trained and tested. We retrieved MFCC coefficients from training dataset audio samples to reflect speaker vocal tract…

View at Publisher

BORR does not host full-text PDFs. The button above takes you to the original publisher.

Fields & Keywords

Social Sciences Psychology Experimental and Cognitive Psychology Emotion and Mood Recognition Speech and Audio Processing Music and Audio Processing Speech recognition Artificial intelligence Psychiatry