Speech emotion recognition using spectrogram based neural structured learning

Sivan, Dawn and Haripriya, P. H. and Jose, Rajan (2022) Speech emotion recognition using spectrogram based neural structured learning. In: The 6th National Conference for Postgraduate Research (NCON-PGR 2022), 15 November 2022 , Virtual Conference, Universiti Malaysia Pahang, Malaysia. p. 80..

[img]
Preview
Pdf
Speech emotion recognition using spectrogram based neural structured learning.pdf

Download (334kB) | Preview
[img] Pdf
Speech Emotion Recognition Using Spectrogram Based Neural Structured_FULL.pdf
Restricted to Repository staff only

Download (568kB) | Request a copy

Abstract

Human emotions are extremely crucial in our daily life. Emotion analysis based solely on auditory data is difficult due to the lack of visible visual information on human faces. Thus, a unique emotion recognition system based on robust characteristics and machine learning from the audio speech is reported in this paper. Audio details are used as input to the person-independent emotion recognition system, from which the spectrogram values are extracted as features. The generated features are then used to train and understand the emotions via Neural Structured Learning (NSL), a fast and accurate deep learning approach. During studies on an emotion dataset of audio speeches, the proposed approach of integrating spectrogram and NSL produced improved recognition rates compared to other known models. The system can be used in smart environments like homes or clinics to provide effective healthcare, music recommendations, customer support, and marketing, among several other things. As a result, rather than processing data and making judgments from far distant data sources, the decision-making could be made closer to where the data lives. The Toronto Emotional Speech Set (TESS) dataset that contains 7 emotions has been used for this research. The algorithm is successfully tested with the dataset with an accuracy of ~97%.

Item Type: Conference or Workshop Item (Lecture)
Uncontrolled Keywords: Deep learning; Human computer interface; Neural structured learning spectrogram; Speech emotion recognition.
Subjects: H Social Sciences > HD Industries. Land use. Labor > HD28 Management. Industrial Management
Q Science > Q Science (General)
T Technology > T Technology (General)
Faculty/Division: Faculty of Industrial Sciences And Technology
Institute of Postgraduate Studies
Depositing User: Mr Muhamad Firdaus Janih@Jaini
Date Deposited: 07 Feb 2023 02:37
Last Modified: 07 Feb 2023 02:37
URI: http://umpir.ump.edu.my/id/eprint/36833
Download Statistic: View Download Statistics

Actions (login required)

View Item View Item