Proceedings of the International Conference on Computational Innovations and Emerging Trends (ICCIET- 2024)

Comparative Analysis of Machine Learning Models for Emotion Classification in Speech Data

Authors
N. Siva1, B. Venkata Sivaiah2, *, G. Sai Kumar3, G. Jaya Vardhan Raju3, V. Sushvitha3, G. Chaithanya3, Sam Goundar4
1Assistant Professor, Department of Artificial Intelligence and Machine Learning, Annamacharya Institute of Technology and Sciences, Tirupati, Rajampet, India
2Assistant Professor, Department of Data Science, Mohan Babu University (Erstwhile Sree Vidyanikethan Engineering College), Tirupati, India
3UG Scholar, Department of Computer Science and Systems Engineering, Sree Vidyankethan Engineering College, Tirupati, India
4RMIT University, Melbourne, Australia
*Corresponding author. Email: siva.bheem@hotmail.com
Corresponding Author
B. Venkata Sivaiah
Available Online 30 July 2024.
DOI
10.2991/978-94-6463-471-6_45How to use a DOI?
Keywords
Convolutional Neural Networks (CNN); Decision Trees; Emotion Recognition; LSTM Networks; Machine Learning Algorithms
Abstract

Understanding emotions is critical to many fields, including psychology, medicine, and human-computer interaction. The study uses datasets from RAVDESS, SAVEE, CREMA, and TESS which cover a wide spectrum of emotions, including neutral, surprise, happiness, sadness, disgust, anger, and fear to thoroughly investigate machine learning algorithms for emotion identification in audio data. Long short-term memory (LSTM) networks, decision trees, and convolutional neural networks (CNNs) are the three different models that are investigated. Decision trees provide simple classification, LSTMs extract temporal correlations from the data, and CNNs are excellent at extracting features from audio signals. Performance indicators like recall, F1, precision, and accuracy score are used in performance evaluation. Significantly, the CNN model outperforms Decision Trees and LSTM networks with 72% and 77%, respectively, in emotion categorization accuracy, reaching a remarkable 91%. This work offers insightful information about how well different machine learning models perform when it comes to audio-based emotion recognition. These realizations will have a big impact on developing trustworthy emotion detection systems for emotional computing, human-robot interaction, and mental health assessment. Future studies could investigate ensemble approaches or hybrid models to improve emotion detection capabilities and progress the creation of increasingly intricate and accurate emotion recognition systems.

Copyright
© 2024 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the International Conference on Computational Innovations and Emerging Trends (ICCIET- 2024)
Series
Advances in Computer Science Research
Publication Date
30 July 2024
ISBN
978-94-6463-471-6
ISSN
2352-538X
DOI
10.2991/978-94-6463-471-6_45How to use a DOI?
Copyright
© 2024 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - N. Siva
AU  - B. Venkata Sivaiah
AU  - G. Sai Kumar
AU  - G. Jaya Vardhan Raju
AU  - V. Sushvitha
AU  - G. Chaithanya
AU  - Sam Goundar
PY  - 2024
DA  - 2024/07/30
TI  - Comparative Analysis of Machine Learning Models for Emotion Classification in Speech Data
BT  - Proceedings of the International Conference on Computational Innovations and Emerging Trends (ICCIET- 2024)
PB  - Atlantis Press
SP  - 464
EP  - 474
SN  - 2352-538X
UR  - https://doi.org/10.2991/978-94-6463-471-6_45
DO  - 10.2991/978-94-6463-471-6_45
ID  - Siva2024
ER  -