Proceedings of the 7th International Conference on Applied Engineering (ICAE 2024)

Enhancing Sentiment Analysis Performance Using SMOTE and Majority Voting in Machine Learning Algorithms

Authors
Fadli Suandi1, M. Khairul Anam2, *, Muhammad Bambang Firdaus3, Sofiansyah Fadli4, Lathifah Lathifah5, Eva Yumami2, Alfa Saleh6, Ade Zulkarnain Hasibuan2
1Politeknik Negeri Batam, Batam, 29461, Indonesia
2Universitas Samudra, Langsa, 24416, Indonesia
3Universitas Mulawarman, Samarinda, 75119, Indonesia
4STMIK Lombok, Lombok Tengah, 83511, Indonesia
5Universitas Teknokrat Indonesia, Bandar Lampung, 35123, Indonesia
6Politeknik Negeri Bengkalis, Bengkalis, 28711, Indonesia
*Corresponding author. Email: khairulanam@unsam.ac.id
Corresponding Author
M. Khairul Anam
Available Online 25 December 2024.
DOI
10.2991/978-94-6463-620-8_10How to use a DOI?
Keywords
Machine Learning; Majority Voting; SMOTE; Sentiment Analysis
Abstract

In the digital era, sentiment analysis on social media has become increasingly important in understanding public perception of various issues. However, one of the main challenges in sentiment analysis is the issue of data imbalance, where one class (such as positive sentiment) may significantly outnumber another (such as negative or neutral sentiment). This imbalance can lead to biased predictions in machine learning models, where the majority class is favored over the minority class. To address this, Synthetic Minority Over-sampling Technique (SMOTE) is used to artificially balance the dataset by creating synthetic samples from the minority class. SMOTE generates new instances by interpolating between existing minority instances, improving the distribution of the data and enhancing model performance. In this research, various machine learning algorithms are utilized to perform sentiment analysis on tweets collected with the hashtag “online learning”. The SMOTE oversampling technique is applied and compared with models that do not use SMOTE. This research focuses mainly on the Majority Voting algorithm, which combines predictions from multiple models to improve overall accuracy. The test results show that using SMOTE significantly improves the model’s performance, especially in terms of recall and F1-Score. The Majority Voting+SMOTE algorithm achieved the highest accuracy of 97%, demonstrating the effectiveness of this approach in handling data imbalance and producing more reliable predictions. These results confirm that SMOTE effectively improves model performance under imbalanced data conditions, especially in sentiment analysis.

Copyright
© 2024 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the 7th International Conference on Applied Engineering (ICAE 2024)
Series
Advances in Engineering Research
Publication Date
25 December 2024
ISBN
978-94-6463-620-8
ISSN
2352-5401
DOI
10.2991/978-94-6463-620-8_10How to use a DOI?
Copyright
© 2024 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - Fadli Suandi
AU  - M. Khairul Anam
AU  - Muhammad Bambang Firdaus
AU  - Sofiansyah Fadli
AU  - Lathifah Lathifah
AU  - Eva Yumami
AU  - Alfa Saleh
AU  - Ade Zulkarnain Hasibuan
PY  - 2024
DA  - 2024/12/25
TI  - Enhancing Sentiment Analysis Performance Using SMOTE and Majority Voting in Machine Learning Algorithms
BT  - Proceedings of the  7th International Conference on Applied Engineering (ICAE 2024)
PB  - Atlantis Press
SP  - 126
EP  - 138
SN  - 2352-5401
UR  - https://doi.org/10.2991/978-94-6463-620-8_10
DO  - 10.2991/978-94-6463-620-8_10
ID  - Suandi2024
ER  -