Siti, Mujilahwati and Noor Zuraidin, Mohd Safar and Ku Muhammad Naim, Ku Khalif and Nasyitah, Ghazalli (2024) Optimizing sentiment analysis of Indonesian texts: Enhancing deep learning models with genetic algorithm-based feature selection. Journal of Soft Computing and Data Mining, 5 (2). pp. 208-222. ISSN 2716-621X. (Published)
|
Pdf
Optimizing sentiment analysis of indonesian texts.pdf Available under License Creative Commons Attribution Non-commercial Share Alike. Download (1MB) | Preview |
Abstract
Automatic text classification techniques are employed in a multitude of real-world applications, including the filtering of unsolicited messages, the analysis of sentiment, and the categorization of news items. The primary challenge in text representation is the high dimensionality, which can increase the complexity and risk of overfitting the model. To address this challenge, feature selection (FS) is conducted during the data pre-processing phase with the objective of enhancing the learning accuracy and efficiency of the model. This study examines the optimization of Indonesian text sentiment analysis through the integration of feature selection using a genetic algorithm (GA) with deep learning models. The application of GA for data dimensionality reduction from 41,140 to 20,769 features, coupled with fitness evaluation based on SVM, resulted in an observed increase in accuracy by 8.10% for SVM, 36.1% for Naïve Bayes, 7.82% for LSTM, 5.47% for DNN, and 6.25% for CNN. Of the three deep learning models, LSTM demonstrated the highest accuracy, at 91.41%, while also exhibiting a notable reduction in computation time, approaching 50%.
Item Type: | Article |
---|---|
Additional Information: | Indexed by Scopus |
Uncontrolled Keywords: | Automatic text classification; Deep learning models; Feature selection; Genetic algorithms; Sentiment analysis |
Subjects: | Q Science > Q Science (General) Q Science > QA Mathematics |
Faculty/Division: | Center for Mathematical Science Centre of Excellence for Artificial Intelligence & Data Science |
Depositing User: | Mr Muhamad Firdaus Janih@Jaini |
Date Deposited: | 20 Feb 2025 08:53 |
Last Modified: | 20 Feb 2025 08:53 |
URI: | http://umpir.ump.edu.my/id/eprint/43886 |
Download Statistic: | View Download Statistics |
Actions (login required)
![]() |
View Item |