Hazim, Mohamad and Zainal Abidin, Ahmad Firdaus and Firdaus, Afifi and Zaki, Faiz and Adewole, Kayode Sakariyah and Anuar, Nor Badrul (2019) Domain-Independent Reviews' Sentiment Polarity Classification using Shallow Word2Seq Convolutional Neural Network. [Research Data]
x_test_5k_sampled.csv - Supplemental Material
Available under License Creative Commons Attribution.
Download (7MB)
x_train_5k_sampled.csv - Supplemental Material
Available under License Creative Commons Attribution.
Download (7MB)
y_test_5k_sampled.csv - Supplemental Material
Available under License Creative Commons Attribution.
Download (25kB)
y_train_5k_sampled.csv - Supplemental Material
Available under License Creative Commons Attribution.
Download (25kB)
Abstract
Reviews and comments are perceptions about specific services or products. They are embedded with hidden sentiments the reviewer has towards certain subjects. Business owners use customer reviews to understand customers’ perceptions about specific services or products. The ability to understand reviews’ sentiment from different domains or areas give decision makers and business owners the opportunity to make critical business decisions which can help them to increase profits of their businesses. Previous studies had focussed on classifying sentiment polarity by using traditional machine learning and deep learning methods. However, these suffered from low model generalization ability, causing the models to perform better only on single domain datasets rather than multiple domain datasets. The problem is the inability of the classification model to learn domain-restricted knowledge from multi-domain datasets. Aiming to improve the accuracy of the cross-domain classification, this paper proposes a method which uses Word2Seq Convolutional Neural Network (CNN) to classify reviews’ sentiment across multiple domain datasets (i.e. digital worker, movie, product, hotel and restaurant reviews). The evaluation showed that the proposed method had achieved the state-of-the-art performance. The high classification performance also promoted the reliability and effectiveness of implementing the Word2Seq CNN to classify reviews’ sentiment across different domains and learn domain restricted knowledge while improving the model generalization ability.
The uploaded dataset is a sampled dataset with 5000 observations for both training and testing sets.
| Item Type: | Research Data |
|---|---|
| Additional Information: | Cite dataset: Mohamad Hazim, Ahmad Firdaus, Firdaus Afifi, Faiz Zaki, Kayode Sakariyah Adewole, & Nor Badrul Anuar. (2019). Domain-Independent Reviews' Sentiment Polarity Classification using Shallow Word2Seq Convolutional Neural Network [Data set]. Zenodo. https://doi.org/10.5281/zenodo.2537798 |
| Uncontrolled Keywords: | Sentiment Analysis, Review Sentiment Classification, Convolutional Neural Networks |
| Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
| Faculty/Division: | Faculty of Computing |
| Depositing User: | Ms Affida Abu Bakar |
| Date Deposited: | 04 Dec 2025 02:28 |
| Last Modified: | 04 Dec 2025 02:28 |
| URI: | https://umpir.ump.edu.my/id/eprint/45499 |
| Statistic Details: | View Download Statistic |

