Domain-Independent Reviews' Sentiment Polarity Classification using Shallow Word2Seq Convolutional Neural Network

Hazim, Mohamad and Zainal Abidin, Ahmad Firdaus and Firdaus, Afifi and Zaki, Faiz and Adewole, Kayode Sakariyah and Anuar, Nor Badrul (2019) Domain-Independent Reviews' Sentiment Polarity Classification using Shallow Word2Seq Convolutional Neural Network. [Research Data]

[thumbnail of Sampled dataset with 5000 observations for both training and testing sets] Spreadsheet (Sampled dataset with 5000 observations for both training and testing sets)
x_test_5k_sampled.csv - Supplemental Material
Available under License Creative Commons Attribution.

Download (7MB)
[thumbnail of Sampled dataset with 5000 observations for both training and testing sets] Spreadsheet (Sampled dataset with 5000 observations for both training and testing sets)
x_train_5k_sampled.csv - Supplemental Material
Available under License Creative Commons Attribution.

Download (7MB)
[thumbnail of Sampled dataset with 5000 observations for both training and testing sets] Spreadsheet (Sampled dataset with 5000 observations for both training and testing sets)
y_test_5k_sampled.csv - Supplemental Material
Available under License Creative Commons Attribution.

Download (25kB)
[thumbnail of Sampled dataset with 5000 observations for both training and testing sets] Spreadsheet (Sampled dataset with 5000 observations for both training and testing sets)
y_train_5k_sampled.csv - Supplemental Material
Available under License Creative Commons Attribution.

Download (25kB)

Abstract

Reviews and comments are perceptions about specific services or products. They are embedded with hidden sentiments the reviewer has towards certain subjects. Business owners use customer reviews to understand customers’ perceptions about specific services or products. The ability to understand reviews’ sentiment from different domains or areas give decision makers and business owners the opportunity to make critical business decisions which can help them to increase profits of their businesses. Previous studies had focussed on classifying sentiment polarity by using traditional machine learning and deep learning methods. However, these suffered from low model generalization ability, causing the models to perform better only on single domain datasets rather than multiple domain datasets. The problem is the inability of the classification model to learn domain-restricted knowledge from multi-domain datasets. Aiming to improve the accuracy of the cross-domain classification, this paper proposes a method which uses Word2Seq Convolutional Neural Network (CNN) to classify reviews’ sentiment across multiple domain datasets (i.e. digital worker, movie, product, hotel and restaurant reviews). The evaluation showed that the proposed method had achieved the state-of-the-art performance. The high classification performance also promoted the reliability and effectiveness of implementing the Word2Seq CNN to classify reviews’ sentiment across different domains and learn domain restricted knowledge while improving the model generalization ability.

The uploaded dataset is a sampled dataset with 5000 observations for both training and testing sets.

Item Type: Research Data
Additional Information: Cite dataset: Mohamad Hazim, Ahmad Firdaus, Firdaus Afifi, Faiz Zaki, Kayode Sakariyah Adewole, & Nor Badrul Anuar. (2019). Domain-Independent Reviews' Sentiment Polarity Classification using Shallow Word2Seq Convolutional Neural Network [Data set]. Zenodo. https://doi.org/10.5281/zenodo.2537798
Uncontrolled Keywords: Sentiment Analysis, Review Sentiment Classification, Convolutional Neural Networks
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Faculty/Division: Faculty of Computing
Depositing User: Ms Affida Abu Bakar
Date Deposited: 04 Dec 2025 02:28
Last Modified: 04 Dec 2025 02:28
URI: https://umpir.ump.edu.my/id/eprint/45499
Statistic Details: View Download Statistic

Actions (login required)

View Item
View Item