UMP Institutional Repository

A new soft set-based technique for clustering attribute selection in educational data mining

Suhirman, . (2016) A new soft set-based technique for clustering attribute selection in educational data mining. PhD thesis, Universiti Malaysia Pahang.

[img]
Preview
PDF
FSKKP - SUHIRMAN - CD 9878.pdf

Download (298kB) | Preview
[img]
Preview
PDF
FSKKP - SUHIRMAN - CD 9878 - CHAP 1.pdf

Download (115kB) | Preview
[img]
Preview
PDF
FSKKP - SUHIRMAN - CD 9878 - CHAP 3.pdf

Download (173kB) | Preview

Abstract

Determining the best clustering attribute is an essential process in data clustering, since this task is a relatively simple and efficient for attributes-based data clustering. Five well-known rough and soft sets-based techniques for selecting a clustering attribute respectively TR, MMR, MDA, NSS, and MAR have been proposed. MAR technique achieves better computational time than that the four other aforesaid approaches. However, in reviewing MAR, execution time is still an outstanding issue, due to iteration processes in determining the relative attribute. This research proposes an alternative soft set-based technique for selecting a clustering attribute, named Maximum Degree of Domination in Soft set theory (MDDS). In this technique, the notion of multi-soft sets is firstly described. Secondly, the domination of soft sets and its degree are defined. Finally, the maximum degree of domination is used to determine the best clustering attribute. The proposed technique is examined through eighteen UCI benchmark machine learning datasets and compared with the results obtained with that of MAR. The results show that MDDS technique achieves fairly well in reducing computation time and outperforms MAR technique up to 43.99%. Furthermore, MDDS has a good scalability, i.e. the executing time of the technique tends to increase linearly as the data sizes are increased. While the accuracy of eight data sets which have a class attributes has increased 3.23%. Furthermore, the proposed MDDS technique was used to solve real world clustering problem in Educational Data Mining. The data sets were taken from a survey on a few courses at the Information Engineering and the Architecture Departments of the University Technology of Yogyakarta (UTY) Indonesia during the last 4 years. The dominant attribute of dataset assessment were determined using MDDS technique, due to its increased efficiency and accuracy, so decisions can be made faster and accurately.

Item Type: Thesis (PhD)
Additional Information: Thesis (Doctor of Philosophy in Computer Science) -- Universiti Malaysia Pahang – 2016
Uncontrolled Keywords: soft sets-based techniques; Educational Data Mining
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
T Technology > T Technology (General)
Faculty/Division: Faculty of Computer System And Software Engineering
Depositing User: Ms. Nurezzatul Akmal Salleh
Date Deposited: 09 Nov 2016 06:46
Last Modified: 09 Nov 2016 06:46
URI: http://umpir.ump.edu.my/id/eprint/15254
Download Statistic: View Download Statistics

Actions (login required)

View Item View Item