Heart disease prediction using ensemble of k-nearest neighbour, random forest and logistic regression method

Mohd Syafiq Asyraf, Suhaimi and Nor Azuana, Ramli and Noryanti, Muhammad (2024) Heart disease prediction using ensemble of k-nearest neighbour, random forest and logistic regression method. AIP Conference Proceedings, 2895 (1). pp. 1-10. ISSN 0094-243X. (Published)

[img] Pdf
syafiqicoaims.pdf
Restricted to Repository staff only

Download (692kB) | Request a copy
[img]
Preview
Pdf
Heart disease prediction using ensemble of k-nearest_ABST.pdf

Download (406kB) | Preview

Abstract

Coronary heart disease has been ranked as the number one leading cause of death in Malaysia. Based on the recent data published by WHO in 2018, death caused by this disease has reached 34,766 which brought up to 24.69 of the total deaths and places the Malaysian population 64th in the world. Medical researchers all around the world believe that there are multiple circumstances for this disease which include health problems, unhealthy personal habits, genetics, and family history. It is not an easy task to predict heart disease since the study needs a broad range of expertise from many disciplines. Recently, machine learning had been applied as one of the methods to predict heart disease. To test the accuracy of different machine learning methods, this study is conducted by applying the data extracted from the machine learning repository. The proposed predictive modelling in this study was developed using the ensemble method. The ensemble technique used was stacking where logistic regression was used as the meta-level classifier while Random Forest and k-nearest neighbour method were applied as the meta-level classifiers. Results obtained from this study show that the proposed method outperforms other single methods with 82.42 accuracies. Although the accuracy and RMSE of the ensemble method are similar to Random Forest, the proposed method is still the best method since it has a 0.903 value for the area under the ROC and 0.843 value for F1 score. This proposed predictive model will be applied by using smartwatch datasets for future study.

Item Type: Article
Additional Information: Indexed by Scopus
Uncontrolled Keywords: Coronary heart disease; Chronic diseases; Machine learning
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
R Medicine > RC Internal medicine
Faculty/Division: Center for Mathematical Science
Institute of Postgraduate Studies
Depositing User: Dr. Nor Azuana Ramli
Date Deposited: 24 Apr 2024 04:20
Last Modified: 24 Apr 2024 04:20
URI: http://umpir.ump.edu.my/id/eprint/41052
Download Statistic: View Download Statistics

Actions (login required)

View Item View Item