Heart disease prediction using supervised machine learning algorithms: Performance analysis and comparison

Machine learning and data mining-based approaches to prediction and detection of heart disease would be of great clinical utility, but are highly challenging to develop. In most countries there is a lack of cardiovascular expertise and a significant rate of incorrectly diagnosed cases which could be...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computers in biology and medicine 2021-09, Vol.136, p.104672-104672, Article 104672
Hauptverfasser: Ali, Md Mamun, Paul, Bikash Kumar, Ahmed, Kawsar, Bui, Francis M., Quinn, Julian M.W., Moni, Mohammad Ali
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Machine learning and data mining-based approaches to prediction and detection of heart disease would be of great clinical utility, but are highly challenging to develop. In most countries there is a lack of cardiovascular expertise and a significant rate of incorrectly diagnosed cases which could be addressed by developing accurate and efficient early-stage heart disease prediction by analytical support of clinical decision-making with digital patient records. This study aimed to identify machine learning classifiers with the highest accuracy for such diagnostic purposes. Several supervised machine-learning algorithms were applied and compared for performance and accuracy in heart disease prediction. Feature importance scores for each feature were estimated for all applied algorithms except MLP and KNN. All the features were ranked based on the importance score to find those giving high heart disease predictions. This study found that using a heart disease dataset collected from Kaggle three-classification based on k-nearest neighbor (KNN), decision tree (DT) and random forests (RF) algorithms the RF method achieved 100% accuracy along with 100% sensitivity and specificity. Thus, we found that a relatively simple supervised machine learning algorithm can be used to make heart disease predictions with very high accuracy and excellent potential utility. •This research aims to find out some classifiers with the highest accuracy to predict heart disease.•Several supervised machine-learning algorithms were applied and compared their performance and accuracy.•Feature importance score for each feature is estimated for all the applied algorithms except MLP and KNN.•All the features are ranked based on the feature importance score to find highly predictive features.•KNN, DT, RF gained 100% accuracy along with 100% sensitivity and specificity.
ISSN:0010-4825
1879-0534
DOI:10.1016/j.compbiomed.2021.104672