Chronic Diseases Prediction Using Machine Learning With Data Preprocessing Handling: A Critical Review

According to the World Health Organization (WHO), some chronic diseases such as diabetes mellitus, stroke, cancer, cardiac vascular, kidney failure, and hypertension are essential for early prevention. One of the prevention that can be taken is to predict chronic diseases using machine learning base...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2024, Vol.12, p.80698-80730
Hauptverfasser: Ghaniaviyanto Ramadhan, Nur, Adiwijaya, Maharani, Warih, Akbar Gozali, Alfian
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:According to the World Health Organization (WHO), some chronic diseases such as diabetes mellitus, stroke, cancer, cardiac vascular, kidney failure, and hypertension are essential for early prevention. One of the prevention that can be taken is to predict chronic diseases using machine learning based on personal medical record or general checkup result. The common prediction objective is to minimize the prediction error as low as possible. The most influencing chronic diseases prediction factors are the quality of data and the choice of predictor such as machine learning methods. The five main problems those lower data quality are outliers, missing values, feature selection, normalization, and imbalance. After we ensure the quality of data, the next task is to choose the best machine learning methods. The most influencing factor to consider when we choose the predictor its performance evaluation (accuracy, recall, precision, f1-score). Thus, predicting chronic disease aims to produce increased performance and solve problems in medical data. This paper presents a Systematic Literature Review (SLR) that offers a comprehensive discussion of research on chronic diseases prediction using machine learning and its data preprocessing handling. This paper covers machine learning methods discussion such as supervised learning, ensemble learning, deep learning, and reinforcement learning. The preprocessing handling we discuss includes missing values, outliers, feature selection, normalization, and imbalance. The final discussions of this paper are open issues, and the potential future works in improving the prediction performance for chronic diseases using a data preprocessing handling and machine learning methods.
ISSN:2169-3536
2169-3536
DOI:10.1109/ACCESS.2024.3406748