Optimized classification of diabetes using dynamic waterwheel plant optimization algorithm

The classification of chronic diseases has been a prominent research focus in public health, extensively leveraging machine learning algorithms. One of these chronic diseases that has significant rates of occurrence all around the world is diabetes, which is a disease by itself. Many academics are w...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Scientific reports 2024-10, Vol.14 (1), p.23386-16, Article 23386
Hauptverfasser: El-kenawy, El-Sayed M., Alhussan, Amel Ali, Khafaga, Doaa Sami, Eid, Marwa M., Abdelhamid, Abdelaziz A.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The classification of chronic diseases has been a prominent research focus in public health, extensively leveraging machine learning algorithms. One of these chronic diseases that has significant rates of occurrence all around the world is diabetes, which is a disease by itself. Many academics are working to construct robust machine-learning algorithms for accurate categorization, given the prevalence of this chronic disease. A revolutionary methodology that can accurately categorize diabetic disease is the focus of this study, which aims to provide new methods. The proposed technique in this work is based on developing a novel feature selection method, DWWPA, which stands for dynamic waterwheel plant algorithm. The DWWPA algorithm is utilized in the process of optimizing the K-nearest neighbors (KNN) model in order to improve the accuracy of its classification. In the feature selection process, a binary representation of this method is called binary DWWPA (bDWWPA). Several different machine learning models and optimization techniques are compared to the strategy that has been presented. When categorizing diabetes cases in the dataset, the findings demonstrate the superiority and success of the proposed method. Furthermore, several different statistical analysis techniques, such as Analyses of variance (ANOVA) and Wilcoxon signed-rank test, are carried out to investigate the statistical difference and importance of the suggested strategy in contrast to the other ways at the same level of competition. The conclusions of these tests were consistent with what was anticipated they would be. Based on the suggested feature selection and the optimization of the KNN model, the proposed method has an accuracy of 98.9% when taken as an entire. The suggested method was useful in accurately classifying diabetic disease, as evidenced by the fact that it achieved a higher level of accuracy than the contemporary approaches.
ISSN:2045-2322
2045-2322
DOI:10.1038/s41598-024-72792-3