Explore the factors related to the death of offspring under age five and appraise the hazard of child mortality using machine learning techniques in Bangladesh

Child mortality is a reliable and significant indicator of a nation's health. Although the child mortality rate in Bangladesh is declining over time, it still needs to drop even more in order to meet the Sustainable Development Goals (SDGs). Machine Learning models are one of the best tools for...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:BMC public health 2025-01, Vol.25 (1), p.360-22, Article 360
Hauptverfasser: Rahman, Ashikur, Rahman, Md Habibur
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Child mortality is a reliable and significant indicator of a nation's health. Although the child mortality rate in Bangladesh is declining over time, it still needs to drop even more in order to meet the Sustainable Development Goals (SDGs). Machine Learning models are one of the best tools for making more accurate and efficient forecasts and gaining in-depth knowledge. A deeper understanding is crucial for significantly reducing child mortality rates. Accurate predictions using machine learning models can empower authorities to implement timely interventions and raise awareness. So, the study aimed to explore the factors related to child mortality and assess the efficacy of various machine-learning models in predicting child mortality in Bangladesh. About Forty-two thousand observations, except the missing observations, were extracted for this study from the Bangladesh Demographic and Health Survey (BDHS) data conducted in 2017-18. The survey utilized a two-stage stratified sampling method, selecting 675 enumeration areas-250 in urban settings and 425 in rural areas-resulting in effective data collection from 672 clusters and 20160 households. The Chi-square test and recursive feature elimination (RFE) are used to find the relevant risk factors of child mortality among the number of factors. Six ML-based algorithms were implemented for predicting child mortality, such as Naïve Bayes, Classification and Regression Trees, Random Forest, C5.0 Classification, Gradient Boosting Machine, and Logistic Regression. Model evaluation metrics like accuracy, specificity, sensitivity, negative predictive value, score, positive predictive value, k-fold cross-validation, and area under the curve (AUC) techniques were used to evaluate the performance of the models. The child mortality rate is 8.2%, according to the data. The bivariate analysis showed that the child mortality rate was higher among the children whose mothers were uneducated, impoverished, underweight, aged 35-49, and gave birth before age 20. Families' water sources and religious connections had no statistically significant impact on child mortality. The prediction of child mortality using machine learning models is the main objective of this study. None of the machine learning models correctly classified dead occurrences. Therefore, this study conducted over-sampling and under-sampling analysis. Approximately 76727 and 6910 observations were sampled for over-sampling and under-sampling techniques, respecti
ISSN:1471-2458
1471-2458
DOI:10.1186/s12889-025-21460-w