Determinants of childhood diarrhea in low- and middle-income countries: a comparative analysis of epidemiological and machine learning approaches

低收入和中等收入国家儿童腹泻的决定因素:流行病学和机器学习方法的比较分析

阅读:1

Abstract

BACKGROUND: Diarrheal disease remains a major cause of under-five mortality in low- and middle-income countries (LMICs). This study investigates the diarrhea determinants, employing a novel dual approach comparing classical epidemiology and machine learning (ML) to determine the most important predictors and to optimize intervention targeting in Nigeria. METHODS: A cross-sectional analysis of 33,924 children aged < 5 years using 2018 Nigeria Demographic and Health Survey (NDHS) was used. Traditional logistic regression was contrasted to ML models (Random Forest, Gradient Boosting Machines, Decision Trees) to consider non-linear associations and variable importance, while using model parameters to assess model performance. RESULTS: The prevalence of diarrhea was 11.98%, with significant disparities between regions. Child's age was the strongest predictor across all models, with a significant odd seen among children aged 6-23 months (AOR = 2.48-2.54). Increased maternal education was protective (AOR = 0.77-0.79), while exposure to the media had multicomponent associations. Urban-rural wealth index was a robust strong socioeconomic predictor. Logistic regression had the best predictive performance (AUC = 0.727), closely followed by that of Gradient Boosting (AUC = 0.718). Sensitivity analysis showed that MICE generated more accurate estimates than complete case analysis because missingness was not random. CONCLUSION: The study stresses the significance of core modifiable determinants such as maternal education and contextual wealth. Methodological triangulation illustrates the complementarity of classic regression for inference and machine learning for feature discovery. These findings justify the imposition of hyper-localized, multi-sectoral interventions on high-risk age groups and areas based on sound data analysis to optimize public health resources. This mixed approach provides a scalable model for disease burden measurement in LMICs.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。