Predictive models and determinants of mortality among T2DM patients in a tertiary hospital in Ghana, how do machine learning techniques perform?

在加纳一家三级医院,机器学习技术在预测 2 型糖尿病患者死亡率及其决定因素方面表现如何?

阅读:2

Abstract

BACKGROUND: The increasing prevalence of type 2 diabetes mellitus (T2DM) in lower and middle - income countries call for preventive public health interventions. Studies from Africa including those from Ghana, consistently reveal high T2DM-related mortality rates. While previous research in the Ho municipality has primarily examined risk factors, comorbidity, and quality of life of T2DM patients, this study specifically investigated mortality predictors among these patients. METHOD: The study was retrospective involving medical records of T2DM patients. Data extracted included mortality outcome (dead or alive), sociodemographic characteristics (age, sex, marital status, educational level, occupation and location), family history of diseases (diabetes, cardiovascular disease (CVD), or asthma), lifestyle (smoking and alcohol intake), comorbidities (such as skin infections, sickle cell disease, urinary tract infections, and pneumonia) and complications of diabetes (CVD, nephropathy, neuropathy, foot ulcers, and diabetic ketoacidosis) were analyzed using Stata version 16.0 and Python 3.6.1 programming language. Both descriptive and inferential statistics were done to describe and build predictive models respectively. The performance of machine learning (ML) techniques such as support vector machine (SVM), decision tree, k nearest neighbor (kNN), eXtreme Gradient Boosting (XGBoost) and logistic regression were evaluated using the best-fitting predictive model for T2DM mortality. RESULTS: Of the 328 participants, 183 (55.79%) were female, and the percentage of mortality was 11.28%. A 100% mortality was recorded among the T2DM patients with sepsis (p-value = 0.012). T2DM in-patients were 3.83 times as likely to die [AOR = 3.83; 95% CI: (1.53-9.61)] if they had nephropathy compared to T2DM in-patients without nephropathy (p-value = 0.004). The full model which included sociodemographic characteristics, family history, lifestyle variables and complications of T2DM had the best prediction of T2DM mortality outcome (ROC = 72.97%). The accuracy for (test and train datasets) were as follows: (90% and 90%), (100% and 100%), (90% and 90%), (90% and 88%) and (88% and 90%) respectively for the various ML classification techniques: logistic regression, Decision tree classifier, kNN classifier, SVM and XGBoost. CONCLUSION: This study found that all in-patients with sepsis died. Nephropathy was the identified significant predictor of T2DM mortality. Decision tree classifier provided the best classifying potential.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。