Improving Hepatitis B outcome prediction with ensemble machine learning: A study on predictive models and interpretability

利用集成机器学习改进乙型肝炎预后预测:预测模型及其可解释性研究

阅读:1

Abstract

OBJECTIVE: Hepatitis B virus (HBV) is a significant global health threat, responsible for severe liver diseases such as liver failure, cirrhosis, and hepatocellular carcinoma. The burden is especially high in low-income regions, where early diagnosis and treatment are critical for mitigating its impact. This study investigates the effectiveness of various machine learning (ML) techniques in predicting patient outcomes in HBV infection. METHODS: The Chi-squared test was used for feature selection to find the most important factors, which were later applied to train and evaluate various ML models. To address the class imbalance in the dataset, the Synthetic Minority Over-sampling Technique (SMOTE) was used to balance the data. SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME) were used to improve the models' interpretability. RESULTS: Among individual models, Support Vector Machine (SVM) and Logistic Regression (LR) each achieved an accuracy of 92.5%. By implementing a Voting Classifier that combined SVM and LR, the overall accuracy was improved to 95%. The results showed that higher levels of some risk factors, especially in older patients, greatly raise the risk of death. CONCLUSION: These insights provide healthcare professionals and policymakers with valuable information to develop predicting better patient outcomes in HBV infection and patient care strategies.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。