Validation and interpretation of machine-learning models for rapid identification of active tuberculosis infection using routine laboratory indicators

利用常规实验室指标快速识别活动性结核病感染的机器学习模型的验证与解释

阅读:1

Abstract

INTRODUCTION: Diagnosis of active Mycobacterium tuberculosis (Mtb) infection relies on clinical symptoms, imaging, and molecular testing, but these methods are often costly and slow. Consequently, there is an urgent need for a rapid and accessible diagnostic approach that can support early detection and reduce ongoing tuberculosis transmission. METHODS: A discovery cohort of 3,829 individuals and an external validation cohort of 405 individuals were included. Six supervised machine learning models were trained using routine laboratory data, and model interpretability was assessed with SHapley Additive exPlanations (SHAP). RESULTS: Among the six models, XGBoost demonstrated the best diagnostic performance in the internal cohort (accuracy 97.49%; sensitivity 97.56%; specificity 97.42%) and maintained strong performance in the external cohort (accuracy 93.67%; sensitivity 91.56%; specificity 91.13%). SHAP analysis indicated that key predictors reflected characteristic host-response patterns, including inflammation-related hypoalbuminemia, lipid metabolism suppression (HDL-C and LDL-C), altered platelet activity (MPV), and lymphocyte reduction (LYM). CONCLUSION: The study presents a high-performing and interpretable machine learning model capable of accurately identifying active Mtb infection using routine blood tests. This low-cost and non-invasive approach has strong potential for application in resource-limited and high-burden settings.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。