Prediction of Tuberculosis Risk in the Elderly Population of Eastern China: Development and Validation of Multiple Machine Learning Models

中国东部老年人群结核病风险预测:多种机器学习模型的开发与验证

阅读:1

Abstract

BACKGROUND: Tuberculosis (TB) remains a significant public health burden among older adults, yet predictive tools for this population are limited. This study aimed to develop and validate machine learning models to predict TB risk among older adults in Eastern China. METHODS: A prospective cohort of 33,935 participants aged ≥60 years was followed for over 8 years. TB diagnosis was confirmed through linkage with the national TB surveillance system. LassoCox regression was used to identify key predictors of TB risk. Four machine learning models-CoxBoost, Generalized Boosted Models (GBM), LassoCox, and Random Survival Forests (RSF)-were developed and compared. Model performance was evaluated using time-dependent area under the receiver operating characteristic curve (AUC), Brier score, and concordance index. RESULTS: During follow-up, 387 participants developed TB, yielding an incidence rate of 134.5 per 100,000 person-years. The LassoCox model identified 14 predictors, including sex, alcohol consumption, dietary quality, body mass index, and C-reactive protein levels. Among the four models, the LassoCox model demonstrated the best discriminatory ability with an AUC of 0.717 (95% CI: 0.692-0.742), followed by GBM (AUC: 0.712, 95% CI: 0.687-0.737), CoxBoost (AUC: 0.708, 95% CI: 0.683-0.733), and RSF (AUC: 0.637, 95% CI: 0.611-0.663). The LassoCox model also demonstrated satisfactory calibration, with a Brier score of 0.338. Decision curve analysis confirmed clinical utility at threshold probabilities below 20%. Kaplan-Meier survival analysis showed significant differences between risk groups (log-rank P < 0.001), though survival curves revealed limited separation between low- and high-risk groups. CONCLUSION: The LassoCox model demonstrated acceptable predictive performance for TB risk in older Chinese adults. These findings suggest that machine learning-based risk prediction tools could facilitate targeted TB screening by identifying high-risk individuals in aging populations, thereby enabling more efficient allocation of screening resources and earlier intervention. However, further model refinement and external validation in diverse populations are warranted before clinical implementation.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。