Establishment and verification of the model in diagnosis of thalassemia trait based on red blood cell parameters: A two-center retrospective study

基于红细胞参数的地中海贫血基因携带者诊断模型的建立与验证:一项双中心回顾性研究

阅读:3

Abstract

OBJECTIVES: Thalassemia trait (TT) screening in resource-limited settings is hampered by reliance on expensive and complex tests. This study aimed to develop and validate a highly accessible machine learning-based tool using only routine blood parameters to accurately differentiate TT from non-TT and its major subtypes. METHODS: The retrospective study included 987 individuals (221 α-TT, 211 β-TT and 555 non-TT) from two medical centers. Seven machine learning methods-Logistic Regression, Gaussian Naive Bayes, Decision Tree, Random Forest, Multilayer Perceptron, XGBoost, and CatBoost-were employed to develop diagnostic models, which were evaluated using accuracy, sensitivity, specificity, AUC, PPV, NPV, and F1 score. RESULTS: The CatBoost model emerged as superior for differentiating TT from non-TT, achieving an AUC of 0.976, accuracy of 0.940, and specificity of 0.981. It also outperformed other models in distinguishing α-TT from β-TT (AUC = 0.842). Critically, this high-performance model was successfully deployed as a user-friendly WeChat mini-program AI Lab, for real-world clinical application. CONCLUSION: The deployed ML-based AI Lab represents a robust, interpretable, and scalable tool poised to enhance TT screening efficiency and accessibility, particularly in underserved healthcare environments.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。