Systematic evaluation of machine learning models for clinical risk prediction on real-world hospital datasets

基于真实医院数据集的临床风险预测机器学习模型的系统性评估

阅读:1

Abstract

The application of machine learning in clinical medicine requires systematic evaluation across diverse modeling paradigms. We benchmarked 10 models, including classic machine learning, tabular deep learning, and automated machine learning (AutoML), across eight real-world clinical risk prediction datasets. Using a 10-time repeated 5-fold cross-validation protocol, we assessed discrimination, calibration, and clinical utility. Gradient boosting decision trees, particularly CatBoost, and the tabular foundation model TabPFN consistently demonstrated superior robustness, forming the top tier for performance. AutoGluon also exhibited strong competitiveness. In contrast, most other tabular deep learning models displayed significant instability. These findings indicate that advanced gradient boosting models and TabPFN represent premier strategies for building high-performance clinical risk prediction models, while AutoML offers a reliable alternative. This study provides crucial empirical guidance for clinicians and data scientists in selecting appropriate modeling strategies.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。