Traditional Cox regression outperforms large language models in predicting long-term progression of intermediate to advanced hepatocellular carcinoma

在预测中晚期肝细胞癌的长期进展方面,传统的Cox回归优于大型语言模型。

阅读:3

Abstract

OBJECTIVE: This study aimed to evaluate and compare the performance of large language models (LLMs) and traditional Cox regression models in predicting the long-term progression risk in patients with intermediate to advanced hepatocellular carcinoma (HCC). METHODS: A total of 576 patients with intermediate to advanced HCC were included, comprising a training cohort (n = 403) and a validation cohort (n = 173) for model development and validation. We evaluated the predictive performance of LLMs (DeepSeek R1, DeepSeek V3, and Qwen/QWQ-32B) and the traditional Cox regression model for estimating the progression risk of HCC at 12, 24, and 36 months. Time-dependent area under the curve (AUC), decision curve analysis, calibration curve, net reclassification improvement, and integrated discrimination improvement were used to comprehensively assess model performance. RESULTS: Based on transarterial chemoembolization combined with targeted therapy, the addition of immune checkpoint inhibitors (ICIs) and/or ablation prolonged the progression-free survival (PFS): all four treatments combined showed optimal outcome (median PFS = 12.3 months, 95%CI = 9.9-14.1). Univariate and multivariate Cox analyses identified independent prognostic factors, which were utilized to develop a progression risk nomogram. The model had good discrimination, with training cohort AUCs (at 12, 24, and 36 months) of 0.72 (95%CI = 0.67-0.78), 0.77 (95%CI = 0.69-0.86), and 0.96 (95%CI = 0.93-0.99), respectively, and validation cohort AUCs of 0.75 (95%CI = 0.67-0.83), 0.81 (95%CI = 0.71-0.91), and 0.97 (95%CI = 0.94-1.0), respectively. Three LLMs were evaluated on the same dataset. Except for DeepSeek R1 at 12 and 24 months (training cohort), all LLMs underperformed the Cox model across time points, indicating current limitations in predicting long-term progression risk. CONCLUSION: The combination of ablation and/or ICIs with standard treatment could prolong PFS. In predicting the long-term HCC progression risk, the traditional Cox model exceeded the LLMs. Their combination may merge structured modeling stability with the multi-source data processing capacity of LLMs, potentially improving prediction accuracy.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。