Comparative evaluation of regression and machine-learning models for hepatocellular carcinoma risk stratification across diverse aetiologies

针对不同病因的肝细胞癌风险分层，对回归模型和机器学习模型进行比较评估

阅读：1

作者：Nahon,Pierre,Layese,Richard,Natella,Pierre-André,Parlati,Lucia,Saidi,Tounes,Ganne-Carrié,Nathalie,N'Kontchou,Gisèle,Chaffaut,Cendrine,Nault,Jean-Charles,Bamba-Funck,Jessica,Sutton,Angela,Nzinga,Clovis Lusivika,Carrat,Fabrice,Audureau,Etienne

期刊：	Jhep Reports	影响因子：	7.500
时间：	2026	起止号：	2026 Apr;8(4):101740
doi：	10.1016/j.jhepr.2026.101740

Abstract

BACKGROUND & AIMS: We aimed to develop machine learning (ML) models for hepatocellular carcinoma (HCC) risk stratification in patients with cirrhosis and to test their ability to identify those with an annual HCC incidence >3%, for whom more intensive surveillance may be justified. METHODS: Data from three prospective cohorts (ANRS CO12 CirVir, CO22 Hepather, APHP CIRRAL) were analyzed. All patients underwent semiannual ultrasound surveillance and were randomly split into training and validation sets. HCC incidence was evaluated using a competing risk framework. A single tree (ST) model was developed using conditional decision trees, while random forest (RF) models were built by aggregating 1,000 trees. A deep neural network (DNN)-based survival model was also applied. ML model performance was compared with established regression-based scores: aMAP (age-male-ALBI-platelets) and FASTRAK (FAST-MRI for HCC suRveillance in pAtients with high risK of liver cancer). RESULTS: Among 4,867 patients with non-viral cirrhosis or resolved/controlled viral cirrhosis, 294 (9.2%) developed HCC over a median follow-up of 61.5 months (annual incidence: 1.99%). The ST model identified four key predictors, generating five distinct risk groups. These included patients with mildly impaired liver function or those with elevated GGT and low platelet counts. The RF and DNN approaches confirmed ST findings and delineated complex interactions among predictors. Performance metrics (C-index, Brier score, decision curve analysis) showed no significant advantage of ML models over aMAP and FASTRAK. Calibration was consistent across models. ML models identified higher proportions of patients with an annual HCC incidence >3% (ST 44%; DNN 37%; RF 30%) compared with aMAP (36%) and FASTRAK (29%). CONCLUSIONS: ML-based algorithms did not outperform traditional risk scores but provided novel insights into variable interactions and helped identify clinically relevant patient subgroups with differing HCC risk profiles. IMPACT AND IMPLICATIONS: Accurate stratification of hepatocellular carcinoma risk in cirrhosis is essential to optimize surveillance strategies, and this study provides a scientific rationale for exploring machine learning approaches to capture complex, non-linear interactions among clinical variables beyond traditional regression models. Although machine learning did not improve predictive performance over established scores, it revealed clinically meaningful risk subgroups defined by liver function, platelet count, and GGT, underscoring its value as an interpretative and hypothesis-generating tool. These results are particularly relevant for hepatologists and clinical researchers seeking to refine risk-adapted surveillance and to inform the design of future models or trials.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。

肿瘤免疫

炎症

T细胞

凋亡

线粒体

转录调控

巨噬细胞

自噬

传染病

氧化应激

肠道菌群

血管生成

磷酸化

囊泡

单细胞

3D/类器官

中性粒细胞

外泌体

药物研究

DNA甲基化

细胞衰老

miRNA

铁死亡

缺氧低氧

乙酰化

泛素化

组蛋白修饰

炎性小体

树突状细胞

代谢重编程

肿瘤微环境

焦亡

lncRNA

m6A/m5C/m7G

空间多组学

细胞基因治疗

内质网应激

相分离

治疗耐药

Treg

免疫代谢

上皮间质转化

染色质重塑

脂质过氧化

蛋白质稳态

铁代谢

脂代谢

cGAS-STING

肠脑轴

细胞极性

乳酸化

氨基酸代谢

碱基编辑

蛋白降解

circRNA

翻译调控

肿瘤异质性

piRNA

低氧缺氧

NK 细胞

MDSC

氧化脂质

溶酶体功能

NETosis

RNA 编辑

细胞干性

CAR-NK

琥珀酰化

冷应激

Tfh

器官芯片

巴豆酰化

表观遗传记忆

空间代谢组

铜死亡

器官纤维化

线粒体未折叠蛋白反应

程序性坏死

自噬流

肠肝轴

MAIT 细胞

丙酰化