Development and Validation of an Interpretable Machine Learning Model for Predicting Distant Metastasis in Tongue Squamous Cell Carcinoma: A Multicentre Study

开发和验证用于预测舌鳞状细胞癌远处转移的可解释机器学习模型:一项多中心研究

阅读:1

Abstract

INTRODUCTION AND AIMS: Distant metastasis (DM) in tongue squamous cell carcinoma is associated with poor prognosis. However, reliable tools for immediate postoperative risk prediction remain lacking. This study aimed to develop and validate an interpretable machine learning (ML) model for early risk stratification of DM. METHODS: This study included 752 patients from Sun Yat-sen Memorial Hospital as the model development cohort, while 234 patients from Nanfang Hospital and 105 patients from the HANCOCK database constituted external validation cohorts 1 and 2. Variables were selected using 3 feature selection methods and 6 ML models were developed. Model performance was evaluated using metrics including the area under the receiver operating characteristic curve and interpretability analysis was conducted using Shapley Additive Explanations. RESULTS: A total of 1091 patients were included and 7 key predictors were ultimately identified, including histological grade, lymphovascular invasion, number of regional lymph node metastases, maximum tumour diameter, depth of invasion, neutrophil-to-lymphocyte ratio and monocyte-to-lymphocyte ratio. The Elastic Net model demonstrated the best performance, with area under the curves of 0.935 (95% CI 0.882-0.988) in the internal validation cohort, 0.889 (95% CI 0.800-0.978) in external validation cohort 1 and 0.905 (95% CI 0.808-1.000) in external validation cohort 2. The median time to DM was 11.9 months. The model enabled immediate postoperative risk stratification. Shapley Additive Explanations analysis enhanced model interpretability and an online prediction platform was developed for clinical application (https://nfyy-stomatology-dept.shinyapps.io/Predict-DM-of-TSCC/). CONCLUSIONS: This study developed and validated an interpretable ML model using multicentre real-world data for immediate postoperative DM risk stratification. The model provides a reliable and clinically applicable tool to support individualised patient management. CLINICAL RELEVANCE: The model enables immediate postoperative identification of high-risk patients, providing approximately 12 months of earlier risk recognition compared with routine imaging.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。