Machine Learning Models Using General and Tissue-Specific Feature Extractors for Accurate Subtyping of Biopsy Samples: Advancing Lung Cancer Diagnosis in Latin America

利用通用和组织特异性特征提取器的机器学习模型对活检样本进行精确亚型分类:推进拉丁美洲肺癌诊断

阅读:3

Abstract

INTRODUCTION: Lung cancer is the leading cause of cancer-related deaths worldwide, with accurate histologic subtype classification critical for diagnosis and treatment planning. Diagnostic variability and resource disparities, particularly in underrepresented regions such as Latin America, pose substantial challenges. This study developed and evaluated novel artificial intelligence models trained on both global and Latin American pathology samples for subtype classification of hematoxylin and eosin (HE)-stained whole-slide images (WSIs). METHODS: Two DinoV2-based feature extractors, LungDino and OncoDino, trained on large data sets for task-specific and general pathology applications, were developed. The training data set consisted of 1308 HE-stained WSIs, including 412 adenocarcinomas, 323 squamous cell carcinomas, 41 small cell carcinomas, and 532 benign tissue samples, sourced from The Cancer Genome Atlas and an in-house Latin American data set. A ResNet model trained on ImageNet served as the baseline. Models were evaluated on 79 Latin American WSIs using receiver operating characteristic curves, and heatmaps were generated for tumor localization. RESULTS: The DinoV2-based models outperformed the ResNet baseline. LungDino achieved the highest overall performance, with area under the curves of 0.97 for adenocarcinoma and 0.96 for squamous cell carcinoma. OncoDino excelled in underrepresented categories, achieving an area under the curve of 0.99 for small cell carcinoma, demonstrating its generalizability. Both models generated interpretable heatmaps, with LungDino demonstrating precise tumor localization. In the subset of samples classified as poorly differentiated or undifferentiated in HE pathology reports, the DinoV2 models also maintained high classification performance. CONCLUSION: These findings underscore the effectiveness of task-specific and general feature extractors in delivering accurate, explainable results and address a gap in artificial intelligence-driven histopathology advancements, paving the way for future clinical applications.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。