Multicenter Histology Image Integration and Multiscale Deep Learning Support Machine Learning-Enabled Pediatric Sarcoma Classification

多中心组织学图像融合和多尺度深度学习支持机器学习赋能的儿童肉瘤分类

阅读:1

Abstract

Pediatric sarcomas present diagnostic challenges due to their rarity and diverse subtypes, often requiring specialized pathology expertise and costly genetic tests. To overcome these barriers, we developed a computational pipeline leveraging deep learning methods to accurately classify pediatric sarcoma subtypes from digitized histology slides. To ensure classifier generalizability and minimize center-specific artifacts, a dataset comprising 867 whole-slide images (WSI) from three medical centers and the Children's Oncology Group was collected and harmonized. Multiple convolutional neural network and vision transformer (ViT) architectures were systematically evaluated as feature extractors for SAMPLER-based WSI representations, and input parameters, such as tile size combinations and resolutions, were tested and optimized. The analysis showed that advanced ViT foundation models (UNI and CONCH) significantly outperformed earlier approaches, and incorporating multiscale features enhanced classification accuracy. The optimized models achieved high performance, distinguishing rhabdomyosarcoma (RMS) from non-RMS soft-tissue sarcomas (NRSTS) with an AUC of 0.969 and differentiating RMS subtypes (alveolar vs. embryonal) with an AUC of 0.961. Additionally, a two-stage pipeline effectively identified scarce Ewing sarcoma images from other NRSTS (AUC = 0.929). Compared with conventional transformer-encoder architectures used for WSI representations, these SAMPLER-based classifiers were three orders of magnitude faster to train, despite operating entirely without a graphical processing unit. This study highlights that digital histopathology paired with rigorous image harmonization provides a powerful solution for pediatric sarcoma classification. SIGNIFICANCE: An approach pairing a multi-institutional dataset with a published pipeline extends imaging-based diagnostic capabilities and creates the potential for accurate, rapid cancer diagnoses across resource-limited and remote settings. This article is part of a special series: Driving Cancer Discoveries with Computational Research, Data Science, and Machine Learning/AI .

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。