Cancer detection via one-shot learning: integrating gene expression and genomic mutation analysis

基于单样本学习的癌症检测:整合基因表达和基因组突变分析

阅读:1

Abstract

BACKGROUND: Cancer is a complex disease influenced by numerous concurrent genetic factors that result in diverse tumor microenvironments (TMEs) across different cancer types. Large-scale genomic projects, such as The Cancer Genome Atlas, have underscored the need for molecular classification of cancer to enable more precise therapeutic strategies. Yet, traditional machine learning (ML) approaches currently face several limitations. First, while effective, they predominantly rely on gene expression data and often overlook critical genomic alterations such as copy number alterations, single nucleotide polymorphisms, and other mutational profiles, limiting the scope of biomarker discovery. Most importantly, they are usually limited by the need of large sample sizes. RESULTS: Building on the hypothesis that type-agnostic representations integrating gene expression with genomic mutations can comprehensively characterize TMEs and capture the similarity or dissimilarity between samples of the same or different types, we propose a novel ML-based method for cancer detection using a one-shot learning framework implemented through Siamese Neural Networks. Our method redefines cancer detection as a similarity-based classification task, allowing the model to generalize to unseen cancer types, a critical advantage in genomics where data scarcity and frequent updates pose significant challenges. To enhance interpretability, we introduce a robust explainability technique founded on SHapley Additive exPlanations (SHAP) values, to provide clear insights into the contributions of gene expression and mutational data, enabling a deeper understanding of the key factors driving cancer detection decisions. CONCLUSIONS: Our experimental results show that integrating mutational profiles with gene expression data allows for more accurate cancer type detection and reveals significant mutation patterns. These findings indicate that the proposed method has the potential to significantly enhance cancer type detection by leveraging a more comprehensive understanding of TMEs. Beyond merely classifying cancer types, the proposed SHAP-based explainability technique enables the identification and the analysis of key biomarkers relevant for immunotherapy success, thereby addressing limitations of existing approaches.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。