MOGEDN: small-sample cancer subtype classification with encoder-decoder networks for missing-omics recovery and biomarker discovery

MOGEDN:基于编码器-解码器网络的小样本癌症亚型分类,用于缺失组学数据的恢复和生物标志物的发现

阅读:1

Abstract

Effective cancer subtype classification from multi-omics data remains challenging due to incomplete omics data and limited sample sizes. While graph convolutional networks (GCNs) have been used to incorporate inter-sample relationships for enhancing small-sample classification, their performance deteriorates when a certain omics modality is entirely missing. Here, we propose MOGEDN, a novel framework for cancer subtype classification using multi-omics encoder-decoder networks designed to reconstruct the latent features of missing omics data. The reconstructed features are integrated with available omics features to enable robust prediction under small-sample and missing-omics settings. We develop a step-wise algorithm to pretrain our model with diverse cancer types then to finetune for a specific cancer type while incorporating inter-sample and cross-omics dependencies. Evaluated on TCGA cancer datasets including subtypes with fewer than 50 samples, MOGEDEN consistently outperforms state-of-the-art baselines in accuracy and F1 scores. Moreover, MOGEDN's feature analysis provides two complementary biomarker sets: biomarkers shared across diverse cancer types in the pretraining phase; and biomarkers for a specific cancer type in the finetuning phase, facilitating model interpretability, and biological findings. These results highlight decoder-based imputation as a powerful approach to enhance multi-omics learning, delivering accurate classification, robust few-shot performance, and multi-scale biomarker discovery in incomplete multi-omics cohorts.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。