Deconvolving bulk proteomics data via deep adaptation network

利用深度适应网络对大量蛋白质组学数据进行反卷积

阅读:1

Abstract

Bulk proteomic assays measure average protein abundances across heterogeneous tissues, while deconvolution algorithms attempt to estimate the proportions of distinct cell types from such data. However, existing proteomics deconvolution methods often underperform when the same cell type has different proportion distributions in training and target data. To bridge this gap, we propose Deconv-DAN, a novel machine learning framework for bulk proteomic deconvolution. Deconv-DAN begins by generating a diverse set of pseudo-bulk training samples where it draws cell type proportions from a heavy-tailed distribution, guided by a k-nearest neighbors classifier which estimates the most abundant cell type in each target sample. Next, a Deep Adaptation Network [1] combining prediction loss and domain adaptation loss is trained. We evaluate Deconv-DAN on both simulated data (specifically designed to mimic mismatched proportion distributions) and real bulk proteomics datasets. Across both settings, Deconv-DAN yields superior deconvolution accuracy compared to existing methods. We further demonstrate its adaptability by applying it to DNA methylation deconvolution, where it attains competitive performance. Deconv-DAN therefore offers a solution for bulk deconvolution in proteomics and beyond, particularly in scenarios with mismatched training and target cell composition distributions. We expect Deconv-DAN to be useful in revealing heterogeneity of different tissues and tumor microenvironment. REFERENCES: [1] Long, M, et al. ‘Learning transferable features with deep adaptation networks.’ In: Proceedings of the 32nd International Conference on Machine Learning. 2015, 97–105.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。