In the analysis of complex diseases, high-dimensional profiling data is important for assessing risks and detecting biomarkers. With the increasing accessibility of cancer genomic data, the sample sizes remain limited in most studies. Hence, borrowing information from additional data sources is thus desirable to improve estimation and prediction. Transfer learning has been demonstrated to be flexible and effective in boosting modeling performance with a record in biomedical applications. In practice, outliers and even data contamination often occur. However, existing transfer learning methods often lack robustness to outliers and data contamination, issues commonly observed in real-world biomedical data. In this study, we propose a robust transfer learning approach based on the minimum γ -divergence under a generalized linear model (GLM) framework for high-dimensional data. Our method incorporates a data-driven source detection scheme that automatically identifies informative sources while mitigating the risk of negative transfer. We establish rigorous theoretical results, including consistency and high-dimensional estimation error bounds, ensuring robustness and reliable performance. A computationally efficient algorithm is developed based on proximal gradient descent to facilitate both the transfer and debiasing steps. Simulation demonstrates the superior and competitive performance of the proposed approach in selection and prediction/classification. We further validate its practical utility by analyzing data on breast cancer and glioblastoma, showcasing the method's effectiveness in real-world high-dimensional settings.
Robust Transfer Learning for High-Dimensional GLM Using γ -Divergence With Applications to Cancer Genomics.
基于Ύ³散度的高维GLM鲁棒迁移学习及其在癌症基因组学中的应用
阅读:7
作者:Xu Fuzhi, Ma Shuangge, Zhang Qingzhao, Xu Yaqing
| 期刊: | Statistics in Medicine | 影响因子: | 1.800 |
| 时间: | 2025 | 起止号: | 2025 Jul;44(15-17):e70170 |
| doi: | 10.1002/sim.70170 | 研究方向: | 肿瘤 |
特别声明
1、本文转载旨在传播信息,不代表本网站观点,亦不对其内容的真实性承担责任。
2、其他媒体、网站或个人若从本网站转载使用,必须保留本网站注明的“来源”,并自行承担包括版权在内的相关法律责任。
3、如作者不希望本文被转载,或需洽谈转载稿费等事宜,请及时与本网站联系。
4、此外,如需投稿,也可通过邮箱info@biocloudy.com与我们取得联系。
