Data tells the truth: A Knowledge distillation method for genomic survival analysis by handling censoring

数据揭示真相:一种通过处理删失数据进行基因组生存分析的知识蒸馏方法

阅读:3

Abstract

Survival analysis is a critical tool for cancer research, yet handling censored data remains challenging due to supervision bias and inaccurate hazard estimates. To address these issues, we propose a simple but effective method termed KD, which employs knowledge distillation using uncensored data to rectify the supervision bias in censored data. This approach leverages the combined power of both rectified censored data and uncensored data to improve survival prediction accuracy. Remarkably, our KD method not only effectively harnesses censored data but also better reflects clinical reality, demonstrating its immense value in survival analysis. We applied our KD method to 19 target cancer sites using The Cancer Genome Atlas (TCGA) dataset. Our results consistently outperform traditional machine learning and deep learning-based methods across both target cancer sites and independent cancer cohorts. More importantly, our data-driven approach enables the model to extract hidden information from censored data, leading to conclusions that align more closely with clinical knowledge and scenarios. This validation of our KD method's effectiveness highlights the substantial value of rational censored data usage, providing valuable insights for cancer research and clinical decisions. All data and codes are freely available at: https://datatellstruth.github.io/.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。