Enhanced prognostic signature for lung adenocarcinoma through integration of adjacent normal and tumor gene expressions

通过整合邻近正常组织和肿瘤基因表达谱,增强肺腺癌的预后特征

阅读:1

Abstract

BACKGROUND: Cancer prognosis-related signatures have traditionally been constructed based on gene expression profiles derived from tumor or normal tissues. However, the potential benefits of incorporating gene expression profiles from both tumor and normal tissues to improve signature performance have not been explored. METHODS: In this study, we developed three prognostic models for lung adenocarcinoma (LUAD) using gene expression profiles from tumor tissues, normal tissues, and a combination (COM) of both, sourced from The Cancer Genome Atlas (TCGA). To ensure comparability, the same workflow was followed for all three models. RESULTS: When applied to the TCGA LUAD dataset, the tumor-derived model exhibited the best overall performance, except in calibration analysis, where the normal-derived model performed better. The COM-derived model demonstrated intermediate performance. Validation on three independent test datasets revealed that the COM-derived model showed the best performance, while the normal-derived model showed the worst. In overall survival (OS) analysis, the low-risk group defined by the COM-derived model consistently exhibited longer mean survival times. The tumor-derived model did not consistently show this trend, and the normal-derived model produced opposite results. In discrimination analysis, no significant differences were observed. The COM-derived model demonstrated good discrimination ability for short periods, while the tumor-derived model performed better for longer periods. In calibration analysis, both the COM and tumor-derived models had similar absolute prediction errors, which were better than those of the normal-derived model. However, the tumor-derived model tended to underestimate survival rates. The clinical feature analysis and validation in GSE229705 indicate that the risk score (RS) from the COM model is the most clinically significant. These results demonstrate that the COM model's RS aligns more closely with clinical data, maintaining stable performance and the strongest generalizability. CONCLUSIONS: Overall, the COM-derived model demonstrated the best generalization ability. The superior performance of the tumor-derived model in the TCGA LUAD dataset might be due to overfitting. Our results suggest that appropriate combinations of gene expression data from tumor and normal tissues can enhance the predictive power of prognostic signatures.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。