Development and Validation of a Machine Learning Prognostic Model of m5C Related immune Genes in Lung Adenocarcinoma

肺腺癌中m5C相关免疫基因的机器学习预后模型的开发与验证

阅读:1

Abstract

BACKGROUND: The aim of this retrospective research was to develop an immune-related genes significantly associated with m5C methylation methylation (m5C-IRGs)-related signature associated with lung adenocarainoma (LUAD). METHODS: We introduced transcriptome data to screen out m5C-IRGs in The Cancer Genome Atlas (TCGA)-LUAD dataset. Subsequently, the m5C-IRGs associated with survival were certificated by Kaplan Meier (K-M) analysis. The univariate Cox, least absolute shrinkage and selection operator (LASSO) regression, and xgboost.surv tool were adopted to build a LUAD prognostic signature. We further conducted gene functional enrichment, immune microenvironment and immunotherapy analysis between 2 risk subgroups. Finally, we verified m5C-IRGs-related prognostic gene expression in transcription level. RESULTS: A total of 76 m5C-IRGs were identified in TCGA-LUAD dataset. Furthermore, 27 m5C-IRGs associated with survival were retained. Then, a m5C-IRGs prognostic signature was build based on the 3 prognostic genes (HLA-DMB, PPIA, and GPI). Independent prognostic analysis suggested that stage and RiskScore could be used as independent prognostic factors. We found that 4104 differentially expressed genes (DEGs) between the 2 risk subgroups were mainly concerned in immune receptor pathways. We found certain distinction in LUAD immune microenvironment between the 2 risk subgroups. Then, immunotherapy analysis and chemotherapeutic drug sensitivity results indicated that the m5C-IRGs-related gene signature might be applied as a therapy predictor. Finally, we found significant higher expression of PPIA and GPI in LUAD group compared to the normal group. CONCLUSIONS: The prognostic signature comprised of HLA-DMB, PPIA, and GPI based on m5C-IRGs was established, which might provide theoretical basis and reference value for the research of LUAD. PUBLIC DATASETS ANALYZED IN THE STUDY: TCGA-LUAD dataset was collected from the TCGA (https://portal.gdc.cancer.gov/) database, GSE31210 (validation set) was retrieved from GEO (https://www.ncbi.nlm.nih.gov/geo/) database.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。