A computational framework integrating multi-omics and machine learning for identifying glycolytic gene markers in breast cancer

一种整合多组学和机器学习的计算框架,用于识别乳腺癌中的糖酵解基因标志物

阅读:1

Abstract

Breast cancer remains a leading cause of cancer-related mortality in women worldwide. Metabolic reprogramming, especially enhanced glycolysis, is a hallmark of breast cancer progression and offers promising targets for therapeutic intervention. However, systematic identification of glycolysis-related gene markers at single-cell resolution remains limited.We integrated single-cell RNA sequencing, spatial transcriptomics, and multiple machine learning algorithms to analyze glycolytic heterogeneity in breast cancer. A novel interquartile range (IQR)-based scoring method was developed to quantify glycolytic activity at the single-cell level. Glycolysis-related genes (GRGs) were systematically screened using five machine learning models (RF, Boruta, Lasso, ABESS, and GBM), and their diagnostic and prognostic values were evaluated.We identified significant glycolytic heterogeneity among breast cancer cell subsets, with malignant cells exhibiting the highest glycolytic activity. Seven hub genes, namely PKM, MIF, SOD1, PGK1, APP, LDHB and NUPR1, were consistently identified and validated. These genes showed strong diagnostic potential with high AUC values in ROC analysis, and their elevated expression was confirmed in breast cancer tissues via immunohistochemistry. Spatial and cell communication analyses further revealed distinct metabolic niches and intercellular signaling networks associated with high-glycolytic cells.This study establishes a robust IQR-based glycolysis assessment strategy that overcomes limitations of previous scoring methods. We identified seven core glycolytic regulatory genes that are closely linked to breast cancer progression and poor prognosis. These findings provide novel insights into metabolic reprogramming in breast cancer and offer potential biomarkers for targeted metabolic therapy. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s12672-026-04826-3.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。