Abstract
Purpose: Hepatocellular carcinoma (HCC) is recognized as one of the most aggressive cancers worldwide. This study aims to discover novel biomarkers that can effectively stratify patients at high risk for HCC and facilitate personalized management. Methods: Differentially expressed genes analysis was conducted to identify marker genes associated with HCC based on transcriptomic profiles from the TCGA-LIHC database. And then, we utilized the weighted correlation network analysis (WGCNA) to explore the correlation between various gene expression clustering modules based on the the identified differentially expressed genes above and clinical phenotypes. Subsequently, we integrated 10 machine learning algorithms into 117 combinations to establish the optimal model for predicting survival in HCC patients. Then, immune landscape was evaluated. Additionally, single-cell RNA sequencing analysis revealed the expression patterns of genes involved in model at single-cell resolution. Furthermore, the role of DTYMK was examined in vitro in two liver cancer cell lines. Results: Forty-eight G2/M checkpoint-related genes were identified as marker genes for HCC. Among these, STMN1, DTYMK, EZH2, PBK, and UCK2 were incorporated into a prognostic model (G2/MR), which demonstrated robust and reliable performance in predicting the survival rates of HCC patients. Individuals with high G2/MR scores exhibited shorter survival and higher regulatory T cells infiltration compared to those with low G2/MR scores. Silencing DTYMK arrested cell cycle and inhibited proliferation, migration, invasion, and colony formation in Huh-7 and PLC/PRF/5 cells. Conclusion: Our study developed a five-gene risk score model that may serve as a valuable tool for prognostic assessment and personalized HCC management. Supplementary Information: The online version contains supplementary material available at 10.1007/s12672-025-04054-1.
