Abstract
BACKGROUND: Epithelial-mesenchymal transition (EMT) plays a critical role in tumor progression; however, the underlying molecular mechanisms of EMT in papillary thyroid carcinoma (PTC) remain incompletely understood. This study aimed to investigate EMT-related mechanisms in PTC using an integrative approach combining single-cell RNA sequencing and machine learning. METHODS: Differentially expressed genes (DEGs) between PTC and normal thyroid tissues were identified, and EMT-related candidate genes were obtained by intersecting DEGs with EMT-related genes (EMT-RGs). Prognostic genes were screened using univariate Cox regression, and a risk model was constructed based on 101 machine learning algorithm combinations. Patients were stratified into high- and low-risk groups (HRG and LRG) according to risk scores, and the model was validated in an internal cohort. Additional analyses included nomogram construction, immune infiltration profiling, tumor mutational burden (TMB) assessment, drug sensitivity prediction, and molecular regulatory network analysis. Prognostic gene expression was further validated in vitro. RESULTS: Eight EMT-related prognostic genes (TYRO3, E2F1, TNFSF15, TGFBR3, PTX3, FHL2, SNAI1, and WT1) were identified. Patients in the HRG exhibited significantly poorer overall survival than those in the LRG. The nomogram showed good predictive accuracy for survival estimation. Immune infiltration analysis revealed significant differences between risk groups across six immune-related features. Splice site-related mutations were predominantly observed in the LRG but were absent in the HRG. Drug sensitivity analysis indicated higher sensitivity to BIRB.0796 in the LRG, whereas ABT-263, AG-014699, BX-795, and DMOG were more effective in the HRG. Single-cell analysis identified fibroblasts as key cell populations, with FHL2, PTX3, and TGFBR3 showing increased activity during critical differentiation stages. In vitro experiments confirmed expression patterns consistent with bioinformatics findings. CONCLUSION: This study identifies eight EMT-related prognostic genes in PTC and highlights their potential value as biomarkers for prognostic evaluation and therapeutic stratification.