Abstract
OBJECTIVE: This research aims to detect genes associated with the extracellular matrix (ECM) in idiopathic pulmonary fibrosis (IPF) using bioinformatics techniques and investigate their relationships with immune infiltration, with the goal of identifying new diagnostic and therapeutic targets for IPF. METHODS: The study employed a combination of differential expression analysis, weighted gene co-expression network analysis (WGCNA), and various machine learning algorithms to screen for characteristic genes. Gene set enrichment analysis (GSEA), gene ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) were utilized to evaluate relevant biological functions and pathways. Additionally, the analysis of immune cell infiltration was conducted to assess the disease's immune status and the correlations between genes and immunity. RESULTS: IPF is strongly linked to pathways such as ECM organization and immune response, with differentially expressed genes primarily involving signal pathways related to collagen deposition in the extracellular matrix. A total of 1,193 ECM-related genes associated with IPF were identified, and 94 differentially expressed ECM-related genes were further screened compared to the normal control group. Through machine learning approaches, three key genes-BAAT, COMP, and CXCL13-were pinpointed. These genes are closely tied to the onset, progression, and immune processes of IPF, and clustering analysis based on them can reveal distinct disease states and changes in immune cell infiltration patterns. CONCLUSION: BAAT, COMP, and CXCL13 may serve as potential therapeutic targets for slowing the progression and preventing the exacerbation of IPF. Moreover, monocytes demonstrate consistent infiltration patterns across the disease group, control group, and various subgroups, indicating their potential significance in the development of IPF.