Abstract
The role of chromatin regulators (CRs) in mediating epigenetic changes during tuberculosis (TB) infection remains poorly understood. This study aimed to determine the efficacy of CRs in diagnosing TB and characterizing its heterogeneity. GSE83456 dataset was analyzed to identify differentially expressed CRs (DE-CRs) and immune cell infiltration in patients with TB. Consensus clustering was used to classify patients with TB based on DE-CR expression patterns. The optimal machine learning model was selected from four algorithms (Random Forest (RF), Support Vector Machine (SVM), Generalized Linear Model (GLM), and eXtreme Gradient Boosting (XGB)) to differentiate between the molecular clusters. Validation was performed using an external dataset (GSE152532). Blood samples were collected from healthy individuals and patients with pulmonary TB (PTB) or tuberculous meningitis (TBM). Analysis identified 15 DE-CRs, which were used to stratify patients with TB into two distinct molecular clusters exhibiting divergent immune microenvironment characteristics. The XGB model exhibited superior performance in distinguishing these clusters (area under the receiver operating characteristic curve = 0.965). From this model, a five-gene signature (DHRS9, HIST1H2BK, C16orf74, SLC30A1, and GBP1) was identified. This signature effectively predicted TB subtypes and was significantly associated with active TB (ATB) in an external validation set. Clinically, IFIT3 expression was validated as being significantly elevated in the blood of patients with TB (including PTB and TBM) compared to healthy controls, thereby confirming its potential role as a pan-TB biomarker. Our study revealed that CRs are closely associated with immune infiltration and heterogeneity in TB. We developed a robust XGBoost model based on a five-gene signature for accurate TB subtyping and disease-status assessment. Elevated IFIT3 expression underscores the value of CRs as novel biomarkers for TB diagnosis.
