A machine learning model and molecular clusters of epigenetic chromatin regulators in tuberculosis based on bioinformatics and clinical samples

基于生物信息学和临床样本的结核病表观遗传染色质调控因子机器学习模型及分子簇分析

阅读:2
作者:Huawei He,Liuying Wei,Lanwei Nong,Beibei Gong,Chaoyan Xu,Qingdong Zhu

Abstract

The role of chromatin regulators (CRs) in mediating epigenetic changes during tuberculosis (TB) infection remains poorly understood. This study aimed to determine the efficacy of CRs in diagnosing TB and characterizing its heterogeneity. GSE83456 dataset was analyzed to identify differentially expressed CRs (DE-CRs) and immune cell infiltration in patients with TB. Consensus clustering was used to classify patients with TB based on DE-CR expression patterns. The optimal machine learning model was selected from four algorithms (Random Forest (RF), Support Vector Machine (SVM), Generalized Linear Model (GLM), and eXtreme Gradient Boosting (XGB)) to differentiate between the molecular clusters. Validation was performed using an external dataset (GSE152532). Blood samples were collected from healthy individuals and patients with pulmonary TB (PTB) or tuberculous meningitis (TBM). Analysis identified 15 DE-CRs, which were used to stratify patients with TB into two distinct molecular clusters exhibiting divergent immune microenvironment characteristics. The XGB model exhibited superior performance in distinguishing these clusters (area under the receiver operating characteristic curve = 0.965). From this model, a five-gene signature (DHRS9, HIST1H2BK, C16orf74, SLC30A1, and GBP1) was identified. This signature effectively predicted TB subtypes and was significantly associated with active TB (ATB) in an external validation set. Clinically, IFIT3 expression was validated as being significantly elevated in the blood of patients with TB (including PTB and TBM) compared to healthy controls, thereby confirming its potential role as a pan-TB biomarker. Our study revealed that CRs are closely associated with immune infiltration and heterogeneity in TB. We developed a robust XGBoost model based on a five-gene signature for accurate TB subtyping and disease-status assessment. Elevated IFIT3 expression underscores the value of CRs as novel biomarkers for TB diagnosis.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。