Abstract
Asthma is a heterogeneous disease with variable presentation and characteristics. There is a critical need to identify underlying molecular endotypes of asthma. We performed the largest transcriptomic analysis of 808 bronchial epithelial cell samples across 11 independent cohorts, including 3 cohorts from the Severe Asthma Research Program. Using seven datasets (218 patients with asthma, 148 healthy control subjects) as discovery cohorts, we identified 505 differentially expressed genes, which we validated in the remaining four datasets. Unsupervised clustering using the 505 differentially expressed genes identified four reproducible clusters of patients with asthma across all datasets, corresponding to healthy control subjects, patients with mild/moderate asthma, and patients with severe asthma with significant differences in several clinical markers of severity, including pulmonary function, Type 2 inflammation, fractional exhaled nitric oxide, and maximum bronchodilator reversibility. Importantly, we found the same clusters in pediatric patients using nasal lavage fluid cells, demonstrating the gene signature and clusters are not confounded by age and are conserved in both lower and upper airways. The four asthma clusters may represent a unifying framework for understanding the molecular heterogeneity of asthma. Further study could potentially enable a precision medicine approach of matching therapies with patients with asthma most likely to benefit.