Abstract
Chronic Lymphocytic Leukemia (CLL) is clinically divided into IGHV mutated (M-CLL) and IGHV unmutated (U-CLL) subtypes, which are thought to arise from distinct cells of origin along the B-cell differentiation pathway. We measured genome-scale DNA methylation in purified CLL samples ( n = 89) and utilized reference-based cell deconvolution techniques to develop a continuous metric of epigenetic similarity across a B-naive-like to B-memory-like scale (B-Index). B-Index accurately classifies CLL into clinical subtypes (98.8%), has a stronger epigenetic signal than IGHV gene percent identity, and demonstrates additional epigenetic signal within the M-CLL subgroup. We demonstrate that U-CLL is epigenetically more similar to B-memory than B-naive cells and reconcile previous reports of a B-naive-like epigenetic signal. The B-memory-like program of U-CLL is enriched for binding sites of transcription factors related to the germinal center activation pathway. Our findings provide epigenetic evidence for discerning CLL mechanisms of initiation and cell of origin. We also identified an epigenetic signal associated with tumor burden, which may have some relation to viral infections such as Epstein-Barr-Virus. Our cell-type deconvolution-based approach to developing a continuous metric for CLL epigenetic differentiation state can be applied to other tumors with multiple subtypes across differentiation stages.