Abstract
MOTIVATION: The spatial organization of chromatin is fundamental to gene regulation and essential for proper cellular function. The Hi-C technique remains the leading method for unraveling 3D genome structures, but the limited availability of high-resolution (HR) Hi-C data poses significant challenges for comprehensive analysis. Deep learning models have been developed to predict HR Hi-C data from low-resolution counterparts. Early Convolutional Neural Network (CNN)-based models improved resolution but struggled with issues like blurring and capturing fine details. In contrast, Generative Adversarial Network (GAN)-based methods encountered difficulties in maintaining diversity and generalization. Additionally, most existing algorithms perform poorly in cross-cell line generalization, where a model trained on one cell type is used to enhance HR data in another cell type. RESULTS: In this work, we propose Dilated Cascading Residual Network (DiCARN) to overcome these challenges and improve Hi-C data resolution. DiCARN leverages dilated convolutions and cascading residuals to capture a broader context while preserving fine-grained genomic interactions. Additionally, we incorporate DNase-seq data into our model, providing a robust framework that demonstrates superior generalizability across cell lines in HR Hi-C data reconstruction. AVAILABILITY AND IMPLEMENTATION: DiCARN is publicly available at https://github.com/OluwadareLab/DiCARN.