Abstract
BACKGROUND: Super-enhancers are critical cis-regulatory elements that play a central role in modulating gene expression and driving cellular identity. Their dysregulation is closely associated with the development of numerous major human diseases, particularly cancer. In this study, we propose DcSE, a novel deep learning framework designed for efficient and precise super-enhancer prediction. The core architecture is based on an enhanced DenseNet featuring an improved Convolutional Block Attention Module. Unlike standard serial processing, our module employs a dynamic fusion mechanism that adjusts the contributions of channel and spatial attention through learnable parameters. To further enhance robustness, DcSE adopts an ensemble learning framework utilizing cross-validation and multiple initializations. RESULTS: DcSE demonstrates exceptional performance on benchmark datasets for both human and mouse, surpassing existing state-of-the-art models across all evaluation metrics. It achieves 80.81% ACC and 87.86% AUC on the human dataset, along with 80.16% ACC and 87.04% AUC on the mouse dataset. Visual analysis through t-SNE confirms that the model learns highly separable, high-order feature representations from raw sequences. Furthermore, cross-species validation experiments prove the robust generalization capability of the framework. Motif analysis utilizing mask-based attribution methods successfully identifies species-specific key transcription factors, such as ZKSCAN3 in humans and STAT1 in mouse, providing clear biological interpretability. CONCLUSIONS: DcSE is a high-performance, robust, and interpretable computational tool. By accurately capturing key sequence features and providing biological insights into transcription factor regulation, it offers a reliable framework for super-enhancer identification and the study of genomic regulatory mechanisms.