Abstract
BACKGROUND: The CRISPR-Cas9 system has emerged as one of the most promising gene-editing technologies in biology. However, off-target effects remain a significant challenge. While recent advances in deep learning have led to the development of models for off-target prediction, these models often fail to fully leverage sequence pair information. Furthermore, as the models' parameter sizes increase, so do their complexities, limiting their practical applicability. METHODS: In this study, we introduce a novel multi-feature independent encoding method, which encodes the gRNA-DNA sequence pair into three distinct feature matrices to minimize information loss. Additionally, we propose a lightweight hybrid deep learning framework, CRISPR-MFH, that integrates multi-scale separable convolutions and hybrid attention mechanisms for efficient and accurate off-target prediction. RESULTS: Extensive experiments across multiple benchmark datasets demonstrate that the proposed encoding method effectively captures critical features and that CRISPR-MFH outperforms or matches state-of-the-art models with significantly fewer parameters across multiple evaluation metrics. CONCLUSIONS: This study offers a novel perspective for advancing deep learning technology in the realm of CRISPR-Cas9 off-target detection.