Abstract
Understanding the relationship between protein sequences and structures is essential for accurate protein property prediction. We propose BridgeNet, a pre-trained deep learning framework that integrates sequence and structural information through a novel latent environment matrix, enabling seamless alignment of these two modalities. The model's modular architecture-comprising sequence encoding, structural encoding, and a bridge module-effectively captures complementary features without requiring explicit structural inputs during inference. Extensive evaluations on tasks such as enzyme classification, Gene Ontology annotation, coenzyme specificity prediction, and peptide toxicity prediction demonstrate its superior performance over state-of-the-art models. BridgeNet provides a scalable and robust solution, advancing protein representation learning and enabling applications in computational biology and structural bioinformatics.