Abstract
Vibrio cholerae pathogens cause cholera, an acute diarrheal disease resulting in significant morbidity and mortality worldwide. Biofilms in vibrios enhance their survival in natural ecosystems and facilitate transmission during cholera outbreaks. Critical components of the biofilm matrix include the Vibrio polysaccharides produced by the vps-1 and vps-2 gene clusters and the biofilm matrix proteins encoded in the rbm gene cluster, together comprising the biofilm matrix cluster. However, the biofilm matrix clusters and their evolutionary patterns in other Vibrio species remain underexplored. In this study, we systematically investigated the distribution, diversity, and evolution of biofilm matrix clusters and proteins across the Vibrio genus. Our findings reveal that these gene clusters are sporadically distributed throughout the genus, even appearing in species phylogenetically distant from V. cholerae. Evolutionary analysis of the major biofilm matrix proteins RbmC and Bap1 shows that they are structurally and sequentially related, having undergone structural domain and modular alterations. Additionally, a novel loop-less Bap1 variant was identified, predominantly represented in two phylogenetically distant Vibrio cholerae subspecies clades that share specific gene groups associated with the presence or absence of the protein. Furthermore, our analysis revealed that rbmB, a gene involved in biofilm dispersal, shares a recent common ancestor with Vibriophage tail proteins, suggesting that phages may mimic host functions to evade biofilm-associated defenses. Our study offers a foundational understanding of the diversity and evolution of biofilm matrix clusters in vibrios, laying the groundwork for future biofilm engineering through genetic modification.