Understanding the regulatory grammar of sepsis-causing Staphylococcus aureus bacteria using contexualised DNA language models

利用情境化DNA语言模型理解引起败血症的金黄色葡萄球菌的调控语法

阅读:1

Abstract

To efficiently and accurately understand, detect, and treat sepsis, we require a deep understanding of how its underlying biological systems work. Conventional methods struggle to provide a detailed view of both the functional and regulatory elements orchestrating a system-wide response to pathogenic infection. Our approach was to use a streamlined process: First, we used multi-omics integration to create a functional-omics signature at the single molecule level, employing sparse Partial Least Squares Discriminant Analysis (sPLSDA). Then, we turned to genome Natural Language Processing (genomeNLP) to identify correlated features and discover regulatory motifs in bacterial promoters. This novel method utilised specialised language models trained on DNA sequences. Our approach is distinct from traditional methods as we did not rely on predefined biomarkers and motifs or focus solely on short-range context. As a result, we (a) pinpointed multi-omics signatures with single molecule-level precision and (b) uncovered regulatory patterns for both novel and established regulatory motifs governing the omics signature. Ultimately, we presented a comprehensive systems-level view of five sepsis-causing Staphylococcus aureus bacterial strains, revealing hierarchical gene regulation in genome structure, ncRNA control, metabolism, and antibiotic resistance. We additionally note that our approach is agnostic to organism type and can be applied outside of a sepsis system.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。