Sampling informational properties of codon usage through the tree of life

通过生命之树对密码子使用情况的信息特性进行采样

阅读:1

Abstract

The genetic code, a unifying principle in biology, ensures that all organisms, stemming from a Last Universal Common Ancestor (LUCA), share fundamental rules for translating DNA into proteins. However, codon usage varies across the tree of life, influenced not only by GC-content and proteome composition but also by complex, often less understood rules dependent on each species' evolutionary trajectory. To better understand these rules, we segregated codons into their functional parts and applied Shannon's information-theoretic measures to 1,434 species from eight diverse taxonomic groups. We provide robust evidence that the first codon base plays a central role in amino acid determination, while the third base serves an accessory function. Using conditional entropy measures, we rigorously quantified this relationship, universally confirming the greater informational variability of the third base across all sampled species for the first time at this scale. Our analysis revealed significant heterogeneity in coding strategies across different taxonomic groups. Notably, the unique variability observed in Archaea, in contrast to the more constrained patterns in Eukaryotes and Bacteria, underscores the profound influence of evolutionary pressures and distinct life histories on genetic information processing. The identification of outlier species, exhibiting distinct informational profiles, highlights specific instances where unusual lifestyles or ecological niches may have driven unique adaptations in codon usage and underlying informational dependencies. These informational patterns offer a complementary perspective to traditional phylogenetic analyses, further revealing a hierarchical organization of informational dependencies among codon components that sheds light on the intricate grammar of genetic information. We also rigorously investigated the relationship between GC-content and our informational measures, concluding that these entropy measures provide valuable insights that cannot be obtained from GC-content alone. This work not only offers a novel framework for quantifying informational properties of codon usage but also reveals previously unappreciated aspects of how genetic information is encoded and processed across life's domains.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。