A comprehensive dataset of homologous and non-homologous isofunctional enzymes across the tree of life

涵盖生命之树中同源和非同源同工酶的综合数据集

阅读:1

Abstract

BACKGROUND: Convergent evolution, the independent emergence of similar traits, is increasingly recognized as a pervasive force shaping molecular and metabolic diversity. A striking manifestation of convergence at the molecular level is represented by non-homologous isofunctional enzymes (NISE), distinct proteins with no detectable common ancestry that catalyze identical biochemical reactions. Despite their conceptual and practical relevance, NISE are often treated as exceptional cases, and no large-scale, systematically curated resource has been available to explore their distribution and properties across all domains of life. DATA DESCRIPTION: Here we present a curated dataset of homologous and non-homologous isofunctional enzymes (HISE and NISE) derived from UniProtKB release 2025_01, encompassing both reviewed (Swiss-Prot) and unreviewed (TrEMBL) entries. Using Enzyme Commission (EC) numbers to define catalytic equivalence and SUPERFAMILY (SCOP structure superfamily) annotations to infer evolutionary relationships, we implemented a transparent and reproducible pipeline to classify enzymes into homologous and non-homologous functional groups. The dataset comprises over 200,000 Swiss-Prot and 27 million TrEMBL enzymes with complete EC and SUPERFAMILY annotations, organized by domain of life, enzyme class, and structural domain composition. Multiple output files, including presence/absence matrices, clustered enzyme groups, phyloprofiles, and full annotation tables, are provided to facilitate downstream evolutionary, functional, and comparative analyses. This resource offers a global view of molecular convergence and divergence in enzymatic functions, highlighting the widespread nature of NISE across taxa and enzyme classes. It provides a foundation for studying metabolic evolution, functional redundancy, drug target discovery, and the evolutionary constraints shaping biochemical solutions.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。