The Computer-Assisted Sequence Annotation (CASA) workflow for enzyme discovery

用于酶发现的计算机辅助序列注释 (CASA) 工作流程

阅读:1

Abstract

PREMISE: With the advent of inexpensive nucleic acid sequencing and automated annotation at the level of basic functionality, the central problem of enzyme discovery is no longer finding active sequences, it is determining which ones are suitable for further study. This requires annotation that goes beyond sequence similarity to known enzymes and provides information at the sequence and structural levels. METHODS: Here we introduce a workflow for generating highly informative, richly annotated sequence alignments from protein sequence data. Computer-Assisted Sequence Annotation (CASA) is a freely available Python-based workflow designed to automate portions of novel protein characterization, while producing a human-interpretable final output. RESULTS: We demonstrate CASA using one enzyme from the Drosera capensis genome. The workflow generates detailed annotations providing comparisons to known reference sequences. In addition to sequence similarity and predicted function, user-specified features such as active site residues, disulfide bonds, and substrate-binding residues can be displayed, and these can then be combined with downstream analyses to gain new insights into enzyme structure and function. DISCUSSION: This work demonstrates the utility of detailed annotations and protein structure prediction for choosing protein targets for biochemistry or structural biology from nucleic acid sequence data. The toolchain is freely available along with instructions and representative examples.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。