A knowledge-driven protocol for prediction of proteins of interest with an emphasis on biosynthetic pathways

一种知识驱动的预测目标蛋白质的协议,重点是生物合成途径

阅读:5
作者:Adwait G Joshi, K Harini, Iyer Meenakshi, K Mohamed Shafi, Shaik Naseer Pasha, Jarjapu Mahita, Radha Sivarajan Sajeevan, Snehal D Karpe, Pritha Ghosh, Sathyanarayanan Nitish, A Gandhimathi, Oommen K Mathew, Subramanian Hari Prasanna, Manoharan Malini, Eshita Mutt, Mahantesha Naika, Nithin Ravooru, R

Abstract

This protocol describes a stepwise process to identify proteins of interest from a query proteome derived from NGS data. We implemented this protocol on Moringa oleifera transcriptome to identify proteins involved in secondary metabolite and vitamin biosynthesis and ion transport. This knowledge-driven protocol identifies proteins using an integrated approach involving sensitive sequence search and evolutionary relationships. We make use of functionally important residues (FIR) specific for the query protein family identified through its homologous sequences and literature. We screen protein hits based on the clustering with true homologues through phylogenetic tree reconstruction complemented with the FIR mapping. The protocol was validated for the protein hits through qRT-PCR and transcriptome quantification. Our protocol demonstrated a higher specificity as compared to other methods, particularly in distinguishing cross-family hits. This protocol was effective in transcriptome data analysis of M. oleifera as described in Pasha et al.•Knowledge-driven protocol to identify secondary metabolite synthesizing protein in a highly specific manner.•Use of functionally important residues for screening of true hits.•Beneficial for metabolite pathway reconstruction in any (species, metagenomics) NGS data.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。