A proteogenomic analysis of Anopheles gambiae using high-resolution Fourier transform mass spectrometry

利用高分辨率傅里叶变换质谱法对冈比亚按蚊进行蛋白质组学分析

阅读:5
作者:Raghothama Chaerkady, Dhanashree S Kelkar, Babylakshmi Muthusamy, Kumaran Kandasamy, Sutopa B Dwivedi, Nandini A Sahasrabuddhe, Min-Sik Kim, Santosh Renuse, Sneha M Pinto, Rakesh Sharma, Harsh Pawar, Nirujogi Raja Sekhar, Ajeet Kumar Mohanty, Derese Getnet, Yi Yang, Jun Zhong, Aditya P Dash, Robert

Abstract

Anopheles gambiae is a major mosquito vector responsible for malaria transmission, whose genome sequence was reported in 2002. Genome annotation is a continuing effort, and many of the approximately 13,000 genes listed in VectorBase for Anopheles gambiae are predictions that have still not been validated by any other method. To identify protein-coding genes of An. gambiae based on its genomic sequence, we carried out a deep proteomic analysis using high-resolution Fourier transform mass spectrometry for both precursor and fragment ions. Based on peptide evidence, we were able to support or correct more than 6000 gene annotations including 80 novel gene structures and about 500 translational start sites. An additional validation by RT-PCR and cDNA sequencing was successfully performed for 105 selected genes. Our proteogenomic analysis led to the identification of 2682 genome search-specific peptides. Numerous cases of encoded proteins were documented in regions annotated as intergenic, introns, or untranslated regions. Using a database created to contain potential splice sites, we also identified 35 novel splice junctions. This is a first report to annotate the An. gambiae genome using high-accuracy mass spectrometry data as a complementary technology for genome annotation.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。