New gene annotations for three pea aphid genome assemblies allow comparative analyses of genes and gene family evolution

对三种豌豆蚜基因组组装体进行新的基因注释,可以对基因和基因家族的进化进行比较分析。

阅读:1

Abstract

Reliable genome annotation is crucial for analyses of gene function, conservation, and evolution. Factors such as the sequencing technology used to create the assembly and the amount of duplicated sequence within the genome of interest can have a large impact on the quality of gene annotations. In particular, short read-based assemblies tend to mis-assemble duplicated genes as single loci, a problem that requires additional long read sequencing to resolve. Pea aphids exhibit a high level of gene duplication, resulting in mis-assembly and mis-annotation of genes in the short read reference genome. Here, we re-annotate the pea aphid reference genome, along with two long read pea aphid genomes, to facilitate future analyses of gene duplication and function in pea aphids. We use an integrated approach, consolidating both ab initio and RNAseq-based annotations into unified gene models. The new annotations contain genes that were missing, mis-annotated, or mis-assembled in the reference, and are generally consistent across assemblies, showing very good agreement between the long read assemblies. Our annotation method is sensitive enough to refine existing gene models, uncovering alternatively used promoters and isoforms, and aids in finding gene duplications. These data provide a useful supplement to the existing reference annotations and a new comparative framework for discovery and analysis of gene function and duplication in this important emerging model insect.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。