K-mer-based approach for serodiagnostic antigen discovery in Chagas disease using unassembled sequencing reads

基于K-mer的未组装测序读段在恰加斯病血清诊断抗原发现中的应用

阅读:2

Abstract

Trypanosoma cruzi, the causative agent of Chagas disease (CD), exhibits remarkable genetic diversity, classified into six discrete typing units (DTUs), and one additional DTU, TcBat, primarily associated with bats. These DTUs are distributed differentially across CD-endemic regions, posing significant challenges for molecular and serological diagnosis, as test performance often varies geographically. Identifying conserved genomic regions shared among parasites circulating in distinct endemic areas is therefore essential. However, complete or semi-complete genome assemblies are available for only a limited number of strains, insufficiently capturing intra- and inter-DTU variability, particularly within repetitive multigene families. A wealth of raw T. cruzi genomic reads is publicly available, offering an opportunity to investigate highly repetitive, high-copy number sequences that are difficult to assemble but potentially valuable for improving diagnostic sensitivity. In this study, we applied a read-based bioinformatics pipeline to analyze data from six DTUs (TcI-TcVI), generating 80-mer fragments and clustering them to identify conserved sequences. Consensus sequences from conserved clusters were used to design synthetic peptides, which were evaluated serologically with samples from chronically infected individuals from Brazil, Bolivia, and Peru. Four peptides from the conserved C-terminal region of mucin family proteins demonstrated robust diagnostic performance (AUC: 0.8783-0.9353), with particularly high values obtained with sera from Brazilian and Bolivian patients. Overall, our results demonstrate that k-mer-based, assembly-free approaches can successfully identify conserved antigens across genetically diverse T. cruzi populations, underscoring their value as discovery tools for potential serological markers. While the peptides identified here represent promising candidates, validation in larger and more geographically diverse cohorts will be essential to establish their broader diagnostic applicability. Importantly, similar genome-informed strategies may also be leveraged to guide the discovery of diagnostic targets for other infectious diseases.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。