Genotyping from targeted NGS data based on a small set of SNPs correctly matches patient samples

基于少量SNP的靶向NGS数据基因分型能够正确匹配患者样本

阅读:1

Abstract

OBJECTIVE: Mislabelling and swapping of laboratory samples are handling errors that can lead to erroneous interpretation of data and/or patient harm. Sequenced samples can be traced back to the respective donors by matching of single nucleotide polymorphisms (SNPs). Frameworks and software to do this have been developed for use with whole genome/exome sequencing data but not for targeted next-generation sequencing (tNGS), possibly due to the limited genomic coverage with tNGS and the need for individualization of the set of interrogated SNPs. We decided to adapt a popular tool for use with tNGS data, to demonstrate the possibility of selecting informative SNPs from a typical tNGS panel and to create an automated workflow for detection of sample handling errors. RESULTS: We compiled a custom list of 28 SNPs and with its help we demonstrated the practicability of using only tNGS data to cost-effectively detect mislabelled samples. In two cohorts of totally 1441 patients with sequential samples, we could identify 3 sample swaps, 7 mislabelled samples (3 externally and 4 internally) and 1 mistake of unknown origin. We provide an R function for automated detection of sample swaps and mislabelling to the community as a free and open-source tool.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。