Construction of SNP feature library for the identification of chicken breeds

构建用于鸡品种鉴定的SNP特征库

阅读:1

Abstract

Breed identification is an important prerequisite for the protection, development and utilization of animal genetic resources. This study developed an accurate identification strategy for chicken breeds using whole-genome sequencing data from 492 individuals belonging to 14 chicken breeds. These breeds include eight local Chinese breeds (Tibetan chicken, Chahua chicken, Daweishan chicken, Liyang chicken, Lindian chicken, Silky chicken, Dongxiang blue-shell egg chicken, and WenChang chicken), three standard chicken breeds (Rhode Island Red, Leghorn, and Light Sussex chicken), two commercial breeds (Cobb broiler and Yellow Plumage Dwarf chicken) and the Red Jungle fowl. We compared three ancestry informative marker (AIM) detection methods (Fst, I(n), and PCA-correlated SNPs) and four machine learning classifiers (K-NearestNeighbor, Support Vector Machine, Random Forest, and XGBoost) to identify the best breed identification model. A total of 30,831 high-information SNPs (Single nucleotide polymorphism) were detected and selected from these breeds using the three AIM detection methods. We found that several AIM methods performed well, but I(n) was the best. Machine learning classifiers were implemented to fit the important SNP loci, and ROC (receiver operating characteristic curve) curves were generated to evaluate the performance of these machine learning classifiers. The ROC curves and 5-fold cross-validation results indicated that XGBoost was the best machine learning classifier, with the largest AUC (Area Under Curve) (macro-AUC=0.9996). In addition, XGBoost achieved 100% accuracy using only 238 SNPs. In this study, it was observed that utilizing only 238 SNPs was effective for breed identification. We found that the combination of XGBoost and I(n) was the optimal strategy for breed identification. This study provides a new method for breed identification, which is highly important for the breeding and preservation of animal genetic resources.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。