Application of machine learning to explore the genomic prediction accuracy of fall dormancy in autotetraploid alfalfa

应用机器学习探索自四倍体紫花苜蓿秋季休眠的基因组预测准确性

阅读:2

Abstract

Fall dormancy (FD) is an essential trait to overcome winter damage and for alfalfa (Medicago sativa) cultivar selection. The plant regrowth height after autumn clipping is an indirect way to evaluate FD. Transcriptomics, proteomics, and quantitative trait locus mapping have revealed crucial genes correlated with FD; however, these genes cannot predict alfalfa FD very well. Here, we conducted genomic prediction of FD using whole-genome SNP markers based on machine learning-related methods, including support vector machine (SVM) regression, and regularization-related methods, such as Lasso and ridge regression. The results showed that using SVM regression with linear kernel and the top 3000 genome-wide association study (GWAS)-associated markers achieved the highest prediction accuracy for FD of 64.1%. For plant regrowth height, the prediction accuracy was 59.0% using the 3000 GWAS-associated markers and the SVM linear model. This was better than the results using whole-genome markers (25.0%). Therefore, the method we explored for alfalfa FD prediction outperformed the other models, such as Lasso and ElasticNet. The study suggests the feasibility of using machine learning to predict FD with GWAS-associated markers, and the GWAS-associated markers combined with machine learning would benefit FD-related traits as well. Application of the methodology may provide potential targets for FD selection, which would accelerate genetic research and molecular breeding of alfalfa with optimized FD.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。