WheatGP, a genomic prediction method based on CNN and LSTM

WheatGP 是一种基于 CNN 和 LSTM 的基因组预测方法。

阅读:1

Abstract

Wheat plays a crucial role in ensuring food security. However, its complex genetic structure and trait variation pose significant challenges for breeding superior varieties. In this study, a genomic prediction method for wheat (WheatGP) is proposed. WheatGP is designed to improve the phenotype prediction accuracy by modeling both additive genetic effects and epistatic genetic effects. It is primarily composed of a convolutional neural network (CNN) module and a long short-term memory (LSTM) module. The multilayer CNNs within the CNN module focus on capturing short-range dependencies within the genomic sequence. Meanwhile, the LSTM module, with its unique gating mechanism, is designed to retain long-distance dependency relationships between gene loci in the features. Therefore, WheatGP could comprehensively extract multilevel features from genomic inputs. Compared to ridge regression best linear unbiased prediction (rrBLUP), extreme gradient boosting (XGBoost), support vector regression (SVR), and deep neural network genomic prediction (DNNGP), WheatGP demonstrates a clear advantage in terms of prediction accuracy. The prediction accuracy for wheat yield reaches 0.73, while the prediction accuracies for various agronomic traits range between 0.62 and 0.78. It also exhibits robust performance across other crop types and multi-omics datasets. In addition, SHapley Additive exPlanations (SHAP) is employed to evaluate the contributions of inputs to the predictive model. As a high-performance tool for genomic prediction in wheat, WheatGP opens up new possibilities for achieving efficient and optimized wheat breeding.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。