Normalized cumulative gain as an alternative evaluation measure for genomic selection models

标准化累积增益作为基因组选择模型的替代评估指标

阅读:2

Abstract

BACKGROUND: Genomic selection relies on a variety of statistical and machine learning methods to predict phenotypes from genomic data. Since no single method consistently outperforms others across datasets, evaluating and comparing model performance is essential. However, standard evaluation metrics such as Pearson's correlation coefficient and mean squared error treat genomic prediction as a regression problem, assessing overall fit rather than the effectiveness of selecting top-performing individuals for breeding. This disconnect can lead to suboptimal model selection in practice. RESULTS: To address this, we present the normalized cumulative gain (NCG) as an alternative evaluation measure that directly measures the phenotypic gain achieved from the individuals selected by the model. We applied this measure on four animal and plant datasets to compare nine commonly used methods for genomic prediction. CONCLUSIONS: NCG offers an intuitive and interpretable measure of selection efficiency, focusing solely on the individuals that would actually be chosen. We further demonstrate that calculating the performance under all possible selection thresholds provides more information than a single or few arbitrary thresholds. This more granular analysis shows that the performance of the methods may differ under varying selection intensities and can provide guidance for the choice of selection intensity. Our approach is implemented in R and is available at https://github.com/FelixHeinrich/GS_Comparison_with_NCG .

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。