The in silico prediction of non-coding and protein-coding genetic loci has received considerable attention in comparative genomics aiming in particular at the identification of properties of nucleotide sequences that are informative of their biological role in the cell. We present here a software framework for the alignment-based training, evaluation and application of machine learning models with user-defined parameters. Instead of focusing on the one-size-fits-all approach of pervasive in silico annotation pipelines, we offer a framework for the structured generation and evaluation of models based on arbitrary features and input data, focusing on stable and explainable results. Furthermore, we showcase the usage of our software package in a full-genome screen of Drosophila melanogaster and evaluate our results against the well-known but much less flexible program RNAz.
Tailored machine learning models for functional RNA detection in genome-wide screens.
阅读:4
作者:Klapproth Christopher, Zötzsche Siegfried, Kühnl Felix, Fallmann Jörg, Stadler Peter F, Findeià Sven
| 期刊: | NAR Genomics and Bioinformatics | 影响因子: | 2.800 |
| 时间: | 2023 | 起止号: | 2023 Aug 21; 5(3):lqad072 |
| doi: | 10.1093/nargab/lqad072 | ||
特别声明
1、本文转载旨在传播信息,不代表本网站观点,亦不对其内容的真实性承担责任。
2、其他媒体、网站或个人若从本网站转载使用,必须保留本网站注明的“来源”,并自行承担包括版权在内的相关法律责任。
3、如作者不希望本文被转载,或需洽谈转载稿费等事宜,请及时与本网站联系。
4、此外,如需投稿,也可通过邮箱info@biocloudy.com与我们取得联系。
