A genetic algorithm-based ensemble model for efficiently identifying interleukin 6 inducing peptides

基于遗传算法的集成模型可高效识别白细胞介素 6 诱导肽

阅读:1

Abstract

Interleukin-6 (IL-6) is a cytokine with diverse biological activities that contribute to a variety of physiologic and immune responses. IL-6-inducing peptides are the short protein fragments that are critical for playing a contributing role in biological processes. Extensive research has advanced the development of IL-6-inducing peptides, but identifying these peptides experimentally remains time-consuming, labor-intensive, and costly. Therefore, computational prediction has gained attention as an alternative method. Meanwhile, some computational methods have already been developed, but they suffer from insufficient accuracy and inadequate feature engineering. In this study, we developed PredIL6, an advanced ensemble learning model that precisely identifies IL-6-inducing peptides by combining probability scores from 148 baseline machine learning and deep learning models, using a genetic algorithm-based meta-classifier. A forward feature selection method was used to construct the ensemble model, which consists of 20 baseline or single-feature models, including AAINDEX, BLOSUM62, and language models (ESM-2 and word2vec). PredIL6 outperformed existing state-of-the-art methods, achieving accuracy values of 0.934 and 0.899 on the training and test sets, respectively. Thus, PredIL6 is a powerful tool for expediting the identification of IL-6-inducing peptides. A freely available web application and a standalone PredIL6 program are provided.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。