Constructing inflammatory bowel disease diagnostic models based on k-mer and machine learning

基于k-mer和机器学习构建炎症性肠病诊断模型

阅读:1

Abstract

BACKGROUND: Inflammatory bowel disease (IBD), encompassing Crohn's disease (CD) and ulcerative colitis (UC), is linked to significant alterations in gut microbiota. Conventional diagnostic approaches frequently rely on invasive procedures, contributing to patient discomfort; hence, non-invasive diagnostic models present a valuable clinical alternative. METHODS: Metagenomic and amplicon sequencing data were collected from fecal samples of patients with IBD and healthy individuals across diverse geographic regions. Diagnostic models were developed using Logistic Regression (LR), Support Vector Machine (SVM), Naïve Bayes (NB), and Feedforward Neural Network (FFNN), complemented by an ensemble model via a voting mechanism. Five-fold cross-validation facilitated the differentiation between normal controls (NC) and IBD, as well as between CD and UC. RESULTS: K-mer-based methods leveraging metagenomic sequencing data demonstrated robust diagnostic performance, yielding ROC AUCs of 0.966 for IBD vs. NC and 0.955 for CD vs. UC. Similarly, models based on amplicon sequencing achieved ROC AUCs of 0.831 for IBD vs. NC and 0.903 for CD vs. UC. In comparison, k-mer-based approaches outperformed traditional microbiota-based models, which produced lower ROC AUCs of 0.868 for IBD vs. NC and 0.810 for CD vs. UC. Across all machine learning frameworks, the FFNN consistently attained the highest ROC AUC, underscoring its superior diagnostic performance. CONCLUSION: The integration of k-mer-based feature extraction with machine learning offers a non-invasive, highly accurate approach for IBD diagnosis, surpassing traditional microbiota-based models. This method holds considerable potential for clinical use, offering an effective alternative to invasive diagnostics and enhancing patient comfort.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。