Multiomics Analysis and Machine Learning-based Identification of Molecular Signatures for Diagnostic Classification in Liver Disease Types Along the Microbiota-gut-liver Axis

基于多组学分析和机器学习的分子特征识别在沿肠道菌群-肝脏轴的肝病类型诊断分类中的应用

阅读:1

Abstract

BACKGROUND: Liver disease, responsible for around two million deaths annually, remains a pressing global health challenge. Microbial interactions within the microbiota-gut-liver axis play a substantial role in the pathogenesis of various liver conditions, including early chronic liver disease (eCLD), chronic liver disease (CLD), acute liver failure (ALF), acute-on-chronic liver failure (ACLF), non-alcoholic fatty liver disease (NAFLD), steatohepatitis, and cirrhosis. This study aimed to identify key molecular signatures involved in liver disease progression by analyzing transcriptomic and gut microbiome data, and to evaluate their diagnostic utility using machine learning models. METHODS: Transcriptomic analysis identified differentially expressed genes (DEGs) that, when integrated with regulatory elements microRNAs, transcription factors, receptors, and the gut microbiome highlight disease-specific molecular interactions. To assess the diagnostic potential of these molecular signatures, a two-step analysis involving principal component analysis (PCA) and Random Forest classification was conducted, achieving accuracies of 75% for ALF and 89% for NAFLD. Additionally, machine learning algorithms, including K-neighbors, multi-layer perceptron (MLP), decision tree, Random Forest, logistic regression, gradient boosting, CatBoost, Extreme Gradient Boosting (XGB), and Light Gradient Boosting Machine (LGBM), were applied to gene expression data for ALF and NAFLD. RESULTS: Key genes including CLDN14, EGFR, GSK3B, MYC, and TJP2, alongside regulatory miRNAs let-7a-5p, miR-124-3p, and miR-195-5p and transcription factors NFKB1 and SP1 may be suggested as critical to liver disease progression. Additionally, gut microbiota members, Dictyostelium discoideum and Eikenella might be novel candidates associated with liver disease, highlighting the importance of the gut-liver axis. The Random Forest model reached 75% accuracy and 83% area under the curve for ALF, while NAFLD classification achieved 100% accuracy, precision, recall, and area under the curve underscoring robust diagnostic potential. CONCLUSION: This study establishes a solid foundation for further research and therapeutic advancement by identifying key biomolecules and pathways critical to liver disease. Additional experimental validation is needed to confirm clinical applicability.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。