Transparent Machine Learning Reveals Diagnostic Glycan Biomarkers in Subarachnoid Hemorrhage and Vasospasm

透明机器学习揭示蛛网膜下腔出血和血管痉挛中的诊断性聚糖生物标志物

阅读:1

Abstract

Subarachnoid hemorrhage (SAH) and its major complication, cerebral vasospasm (CVS), present significant challenges for early diagnosis and risk stratification. In this study, we developed interpretable decision tree models to differentiate between healthy controls, SAH patients, and SAH patients with vasospasm using serum N-glycomic data. Building on previously published glycomic profiles, we introduced a refined modeling approach combining systematic preprocessing, feature selection, and interpretable machine learning. Our methodology included outlier removal, standard scaling, and a novel correlation-based feature reduction guided by feature importance scores derived from preliminary decision trees. Binary classification tasks (Control vs. SAH and Control vs. CVS, and SAH vs. CVS) were evaluated through stratified repeated cross-validation and hyperparameter optimization. Models achieved high accuracy (up to 0.91) and stable F1-scores across configurations. Key glycans such as FA2(6)G1 (bi-antennary, fucosylated, monogalactosylated), A4G4S3(2) (tetra-antennary, tetra-galactosylated, tri-sialylated), and A3G3S3(5) (tri-antennary, tri-galactosylated, tri-sialylated) emerged as the most discriminative. Visualizations that combine joint feature distributions and decision boundaries provided intuitive insight into the classifier's logic. These findings support the integration of interpretable glycomics-based models into clinical workflows.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。