MAF-Net: Multimodal cross-attention-based fusion network for cardiovascular disease classification

MAF-Net:一种基于多模态交叉注意力机制的融合网络,用于心血管疾病分类

阅读:1

Abstract

Cardiovascular disease ranks among the leading causes of death globally, posing a severe threat to human health. Consequently, rapid and accurate identification of cardiovascular disease has become a critical research endeavor. Electrocardiograms (ECGs), as a non-invasive detection tool, are widely used in cardiovascular disease detection due to their convenience and effectiveness. However, existing methods are often limited to single-modality analysis, neglecting the interaction between clinical data (such as age, gender, weight, etc.) and ECG features in classification tasks, resulting in limited recognition accuracy. Integrating multimodal data is key to improving CVD diagnostic accuracy. To address this, we propose MAF-Net (Multimodal Cross-Attention-based Fusion Network), a multi-class classification model that fuses clinical data features with ECG signal features for cardiovascular disease classification. The model comprises three components: (1) X Branch (Clinical Data Processing): Generates high-order interaction features via a second-order polynomial feature cross-layer and employs channel attention-weighted selection to identify key clinical factors;(2) Y Branch (Multi-scale ECG Feature Extraction): Parallel multi-scale convolutional modules (64@7, 128@3, 256@3) capture local morphological features, while Bi-LSTM models long-range temporal dependencies, supplemented by multi-head attention to focus on pathological segments;(3) Bidirectional Modality Fusion Module: Employing a bidirectional cross-attention mechanism, it uses clinical features as Query and ECG features as Key/Value to deeply fuse clinical and ECG data features. On the dataset, experiments targeting five super-categories of arrhythmias- NORM (Normal), MI (Myocardial Infarction), STTC (ST-T Segment Changes), CD (Conduction Disturbance), HYP (Hypertrophy). showed an accuracy rate of 90.75% ± 0.32%, precision of 84.58% ± 0.41%, and recall of 87.12% ± 0.38%, with an F1 score of 0.8069 ± 0.005 and a ROC-AUC value of 0.9407 ± 0.002. Results indicate that this model outperforms existing methods across key metrics, demonstrating its potential for application in clinical decision support.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。