Enhancing Glaucoma Diagnosis Through Multi-Layer Transformer and Multi-Modal Feature Fusion

通过多层Transformer和多模态特征融合增强青光眼诊断

阅读:2

Abstract

PURPOSE: To develop a more accurate glaucoma grading framework by combining multiple examination modalities, aiming to overcome the limitations of single-modality diagnostic systems for comprehensive glaucoma diagnosis. METHODS: This paper proposes a novel multi-modal-based glaucoma grading framework to classify healthy, mild glaucoma, and moderate-to-severe glaucoma patients. The method simulates the clinical diagnosis process by leveraging multiple examination modalities and integrating prior knowledge of ocular structure to enhance feature learning. A multi-modal feature fusion framework (M2F3) is developed, utilizing a multi-layer transformer (MLT) for efficient combination of modalities. A contrastive learning strategy is also employed to improve feature learning further. RESULTS: Experimental results demonstrated that the proposed M2F3 glaucoma grading method shows a substantial 0.0465 increase in Cohen's kappa (κ) coefficient compared to state-of-the-art (SOTA) methods on the Glaucoma grAding from Multi-Modality imAges (GAMMA) dataset. CONCLUSIONS: The proposed multi-modal-based glaucoma grading framework offers a more accurate diagnostic tool by integrating multiple examination modalities and prior knowledge, representing a substantial improvement over existing single-modality-based systems.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。