Classifying retinal diseases via pyramid vision graph convolutional network for optical coherence tomography images

利用金字塔视觉图卷积网络对光学相干断层扫描图像进行视网膜疾病分类

阅读:1

Abstract

Recent advancements have seen a significant focus on using deep neural networks for classifying retinal diseases in optical coherence tomography (OCT) images. However, traditional deep neural networks treat images as grid or sequential structures, limiting their flexibility in capturing irregular and complex objects, resulting in suboptimal performance in practical applications. To address this issue, we propose a novel visual neural network model with a pyramid structure, called pyramid vision graph convolutional networks (PVGCN). This model enhances the correlations between structures by segmenting images into multiple nodes and connecting the nearest nodes. Specifically, it consists of two core components: 1) vision graph block and 2) pyramid structure. The vision graph block, composed of a grapher block and a feed-forward network (FFN), uses graph convolution methods to divide the image into multiple regions, treating them as nodes and representing the image as graph data. The graph constructed based on nodes can capture relationships between nodes without positional restrictions, better representing the irregular structure of retinal tissue. The FFN module improves the over-smoothing phenomenon in the grapher stage, enabling more accurate classification. The pyramid structure decomposes OCT images into a series of sub-images at different scales, integrating features at different scales to obtain a comprehensive feature representation of retinal hierarchical structure information. This structure can replace the extraction of higher-dimensional features in a large model by integrating features at different scales, significantly reducing the number of parameters. We conducted extensive experiments on two different datasets. The experimental results show that the proposed PVGCN achieved accuracies of 0.9954 and 0.9787 on the two datasets, respectively, surpassing existing methods. Additionally, the model demonstrated recognition capabilities comparable to those of human experts in the experiments, effectively identifying retinal diseases in OCT images.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。