Abstract
PURPOSE: To assess the performance of an artificial intelligence deep learning (DL) model compared with neuro-ophthalmologists for the classification of subjects with elevated intracranial pressure (ICP) and papilledema versus control subjects, based on retinal images with and without masking the optic nerve heads (ONHs). METHODS: Widefield retinal images were obtained in 32 subjects (70 images) with elevated ICP and papilledema and 31 control subjects (62 images). ONH-unmasked images were generated by cropping the image to a 30° circular region. ONH-masked images were generated by masking the ONH and peripapillary region. Classification was performed using a convolutional neural network model and by two neuro-ophthalmologists (graders). RESULTS: For the ONH-unmasked images, the classification accuracies of the model, grader 1, and grader 2 were 83% (area under the receiver operating characteristic curve [AUC] = 0.94), 93%, and 86%, respectively. For the ONH-masked images, the classification accuracies of the model, grader 1, and grader 2 were 79% (AUC = 0.90), 66%, and 76%, respectively. The classification performance of the DL model and graders did not significantly differ for both datasets (P ≥ 0.38). There was no significant effect of ONH masking on the performance of the DL model (P = 1.0) and grader 2 (P = 0.38), whereas the performance of grader 1 was significantly reduced (P = 0.02). CONCLUSIONS: Both expert graders and the DL models demonstrated excellent performance for classifying retinal images of subjects with elevated and normal ICP using images with or without masking the ONH. TRANSLATIONAL RELEVANCE: Methods for assessment of non-optic nerve retinal image features have the potential to improve diagnosis and monitoring the progression and response to treatment of elevated ICP.