Multicenter Clinical Validation of an Artificial Intelligence Diagnostic Classification Model for Laryngoscopy Images

人工智能诊断分类模型在喉镜图像分类中的多中心临床验证

阅读:1

Abstract

OBJECTIVE: To develop and externally validate a computer-aided diagnosis (CADx) model using artificial intelligence (AI) for classifying laryngeal lesions from laryngoscopy images into high-risk (HR), low-risk (LR). STUDY DESIGN: Retrospective multicenter development of a CADx model and external validation on independent cohorts. SETTING: Multicenter tertiary referral hospitals (Italy, India, China, Greece, and Spain). METHODS: Over 20,000 images derived from laryngoscopic examinations were retrieved. Images were annotated based on histopathology or expert consensus. A deep learning model was trained using an internal dataset and evaluated on 2 external datasets to assess generalizability. The CADx model classifies only images containing visible lesions, discriminating between LR and HR categories. Diagnostic performance was measured using standard metrics, including accuracy, precision, recall, F1-score, and area under the receiver operating characteristic curve (AUC). Model performance was compared with physicians of varying expertise and ChatGPT-4o. RESULTS: The computer-aided diagnosis model achieved a similar performance across internal and external datasets in distinguishing HR from LR lesions, with accuracy/AUC of 0.90/0.89 internally, 0.85/0.85 on the Greek dataset, and 0.88/0.88 on the Spanish dataset. The model's accuracy was statistically noninferior to that of otolaryngologists and expert laryngologists, and superior to general practitioners and ChatGPT-4o. CONCLUSION: This is a large multicenter clinical validation of a CADx model for laryngeal endoscopy, demonstrating generalizability and performance comparable to clinicians in discriminating between LR and HR lesions. The model's success supports its potential role in augmenting diagnostic capabilities, especially in resource-limited settings. A prospective multicenter clinical trial is underway to assess real-world clinical implementation.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。