Optimizing Thyroid Nodule Evaluation: AI Integration Into the Thyroid Imaging Reporting and Data System Through AI-Based Ultrasound Image Analysis

优化甲状腺结节评估:通过基于人工智能的超声图像分析将人工智能集成到甲状腺影像报告和数据系统中

阅读:1

Abstract

Background Thyroid nodules are among the most common endocrine abnormalities, with ultrasound serving as the first-line tool for risk stratification. The American College of Radiology Thyroid Imaging Reporting and Data System (ACR-TIRADS) standardizes evaluation but is limited by interobserver variability and the time required for detailed interpretation. Artificial intelligence (AI) offers the opportunity to address these limitations and to automate diagnostic processes and enhance diagnostic accuracy. Objective To develop and evaluate a vision-language AI model for ACR-TIRADS-based risk stratification of thyroid nodules on ultrasound. Methodology This retrospective study analyzed 1,000 thyroid ultrasound images collected between March 2024 and January 2025, of which 139 met the inclusion criteria. Images were annotated according to ACR-TIRADS features (composition, echogenicity, shape, margins, echogenic foci). A vision-language AI model (LLaVA-Med, Microsoft Research, Redmond, WA) was trained using a two-stage strategy that included domain-specific pretraining and fine-tuning on a curated dataset. Diagnostic performance was assessed as a binary classification: suspicious (TR3-TR5) vs. non-suspicious (TR1-TR2). Results The model achieved an accuracy of 67%, sensitivity of 71%, specificity of 53%, and precision of 84.6%. The F1 score is an average of an AI algorithm's precision and recall, used to evaluate the algorithm's predictive performance. Our model achieved an F1 score of 77%, and its performance favored sensitivity, reducing the likelihood of missed malignant nodules, though specificity remained moderate. Conclusion The vision-language AI model trained on ACR-TIRADS features demonstrated promising performance in thyroid nodule risk stratification. Its higher sensitivity and explainable outputs reflect its potential as a supportive screening tool in clinical practice, particularly in settings with limited radiological expertise. Further refinement and multi-institutional validation are warranted.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。