Transformer-powered precision: A DETR-based approach for robust detection in medical ultrasound with cholelithiasis as a case study

变压器驱动的高精度:基于DETR的稳健检测方法在医学超声中用于胆结石检测

阅读:2

Abstract

BACKGROUND AND OBJECTIVE: Transformers have demonstrated strong capabilities in capturing long-range dependencies in visual data, but their application to noisy, low-contrast medical imaging such as ultrasound remains limited. Cholelithiasis detection, often hindered by the inherent limitations of ultrasound imaging, requires more precise and robust computational approaches. This study introduces a detection architecture that integrates convolutional feature extraction with a customized Detection Transformer (DETR) to improve localization and detection in challenging ultrasound conditions. METHODS: The proposed method combines convolutional inductive biases with transformer-based self-attention to capture both local and global spatial relationships. It was applied to the task of detecting gallstones and gallbladder regions in ultrasound images of cholelithiasis. The approach was benchmarked against state-of-the-art object detection models, including RT-DETR, YOLOv8, and YOLO-NAS. Radiologists validated the bounding boxes generated by the model to assess clinical reliability. RESULTS: The custom DETR achieved confidence score up to 99 % in detecting gallstones and gallbladder regions, outperforming RT-DETR, YOLOv8, and YOLO-NAS. The method demonstrated mean average precision improvements of 13 % and 14 % compared to YOLOv8 and YOLO-NAS, respectively. Radiologist validation confirmed the clinical accuracy and robustness of the proposed detection framework. CONCLUSIONS: By effectively addressing the challenges of low-quality ultrasound imaging, the proposed DETR-based framework provides a reliable and generalizable approach for automated cholelithiasis detection. The findings highlight its potential for integration into real-world diagnostic workflows and its applicability to broader intelligent diagnostic systems at the intersection of computational science, medical informatics, and vision transformers.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。