A MultiRater MultiOrgan Abdominal CT Dataset for Calibration Analysis and Uncertainty Modeling in Segmentation

用于分割校准分析和不确定性建模的多评级多器官腹部CT数据集

阅读:1

Abstract

In medical imaging, deep learning (DL) models often struggle to delineate ambiguous structures such as tumors or organ boundaries, leading to uncertainty in defining precise contours. This challenge is amplified by inter-rater variability, where experts may disagree on boundary delineations, resulting in inconsistent segmentation outcomes. Addressing these issues requires robust algorithms capable of quantifying uncertainty, standardizing annotation practices, and improving calibration to ensure reliable predictions, particularly in multi-class and multi-rater scenarios. When models are miscalibrated and overconfident, their outputs can mislead clinical decision-making, potentially influencing radiologists to over- or under-estimate malignancy risks. The CURVAS challenge (Calibration and Uncertainty for multiRater Volume Assessment in multiorgan Segmentation) was established to address these challenges by jointly assessing uncertainty, calibration, and segmentation quality, as well as promoting clinical relevance by evaluating organ volumes while accounting for annotation variability. To support this, a dataset of 90 contrast-enhanced CT scans from University Hospital Erlangen was curated, containing pancreas, liver, and kidney segmentations annotated by three experts. This resource provides a foundation for developing and benchmarking algorithms that balance segmentation accuracy, calibration, and reliability. A quantitative analysis of the annotations shows that kidney and liver segmentations exhibit strong consistency, whereas the pancreas remains challenging, emphasizing the need for refined labeling protocols and improved training strategies.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。