Identification of areas of grading difficulties in prostate cancer and comparison with artificial intelligence assisted grading

识别前列腺癌分级中的难点并与人工智能辅助分级进行比较

阅读：1

作者：Egevad,Lars,Swanberg,Daniela,Delahunt,Brett,Ström,Peter,Kartasalo,Kimmo,Olsson,Henrik,Berney,Dan M,Bostwick,David G,Evans,Andrew J,Humphrey,Peter A,Iczkowski,Kenneth A,Kench,James G,Kristiansen,Glen,Leite,Katia R M,McKenney,Jesse K,Oxley,Jon,Pan,Chin-Chen,Samaratunga,Hemamali,Srigley,John R,Takahashi,Hiroyuki,Tsuzuki,Toyonori,van der Kwast,Theo,Varma,Murali,Zhou,Ming,Clements,Mark,Eklund,Martin

期刊：	Virchows Archiv	影响因子：	3.100
时间：	2020	起止号：	2020 Dec;477(6):777-786
doi：	10.1007/s00428-020-02858-w	研究方向：	肿瘤
疾病类型：	前列腺癌

Abstract

The International Society of Urological Pathology (ISUP) hosts a reference image database supervised by experts with the purpose of establishing an international standard in prostate cancer grading. Here, we aimed to identify areas of grading difficulties and compare the results with those obtained from an artificial intelligence system trained in grading. In a series of 87 needle biopsies of cancers selected to include problematic cases, experts failed to reach a 2/3 consensus in 41.4% (36/87). Among consensus and non-consensus cases, the weighted kappa was 0.77 (range 0.68-0.84) and 0.50 (range 0.40-0.57), respectively. Among the non-consensus cases, four main causes of disagreement were identified: the distinction between Gleason score 3 + 3 with tangential cutting artifacts vs. Gleason score 3 + 4 with poorly formed or fused glands (13 cases), Gleason score 3 + 4 vs. 4 + 3 (7 cases), Gleason score 4 + 3 vs. 4 + 4 (8 cases) and the identification of a small component of Gleason pattern 5 (6 cases). The AI system obtained a weighted kappa value of 0.53 among the non-consensus cases, placing it as the observer with the sixth best reproducibility out of a total of 24. AI may serve as a decision support and decrease inter-observer variability by its ability to make consistent decisions. The grading of these cancer patterns that best predicts outcome and guides treatment warrants further clinical and genetic studies. Results of such investigations should be used to improve calibration of AI systems.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。