Developing Customizable Cancer Information Extraction Modules for Pathology Reports Using CLAMP

利用CLAMP开发可定制的病理报告癌症信息提取模块

阅读：1

作者：Soysal,Ergin,Warner,Jeremy L,Wang,Jingqi,Jiang,Min,Harvey,Krysten,Jain,Sandeep Kumar,Dong,Xiao,Song,Hsing-Yi,Siddhanamatha,Harish,Wang,Liwei,Dai,Qi,Chen,Qingxia,Du,Xianglin,Tao,Cui,Yang,Ping,Denny,Joshua Charles,Liu,Hongfang,Xu,Hua

期刊：		影响因子：
时间：	2019	起止号：	2019 Aug 21;264：1041-1045
doi：	10.3233/SHTI190383	研究方向：	肿瘤

Abstract

Natural language processing (NLP) technologies have been successfully applied to cancer research by enabling automated phenotypic information extraction from narratives in electronic health records (EHRs) such as pathology reports; however, developing customized NLP solutions requires substantial effort. To facilitate the adoption of NLP in cancer research, we have developed a set of customizable modules for extracting comprehensive types of cancer-related information in pathology reports (e.g., tumor size, tumor stage, and biomarkers), by leveraging the existing CLAMP system, which provides user-friendly interfaces for building customized NLP solutions for individual needs. Evaluation using annotated data at Vanderbilt University Medical Center showed that CLAMP-Cancer could extract diverse types of cancer information with good F-measures (0.80-0.98). We then applied CLAMP-Cancer to an information extraction task at Mayo Clinic and showed that we can quickly build a customized NLP system with comparable performance with an existing system at Mayo Clinic. CLAMP-Cancer is freely available for academic use.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。