Flat-Lattice-CNN: A model for Chinese medical-named-entity recognition

Flat-Lattice-CNN:一种用于中文医学命名实体识别的模型

阅读:1

Abstract

BACKGROUND: In the field of internet-based healthcare, the complexity of pathology features across various disciplines, coupled with the lack of medical training among most patients, results in medical named entities in doctor patient dialogue texts exhibiting long and multiword syntactic patterns, posing new challenges to named-entity recognition algorithms. METHODS: To address the issue mentioned above, in this study we integrate Convolutional Neural Networks (CNNs) of different dilation rates on top of Flat-Lattice architecture to construct the Flat-Lattice-CNN model. This model not only considers the semantic information of characters and words, as well as their absolute and relative positional information, but also extracts multiple-token co-occurrence relationship features among characters/words spanning different distances to improve the recognition accuracy of long medical-named entities. RESULTS: Experimental results show an improved performance in the task of recognizing medical-named entities on all evaluation datasets, especially on CTDD with a 2.3% increase in F1 score. The proposed Flat-Lattice-CNN model effectively addresses the challenges posed by long and multiword syntactic patterns in medical-named entities, offering improved recognition accuracy and demonstrating the potential for enhancing medical-named-entity recognition in internet-based healthcare dialogues.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。