Psychomedical named entity recognition method based on multi-level feature extraction and multi-granularity embedding fusion

基于多层次特征提取和多粒度嵌入融合的心理医学命名实体识别方法

阅读:2

Abstract

Named Entity Recognition (NER) in psychomedicine is one of the key tasks in natural language processing in psychomedicine. It aims to identify and classify specialized terms in psychomedical texts and provide powerful support for downstream tasks. Psychological medicine texts are characterized by long paragraphs, complex sentences, and scattered knowledge. The current character-based psychomedicine NER model has single embedded information. It lacks structural and phonetic characterization information. Migrating NER models from the general purpose domain to the psychomedical domain are not effective in improving entity recognition accuracy. To solve this problem, we propose a NER method based on multi-level feature extraction and multi-granularity embedding fusion (MFME-NER), which aims to provide an innovative solution. First, three different granularities of embedding information, character embedding, radical embedding and pinyin embedding, are introduced to enrich the semantic representation of the input text. Second, the BERT model is improved. Merging the features of all Encoder layers inside the output. So that the BERT model has multi-layer feature extraction capability (MFE-BERT). The character embedding is pre-trained by MFE-BERT. And the BiLSTM model is utilized for the extraction of features at the character granularity. The features of radical embedding and pinyin embedding are extracted separately by the CNN model, and then feature fusion is performed. Finally, feature vectors at three granularities are integrated using a gated feed-forward neural network attention mechanism (GA-FNNAtention). The experimental results show that MFME-NER achieved 94.26% and 89.63% F1 Score in the self-constructed psychomedical dataset PsyDatase and CBLUE dataset, respectively. The proposed method surpasses the currently used evaluation metrics, thus substantiating its rationality and efficacy.This study can better contribute to the analysis of psychomedical data.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。