Abstract
Named Entity Recognition (NER) stands as a fundamental task in Chinese information processing. However, it encounters unique difficulties due to the lack of explicit word boundaries in the Chinese language. This study proposes framing Chinese NER as a joint task that combines boundary detection and entity identification within an encoder-decoder architecture. The presented method utilizes hybrid embeddings to enhance word-level representations and naturally incorporates head and tail boundary information to improve NER performance. It combines two types of tasks: sequence labeling for NER and binary classification for boundary prediction. In the primary NER task, a convolutional attention network serves as the encoder to extract contextual information about the target word from the input. For the auxiliary boundary prediction task, two Bi-GRU networks are employed to model long range semantic associations and predict the start and end of entities. A feature fusion layer is then introduced to adjust the contribution of the main and auxiliary tasks to the hidden states of the global representation. The final input representation, obtained through the joint training framework where the learned boundary information supports the NER task, is passed to the CRF decoding layer. Experimental results on the Weibo and Ontonotes5.0 datasets show that the multi - task learning framework significantly enhances Chinese NER performance compared to existing models.