融合位置和实体类别信息的中文命名实体识别

doi:10.19678/j.issn.1000-3428.0068741

摘要/Abstract

摘要：

词语作为一种上下文信息在中文命名实体识别(NER)任务中发挥着重要作用。以往基于字符的中文NER方法虽然在一定程度上取得了成功, 但仍存在词语信息嵌入方式简单、特征捕捉方式单一, 且忽视了潜在词的影响、未能充分利用词语信息的问题。针对上述问题, 提出一种改进的中文NER方法。首先, 通过基于全遮蔽技术的预训练模型RoBERTa-wwm将文本表示为字符级嵌入向量; 其次, 使用门控空洞卷积神经网络(DGCNN)模型进一步捕捉文本的特征信息; 然后, 在键值记忆网络(KV-MemNN)模型中采用位置实体类别组合机制(PECM)更有效地融合词语信息并缓解潜在词冲突所带来的影响; 最后, 利用条件随机场(CRF)模型对预测结果进行约束得到最佳标签序列。实验结果表明, 该方法在Weibo、MSRA和Resume数据集上的F1值分别达到71.82%、95.00%和96.14%, 相比于融合词语信息的FLAT模型分别提升了11.50、0.88、0.69百分点, 同时在不同实体上的识别表现整体优于RoBERTa-wwm+CRF和Lattice LSTM模型。此外, 通过预训练模型的对比和消融实验进一步证明了KV-MemNN和RoBERTa-wwm模型的有效性。

关键词: 命名实体识别, 键值记忆网络, 词语信息, 位置信息, 实体类别信息

Abstract:

Words play a crucial role as contextual information in Chinese Named Entity Recognition (NER) tasks. Although character-based methods have achieved some success, existing methods simplistically embed word information and use a limited feature capture approach, overlooking the influence of potential words and failing to fully utilize word information. To address these issues, this study proposes an improved Chinese NER method. First, the pretrained model uses a full-word masking technique, known as RoBERTa-wwm, to represent the text as character-level embedding vectors. Second, the Dilated Gated Convolution Neural Network (DGCNN) model captures additional textual features. Third, the Key-Value Memory Network (KV-MemNN) model introduces a Position Entity-category Combination Mechanism (PECM) to integrate word information and further mitigate the impact of potential word conflicts. Finally, the Conditional Random Field (CRF) model constrains the predicted results and obtains the optimal label sequences. The experimental results obtained F1 values of 71.82%, 95.00%, and 96.14% for the Weibo, MSRA, and Resume datasets, respectively. This represents an improvement of 11.50, 0.88, and 0.69 percentage points compared to the FLAT model integrating word information. Moreover, this method outperformed RoBERTa-wwm+CRF and Lattice Long Short-Term Memory (LSTM) in entity recognition across various types. Pretrained model comparisons and ablation studies confirmed the effectiveness of the KV-MemNN and RoBERTa-wwm models.

Key words: Named Entity Recognition (NER), Key-Value Memory Network (KV-MemNN), word information, positional information, entity category information

杨竣辉, 李苏晋. 融合位置和实体类别信息的中文命名实体识别[J]. 计算机工程, 2025, 51(3): 113-121.

YANG Junhui, LI Sujin. Chinese Named Entity Recognition Integrating Positional and Entity Category Information[J]. Computer Engineering, 2025, 51(3): 113-121.

https://www.ecice06.com/CN/Y2025/V51/I3/113

图/表 10

图1 RDKVC模型整体架构

Fig.1 Overall architecture of RDKVC model

图2 DGCNN模型

Fig.2 DGCNN model

图3 KV-MemNN模型

Fig.3 KV-MemNN model

图4 实体标签表的生成规则

Fig.4 Rules for generating the entity label table

图5 不同实体上的性能表现

Fig.5 Performance on different entities

参考文献 27

1	ZHANG N X , LI F , XU G L , et al. Chinese NER using dynamic meta-embeddings. IEEE Access, 2019, 7, 64450- 64459. doi: 10.1109/ACCESS.2019.2916816
2	罗辉, 卢玲. 面向中文命名实体识别的中文字符表示方法. 小型微型计算机系统, 2023, 44 (7): 1434- 1440. doi: 10.20009/j.cnki.21-1106/TP.2021-0862
	LUO H , LU L . Character embedding method for Chinese named entity recognition. Journal of Chinese Computer System, 2023, 44 (7): 1434- 1440. doi: 10.20009/j.cnki.21-1106/TP.2021-0862
3	ZHANG Y, YANG J. Chinese NER using Lattice LSTM[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Stroudsburg, USA: Association for Computational Linguistics, 2018: 1554-1564.
4	LIU W, XU T G, XU Q H, et al. An encoding strategy based word-character[C]//Proceedings of the 2019 Conference of the North. Stroudsburg, USA: Association for Computational Linguistics, 2019: 2379-2389.
5	LUO Y, XIAO F S, ZHAO H. Hierarchical contextualized representation for named entity recognition[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Palo Alto, AAAI Press, 2020: 8441-8448.
6	MILLER A, FISCH A, DODGE J, et al. Key-value memory networks for directly reading documents[C]//Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, USA: Association for Computational Linguistics, 2016: 1400-1409.
7	LIU T Y, YAO J G, LIN C Y. Towards improving neural named entity recognition with gazetteers[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2019: 5301-5307.
8	GUI T, MA R T, ZHANG Q, et al. CNN-based Chinese NER with lexicon rethinking[EB/OL]. [2023-09-11]. https://www.ijcai.org/Proceedings/2019/692.
9	SUI D B, CHEN Y B, LIU K, et al. Leverage lexical knowledge for Chinese named entity recognition via collaborative graph network[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Stroudsburg, USA: Association for Computational Linguistics, 2019: 3821-3831.
10	TANG Z , WAN B Y , YANG L . Word-character graph convolution network for Chinese named entity recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020, 28, 1520- 1532. doi: 10.1109/TASLP.2020.2994436
11	GUI T, YE J C, ZHANG Q, et al. Leveraging document-level label consistency for named entity recognition[C]//Proceedings of the 29th International Joint Conference on Artificial Intelligence. Yokohama, Japan: International Joint Conferences on Artificial Intelligence Organization, 2020: 3976-3982.
12	TIAN Y H , SHEN W , SONG Y , et al. Improving biomedical named entity recognition with syntactic information. BMC Bioinformatics, 2020, 21 (1): 539. doi: 10.1186/s12859-020-03834-6
13	张云秋, 汪洋, 李博诚. 基于RoBERTa-wwm动态融合模型的中文电子病历命名实体识别. 数据分析与知识发现, 2022, 6 (2/3): 242- 250.
	ZHANG Y Q , WANG Y , LI B C . Identifying named entities of chinese electronic medical records based on RoBERTa-wwm dynamic fusion mode. Data Analysis and Knowledge Discovery, 2022, 6 (2/3): 242- 250.
14	DEVLIN J, CHANG M W, LEE K, et al. BERT: pre-training of deep bidirectional Transformers for language understanding[EB/OL]. [2023-09-11]. https://arxiv.org/abs/1810.04805.
15	杨飘, 董文永. 基于BERT嵌入的中文命名实体识别方法. 计算机工程, 2020, 46 (4): 40-45, 52. doi: 10.19678/j.issn.1000-3428.0054272
	YANG P , DONG W Y . Chinese named entity recognition method based on BERT embedding. Computer Engineering, 2020, 46 (4): 40-45, 52. doi: 10.19678/j.issn.1000-3428.0054272
16	王笑月, 李茹, 段菲. 一种基于门控空洞卷积的高效中文命名实体识别方法. 中文信息学报, 2021, 35 (1): 72- 80. doi: 10.3969/j.issn.1003-0077.2021.01.010
	WANG X Y , LI R , DUAN F . An efficient Chinese named entity recognition method based on gated-dilated convolution. Journal of Chinese Information Processing, 2021, 35 (1): 72- 80. doi: 10.3969/j.issn.1003-0077.2021.01.010
17	杨长沛, 廖列法. 基于门控空洞卷积特征融合的中文命名实体识别. 计算机工程, 2023, 49 (8): 85- 95. doi: 10.19678/j.issn.1000-3428.0065455
	YANG C P , LIAO L F . Chinese named entity recognition based on dilated gated convolution feature fusion. Computer Engineering, 2023, 49 (8): 85- 95. doi: 10.19678/j.issn.1000-3428.0065455
18	LAFFERTY J, MCCALLUM A, PEREIRA F. Conditional random fields: probabilistic models for segmenting and labeling sequence data[C]//Proceedings of the 18th International Conference on Machine Learning. Berlin, Germany: Springer, 2001: 282-289.
19	MA R T, PENG M L, ZHANG Q, et al. Simplify the usage of lexicon in Chinese NER[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2020: 5951-5960.
20	ZHU Y, WANG G, KARLSSON B F. CAN-NER: convolutional attention network for Chinese named entity recognition[C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg, USA: Association for Computational Linguistics, 2019: 3384-3393.
21	CHANG N , ZHONG J , LI Q , et al. A mixed semantic features model for Chinese NER with characters and words. Advances in Information Retrieval, 2020, 12035, 356- 368.
22	LI J T, MENG K. MFE-NER: multi-feature fusion embedding for Chinese named entity recognition[EB/OL]. [2023-09-11]. http://arxiv.org/abs/2109.07877.
23	WU S, SONG X N, FENG Z H. MECT: multi-metadata embedding based cross-Transformer for Chinese named entity recognition[EB/OL]. [2023-09-11]. https://www.ijcai.org/Proceedings/2019/692.
24	GUI T, ZOU Y C, ZHANG Q, et al. A lexicon-based graph neural network for Chinese NER[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Stroudsburg, USA: Association for Computational Linguistics, 2019: 1039-1049.
25	HU D, WEI L W. SLK-NER: exploiting second-order lexicon knowledge for Chinese NER[EB/OL]. [2023-09-11]. https://arxiv.org/abs/2007.08416.
26	LI X N, YAN H, QIU X P, et al. FLAT: Chinese NER using flat-lattice Transformer[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2020: 6836-6842.
27	李军怀, 陈苗苗, 王怀军, 等. 基于ALBERT-BGRU-CRF的中文命名实体识别方法. 计算机工程, 2022, 48 (6): 89-94, 106. doi: 10.19678/j.issn.1000-3428.0061630
	LI J H , CHEN M M , WANG H J , et al. Chinese named entity recognition method based on ALBERT-BGRU-CRF. Computer Engineering, 2022, 48 (6): 89-94, 106. doi: 10.19678/j.issn.1000-3428.0061630

[1]	党小超, 刘涧, 董晓辉, 祝忠彦, 李芬芳. 面向不平衡数据的机械设备故障命名实体识别[J]. 计算机工程, 2024, 50(9): 104-112.
[2]	张华青, 夏张涛, 陆晓庆, 童基均. 基于字形特征的血管外科命名实体识别[J]. 计算机工程, 2024, 50(8): 13-21.
[3]	李华昱, 张智康, 闫阳, 岳阳. 基于知识图谱增强的领域多模态实体识别[J]. 计算机工程, 2024, 50(8): 31-39.
[4]	隗昊, 刁宏悦, 孔亮宸, 邓耀臣. 东北亚舆情文本细粒度命名实体识别方法研究[J]. 计算机工程, 2024, 50(5): 354-362.
[5]	隗昊, 刁宏悦, 孔亮宸, 邓耀臣. 东北亚舆情文本细粒度命名实体识别方法研究[J]. 计算机工程, 2024, 50(5): 354-362.
[6]	王安政, 党建武, 岳彪, 杨景玉. 基于位置信息和注意力机制的路面裂缝检测[J]. 计算机工程, 2024, 50(4): 303-312.
[7]	刘威, 马磊, 李凯, 李蓉. 基于多粒度字形增强的中文医学命名实体识别[J]. 计算机工程, 2024, 50(2): 337-344.
[8]	高锐涛, 林达伟, 郭亮, 金鸿, 王红. 基于知识图谱的水稻种植智能问答系统设计与实现[J]. 计算机工程, 2024, 50(12): 133-141.
[9]	倪渊, 廖世豪, 张健. 基于Wobert与对抗学习的中文命名实体识别[J]. 计算机工程, 2024, 50(11): 119-129.
[10]	任义, 苏博, 袁帅. 教育领域下多维度特征命名实体识别方法[J]. 计算机工程, 2024, 50(10): 110-118.
[11]	唐卓然, 柳毅. 基于词汇融合和依存关系的中文命名实体识别[J]. 计算机工程, 2024, 50(10): 145-153.
[12]	杨长沛, 廖列法. 基于门控空洞卷积特征融合的中文命名实体识别[J]. 计算机工程, 2023, 49(8): 85-95.
[13]	张家熔, 苑津莎, 许珈宁, 罗志宏. 基于多元信息嵌入与协同神经网络的力学实体识别算法[J]. 计算机工程, 2023, 49(7): 125-134.
[14]	陈明, 刘蓉, 张晔. 基于多重注意力机制的中文医疗实体识别[J]. 计算机工程, 2023, 49(6): 314-320.
[15]	朱红, 牛浩然, 朱彤. 基于字词融合与对抗训练的行业人物实体识别[J]. 计算机工程, 2023, 49(5): 56-62.

选择文件类型/文献管理软件名称

选择包含的内容