Chinese Named Entity Recognition Method Based on Multi-Feature Hierarchical Fusion

doi:10.19678/j.issn.1000-3428.0260183

Abstract

Abstract: Named Entity Recognition (NER) aims to accurately identify entities with predefined semantic categories and clear boundaries from text. In Chinese NER, the absence of explicit word boundaries, the complexity of semantic expressions, and the widespread presence of polyphonic and visually similar characters often lead to semantic ambiguity. Existing methods predominantly rely on character- or word-level information, with insufficient utilization of key linguistic features such as pinyin and radicals, and multi-source heterogeneous feature fusion is typically performed via simple concatenation or weighting strategies, which fail to capture deep semantic correlations among different features and thus limit further performance improvements. To address these issues, this paper proposes a Chinese NER method based on Multi-Feature Hierarchical Fusion (MFHF) to achieve collaborative modeling and deep semantic integration of multi-dimensional linguistic features. Specifically, in the feature representation stage, four types of embeddings—character, pinyin, radical, and lexical—are constructed, where character embeddings are derived from a pre-trained language model to capture contextual semantic information and long-range dependencies, pinyin embeddings encode phonetic sequences to model pronunciation differences and alleviate polyphonic ambiguity, radical embeddings employ a convolutional neural network to model character structures and extract fine-grained semantic features from the glyph level, and lexical embeddings incorporate word-level information via a lexicon matching mechanism to enhance the model’s ability to detect multi-character entity boundaries, thereby improving character representations from phonetic, glyph, and lexical semantic perspectives. To address insufficient interaction and coarse granularity in multi-source feature fusion, a hierarchical cross-attention mechanism is designed, where at the local level, two groups of cross-attention—pinyin–radical and character–lexical—are constructed to model the intrinsic relationships between phonetic and glyph information as well as the structural dependencies between character-level and word-level semantics through bidirectional attention interactions, enabling fine-grained alignment and complementarity among heterogeneous features, and at the global level, the locally enhanced multi-source features are concatenated and further modeled using a multi-head self-attention mechanism to capture long-range dependencies across features, achieving deep semantic integration and generating semantically enriched representations. On this basis, a joint optimization strategy combining multi-task learning and adversarial training is introduced, where auxiliary tasks of pinyin prediction and radical prediction are designed to strengthen feature learning, and gradient-based adversarial perturbations are applied in the embedding space to improve robustness and generalization under complex conditions. Finally, the fused representations are fed into a Bidirectional Long Short-Term Memory (BiLSTM) network for sequence modeling, and a Conditional Random Field (CRF) layer is employed for global decoding to obtain entity recognition results. Experiments conducted on three public Chinese NER datasets, MSRA, Weibo, and Resume, demonstrate that the MFHF model achieves F1 scores of 96.78%, 96.14%, and 71.80%, respectively, outperforming several representative baseline models, with improvements of 1.09, 1.55, and 1.68 percentage points over CPL-NER, GS-Lexicon, and Lattice-LSTM on the respective datasets. In summary, the proposed approach effectively enhances semantic modeling capability and model robustness for Chinese NER through multi-feature hierarchical fusion and joint optimization strategies.

摘要： 命名实体识别旨在从文本中准确识别具有特定语义类别及明确边界的实体。针对中文文本中缺乏显式词边界、语义表达复杂以及多音字、形近字广泛存在所带来的语义歧义问题，现有方法多依赖字符或词汇信息进行建模，对拼音、部首等关键语言学特征的利用仍显不足，同时在多源异构特征融合过程中多采用简单拼接或加权方式，难以充分挖掘不同特征之间的深层语义关联，进而限制了模型整体性能的进一步提升。本文提出一种基于多特征层次化融合（Multi-Feature Hierarchical Fusion，MFHF）的中文命名实体识别方法，以实现多维语言学特征的协同建模与深度语义融合。首先，在特征表示阶段构建字符、拼音、部首和词汇四类嵌入表示，其中字符嵌入基于预训练语言模型获取上下文语义信息并捕获长距离依赖关系，拼音嵌入通过编码拼音序列刻画语音差异，有助于缓解多音字歧义问题，部首嵌入利用卷积神经网络对汉字结构进行建模，从字形层面提取细粒度语义特征，词汇嵌入基于词典匹配机制引入词级信息，以增强模型对多字实体边界的感知能力，从而从语音、字形和词汇语义多个维度提升字符表示能力。其次，为解决多源特征融合过程中信息交互不足及融合粒度单一问题，设计层次化交叉注意力机制，在局部层面构建拼音—部首与字符—词汇两组交叉注意力，通过双向注意力交互分别建模音形之间的内在关联以及字级与词级语义之间的结构关系，使不同模态特征能够在细粒度层面实现信息对齐与互补，在全局层面，将经过局部增强的多源特征进行拼接，并引入多头自注意力机制对其进行统一建模，实现跨特征的深层语义整合，从而获得兼具多维信息的语义增强表示。在此基础上，引入多任务学习与对抗训练的联合优化策略，通过拼音预测和部首预测辅助任务强化特征学习，并在嵌入空间加入基于梯度的对抗扰动，以提升模型在复杂环境下的鲁棒性与泛化能力。最后，将融合后的特征表示输入BiLSTM进行序列建模，并通过条件随机场进行全局解码，从而获得实体识别结果。为验证所提方法的有效性，在MSRA、Weibo和Resume三个公开中文命名实体识别数据集上进行实验评估，并从整体性能与不同实体类型两个层面进行对比分析。实验结果表明，MFHF模型在MSRA、Resume和Weibo数据集上分别取得96.78%、96.14%和71.80%的F1值，整体性能优于多种代表性基线模型。其中，在MSRA数据集上相较于CPL-NER模型取得1.09个百分点的性能提升，在Weibo数据集上较GS-Lexicon模型提升1.55个百分点，在Resume数据集上相较于Lattice-LSTM模型取得1.68个百分点的提升。综上，本文方法通过多特征层次化融合与联合优化策略，有效提升了中文命名实体识别的语义建模能力与模型鲁棒性。

Wu Yongqing, Zhang Han. Chinese Named Entity Recognition Method Based on Multi-Feature Hierarchical Fusion[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0260183.

吴永庆, 张涵. 基于多特征层次化融合的中文命名实体识别方法[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0260183.

/ Recommend / Download Citations

URL: https://www.ecice06.com/EN/10.19678/j.issn.1000-3428.0260183

References

[1] Qin Z, Wu D, He J, et al. DP-FWCA: A Prompt-Enhanced Model for Named Entity Recognition in Educational Domains[J]. IEEE Access, 2025, 13(1): 129250-129268.
[2] Lin X, Zhang Y, Xu Y, et al. MAKAR: a Multi-Agent framework based knowledge-augmented reasoning for grounded multimodal named entity recognition[C]//Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. Miami, USA: Association for Computational Linguistics, 2025: 6121-6141.
[3] Abdullayeva F, Suleymanzade S. Estimating Page Ranks with Inductive Capability of Graph Neural Networks and Zone Partitioning in Information Retrieval[J]. Automatic Control and Computer Sciences, 2025, 59(2): 150-163.
[4] Wang F L, Shi D, Aguilar J, et al. A few-shot learning method based on knowledge graph in large language models[J]. International Journal of Data Science and Analytics, 2025, 20(4): 3869-3888.
[5] 高锐涛,林达伟,郭亮,等.基于知识图谱的水稻种植智能问答系统设计与实现[J].计算机工程,2024,50(12):133-141.DOI:10.19678/j.issn.1000-3428.0068464. Gao R T, Lin D W, Guo L, et al. Design and implementation of an intelligent question answering system for rice cultivation based on knowledge graphs[J]. Computer Engineering, 2024, 50(12): 133-141. DOI:10.19678/j.issn.1000-3428.0068464.
[6] Tao Y, Wang Z, Zhang H, et al. Nevlp: Noise-robust framework for efficient vision-language pre-training[C]//International Conference on Intelligent Computing. Singapore: Springer Nature Singapore, 2025: 74-85.
[7] Li Z, Ji S, Mickus T, et al. A comparison of language modeling and translation as multilingual pretraining objectives[C]//Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Miami, USA: Association for Computational Linguistics, 2024: 15882-15894.
[8] Yang J, Luo J, Guo W, et al. Exploiting hierarchically structured categories in fine-grained Chinese named entity recognition[C]//Findings of the Association for Computational Linguistics: ACL 2023. Toronto, Canada: Association for Computational Linguistics, 2023: 3407-3421.
[9] Dou C, Gong C, Li Z, et al. Improving Chinese named entity recognition with multi-grained words and part-of-speech tags via joint modeling[C]//Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). Torino, Italy: ELRA, 2024: 8732-8742.
[10] 张有胜,季伟东.基于双位置编码Transformer的命名实体识别研究[J].微电子学与计算机,2025,42(08):78-88. Zhang Y S, Ji W D. Research on named entity recognition based on dual position encoding transformer[J]. Microelectronics & Computer, 2025, 42(8): 78-88.
[11] Zhang Y, Yang J. Chinese NER using Lattice LSTM[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Melbourne, Australia: Association for Computational Linguistics, 2018: 1554-1564.
[12] Ma R, Peng M, Zhang Q, et al. Simplify the Usage of Lexicon in Chinese NER[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Online: Association for Computational Linguistics, 2020: 5951-5960.
[13] Xu Y, Ying M, Fang K, et al. Chinese named entity recognition based on adaptive lexical weights[J]. Computer Speech & Language, 2025, 90(1): 101735.
[14] Liu Z, Zhang G, Shen Y. Psychomedical named entity recognition method based on multi-level feature extraction and multi-granularity embedding fusion[J]. Scientific Reports, 2025, 15(1): 16927.
[15] Ke X, Wu X, Ou Z, et al. Chinese named entity recognition method based on multi-feature fusion and biaffine[J]. Complex & Intelligent Systems, 2024, 10(5): 6305-6318.
[16] 张文东,吴子炜,宋国昌,等.基于SiKuBERT与多元数据嵌入的中医古籍命名实体识别[J].华南理工大学学报(自然科学版),2024,52(06):128-137. Zhang W D, Wu Z W, Song G C, et al. Named entity recognition of ancient Chinese medical texts based on SiKuBERT and multi-data embedding[J]. Journal of South China University of Technology (Natural Science Edition), 2024, 52(6): 128-137.
[17] Lu J, Yang Z, Wang Y, et al. PaDeLLM-NER: parallel decoding in large language models for named entity recognition[J]. Advances in Neural Information Processing Systems, 2024, 37(1): 117853-117880.
[18] Wang S, Sun X, Li X, et al. GPT-NER: named entity recognition via large language models[C]//Findings of the Association for Computational Linguistics: NAACL 2025. Albuquerque, USA: Association for Computational Linguistics, 2025: 4257-4275.
[19] Liu T, Gao J, Ni W, et al. A multi-granularity word fusion method for Chinese NER[J]. Applied Sciences, 2023, 13(5): 2789.
[20] Liu X, Zhao J, Yao J, et al. Sequential lexicon enhanced bidirectional encoder representations from transformers: Chinese named entity recognition using sequential lexicon enhanced BERT[J]. PeerJ Computer Science, 2024, 10(1): e2344.
[21] Zhang Y, Liu Y, Zhu J, et al. Learning Chinese word embeddings from stroke, structure and pinyin of characters[C]//Proceedings of the 28th ACM International Conference on Information and Knowledge Management. Beijing, China: ACM, 2019: 1011-1020.
[22] Zhang Y, Liu Y, Zhu J, et al. FSPRM: a feature subsequence based probability representation model for Chinese word embedding[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 29(1): 1702-1716.
[23] Zhao J, Qian Y, Hou S, et al. Unleashing the power of pinyin: promoting Chinese named entity recognition with multiple embedding and attention[J]. Complex & Intelligent Systems, 2025, 11(1): 1-13.
[24] Cheng G, Deng G, Wang S, et al. D-NER: double embedding for Chinese named entity recognition[C]//2025 IEEE 12th Joint International Information Technology and Artificial Intelligence Conference (ITAIC). Chongqing, China: IEEE, 2025: 323-327.
[25] Pan J, Xiao M, Li M, et al. A boundary enhanced multi-task neural attention approach for Chinese named entity recognition[J]. Scientific Reports, 2025, 15(1): 41357.
[26] LI P, CHENG G, DENG G, et al. MP-NER: Morpho-phonological integration embedding for Chinese named entity recognition[J]. IEEE Access, 2025, 13(1): 78427-78440.
[27] Wu S, Song X, Feng Z. MECT: multi-metadata embedding based cross-transformer for Chinese named entity recognition[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Bangkok, Thailand: Association for Computational Linguistics, 2021: 1529-1539.
[28] 胡文彬, 陈龙, 黄贤波, 等. 融合交叉注意力的突发事件多模态中文反讽识别模型[J]. 智能系统学报, 2023, 19(2): 392-400. Hu W B, Chen L, Huang X B, et al. A cross-attention fusion based multimodal Chinese sarcasm recognition model for emergencies[J]. Journal of Intelligent Systems, 2023, 19(2): 392-400.
[29] Tang X, Huang Y, Xia M, et al. A multi-task BERT-BiLSTM-AM-CRF strategy for Chinese named entity recognition[J]. Neural processing letters, 2023, 55(2): 1209-1229.
[30] 叶娜,夏宇轩,张桂平,等.基于多任务学习的词汇约束神经机器翻译方法[J].厦门大学学报(自然科学版),2025,64(06):970-982. Ye N, Xia Y X, Zhang G P, et al. A vocabulary-constrained neural machine translation method based on multi-task learning[J]. Journal of Xiamen University (Natural Science Edition), 2025, 64(6): 970-982.
[31] 任义,苏博,袁帅.教育领域下多维度特征命名实体识别方法[J].计算机工程,2024,50(10):110-118.DOI:10.19678/j.issn.1000-3428.0068669. Ren Y, Su B, Yuan S. Multi-dimensional feature named entity recognition method in the education field[J]. Computer Engineering, 2024, 50(10): 110-118. DOI: 10.19678/j.issn.1000-3428.0068669.
[32] Peng N, Dredze M. Named entity recognition for chinese social media with jointly trained embeddings[C]//Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Lisbon, Portugal: Association for Computational Linguistics, 2015: 548-554.
[33] Levow G A. The third international Chinese language processing bakeoff: Word segmentation and named entity recognition[C]//Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing. Sydney, Australia: Association for Computational Linguistics, 2006: 108-117.
[34] Li J, Meng K. MFE-NER: multi-feature fusion embedding for Chinese named entity recognition[C]//China National Conference on Chinese Computational Linguistics. Singapore: Springer Nature Singapore, 2024: 191-204.
[35] Liu W, Fu X, Zhang Y, et al. Lexicon enhanced Chinese sequence labeling using BERT adapter[J/OL]. arXiv:1-10[2026-02-02].https://arxiv.org/abs/2105.07148.
[36] Hou S, Qian Y, Chen J, et al. HiNER: Hierarchical feature fusion for Chinese named entity recognition[J]. Neurocomputing, 2025, 611(1): 128667.
[37] Yang A, Li A, Yang B, et al. Qwen3 technical report[EB/OL].[2026-03-16].https://arxiv.org/abs/2505.09388.

Please choose a citation manager

Content to export