基于BERT的电机领域中文命名实体识别方法

doi:10.19678/j.issn.1000-3428.0058838

计算机工程 ›› 2021, Vol. 47 ›› Issue (8): 78-83,92. doi: 10.19678/j.issn.1000-3428.0058838

基于BERT的电机领域中文命名实体识别方法

顾亦然¹, 霍建霖¹, 杨海根², 卢逸飞¹, 郭玉雯¹

1. 南京邮电大学自动化学院人工智能学院, 南京 210023;
2. 南京邮电大学宽带无线通信技术教育部工程研究中心, 南京 210003

收稿日期:2020-07-06 修回日期:2020-08-11 发布日期:2020-08-27
作者简介:顾亦然(1972-),女,教授、博士,主研方向为复杂网络、嵌入式系统;霍建霖,硕士研究生;杨海根,副教授、博士;卢逸飞、郭玉雯,硕士研究生。
基金资助:
国家部委基金。

BERT-Based Chinese Named Entity Recognition Method in Motor Field

GU Yiran¹, HUO Jianlin¹, YANG Haigen², LU Yifei¹, GUO Yuwen¹

1. College of Automation & College of Artificial Intelligence, Nanjing University of Posts and Telecommunications, Nanjing 210023, China;
2. Engineering Research Center of Wideband Wireless Communication Technology, Ministry of Education, Nanjing University of Posts and Telecommunications, Nanjing 210003, China

Received:2020-07-06 Revised:2020-08-11 Published:2020-08-27

摘要/Abstract

摘要： 针对电机领域实体识别精度较低的问题，提出一种融合BERT预训练语言模型的中文命名实体识别方法。利用BERT预训练语言模型增强字的语义表示并按照上下文特征动态生成字向量，将字向量序列输入双向长短期记忆神经网络进行双向编码，同时通过条件随机场算法标注出实体识别结果。根据电机文本特点对自建数据集进行标注，并将电机领域实体划分为实物、特性描述、问题/故障、方法/技术等4个类别。实验结果表明，与基于BiLSTM-CRF、BiLSTM-CNN和BiGRU的实体识别方法相比，该方法具有更高的准确率、召回率和F1值，并且有效解决了电机领域命名实体识别任务中标注数据不足及实体边界模糊的问题。

关键词: 命名实体识别, BERT预训练语言模型, 电机领域, 深度学习, 迁移学习

Abstract: For motor-related texts, accuracy of Named Entity Recognition(NER) is relatively low. A method for Chinese NER based on a BERT pre-training language model is proposed. The BERT model is used to enhance the semantic representation of words and dynamically generate word vectors based on context features. Then the word sequence is input into the Bidirectional Long Short-Term Memory(BiLSTM) neural network for bidirectional encoding, and the entity recognition results are labeled by using the Conditional Random Field(CRF) algorithm. A data set is built for experiments, and labeled according to the characteristics of the motor-related texts. The entities in the texts are categorized into physical objects, characteristic descriptions, problems/faults, methods/technologies. Experimental results show that the proposed method has higher accuracy, recall rate and F1 value than the BiLSTM-CRF-based, BiLSTM-CNN-based or BiGRUNER-based methods. The proposed method can effectively solve the problems of insufficient annotation data and fuzzy entity boundaries in the NER tasks for the motor-related texts.

Key words: Named Entity Recognition(NER), BERT pre-training language model, motor field, deep learning, transfer learning

中图分类号:

TP391.1

顾亦然, 霍建霖, 杨海根, 卢逸飞, 郭玉雯. 基于BERT的电机领域中文命名实体识别方法[J]. 计算机工程, 2021, 47(8): 78-83,92.

GU Yiran, HUO Jianlin, YANG Haigen, LU Yifei, GUO Yuwen. BERT-Based Chinese Named Entity Recognition Method in Motor Field[J]. Computer Engineering, 2021, 47(8): 78-83,92.

https://www.ecice06.com/CN/Y2021/V47/I8/78

图/表 9

20210819195156

20210819195201

20210819195206

20210819195210

20210819195216

20210819195223

20210819195227

20210819195232

20210819195236

参考文献

[1] 刘浏, 王东波.命名实体识别研究综述[J]. 情报学报, 2018, 37(3): 329-340. LIU L, WANG D B.Summary of research on named entity recognition[J]. Acta Information, 2018, 37(3): 329-340.(in Chinese)
[2] 王蕾, 谢云, 周俊生, 等. 基于神经网络的片段级中文命名实体识别[J]. 中文信息学报, 2018, 32(3): 84-90, 100. WANG L, XIE Y, ZHOU J S, et al. Segment-level Chinese named entity recognition based on neural network[J]. Journal of Chinese Information Processing, 2018, 32(3): 84-90, 100.(in Chinese)
[3] KARJALA T W, HIMMELBLAU D M, MⅡKKULAINEN R.Data rectification using recurrent(Elman) neural networks[C]//Proceedings of International Joint Conference on Neural Networks.Washington D.C., USA:IEEE Press, 1992:901-905.
[4] GRIDACH N.A framework based on (probabilistic)soft logic and neural network for NLP[J]. Applied Soft Computing Journal, 2020, 93:106-132.
[5] LI X Y, ZHANG H, ZHOU X H.Chinese clinical named entity recognition with variant neural structures based on BERT methods[J]. Journal of Biomedical Informatics, 2020, 107:103-122.
[6] HAMMERTON J.Named entity recognition with long short-term memory[C]//Proceedings of the 7th Conference on Natural Language Learning at HLT-NAACL.Philadelphia, USA:ACL Press, 2003:172-175.
[7] COLLOBERT R, WESTON J, BOTTOU L, et al. Natural language processing(almost) from scratch[J]. Journal of Machine Learning Research, 2011, 12(1): 2493-2537.
[8] LAMPLE G, BALLESTEROS M, SUBRAMANIAN S, et al. Neural architectures for named entity recognition[C]//Proceedings of 2016 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.Philadelphia, USA:ACL Press, 2016:260-270.
[9] HUANG Z H, XU W, YU K.Bidirectional LSTM-CRF models for sequence tagging[EB/OL]. [2020-06-01]. https://arxiv.org/abs/1508.01991.
[10] 买买提阿依甫, 吾守尔·斯拉木, 帕丽旦·木合塔尔, 等. 基于BiLSTM-CNN-CRF模型的维吾尔文命名实体识别[J]. 计算机工程, 2018, 44(8): 230-236. Maimaitiayifu, SILAMU Wushouer, MUHETAER Palidan, et al. Uyghur named entity recognition based on BiLSTM-CNN-CRF Model[J]. Computer Engineering, 2018, 44(8): 230-236.(in Chinese)
[11] 李健龙, 王盼卿, 韩琪羽.基于双向LSTM的军事命名实体识别[J]. 计算机工程与科学, 2019, 41(4): 713-718. LI J L, WANG P Q, HAN Q Y.Recognition of military named entities based on two-way LSTM[J]. Computer Engineering and Science, 2019, 41(4): 713-718.(in Chinese)
[12] 李明浩, 刘忠, 姚远哲.基于LSTM-CRF的中医医案症状术语识别[J]. 计算机应用, 2018, 38(S2): 42-46. LI M H, LIU Z, YAO Y Z.Symptom term re-cognition of traditional Chinese medicine records based on LSTM-CRF[J]. Computer Applications, 2018, 38(S2): 42-46.(in Chinese)
[13] ZHANG Y, YANG J.Chinese NER using lattice LSTM[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics.Philadelphia, USA:ACL Press, 2018:1554-1564.
[14] PETERS M E, NEUMANN M, IYYER M, et al. Deep contextualized word representations[C]//Proceedings of 2018 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.Philadelphia, USA:ACL Press, 2018:2227-2237.
[15] RADFORD A, NARASIMHAN K, SALIMANS T, et al. Improving language understanding by generative pre-training[EB/OL]. [2020-06-01]. https://blog.csdn.net/leo_95/article/details/89965558.
[16] DEVLIN J, CHANG M W, LEE K, et al. BERT:pre-training of deep bidirectional transformers for language understanding[C]//Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.Philadelphia, USA:ACL Press, 2018:4171-4186.
[17] PENG M L, XING X Y, ZHANG Q, et al. Distantly supervised named entity recognition using positive-unlabeled learning[EB/OL]. [2020-06-01]. https://arxiv.org/abs/1906.01378.
[18] GHADDAR A, LANGLAIS P.Robust lexical features for improved neural network named-entity recognition[EB/OL]. [2020-06-01]. https://arxiv.org/abs/1806.03489.
[19] 杨飘, 董文永.基于BERT嵌入的中文命名实体识别方法[J]. 计算机工程, 2020, 46(4): 40-45, 52. YANG P, DONG W Y.Chinese named entity recognition method based on BERT embedding[J]. Computer Engineering, 2020, 46(4): 40-45, 52.(in Chinese)
[20] VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[EB/OL]. [2020-06-01]. https://www.researchgate.net/publication/317558625_Attention_Is_All_You_Need.
[21] ZHU Y, WANG G, KARLSSON B F.CAN-NER:convolutional attention network for Chinese named entity recognition[EB/OL]. [2020-06-01]. https://arxiv.org/abs/1904.02141.
[22] CHIU J, NICHOLS E.Named entity recognition with bidirectional LSTM-CNN[EB/OL]. [2020-06-01]. https://arxiv.org/abs/1511.08308.
[23] 王洁, 张瑞东, 吴晨生.基于GRU的命名实体识别方法[J]. 计算机系统应用, 2018, 27(9): 18-24. WANG J, ZHANG R D, WU C S.Named entity recognition method based on GRU[J]. Computer Systems & Applications, 2018, 27(9): 18-24.(in Chinese)

选择文件类型/文献管理软件名称

选择包含的内容

基于BERT的电机领域中文命名实体识别方法

BERT-Based Chinese Named Entity Recognition Method in Motor Field

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 9

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	魏嵬, 丁香香, 郭梦星, 杨钊, 刘辉. 文本相似度计算方法综述[J]. 计算机工程, 2024, 50(9): 18-32.
[2]	党小超, 刘涧, 董晓辉, 祝忠彦, 李芬芳. 面向不平衡数据的机械设备故障命名实体识别[J]. 计算机工程, 2024, 50(9): 104-112.
[3]	朱凯, 李理, 张彤, 江晟, 别一鸣. 基于Transformer的多阶段运动模糊图像修复网络[J]. 计算机工程, 2024, 50(9): 276-285.
[4]	张天鹏, 韩晶, 吕学强. 基于多任务学习的超分辨率辅助小目标检测[J]. 计算机工程, 2024, 50(9): 304-312.
[5]	高煜宝, 文志诚. 基于注意力机制的双路解码器图像去噪方法[J]. 计算机工程, 2024, 50(9): 324-332.
[6]	张华青, 夏张涛, 陆晓庆, 童基均. 基于字形特征的血管外科命名实体识别[J]. 计算机工程, 2024, 50(8): 13-21.
[7]	李华昱, 张智康, 闫阳, 岳阳. 基于知识图谱增强的领域多模态实体识别[J]. 计算机工程, 2024, 50(8): 31-39.
[8]	张亚洲, 和玉, 戎璐, 王祥凯. 基于上下文知识增强型Transformer网络的抑郁检测[J]. 计算机工程, 2024, 50(8): 75-85.
[9]	高伟, 李帅龙, 茆琳, 王磊, 李颖颖, 韩林. 一种基于TVM的算子生成加速策略[J]. 计算机工程, 2024, 50(8): 353-362.
[10]	王宇, 祁琦, 王纯, 许才. 储能变流器信号高精度故障诊断方法[J]. 计算机工程, 2024, 50(8): 389-396.
[11]	牛瑞婷, 严天峰, 高锐, 王映植. 低信噪比下基于深度学习TCNN-MobileNet的调制识别[J]. 计算机工程, 2024, 50(7): 204-215.
[12]	肖慈, 徐杨, 张永丹, 冯明文, 黄易仟. 结合注意力和低光增强的夜间语义分割[J]. 计算机工程, 2024, 50(7): 271-281.
[13]	张诗婧, 莫绪涛, 赵行, 董杨林. 基于球面折反射成像和YOLOv7的内螺纹缺陷检测[J]. 计算机工程, 2024, 50(7): 282-292.
[14]	徐明亮, 李芳媛, 马浩然, 何飞. 大规模神经记录的峰电位聚类算法(特邀)[J]. 计算机工程, 2024, 50(6): 1-34.
[15]	魏琢艺, 罗迈, 李文兵, 曾远松, 余伟江, 杨跃东. 基于多源域适应的单细胞智能分类[J]. 计算机工程, 2024, 50(6): 48-55.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于BERT的电机领域中文命名实体识别方法

BERT-Based Chinese Named Entity Recognition Method in Motor Field

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 9

参考文献

相关文章 15

编辑推荐

Metrics

本文评价