面向属性情感分类的无标签数据增强位置感知网络

doi:10.19678/j.issn.1000-3428.0066017

摘要/Abstract

摘要：

在现有的属性情感分类研究中，训练模型时大多完全依赖标签数据或需要引入文本级标签数据作为补充，很少关注无标签数据对模型性能的提升。提出一种基于无标签数据增强的位置感知网络（UDE-PAN）。引入交叉可视训练（CVT）的半监督训练算法，使模型能够同时利用无标签数据。CVT算法在标签数据和无标签数据中交替训练模型来提升表征学习能力，使模型在无标签数据中学习到额外知识。此外，基于语义相对距离（SRD）嵌入层和动态特征加权（CDW）层捕获属性词和上下文的关系：SRD嵌入层显式地将位置信息建模成特征向量，使上下文特征包含特定的属性信息；CDW层通过动态设置权重系数来感知上下文中与属性词更密切的部分。在SemEval14的2个公开数据集上的实验结果表明：UDE-PAN的准确率分别达到76.23%、82.47%，Macro-F1值分别达到72.13%、73.97%，优于对比的主流模型，验证了模型的有效性；借助CVT算法，无标签数据的训练对模型的准确率平均提升1%，Macro-F1平均提升2%，验证了无标签数据可以有效增强模型性能。

关键词: 属性情感分类, 无标签数据, 位置感知, 交叉视图训练, 注意力机制

Abstract:

Recently, in Aspect-based Sentiment Classification(ASC) studies, models were trained with labeled data or document-level data as a supplement; however, little attention was given to exploring the improvements in model performance using unlabeled data. Accordingly, the Unlabeled Data Enhanced Position-Aware Network(UDE-PAN) is proposed herein to improve the ASC performance. Specifically, a semi-supervised training algorithm, Cross-View Training(CVT) algorithm, is introduced that enables the proposed model to utilize unlabeled data. The CVT algorithm alternatively trains the model on labeled and unlabeled data, improving the model's ability to learn sentence representations. Moreover, Semantic-Relative Distance(SRD) embedding and Context features Dynamic Weighting(CDW) layers are adopted to learn the relationship between aspect words and context. The SRD embedding layer explicitly models the relative position information, so the context features contain more specific information. The CDW layer captures the part closer to the aspect in the context through the dynamic weighting of the context features. Finally, extensive experiments are conducted considering two public benchmark datasets from SemEval14. As a result, the accuracies of UDE-PAN of 76.23% and 82.47% are achieved and the Macro-F1 are 72.13% and 73.97%, respectively. This demonstrates better performance than the comparison models and proves the effectiveness of the proposed model. With the help of CVT algorithm, the training of unlabeled data improves the accuracy of the model by the average of 1% and Macro-F1 by the average of 2%, verifying that unlabeled data can effectively enhance the performance of the model.

Key words: Aspect-based Sentiment Classification(ASC), unlabeled data, position-aware, Cross-View Training(CVT), attention mechanism

蒋慧凯, 李晓戈, 安晓春, 王甜甜, 阮桁. 面向属性情感分类的无标签数据增强位置感知网络[J]. 计算机工程, 2023, 49(11): 106-114.

Huikai JIANG, Xiaoge LI, Xiaochun AN, Tiantian WANG, Heng RUAN. Unlabeled Data Enhanced Position-Aware Network for Aspect-based Sentiment Classification[J]. Computer Engineering, 2023, 49(11): 106-114.

http://www.ecice06.com/CN/Y2023/V49/I11/106

图/表 8

图1 SRD示例说明

Fig.1 An explanation of SRD example

图2 本文模型结构

Fig.2 The structure of the model in this paper

图3 不同标签数据量下的模型性能

Fig.3 Performance of model under different labeled data volumes

图4 样本可视化

Fig.4 Visualization of examples

参考文献 35

1	LI L, LIU Y, ZHOU A. Hierarchical attention based position-aware network for aspect-level sentiment analysis[C]//Proceedings of the 22nd Conference on Computational Natural Language Learning. Brussels, Belgium: Association for Computational Linguistics, 2018: 181-189.
2	CHEN Z, QIAN T. Transfer capsule network for aspect level sentiment classification[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Florence, Italy: Association for Computational Linguistics, 2019: 547-556.
3	FAN F, FENG Y, ZHAO D. Multi-grained attention network for aspect-level sentiment classification[C]//Proceedings of 2018 Conference on Empirical Methods in Natural Language Processing. Brussels, Belgium: Association for Computational Linguistics, 2018: 3433-3442.
4	REN Z Y , ZENG G P , CHEN L , et al. A lexicon-enhanced attention network for aspect-level sentiment analysis. IEEE Access, 2020, 8, 93464- 93471. doi: 10.1109/ACCESS.2020.2995211
5	LUONG T, PHAM H, MANNING C D. Effective approaches to attention-based neural machine translation[C]//Proceedings of 2015 Conference on Empirical Methods in Natural Language Processing. Lisbon, Portugal: Association for Computational Linguistics, 2015: 1412-1421.
6	ZHOU J , CHEN Q , HUANG J X , et al. Position-aware hierarchical transfer model for aspect-level sentiment classification. Information Sciences, 2020, 513, 1- 16. doi: 10.1016/j.ins.2019.11.048
7	ZHOU J , HUANG J X , HU Q V , et al. Is position important? Deep multi-task learning for aspect-based sentiment analysis. Applied Intelligence, 2020, 50 (10): 3367- 3378. doi: 10.1007/s10489-020-01760-x
8	GU S Q, ZHANG L P, HOU Y X, et al. A position-aware bidirectional attention network for aspect-level sentiment analysis[C]//Proceedings of the 27th International Conference on Computational Linguistics. Santa Fe, USA: Association for Computational Linguistics, 2018: 774-784.
9	ZENG B Q , YANG H , XU R Y , et al. LCF: a local context focus mechanism for aspect-based sentiment classification. Applied Sciences, 2019, 9 (16): 3389. doi: 10.3390/app9163389
10	VO D T, ZHANG Y. Target-dependent Twitter sentiment classification with rich automatic features[C]//Proceedings of the 24th International Joint Conference on Artificial Intelligence. Washington D.C., USA: IEEE Press, 2015: 1-10.
11	JIANG L, YU M, ZHOU M, et al. Target-dependent Twitter sentiment classification[C]//Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Portland, USA: Association for Computational Linguistics, 2011: 151-160.
12	TANG D, QIN B, FENG X, et al. Effective LSTMs for target-dependent sentiment classification[C]//Proceedings of the 26th International Conference on Computational Linguistics: Technical Papers. Osaka, Japan: The COLING 2016 Organizing Committee, 2016: 3298-3307.
13	TAY Y , TUAN L A , HUI S C . Learning to attend via word-aspect associative fusion for aspect-based sentiment analysis. Proceedings of the AAAI Conference on Artificial Intelligence, 2018, 32 (1): 1- 10.
14	WANG Y, HUANG M, ZHU X Y, et al. Attention-based LSTM for aspect-level sentiment classification[C]//Proceedings of 2016 Conference on Empirical Methods in Natural Language Processing. Austin, USA: Association for Computational Linguistics, 2016: 606-615.
15	TANG D, QIN B, LIU T. Aspect level sentiment classification with deep memory network[C]//Proceedings of 2016 Conference on Empirical Methods in Natura Language Processing. Austin, USA: Association for Computational Linguistics, 2016: 214-224.
16	MA Y , PENG H , CAMBRIA E . Targeted aspect-based sentiment analysis via embedding commonsense knowledge into an attentive LSTM. Proceedings of the AAAI Conference on Artificial Intelligence, 2018, 32 (1): 1- 10.
17	HAN H, LI X, ZHI S, et al. Multi-attention network for aspect sentiment analysis[C]//Proceedings of the 8th International Conference on Software and Computer Applications. New York, USA: ACM Press, 2019: 22-26.
18	QIAN Q, HUANG M, LEI J, et al. Linguistically regularized LSTM for sentiment classification[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics(Volume 1: Long Papers). Vancouver, Canada: Association for Computational Linguistics, 2017: 1679-1689.
19	SHIN B, LEE T, CHOI J D. Lexicon integrated CNN models with attention for sentiment analysis[C]//Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. Copenhagen, Denmark: Association for Computational Linguistics, 2017: 149-158.
20	KE P, JI H, LIU S, et al. SentiLARE: sentiment-aware language representation learning with linguistic knowledge[C]//Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing. [S. l.]: Association for Computational Linguistics, 2020: 6975-6988.
21	施荣华, 金鑫, 胡超. 基于图注意力网络的方面级别文本情感分析. 计算机工程, 2022, 48 (2): 34- 39. URL
	SHI R H , JIN X , HU C . Aspect-level text emotion analysis based on graph attention network. Computer Engineering, 2022, 48 (2): 34- 39. URL
22	孙天伟, 杨长春, 顾晓清, 等. 结合共现网络的方面级情感分析研究. 计算机工程与应用, 2023, 59 (20): 111- 118. URL
	SUN T W , YANG C C , GU X Q , et al. Research on aspect-level sentimentanalysis combined with co-existing networks. Computer Engineering and Applications, 2023, 59 (20): 111- 118. URL
23	刘欣逸, 宁博, 王明, 等. 基于句法增强的细粒度情感三元组抽取方法. 计算机研究与发展, 2023, 60 (7): 1649- 1660. URL
	LIU X Y , NING B , WANG M , et al. Fine-grained sentiment triple extraction method based on syntactic enhancement. Journal of Computer Research and Development, 2023, 60 (7): 1649- 1660. URL
24	HE R, LEE W S, NG H T, et al. Exploiting document knowledge for aspect-level sentiment classification[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics(Volume 2: Short Papers). Melbourne, Australia: Association for Computational Linguistics, 2018: 579-585.
25	CLARK K, LUONG M, MANNING C D, et al. Semi-supervised sequence modeling with cross-view training[EB/OL]. [2022-09-10]. https://www.docin.com/p-2166120899.html.
26	LI N, CHOW C Y, ZHANG J D. EMOVA: a semi-supervised end-to-end moving-window attentive framework for aspect mining[M]//LAUW H W, WONG R C W, NTOULAS A, et al. Advances in knowledge discovery and data mining. Berlin, Germany: Springer, 2020: 811-823.
27	LI N , CHOW C Y , ZHANG J D . SEML: a semi-supervised multi-task learning framework for aspect-based sentiment analysis. IEEE Access, 2020, 8, 189287- 189297.
28	PENNINGTON J, SOCHER R, MANNING C. GloVe: global vectors for word representation[C]//Proceedings of 2014 Conference on Empirical Methods in Natural Language Processing. Doha, Qatar: Association for Computational Linguistics, 2014: 1532-1543.
29	NI J, LI J, MCAULEY J. Justifying recommendations using distantly-labeled reviews and fine-grained aspects[C]//Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Hong Kong, China: Association for Computational Linguistics, 2019: 188-197.
30	CHENG X, XU W, WANG T, et al. Variational semi-supervised aspect-term sentiment analysis via Transformer[C]//Proceedings of the 23rd Conference on Computational Natural Language Learning. Hong Kong, China: Association for Computational Linguistics, 2019: 961-969.
31	KINGMA D P, BA L J. Adam: a method for stochastic optimization[EB/OL]. [2022-09-10]. https://arxiv.org/abs/1412.6980.
32	MA D, LI S, ZHANG X, et al. Interactive attention networks for aspect-level sentiment classification[C]//Proceedings of the 26th International Joint Conference on Artificial Intelligence. Melbourne, Australia: [s. n.], 2017: 4068-4074.
33	GU S, ZHANG L, HOU Y, et al. A position-aware bidirectional attention network for aspect-level sentiment analysis[C]//Proceedings of the 27th International Conference on Computational Linguistics. Santa Fe, USA: Association for Computational Linguistics, 2018: 774-784.
34	CHEN P, SUN Z, BING L, et al. Recurrent attention network on memory for aspect sentiment analysis[C]//Proceedings of 2017 Conference on Empirical Methods in Natural Language Processing. Copenhagen, Denmark: Association for Computational Linguistics, 2017: 452-461.
35	DEVLIN J, CHANG M W, LEE K, et al. BERT: pre-training of deep bidirectional transformers for language understanding[C]//Proceedings of NAACL-HLT. Minneapolis, USA: Association for Computational Linguistics, 2019: 4171-4186.

[1]	杨静, 陆铭华, 马洁琼, 吴金平, 刘星璇. 基于交替循环神经网络的水下防御态势预测方法[J]. 计算机工程, 2023, 49(9): 69-78.
[2]	孙龙, 张荣芬, 刘宇红, 饶庭漓. 监控视角下密集人群口罩佩戴检测算法[J]. 计算机工程, 2023, 49(9): 313-320.
[3]	苏晓东, 李世洲, 赵佳圆, 亮洪宇, 张玉荣, 徐红岩. 基于多级叠加和注意力机制的图像语义分割[J]. 计算机工程, 2023, 49(9): 265-271, 278.
[4]	韩璐, 霍纬纲, 张永会, 刘涛. 基于多尺度特征融合与双注意力机制的多元时间序列预测[J]. 计算机工程, 2023, 49(9): 99-108.
[5]	龙玉江, 卫薇, 舒彧, 张正刚, 王道累, 李峰. 基于自适应关键点的破损旋转绝缘子检测方法[J]. 计算机工程, 2023, 49(9): 272-278.
[6]	丰芳宇, 罗晓曙, 蒙志明, 王广宇. 基于抗混叠残差注意力网络的人脸表情识别[J]. 计算机工程, 2023, 49(8): 190-198.
[7]	王书朋, 何引弟. 融合特征注意力机制的非均匀光照图像增强算法[J]. 计算机工程, 2023, 49(8): 232-239.
[8]	刘昊鑫, 董超, 勾智楠, 高凯. 融合混合表征的小样本关系抽取方法[J]. 计算机工程, 2023, 49(8): 63-68.
[9]	杨长沛, 廖列法. 基于门控空洞卷积特征融合的中文命名实体识别[J]. 计算机工程, 2023, 49(8): 85-95.
[10]	刘俊豪, 王美林, 谢兴, 宋烨兴, 许莉花. 基于改进YOLOv5的皮革瑕疵检测算法[J]. 计算机工程, 2023, 49(8): 240-249.
[11]	马娜, 温廷新, 贾旭, 李晓会. 复杂光照条件下自适应的车脸重识别模型[J]. 计算机工程, 2023, 49(8): 275-282, 290.
[12]	陈露萌, 曹彦彦, 黄民, 谢鑫钢. 基于改进YOLOv5的火焰检测方法[J]. 计算机工程, 2023, 49(8): 291-301, 309.
[13]	李强龙, 周新文, 位梦恩, 甘阳洲. 基于条形池化和注意力机制的街道场景红外目标检测算法[J]. 计算机工程, 2023, 49(8): 310-320.
[14]	张家熔, 苑津莎, 许珈宁, 罗志宏. 基于多元信息嵌入与协同神经网络的力学实体识别算法[J]. 计算机工程, 2023, 49(7): 125-134.
[15]	白明昌. 基于折叠路径聚合的属性网络节点嵌入方法[J]. 计算机工程, 2023, 49(7): 76-84.

选择文件类型/文献管理软件名称

选择包含的内容