Sequence Recommendation Method Based on RoBERTa and Graph-Enhanced Transformer

doi:10.19678/j.issn.1000-3428.0068307

Abstract

Abstract:

Since the emergence of recommendation systems, further development of recommendation algorithms has been constrained by limited data. To reduce the impact of data sparsity and enhance the utilization of nonrated data, text-recommendation models based on neural networks have been successively proposed. However, mainstream convolutional and recurrent neural networks have clear disadvantages as concerns text semantic understanding and capturing long-distance relationships. To better explore the deep latent features between users and items, and further improve the quality of recommendations, a sequence recommendation method based on RoBERTa and a Graph-enhanced Transformer (RGT) is proposed. This model incorporates textual comment data by first utilizing a pre-trained RoBERTa model to capture the semantic features of words in the comment text, thereby modeling the personalized interests of the user. Subsequently, based on historical interaction information between users and items, a graph attention mechanism network model with the temporal characteristics of item associations is constructed. Using the graph-enhanced Transformer method, the feature representations of various items learned by the graph model are sequentially input to the Transformer encoding layer. Finally, the obtained output vectors, along with the previously captured semantic and computed global representations of the item association graph, are input into a fully connected layer to capture the global interest preferences of the user and achieve prediction ratings for items. The experimental results on three groups of real Amazon public datasets demonstrate that the proposed recommendation model significantly improves the Root Mean Square Error (RMSE) and Mean Absolute Error (MAE) compared to several existing classical text recommendation models, such as DeepFM and ConvMF. Compared to the optimal comparison model, the highest increases are 4.7% and 5.3%, respectively.

Key words: recommendation algorithm, review text, RoBERTa model, graph attention mechanism, Transformer mechanism

摘要：

自推荐系统出现以来, 有限的数据信息就一直制约着推荐算法的进一步发展。为降低数据稀疏性的影响, 增强非评分数据的利用率, 基于神经网络的文本推荐模型相继被提出, 但主流的卷积或循环神经网络在文本语义理解和长距离关系捕捉方面存在明显劣势。为了更好地挖掘用户与商品之间的深层潜在特征, 进一步提高推荐质量, 提出一种基于RoBERTa和图增强Transformer的序列推荐(RGT)模型。引入评论文本数据, 首先利用预训练的RoBERTa模型捕获评论文本中的字词语义特征, 初步建模用户的个性化兴趣, 然后根据用户与商品的历史交互信息, 构建具有时序特性的商品关联图注意力机制网络模型, 通过图增强Transformer的方法将图模型学习到的各个商品的特征表示以序列的形式输入Transformer编码层, 最后将得到的输出向量与之前捕获的语义表征以及计算得到的商品关联图的全图表征输入全连接层, 以捕获用户全局的兴趣偏好, 实现用户对商品的预测评分。在3组真实亚马逊公开数据集上的实验结果表明, 与DeepFM、ConvMF等经典文本推荐模型相比, RGT模型在均方根误差(RMSE)和平均绝对误差(MAE)2种指标上有显著提升, 相较于最优对比模型最高分别提升4.7%和5.3%。

Minghu WANG, Zhikui SHI, Jia SU, Xinsheng ZHANG. Sequence Recommendation Method Based on RoBERTa and Graph-Enhanced Transformer[J]. Computer Engineering, 2024, 50(4): 121-131.

王明虎, 石智奎, 苏佳, 张新生. 基于RoBERTa和图增强Transformer的序列推荐方法[J]. 计算机工程, 2024, 50(4): 121-131.

/ / Recommend / Download Citations

URL: http://www.ecice06.com/EN/10.19678/j.issn.1000-3428.0068307

http://www.ecice06.com/EN/Y2024/V50/I4/121

Figures/Tables 11

Fig.1 RGT model framework

Fig.2 RoBERTa model structure

Fig.3 Commodity related directed graph

Fig.4 Enhanced Transformer model based on graph attention network

Fig.5 Multi-head self-attention mechanism network structure

Fig.6 Ablation experimental results under three datasets

Fig.7 The effect of the number of stacked layers of graph attention network on the performance of the model

Fig.8 The effect of the number of Transformer encoder layers on the performance of the model

Fig.9 The effect of the number of attention heads on the performance of the model

References 30

1	KOREN Y, BELL R, VOLINSKY C. Matrix factorization techniques for recommender systems. Computer, 2009, 42 (8): 30- 37. doi: 10.1109/MC.2009.263
2	邢玉莹, 夏鸿斌, 王涵. 缺失数据建模的改进型ALS在线推荐算法. 计算机工程, 2018, 44 (8): 212-217, 223. URL
	XING Y Y, XIA H B, WANG H. Improved ALS online recommendation algorithm with missing data modeling. Computer Engineering, 2018, 44 (8): 212-217, 223. URL
3	纪成君, 李蕊, 王仕勤. 缺失数据下基于SVDIFC的协同过滤推荐算法. 计算机应用研究, 2021, 38 (10): 2994- 2999. URL
	JI C J, LI R, WANG S Q. Collaborative filtering recommendation algorithm based on SVDIFC under missing data. Application Research of Computers, 2021, 38 (10): 2994- 2999. URL
4	周飞燕, 金林鹏, 董军. 卷积神经网络研究综述. 计算机学报, 2017, 40 (6): 1229- 1251. URL
	ZHOU F Y, JIN L P, DONG J. Review of convolutional neural network. Chinese Journal of Computers, 2017, 40 (6): 1229- 1251. URL
5	王永贵, 尚庚. 融合注意力机制的深度协同过滤推荐算法. 计算机工程与应用, 2019, 55 (13): 8- 14. URL
	WANG Y G, SHANG G. Deep collaborative filtering recommendation with attention mechanism. Computer Engineering and Applications, 2019, 55 (13): 8- 14. URL
6	吴静, 谢辉, 姜火文. 图神经网络推荐系统综述. 计算机科学与探索, 2022, 16 (10): 2249- 2263. doi: 10.3778/j.issn.1673-9418.2203004
	WU J, XIE H, JIANG H W. Survey of graph neural network in recommendation system. Journal of Frontiers of Computer Science and Technology, 2022, 16 (10): 2249- 2263. doi: 10.3778/j.issn.1673-9418.2203004
7	LEI D M, NG A Y, JORDAN M I. Latent Dirichlet allocation. Journal of Machine Learning Research, 2003, 3, 993- 1022.
8	王雅静, 郭强, 邓春燕, 等. 基于LDA主题模型的用户特征预测研究. 复杂系统与复杂性科学, 2020, 17 (4): 9- 15. URL
	WANG Y J, GUO Q, DENG C Y, et al. Research on user traits predicting based on LDA topic model. Complex Systems and Complexity Science, 2020, 17 (4): 9- 15. URL
9	颜端武, 梅喜瑞, 杨雄飞, 等. 基于主题模型和词向量融合的微博文本主题聚类研究. 现代情报, 2021, 41 (10): 67- 74. doi: 10.3969/j.issn.1008-0821.2021.10.008
	YAN D W, MEI X R, YANG X F, et al. Research on microblog text topic clustering based on the fusion of topic model and word embedding. Journal of Modern Information, 2021, 41 (10): 67- 74. doi: 10.3969/j.issn.1008-0821.2021.10.008
10	穆晓霞, 董星辉, 柴旭清, 等. 融合LDA主题模型和支持向量机的商品个性化推荐方法. 郑州大学学报(理学版), 2022, 54 (3): 34- 39. URL
	MU X X, DONG X H, CHAI X Q, et al. Commodity personalized recommendation method integrating LDA topic model and support vector machine. Journal of Zhengzhou University (Natural Science Edition), 2022, 54 (3): 34- 39. URL
11	田保军, 刘爽, 房建东. 融合主题信息和卷积神经网络的混合推荐算法. 计算机应用, 2020, 40 (7): 1901- 1907. URL
	TIAN B J, LIU S, FANG J D. Hybrid recommendation algorithm by fusion of topic information and convolution neural network. Journal of Computer Applications, 2020, 40 (7): 1901- 1907. URL
12	张永宾, 赵金楼. 融合LDA与注意力的网络信息个性化推荐方法. 计算机仿真, 2022, 39 (12): 528- 532. doi: 10.3969/j.issn.1006-9348.2022.12.098
	ZHANG Y B, ZHAO J L. Personalized recommendation method of network information integrating LDA and attention. Computer Simulation, 2022, 39 (12): 528- 532. doi: 10.3969/j.issn.1006-9348.2022.12.098
13	宋晓丽, 贺龙威. 基于改进自编码器的在线课程推荐模型. 计算机系统应用, 2022, 31 (3): 288- 293. URL
	SONG X L, HE L W. Online course recommendation model based on enhanced auto-encoder. Computer Systems & Applications, 2022, 31 (3): 288- 293. URL
14	子健, 李俊, 岳兆娟, 等. 基于自编码器与属性信息的混合推荐模型. 数据与计算发展前沿, 2021, 3 (3): 148- 155. URL
	CHEN Z J, LI J, YUE Z J, et al. Hybrid recommendation model based on autoencoder and attribute information. Frontiers of Data & Computer, 2021, 3 (3): 148- 155. URL
15	周传华, 于猜, 鲁勇. 融合评分矩阵和评论文本的深度神经网络推荐模型. 计算机应用研究, 2021, 38 (4): 1058-1061, 1068. URL
	ZHOU C H, YU C, LU Y. Recommendation model of deep neural network combining rating matrix and review text. Application Research of Computers, 2021, 38 (4): 1058-1061, 1068. URL
16	任胜兰, 郭慧娟, 黄文豪, 等. 基于注意力机制交互卷积神经网络的推荐方法. 计算机科学, 2022, 49 (10): 126- 131. URL
	REN S L, GUO H J, HUANG W H, et al. Recommendation method based on attention mechanism interactive convolutional neural network. Computer Science, 2022, 49 (10): 126- 131. URL
17	刘羽茜, 刘玉奇, 张宗霖, 等. 注入注意力机制的深度特征融合新闻推荐模型. 计算机应用, 2022, 42 (2): 426- 432. URL
	LIU Y Q, LIU Y Q, ZHANG Z L, et al. News recommendation model with deep feature fusion injecting attention mechanism. Journal of Computer Applications, 2022, 42 (2): 426- 432. URL
18	刘振鹏, 尹文召, 王文胜, 等. HRS-DC: 基于深度学习的混合推荐模型. 计算机工程与应用, 2020, 56 (14): 169- 175. URL
	LIU Z P, YIN W Z, WANG W S, et al. HRS-DC: hybrid recommendation model based on deep learning. Computer Engineering and Applications, 2020, 56 (14): 169- 175. URL
19	陈彬, 张荣梅, 张琦. DCFM: 基于深度学习的混合推荐模型. 计算机工程与应用, 2021, 57 (3): 150- 155. URL
	CHEN B, ZHANG R M, ZHANG Q. DCFM: hybrid recommendation model based on deep learning. Computer Engineering and Applications, 2021, 57 (3): 150- 155. URL
20	陈蕾, 刘铭. 引入用户关注的图推荐模型的研究. 系统工程, 2019, 37 (2): 21- 29. URL
	CHEN L, LIU M. The research on graph based recommendation model importing users attention. Systems Engineering, 2019, 37 (2): 21- 29. URL
21	党伟超, 姚志宇, 白尚旺, 等. 基于图模型和注意力模型的会话推荐方法. 计算机应用, 2022, 42 (11): 3610- 3616. URL
	DANG W C, YAO Z Y, BAI S W, et al. Session recommendation method based on graph model and attention model. Journal of Computer Applications, 2022, 42 (11): 3610- 3616. URL
22	邹程辉, 李卫疆. 融合知识图谱和评论文本的个性化推荐模型. 计算机工程与科学, 2023, 45 (1): 181- 190. URL
	ZOU C H, LI W J. A personalized recommendation model integrating knowledge graph and comment text. Computer Engineering & Science, 2023, 45 (1): 181- 190. URL
23	张若一, 金柳, 马慧芳, 等. 融合相似用户影响效应的知识图谱推荐模型. 计算机工程与科学, 2023, 45 (3): 520- 527. URL
	ZHANG R Y, JIN L, MA H F, et al. A knowledge graph recommendation model incorporating the influence effect of similar users. Computer Engineering & Science, 2023, 45 (3): 520- 527. URL
24	INGMA D P, BA J. Adam: a method for stochastic optimization[EB/OL]. [2023-07-05]. https://www.xueshufan.com/publication/2964121744.
25	GUO H F, TANG R M, YE Y M, et al. DeepFM: a factorization-machine based neural network for CTR prediction[EB/OL]. [2023-07-05]. https://arxiv.org/pdf/1703.04247.pdf.
26	KIM D, PARK C, OH J, et al. Convolutional matrix factorization for document context-aware recommendation[C]//Proceedings of the 10th ACM Conference on Recommender Systems. New York, USA: ACM Press, 2016: 11-23.
27	ZHENG L, NOROOZI V, YU P S. Joint deep modeling of users and items using reviews for recommendation[C]// Proceedings of the 10th ACM International Conference on Web Search and Data Mining. New York, USA: ACM Press, 2017: 425-434.
28	CHEN C, ZHANG M, LIU Y Q, et al. Neural attentional rating regression with review-level explanations[C]// Proceedings of 2018 World Wide Web Conference. New York, USA: ACM Press, 2018: 1583-1592.
29	冯兴杰, 崔桂颖. 基于交互注意力的可解释性推荐方法. 计算机应用与软件, 2022, 39 (10): 292-298, 328. URL
	FENG X J, CUI G Y. An interpretable recommendation based on interactive attention. Computer Applications and Software, 2022, 39 (10): 292-298, 328. URL
30	MA H, LIU Q. In-depth recommendation model based on self-attention factorization. KSII Transactions on Internet and Information Systems, 2023, 17 (3): 121- 132.

[1]	Jiajing GU, Dan YANG, Tiezheng NIE, Yue KOU. Recommendation Algorithm Based on Multi-view Fusion Cross-layer Contrastive Learning [J]. Computer Engineering, 2024, 50(1): 120-128.
[2]	XU Feng, YANG Xingyao, YU Jiong, LI Ziyang, LI Chenyu, ZHANG Jun. Wavelet Convolution Enhanced Contrastive Learning Recommendation Algorithm [J]. Computer Engineering, 2023, 49(5): 105-111,121.
[3]	YANG Hongju, JIN Xinyu. A General Model for Entity Relationship and Event Extraction [J]. Computer Engineering, 2023, 49(2): 143-149.
[4]	XU Shangshang, SUN Fuzhen, WANG Shaoqing, DONG Jiawei, WU Tianhui. Heterogeneous Trust Recommendation Algorithm Based on Graph Neural Networks [J]. Computer Engineering, 2022, 48(9): 89-95,104.
[5]	WANG Yan, FAN Lin, ZHAO Nini. Sequential Recommendation Model Using Gated Network to Construct User's Dynamic Interest [J]. Computer Engineering, 2022, 48(8): 283-291.
[6]	MIAO Yuxin, SONG Chunhua, NIU Baoning, KANG Ruixue. Dual-Channel Graph Collaborative Filtering Recommendation Algorithm [J]. Computer Engineering, 2022, 48(8): 121-128.
[7]	DU Qinghua, ZHANG Kai. An Efficient Cross-Platform Workflow Optimization Method [J]. Computer Engineering, 2022, 48(7): 13-21,28.
[8]	LIU Gaojun, LI Yaxin, DUAN Jianyong. Chinese Machine Reading Comprehension Based on Hybrid Attention Mechanism [J]. Computer Engineering, 2022, 48(10): 67-72,80.
[9]	JIA Junjie, LIU Pengtao, CHEN Wanghu. Improved Matrix Factorization Algorithm Using Social Information for Recommendation [J]. Computer Engineering, 2021, 47(9): 97-105.
[10]	LIU Feng, WANG Baoliang, ZOU Rongyu, ZHAO Haochun. Recommendation Algorithm Using Network Representation Learning Based on Random Walk [J]. Computer Engineering, 2021, 47(9): 90-96,105.
[11]	LI Kunlun, YU Zhibo, ZHAI Lina, ZHAO Jiayao. Recommendation Algorithm Based on Attention Mechanism and Improved TF-IDF [J]. Computer Engineering, 2021, 47(8): 69-77.
[12]	LIU Feng, WANG Baoliang, PAN Wencai. Research on Recommendation Algorithm Based on Network Representation Learning and Deep Learning [J]. Computer Engineering, 2021, 47(8): 54-61.
[13]	CHEN Enhua, FANG Baofu. Session-Based Recommendation Algorithm with Item2Vec [J]. Computer Engineering, 2021, 47(7): 74-80.
[14]	YANG Susen, LIU Yong, ZHANG Juyong. Recommendation Algorithm Based on Graph Representation Learning of Review Texts [J]. Computer Engineering, 2021, 47(11): 69-76.
[15]	TANG Weitao, YU Dunhui, WEI Shiwei. Commodity Recommendation Algorithm Fusing with Knowledge Graph and User Comment [J]. Computer Engineering, 2020, 46(8): 93-100.

Please choose a citation manager

Content to export