基于个性化数据增强的自监督序列推荐算法

doi:10.19678/j.issn.1000-3428.0069636

摘要/Abstract

摘要：

序列推荐算法通过对用户的历史行为进行动态建模, 以预测其可能感兴趣的内容。聚焦对比式自监督学习(SSL)在序列推荐中的应用, 通过设计有效的自监督信号, 增强模型在稀疏数据场景下的表征能力。首先, 针对随机数据增强易引入数据噪声的问题, 提出融合用户偏好的个性化数据增强方法, 通过用户评分引导增强过程, 同时对长、短序列使用不同的增强方法组合, 生成符合用户偏好的增强序列; 其次, 为了缓解训练中出现的数据特征学习不平衡问题, 设计一种混合增强训练法, 在训练前期, 通过随机选择增强方法生成增强序列, 提高模型的性能和泛化能力, 在训练后期, 选择与原始序列相似度较高的增强序列, 使模型全面学习用户的实际偏好和行为模式; 最后, 将传统的序列预测目标与SSL目标相结合, 推断出用户的表示。在数据集Beauty、Toys和Sports上进行实验验证, 结果表明, 相较于基线模型中的最优结果, 所提方法的HR@5指标分别提升了6.61%、3.11%和3.76%, NDCG@5指标分别提升了11.40%、3.50%和2.16%, 上述实验结果验证了该方法的合理性和有效性。

关键词: 序列推荐, 自监督学习, 数据增强, 推荐系统, 数据特征

Abstract:

The sequence recommendation algorithm dynamically models the user's historical behavior to predict the content they may be interested in. This study focuses on the application of contrastive Self Supervised Learning (SSL) in sequence recommendation, enhancing the model's representation ability in sparse data scenarios by designing effective self supervised signals. First, a personalized data augmentation method incorporating user preferences is proposed to address the issue of noise introduced by random data augmentation. This method guides the augmentation process based on user ratings and combines different augmentation methods for short and long sequences to generate augmented sequences that align with user preferences. Second, a mixed-augmentation training approach is designed to address the issue of imbalanced feature learning during training. In the early stages of training, augmentation sequences are generated using randomly selected methods to enhance the model performance and generalization. In the later stages, augmentation sequences with high similarity to the original sequences are selected to enable the model to comprehensively learn the actual preferences and behavior patterns of users. Finally, traditional sequence prediction objectives are combined with SSL objectives to infer user representations. Experimental verification is performed using the Beauty, Toys, and Sports datasets. Compared with the best result in the baseline model, the HR@5 indicator of the proposed method increases by 6.61%, 3.11%, and 3.76%, and the NDCG@5 indicator increases by 11.40%, 3.50%, and 2.16%, respectively, for the aforementioned datasets. These experimental results confirm the rationality and validity of the proposed method.

Key words: sequence recommendation, Self-Supervised Learning (SSL), data augmentation, recommendation system, data features

王帅, 史艳翠. 基于个性化数据增强的自监督序列推荐算法[J]. 计算机工程, 2025, 51(8): 190-202.

WANG Shuai, SHI Yancui. Self-Supervised Sequence Recommendation Algorithm Based on Personalized Data Augmentation[J]. Computer Engineering, 2025, 51(8): 190-202.

https://www.ecice06.com/CN/Y2025/V51/I8/190

图/表 13

图1 自监督序列推荐模型框架

Fig.1 Self-supervised sequence recommendation model framework

图2 个性化数据增强方法

Fig.2 Personalized data augmentation method

图3 不同比例对每种增强方法的影响

Fig.3 The impact of different proportions on each enhancement method

图4 用户偏好对增强方法的影响

Fig.4 The influence of user preferences on enhancement methods

图5 长短序列阈值K的影响

Fig.5 The influence of threshold K of long and short sequences

图6 不同训练方法的影响

Fig.6 The influence of different training methods

图7 训练方法转换阈值E的影响

Fig.7 The influence of the training method conversion threshold E

图8 数据稀疏性的影响

Fig.8 The influence of data sparsity

参考文献 30

1	王猛, 汪海涛, 贺建峰, 等. 知识增强的个性化序列推荐算法. 小型微型计算机系统, 2024, 45 (7): 1561- 1567.
	WANG M , WANG H T , HE J F , et al. Knowledge enhanced personalized sequence recommendation algorithm. Journal of Chinese Computer Systems, 2024, 45 (7): 1561- 1567.
2	李盼, 解庆, 李琳, 等. 知识增强的图神经网络序列推荐模型. 计算机工程, 2023, 49 (2): 70- 80. doi: 10.19678/j.issn.1000-3428.0063844
	LI P , XIE Q , LI L , et al. Knowledge-enhanced graph neural network model for sequential recommendation. Computer Engineering, 2023, 49 (2): 70- 80. doi: 10.19678/j.issn.1000-3428.0063844
3	HE R N, MCAULEY J. Fusing similarity models with Markov chains for sparse sequential recommendation[C]//Proceedings of the 16th IEEE International Conference on Data Mining (ICDM). Washington D.C., USA: IEEE Press, 2016: 191-200.
4	SUN F, LIU J, WU J, et al. BERT4Rec: sequential recommendation with bidirectional encoder representations from transformer[C]//Proceedings of the 28th ACM International Conference on Information and Knowledge Management. New York, USA: ACM Press, 2019: 1441-1450.
5	KANG W C, MCAULEY J. Self-attentive sequential recommendation[C]//Proceedings of the IEEE International Conference on Data Mining (ICDM). Washington D.C., USA: IEEE Press, 2018: 197-206.
6	LI J C, WANG Y J, MCAULEY J. Time interval aware self-attention for sequential recommendation[C]//Proceedings of the 13th International Conference on Web Search and Data Mining. New York, USA: ACM Press, 2020: 322-330.
7	DEVLIN J, CHANG M W, LEE K, et al. BERT: pre-training of deep bidirectional transformers for language understanding[EB/OL]. [2023-11-05]. https://arxiv.org/abs/1810.04805v2.
8	PENG D L , ZHOU Y . A long-tail alleviation post-processing framework based on personalized diversity of session recommendation. Expert Systems with Applications, 2024, 249, 123769. doi: 10.1016/j.eswa.2024.123769
9	ZHOU K, WANG H, ZHAO W X, et al. S3-rec: self-supervised learning for sequential recommendation with mutual information maximization[C]//Proceedings of the 29th ACM International Conference on Information & Knowledge Management. New York, USA: ACM Press, 2020: 1893-1902.
10	顾嘉静, 杨丹, 聂铁铮, 等. 基于多视图融合跨层对比学习的推荐算法. 计算机工程, 2024, 50 (1): 120- 128. doi: 10.19678/j.issn.1000-3428.0066906
	GU J J , YANG D , NIE T Z , et al. Recommendation algorithm based on multi-view fusion cross-layer contrastive learning. Computer Engineering, 2024, 50 (1): 120- 128. doi: 10.19678/j.issn.1000-3428.0066906
11	HUYNH T, KORNBLITH S, WALTER M R, et al. Boosting contrastive self-supervised learning with false negative cancellation[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). Washington D.C., USA: IEEE Press, 2022: 2785-2795.
12	NI S , ZHOU W , WEN J H , et al. Enhancing sequential recommendation with contrastive generative adversarial network. Information Processing & Management, 2023, 60 (3): 103331. URL
13	XIE X, SUN F, LIU Z Y, et al. Contrastive learning for sequential recommendation[C]//Proceedings of the 38th IEEE International Conference on Data Engineering (ICDE). Washington D.C., USA: IEEE Press, 2022: 1259-1273.
14	QIU R H, HUANG Z, YIN H Z, et al. Contrastive learning for representation degeneration problem in sequential recommendation[C]//Proceedings of the 15th ACM International Conference on Web Search and Data Mining. New York, USA: ACM Press, 2022: 813-823.
15	MA G F , YANG X H , LONG H X , et al. Robust social recommendation based on contrastive learning and dual-stage graph neural network. Neurocomputing, 2024, 584, 127597. doi: 10.1016/j.neucom.2024.127597
16	LIU Z W, CHEN Y J, LI J, et al. Contrastive self-supervised sequential recommendation with robust augmentation[EB/OL]. [2023-11-05]. https://arxiv.org/abs/2108.06479v1.
17	WEI Z H , WU N , LI F X , et al. MoCo4SRec: a momentum contrastive learning framework for sequential recommendation. Expert Systems with Applications, 2023, 223, 119911. doi: 10.1016/j.eswa.2023.119911
18	QIN X Y, YUAN H H, ZHAO P P, et al. Meta-optimized contrastive learning for sequential recommendation[EB/OL]. [2023-11-05]. https://arxiv.org/abs/2304.07763v5.
19	LI X W, SUN A T, ZHAO M K, et al. Multi-intention oriented contrastive learning for sequential recommendation[C]//Proceedings of the 16th ACM International Conference on Web Search and Data Mining. New York, USA: ACM Press, 2023: 411-419.
20	DU H W, SHI H, ZHAO P P, et al. Contrastive learning with bidirectional transformers for sequential recommendation[C]//Proceedings of the 31st ACM International Conference on Information & Knowledge Management. New York, USA: ACM Press, 2022: 396-405.
21	唐潘, 汪学明. 融合时间感知与兴趣偏好的推荐模型研究. 计算机工程与应用, 2023, 59 (24): 268- 276.
	TANG P , WANG X M . Recommendation model based on time aware and interest preference. Computer Engineering and Applications, 2023, 59 (24): 268- 276.
22	RENDLE S, FREUDENTHALER C, GANTNER Z, et al. BPR: Bayesian personalized ranking from implicit feedback[EB/OL]. [2023-11-05]. https://arxiv.org/abs/1205.2618v1.
23	CHEN Y J, LIU Z W, LI J, et al. Intent contrastive learning for sequential recommendation[C]//Proceedings of the ACM Web Conference 2022. New York, USA: ACM Press, 2022: 2172-2182.
24	QIU R H, HUANG Z, YIN H Z. Memory augmented multi-instance contrastive predictive coding for sequential recommendation[C]//Proceedings of the IEEE International Conference on Data Mining (ICDM). Washington D.C., USA: IEEE Press, 2021: 519-528.
25	ZHOU K, YU H, ZHAO W X, et al. Filter-enhanced MLP is all you need for sequential recommendation[C]//Proceedings of the ACM Web Conference 2022. New York, USA: ACM Press, 2022: 2388-2399.
26	KANG Y , YUAN Y C , PU B , et al. HICL: Hierarchical Intent Contrastive Learning for sequential recommendation. Expert Systems with Applications, 2024, 251, 123886. URL
27	WEI W, HUANG C, XIA L H, et al. Contrastive meta learning with behavior multiplicity for recommendation[C]//Proceedings of the 15th ACM International Conference on Web Search and Data Mining. New York, USA: ACM Press, 2022: 1120-1128.
28	ZOU J, KANOULAS E, REN P J, et al. Improving conversational recommender systems via transformer-based sequential modelling[C]//Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA: ACM Press, 2022: 2319-2324.
29	FAN Z W, LIU Z W, WANG Y, et al. Sequential recommendation via stochastic self-attention[C]//Proceedings of the ACM Web Conference 2022. New York, USA: ACM Press, 2022: 2036-2047.
30	AN G J , SUN J , YANG Y H , et al. Enhancing collaborative information with contrastive learning for session-based recommendation. Information Processing & Management, 2024, 61 (4): 103738.

[1]	冯雅莉, 温雯, 郝志峰, 蔡瑞初. 融合转移关系正则化的序列推荐[J]. 计算机工程, 2025, 51(8): 151-159.
[2]	商雅名, 吴安彪, 袁野, 王一舒. 基于个性化PageRank高阶邻域聚合的图神经网络增强[J]. 计算机工程, 2025, 51(6): 38-48.
[3]	庞鑫, 葛凤培, 李艳玲. 声景识音：数字化时代声学场景分类的探索与前沿[J]. 计算机工程, 2025, 51(6): 1-19.
[4]	姚迅, 王海鹏, 胡新荣, 杨捷. 基于自适应增强的多视图对比推荐算法[J]. 计算机工程, 2025, 51(5): 103-113.
[5]	张兴鹏, 何东, 杨模, 叶杭滨. 基于多尺度注意力和数据增强的细胞核分割[J]. 计算机工程, 2025, 51(2): 387-396.
[6]	李维刚, 厉许昌, 田志强, 李金灵. 基于自蒸馏框架的点云分类及其鲁棒性研究[J]. 计算机工程, 2024, 50(9): 72-81.
[7]	张溢文, 蔡满春, 陈咏豪, 朱懿, 姚利峰. 融合空间特征的多尺度深度伪造检测方法[J]. 计算机工程, 2024, 50(7): 240-250.
[8]	张正康, 杨丹, 聂铁铮, 寇月. 基于图结构聚类的自监督学习疾病诊断方法[J]. 计算机工程, 2024, 50(7): 360-371.
[9]	林芷薇, 杨祖元, 王斯秋, 杨超. 基于多尺度线性全局注意力的运动员检测算法[J]. 计算机工程, 2024, 50(7): 352-359.
[10]	张斯力, 李梓健, 蔡瑞初, 郝志峰, 闫玉光. 基于因果机制约束的强化推荐系统[J]. 计算机工程, 2024, 50(5): 279-290.
[11]	宫阿娟, 潘天荣. 多病种眼底疾病诊断的深度学习策略讨论[J]. 计算机工程, 2024, 50(5): 363-372.
[12]	李晶, 李健, 陈海丰, 张倩, 王丽燕, 裴二成. 基于关键区域遮挡与重建的人脸表情识别[J]. 计算机工程, 2024, 50(5): 241-249.
[13]	张宝鑫, 杨丹, 聂铁铮, 寇月. 基于自监督的多视角图协同过滤推荐方法[J]. 计算机工程, 2024, 50(5): 100-110.
[14]	王琳, 黄浩. 引入预训练表示混合矢量量化和CTC的语音转换[J]. 计算机工程, 2024, 50(4): 313-320.
[15]	侯钰涛, 阿布都克力木·阿布力孜, 史亚庆, 马依拉木·木斯得克, 哈里旦木·阿布都克里木. 面向"一带一路"的低资源语言机器翻译研究[J]. 计算机工程, 2024, 50(4): 332-341.

选择文件类型/文献管理软件名称

选择包含的内容