基于层间融合滤波器与社交神经引文网络的推荐算法

doi:10.19678/j.issn.1000-3428.0068532

摘要/Abstract

摘要：

推荐算法是一种用于解决信息过载问题的方法, 引文推荐通过引文上下文能够自动匹配候选论文列表。现有基于神经引文网络模型在引文上下文数据预处理的过程中, 存在文本噪声和上下文学习不充分的问题。为此, 提出一种基于层间融合滤波器和社交神经引文网络的推荐算法FS-Rec。首先, 利用具有层间融合滤波器的BERT模型预处理引文上下文, 在频域内从所有频率中提取有意义的特征, 缓解引文上下文数据的噪声, 同时在频域中对多层信息进行融合, 增强上下文表示学习的能力; 然后, 在引文作者嵌入中引入社交关系, 与其他引文信息嵌入通过编码器获得表示, 将这些表示与经过BERT预训练的引文上下文表示进行融合, 得到最终表示; 最后, 根据最终表示生成引文文本预测。实验结果表明, 相较于现有的上下文引文推荐模型, FS-Rec在2个基准数据集arXivCS和PubMed取得了更高的召回率和平均倒数排名(MMR), 证明了模型的有效性。

关键词: 滤波器, 自注意力机制, 基于Transformer的双向编码器表示, 引文推荐, 预训练语言模型

Abstract:

Recommendation algorithms address information overload by automatically matching citation recommendations with a list of candidate papers, utilizing the citation context. Existing neural citation network models encounter issues, such as text noise and insufficient context learning, while preprocessing citation context data. Therefore, this study proposes a recommendation algorithm based on interlayer fusion filter and social neural citation network FS-Rec. First, the citation context is preprocessed using a Bidirectional Encoder Representations from Transformers (BERT) model equipped with interlayer fusion filters. This process involves extracting meaningful features from all frequencies within the frequency domain, thereby mitigating noise in citation context data. Multilayer information is simultaneously fused within the frequency domain, thus enhancing the capability of contextual representation learning. Social relationships are then introduced into citation author embeddings, and representations are obtained through an encoder along with other citation information embeddings. These representations are subsequently fused with the pre-trained citation context representations using BERT to obtain the final representation. Finally, citation text predictions are generated based on the ultimate representation. Experimental results demonstrate that compared with existing context-based citation recommendation models, FS-Rec achieves higher recall rates and Average Reciprocal Ranks(MRR) on two benchmark datasets, arXivCS and PubMed, thereby proving the effectiveness of the model.

Key words: filter, self-attention mechanism, bidirectional encoder representation from Transformer, citation recommendation, pre-trained language model

杨兴耀, 李志林, 张祖莲, 于炯, 陈嘉颖, 王东晓. 基于层间融合滤波器与社交神经引文网络的推荐算法[J]. 计算机工程, 2024, 50(11): 98-106.

YANG Xingyao, LI Zhilin, ZHANG Zulian, YU Jiong, CHEN Jiaying, WANG Dongxiao. Recommendation Algorithm Based on Interlayer Fusion Filter and Social Neural Citation Network[J]. Computer Engineering, 2024, 50(11): 98-106.

https://www.ecice06.com/CN/Y2024/V50/I11/98

图/表 8

图1 FS-Rec模型架构

Fig.1 Framework of FS-Rec model

图2 层间融合滤波器层

Fig.2 Inter-layer fusion filter layer

图3 针对不同模块的消融实验结果

Fig.3 Ablation experiments results for different modules

图4 不同嵌入大小对模型性能的影响

Fig.4 Influence of different embedding sizes on the performance of model

图5 不同循环层数对模型性能的影响

Fig.5 Influence of different recurrent layers on the performance of model

参考文献 30

1	FÄRBER M, JATOWT A. Citation recommendation: approaches and datasets. International Journal on Digital Libraries, 2020, 21(4): 375- 405. doi: 10.1007/s00799-020-00288-2
2	TANG J, ZHANG J. A discriminative approach to topic-based citation recommendation[C]//Proceedings of Acific-Asia Conference on Knowledge Discovery and Data Mining. Berlin, Germany: Springer, 2009: 572-579.
3	SHAROU K A, LI Z, SPECIA L. Towards a better understanding of noise in natural language processing[C]//Proceedings of the International Conference on Recent Advances in Natural Language Processing. Stroudsburg, USA: Association for Computational Linguistics, 2021: 53-62.
4	陈海华, 孟睿, 陆伟. 学术文献引文推荐研究进展. 图书情报工作, 2015, 59(15): 133-143, 147.
	CHEN H H, MENG R, LU W. Research review on citation recommendation of academic literatures. Library and Information Service, 2015, 59(15): 133-143, 147.
5	CHEN L, XIA M M. A context-aware recommendation approach based on feature selection. Applied Intelligence, 2021, 51(2): 865- 875. doi: 10.1007/s10489-020-01835-9
6	CHEN X, ZHAO H J, ZHAO S, et al. Citation recommendation based on citation tendency. Scientometrics, 2019, 121(2): 937- 956. doi: 10.1007/s11192-019-03225-6
7	PORNPRASIT C, LIU X, KIATTIPADUNGKUL P, et al. Enhancing citation recommendation using citation network embedding. Scientometrics, 2022, 127(1): 233- 264. doi: 10.1007/s11192-021-04196-3
8	MA S T, ZHANG H, ZHANG C Z, et al. Chronological citation recommendation with time preference. Scientometrics, 2021, 126(4): 2991- 3010. doi: 10.1007/s11192-021-03878-2
9	ALI Z, QI G L, KEFALAS P, et al. A graph-based taxonomy of citation recommendation models. Artificial Intelligence Review, 2020, 53(7): 5217- 5260. doi: 10.1007/s10462-020-09819-4
10	HUANG W Y, KATARIA S, CARAGEA C, et al. Recommending citations: translating papers into references[C]//Proceedings of the 21st ACM International Conference on Information and Knowledge Management. New York, USA: ACM Press, 2012: 1910-1914.
11	HUANG W Y, WU Z H, LIANG C, et al. A neural probabilistic model for context based citation recommendation[C]//Proceedings of the AAAI Conference on Artificial Intelligence. [S. l. ]: AAAI Press, 2015: 1-10.
12	EBESU T, FANG Y. Neural citation network for context-aware citation recommendation[C]//Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA: ACM Press, 2017: 1093-1096.
13	ROBERTSON S, ZARAGOZA H. The probabilistic relevance framework: BM25 and beyond. Foundations and Trends in Information Retrieval, 2009, 3(4): 333- 389. doi: 10.1561/1500000019
14	FÄRBER M, KLEIN T, SIGLOCH J. Neural citation recommendation: a reproducibility study[C]//Proceedings of the 10th International Workshop on Bibliometric-Enhanced Information Retrieval Co-Located with 42nd European Conference on Information Retrieval. Lisbon, Portugal: [s. n. ], 2020: 66-74.
15	DINH T N, PHAM P, NGUYEN G L, et al. Enhanced context-aware citation recommendation with auxiliary textual information based on an auto-encoding mechanism. Applied Intelligence, 2023, 53(14): 17381- 17390. doi: 10.1007/s10489-022-04423-1
16	DEVLIN J, CHANG M W, LEE K, et al. BERT: pre-training of deep bidirectional Transformers for language understanding[EB/OL]. [2023-09-05]. https://arxiv.org/pdf/1810.04805.
17	SCHIAPPA M C, RAWAT Y S, SHAH M. Self-supervised learning for videos: a survey. ACM Computing Surveys, 2023, 55(13s): 1- 37.
18	VINCENT P, LAROCHELLE H, BENGIO Y, et al. Extracting and composing robust features with denoising autoencoders[C]//Proceedings of the 25th International Conference on Machine Learning. New York, USA: ACM Press, 2008: 1096-1103.
19	JEONG C, JANG S, PARK E, et al. A context-aware citation recommendation model with BERT and graph convolutional networks. Scientometrics, 2020, 124(3): 1907- 1922. doi: 10.1007/s11192-020-03561-y
20	COHAN A, FELDMAN S, BELTAGY I, et al. SPECTER: document-level representation learning using citation-informed Transformers[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2020: 2270-2282.
21	许凤, 杨兴耀, 于炯, 等. 小波卷积增强的对比学习推荐算法. 计算机工程, 2023, 49(5): 105-111, 121. doi: 10.19678/j.issn.1000-3428.0064747
	XU F, YANG X Y, YU J, et al. Wavelet convolution enhanced contrastive learning recommendation algorithm. Computer Engineering, 2023, 49(5): 105-111, 121. doi: 10.19678/j.issn.1000-3428.0064747
22	陈明惠, 王腾, 袁媛, 等. 引入双编码器模型的OCT视网膜图像分割. 光电工程, 2023, 50(10): 230146.
	CHEN M H, WANG T, YUAN Y, et al. Study on retinal OCT segmentation with dual-encoder. Opto-Electronic Engineering, 2023, 50(10): 230146.
23	张少东, 杨兴耀, 于炯, 等. 基于对比学习和傅里叶变换的序列推荐算法. 电子科技大学学报, 2023, 52(4): 610- 619.
	ZHANG S D, YANG X Y, YU J, et al. Sequence recommendation based on contrast learning and Fourier transform. Journal of University of Electronic Science and Technology of China, 2023, 52(4): 610- 619.
24	ZHOU K, YU H, ZHAO W X, et al. Filter-enhanced MLP is all you need for sequential recommendation[C]//Proceedings of the ACM Web Conference 2022. New York, USA: ACM Press, 2022: 1-10.
25	DU X Y, YUAN H H, ZHAO P P, et al. Contrastive enhanced slide filter mixer for sequential recommendation[EB/OL]. [2023-09-05]. http://arxiv.org/abs/2305.04322.
26	GOODFELLOW I, BENGIO Y, COURVILLE A. Deep learning[M]. Cambridge, USA: MIT Press, 2016.
27	COLLOBERT R, WESTON J. A unified architecture for natural language processing: deep neural networks with multitask learning[C]//Proceedings of the 25th International Conference on Machine Learning. New York, USA: ACM Press, 2008: 160-167.
28	BAHDANAU D, CHO K, BENGIO Y. Neural machine translation by jointly learning to align and translate[EB/OL]. [2023-09-05]. https://arxiv.org/pdf/1409.0473.
29	CHO K, VAN MERRIENBOER B, GULCEHRE C, et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation[EB/OL]. [2023-09-05]. https://arxiv.org/pdf/1406.1078.
30	LONG K H, LI S S, WANG P C, et al. Integrating title and citation context semantics of citing paper via weighted attentions for local citation recommendation[C]//Proceedings of the International Joint Conference on Neural Networks (IJCNN). Washington D. C., USA: IEEE Press, 2022: 1-7.

[1]	屈潇雅, 李兵, 温立强. 面向行政执法案件文本的事件抽取研究[J]. 计算机工程, 2024, 50(9): 63-71.
[2]	陈瀚, 赵春蕾, 蒋昊达, 王春东. 基于融合模型与语义网络的App用户意图识别研究[J]. 计算机工程, 2024, 50(8): 50-63.
[3]	王夙喆, 张雪英, 陈晓玉, 李凤莲, 吴泽林. 基于有效注意力和GAN结合的脑卒中EEG增强算法[J]. 计算机工程, 2024, 50(8): 336-344.
[4]	陈宇航, 杨勇, 先木斯亚·买买提明, 帕力旦·吐尔逊, 樊小超, 任鸽, 刁宇峰. 基于主题感知和语义增强的作文自动评分方法[J]. 计算机工程, 2024, 50(8): 363-371.
[5]	张朋, 严盼盼, 乔凤杰. 基于长时跟踪的滑雪教学姿态辅助矫正方法[J]. 计算机工程, 2024, 50(7): 79-86.
[6]	刘娟, 段友祥, 陆誉翕, 张鲁. 引入知识增强和对比学习的知识图谱补全[J]. 计算机工程, 2024, 50(7): 112-122.
[7]	耿丽丽, 牛保宁. 基于通道相似度熵的卷积神经网络裁剪[J]. 计算机工程, 2024, 50(7): 133-143.
[8]	陈佳玉, 王元龙, 张虎. 基于文本知识增强的问题生成模型[J]. 计算机工程, 2024, 50(6): 86-93.
[9]	贺姗, 蔺素珍, 王彦博, 李大威. 基于特征融合的多波段图像描述生成方法[J]. 计算机工程, 2024, 50(6): 236-244.
[10]	隗昊, 刁宏悦, 孔亮宸, 邓耀臣. 东北亚舆情文本细粒度命名实体识别方法研究[J]. 计算机工程, 2024, 50(5): 354-362.
[11]	隗昊, 刁宏悦, 孔亮宸, 邓耀臣. 东北亚舆情文本细粒度命名实体识别方法研究[J]. 计算机工程, 2024, 50(5): 354-362.
[12]	张洪程, 李林育, 杨莉, 伞晨峻, 尹春林, 颜冰, 于虹, 张璇. 基于对比学习与语言模型增强嵌入的知识图谱补全[J]. 计算机工程, 2024, 50(4): 168-176.
[13]	徐浩宸, 刘满华. 基于多层次自注意力网络的人脸特征点检测[J]. 计算机工程, 2024, 50(2): 239-246.
[14]	王正家, 胡飞飞, 张成娟, 雷卓, 何涛. 引入轻量级Transformer的自适应窗口立体匹配算法[J]. 计算机工程, 2024, 50(2): 256-265.
[15]	郭祥振, 李思潼, 卢锐, 郭森, 崔学荣, 杨钢. 基于多任务联合注意力的结肠息肉分割网络[J]. 计算机工程, 2024, 50(2): 327-336.

选择文件类型/文献管理软件名称

选择包含的内容