作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2019, Vol. 45 ›› Issue (10): 215-220. doi: 10.19678/j.issn.1000-3428.0052499

• 人工智能及识别技术 • 上一篇    下一篇

融合本体语义与用户属性的协同过滤算法

王光, 姜丽, 董帅含, 李丰   

  1. 辽宁工程技术大学 软件学院, 辽宁 葫芦岛 125105
  • 收稿日期:2018-08-27 修回日期:2018-10-23 出版日期:2019-10-15 发布日期:2018-11-01
  • 作者简介:王光(1979-),男,副教授、硕士,主研方向为智能数据处理、数据挖掘、自然语言处理;姜丽(通信作者)、董帅含,硕士研究生;李丰,学士。
  • 基金资助:
    国家自然科学基金(71371091);国家自然科学基金青年科学基金(61401185)。

Collaborative Filtering Algorithm Combining Ontology Semantics and User Attribute

WANG Guang, JIANG Li, DONG Shuaihan, LI Feng   

  1. School of Software, Liaoning Technical University, Huludao, Liaoning 125105, China
  • Received:2018-08-27 Revised:2018-10-23 Online:2019-10-15 Published:2018-11-01

摘要: 传统协同过滤推荐算法在处理海量数据时存在数据稀疏性和项目长尾效应,导致推荐精度较低。针对该问题,结合本体语义和用户属性,提出一种改进的协同过滤算法。利用本体计算项目之间的语义相似度,构建项目相似度矩阵,同时引入用户属性计算用户相似度矩阵。通过融合本体语义和用户属性形成用户-项目评分矩阵,并对该矩阵的预测评分进行加权处理,生成TOP-N推荐结果。实验结果表明,相比传统皮尔逊相似度计算协同过滤算法、基于本体语义的协同过滤算法和基于评分矩阵填充与用户兴趣的协同过滤算法,该算法的平均绝对误差较低,准确率较高,综合性能及新颖度较优。

关键词: 协同过滤, 本体语义, 用户属性, 相似度, 长尾效应

Abstract: When dealing with massive data,the traditional collaborative filtering recommendation algorithm has the data sparsity and the long tail effect of the items,resulting in low recommendation accuracy.Aiming at this problem,combining ontology semantics and user attributes,this paper proposes an improved collaborative filtering algorithm.The item similarity matrix is constructed by using ontology to calculate semantic similarity between items.User attributes are introduced to calculate user similarity matrix.The user-item scoring matrix is formed by integrating ontology semantics and user attributes,and the prediction score of the matrix is weighted to provide the TOP-N recommendation results.Experimental results show that compared with Pearson similarity calculation collaborative filtering algorithm,collaborative filtering algorithm based on ontology semantics,and collaborative filtering algorithm based on scoring matrix filling and user interest,the proposed algorithm has the lowest mean absolute error and the highest precision,and its integrity and novelty are superior.

Key words: collaborative filtering, ontology semantics, user attribute, similarity, long tail effect

中图分类号: