摘要: 目前关于用户兴趣方面的研究大多数是根据用户兴趣的相似性划分用户群,缺乏对用户兴趣分布模式的度量。为此,提出一种用户兴趣分布模式度量方法。根据向量空间模型进行用户兴趣建模,利用基尼系数和洛伦茨曲线划分用户兴趣分布模式。Movielens数据集上的实验结果验证了该方法的有效性。
关键词:
用户兴趣分布模式,
基尼系数,
洛伦茨曲线,
用户兴趣建模,
向量空间模型,
分布集中度
Abstract: According to that recently the research of user interests mostly focus on dividing user into groups according to similar interests. There are still lacking of considering user distribution interests pattern and lacking of methods to measure user distribution interests pattern. Aiming at the problem, this paper proposes a user interest distribution pattern measurement method. It uses Vector Space Model(VSM) to complete user model, quotes the Gini coefficient and Lurenz curve to measure user distribution interest pattern. Result on Movielens dataset verifies the effectiveness of this method.
Key words:
user interest distribution pattern,
Gini coefficient,
Lurenz curve,
user interest modeling,
Vector Space Model(VSM),
distribution concentration degree
中图分类号:
花青松, 刘海峰, 胡铮. 基于基尼系数的用户兴趣分布模式度量方法[J]. 计算机工程, 2012, 38(22): 39-42.
HUA Jing-Song, LIU Hai-Feng, HU Zheng. User Interest Distribution Pattern Measurement Method Based on Gini Coefficient
HUA Qing-song, LIU Hai-feng, HU Zheng
(Key Laboratory of Universal Wireless Communications, Ministry of Education, Beijing University of Posts and Telecommunications, Beijing 100876, China)[J]. Computer Engineering, 2012, 38(22): 39-42.