作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (8): 93-95. doi: 10.3969/j.issn.1000-3428.2010.08.033

• 软件技术与数据库 • 上一篇    下一篇

序列模式挖掘支持度阈值的确定方法

王翠青,陈未如   

  1. (沈阳化工学院计算机科学与技术学院,沈阳 110142)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2010-04-20 发布日期:2010-04-20

Method of Determining Support Degree Threshold in Sequential Pattern Mining

WANG Cui-qing, CHEN Wei-ru   

  1. (School of Computer Science and Technology, Shenyang Institute of Chemical Technology, Shenyang 110142)
  • Received:1900-01-01 Revised:1900-01-01 Online:2010-04-20 Published:2010-04-20

摘要: 通过对不同支持度下序列模式挖掘产生模式个数分布的研究,利用曲线拟合技术,提出一种支持度与序列模式个数的关系模型。在对客户序列数据库子集进行预挖掘的基础上,利用该模型为用户在挖掘前确定支持度阈值提供参考。在不同类型数据集上采用该方法,得到预期结果,表明该方法是正确有效的。

关键词: 数据挖掘, 序列模式挖掘, 支持度

Abstract: By studying distribution of the pattern number in sequential pattern mining using different support degree, this paper proposes a relation model of support and numbers of sequential pattern. Based on mining on subset of custom sequential database, it uses the relation model to provide users with the reference for determining threshold of the support degree. It uses this method in several different data sets, which gets the expected results, and demonstrates this method is correct and efficient.

Key words: data mining, sequential pattern mining, support degree

中图分类号: