作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (2): 60-62. doi: 10.3969/j.issn.1000-3428.2011.02.021

• 软件技术与数据库 • 上一篇    下一篇

微阵列数据中Top-k频繁闭合项集挖掘

史建军1,缪裕青1,2   

  1. (1. 桂林电子科技大学计算机与控制学院,广西 桂林 541004;2. 阿德莱德大学计算机科学学院,南澳大利亚 5005)
  • 出版日期:2011-01-20 发布日期:2011-01-25
  • 作者简介:史建军(1977-),男,硕士研究生,主研方向:数据挖掘;缪裕青,副教授、博士研究生
  • 基金资助:
    国家留学基金资助项目

Top-k Frequent Closed Item Set Mining in Microarray Data

SHI Jian-jun1, MIAO Yu-qing 1,2   

  1. (1. School of Computer and Control, Guilin University of Electronic Technology, Guilin 541004, China;2. School of Computer Science, The University of Adelaide, South Australia 5005, Australia)
  • Online:2011-01-20 Published:2011-01-25

摘要: 现有大部分微阵列数据中频繁闭合项集的挖掘需要事先给定最小支持度,但在实际应用中该最小支持度很难确定。针对该问题,提出top-k频繁闭合项集挖掘算法,基于自顶向下宽度优先搜索策略挖掘项集长度不小于min_l的top-k频繁闭合项集,并对搜索空间进行有效修剪,从而提高搜索速度。实验结果表明,该算法的时间性能在多数情况下优于CARPENTER算法。

关键词: 微阵列数据, top-k频繁闭合项集, 自顶向下, 宽度优先

Abstract: Most previous mining frequent closed item sets require the specification of a minimum support threshold in microarry data. However, it is difficult for users to provide an appropriate minimum support threshold in practice. Aiming at this problem, this paper presents a top-k frequent closed item set and an algorithm in microarray data. The algorithm uses top-down breadth-first search strategy to mining top-k frequent closed item set of length no less than given value min_l and pruning the search space effectively to improve the search speed. Experimental result shows that the time performance of this algorithm outperforms the CARPENTER algorithm in most cases.

Key words: microarray data, top-k frequent closed item set, top-down, breadth-first

中图分类号: