作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2007, Vol. 33 ›› Issue (16): 50-52,5. doi: 10.3969/j.issn.1000-3428.2007.16.017

• 软件技术与数据库 • 上一篇    下一篇

MFCPLG:微阵列数据中频繁闭合模式挖掘

金 波1,缪裕青1,2   

  1. (1. 桂林电子科技大学计算机系,桂林 541004;2. 中国科学技术大学计算机科学与技术系,合肥 230026)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-08-20 发布日期:2007-08-20

MFCPLG: Mining Frequent Closed Patterns in Microarray

JIN Bo1, MIAO Yu-qing1,2   

  1. (1. Dept. of Computer Science, Guilin University of Electronic Technology, Guilin 541004; 2. Dept. of Computer Science and Technology, University of Science and Technology of China, Hefei 230026)
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-08-20 Published:2007-08-20

摘要: 微阵列数据集行少列多的特征,使得传统基于列枚举空间的算法应用于其中进行频繁闭合模式挖掘时其复杂性迅速增长。基于行枚举的CARPENTER算法较好解决了该问题。但CARPENTER算法使用映射转置表(TT)来完成频繁闭合模式完全集的挖掘效率不高。该文在CARPENTER算法基础上,提出LG-tree数据结构,并基于此结构提出挖掘频繁闭合模式的新算法MFCPLG。真实数据集的实验表明,MFCPLG算法的时间性能优于CARPENTER算法。

关键词: 微阵列数据集, 频繁闭合模式, MFCPLG, LG-tree

Abstract: Because microarray dataset contains a large number of columns and a small number of rows, mining the complete set of frequent closed patterns in microarray poses a great challenge for traditional frequent closed patterns mining algorithms which are based on column-enumerate. CARPENTER, based on row-enumerate, has addressed this problem. However, CARPENTER is inefficient by using the transposed table (TT) to mine the complete set of frequent closed patterns in microarray. A new algorithm, MFCPLG, is proposed inspired by CARPENTER, and a LG-tree structure is suggested. Several experiments are performed on real-life microarray data to show that the MFCPLG algorithm is faster than the CARPENTER algorithm.

Key words: microarray dataset, frequent closed patterns, MFCPLG, LG-tree

中图分类号: