作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (20): 27-29. doi: 10.3969/j.issn.1000-3428.2011.20.010

• 软件技术与数据库 • 上一篇    下一篇

一种新的频繁子图挖掘算法

郭景峰,张 伟,柴 然   

  1. GUO Jing-feng, ZHANG Wei, CHAI Ran
  • 收稿日期:2011-04-20 出版日期:2011-10-20 发布日期:2011-10-20
  • 作者简介:郭景峰(1962-),男,教授、博士、CCF会员,主研方向:数据挖掘,数据库技术;张 伟、柴 然,硕士
  • 基金资助:

    国家自然科学基金资助项目(60673136);河北省应用基础重点研究项目(10963527D)

New Algorithm of Mining Frequent Subgraph

GUO Jing-feng, ZHANG Wei, CHAI Ran   

  1. (College of Information and Engineering, Yanshan University, Qinhuangdao 066004, China)
  • Received:2011-04-20 Online:2011-10-20 Published:2011-10-20

摘要: 传统Aprior频繁子图挖掘算法中存在大量冗余子图。针对该问题,提出一种新的频繁子图挖掘算法(GAI)。介绍一种三层MADI索引结构,用于存储图集的信息,以减少图集的扫描次数,通过扩展ETree树构造频繁子图,并用表来存储候选子图,避免扩展过程中冗余图的产生以及对整个数据库的扫描,从而简化支持度的计算,提高图/子图同构的查询效率。实验结果表明,与Aprior算法相比,GAI的挖掘效率更高。

关键词: Apriori算法, 数据挖掘, 子图同构, 频繁子图

Abstract: In order to resolve the problem of traditional Apriori algorithm that exists redundancy subgraphs when mining frequent subgraph, a new frequent subgraph mining algorithm called GAI is proposed. To reduce the number of scanning database, MADI index structure of three levels is proposed to store the information of graphs. It uses the expansion of the ETree to construct the frequent graph, and uses tables to store candidate subgraphs. It is avoided the redundancy subgraphs in expansion processing and scanning the entire database. It greatly simplifies the calculation of support degree and improves the query efficiency of graph isomorphism and subgraph isomorphism. Experimental results show GAI has the higher mining efficiency than Apriori algorithm.

Key words: Apriori algorithm, data mining, subgraph isomorphism, frequent subgraph

中图分类号: