作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2006, Vol. 32 ›› Issue (11): 119-121,124.

• 软件技术与数据库 • 上一篇    下一篇

基于网格聚类技术的离群点挖掘算法

曹洪其1,余岚 2,孙志挥2   

  1. 1.南通职业大学电子工程系,南通 226007;2.东南大学计算机科学与工程系,南京 210096
  • 出版日期:2006-06-05 发布日期:2006-06-05

An Algorithm of Outliers Mining Based on GridClustering Techniques

CAO Hongqi1,YU Lan2,SUN Zhihui2   

  1. 1. Department of Electronic Engineering, Nantong Vocational College, Nantong 226007;2. Department of Computer Science and Engineering, Southeast University, Nanjing 210096
  • Online:2006-06-05 Published:2006-06-05

摘要: 针对离群点的挖掘,在现有的LOF 算法的基础上,提出了一种基于网格聚类技术的离群点挖掘算法AOMGC。该算法将离群点挖掘分成两步挖掘过程。此外,该算法对其网格的划分加以改进,并能根据数据信息自动生成划分间隔,从而提高了数据挖掘的效率。实验结果表明AOMGC 算法是可行的和有效的。

关键词: 数据挖掘;离群点;局部偏离因子;网格

Abstract: This paper aims at outlier mining, and proposes an algorithm of outlier mining called AOMGC based on grid clustering techniques, withthe existing algorithm of LOF. In this algorithm, the whole outlier mining is divided into two mining steps. In addition, this algorithm modifies themethods of grids partition cells. Also, it can automatically form partition intervals according to the data information, which enhances the efficiencyof data mining. The results of experiments indicate that AOMGC is adoptable and effective.

Key words: Data mining; Outliers; Local outlier factor; Grid