作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (3): 33-35. doi: 10.3969/j.issn.1000-3428.2011.03.012

• 软件技术与数据库 • 上一篇    下一篇

基于分形理论的离群点检测

孙金花1,胡 健2,李向阳2   

  1. (1. 哈尔滨理工大学管理学院,哈尔滨 150040;哈尔滨工业大学经济与管理学院,哈尔滨 150001)
  • 出版日期:2011-02-05 发布日期:2011-01-28
  • 作者简介:孙金花(1979-),女,讲师、博士,主研方向:分形理论,创新理论;胡 健,博士;李向阳,教授、博士生导师
  • 基金资助:
    国家教育部博士点基金资助项目(20060213004)

Outlier Detection Based on Fractal Theory

SUN Jin-hua 1, HU Jian 2, LI Xiang-yang 2   

  1. (1. School of Management, Harbin University of Science and Technology, Harbin 150040, China; 2. School of Economics and Management, Harbin Institute of Technology, Harbin 150001, China)
  • Online:2011-02-05 Published:2011-01-28

摘要: 现有离群点数据挖掘算法在高维空间效率比较低,针对上述不足,从离群点对数据集有序性的影响角度出发,在界定分形离群点含义的基础上,利用分形理论将离群数据挖掘作为一个优化分割问题进行处理。采用推广的G-P算法计算数据集的多重分形广义维数,利用贪婪算法的思想设计FDOM算法用于求解离群数据挖掘优化问题。实验结果证明,该算法能有效地解决离群点检测问题。

关键词: 数据挖掘, 离群点检测, 分形理论, 多重分形

Abstract: According to the weakness that traditional outlier data mining algorithms have lower efficiency in high-dimension space, from the viewpoint of outlier affecting orderliness of data set, this paper considers outlier mining as an optimization segmentation problem by using fractal theory. Based on the defining fractal outlier, Grassberger-Procaccia(GP) algorithm is used to calculate multi-fractal and general dimension. A greedy algorithm named FDOM is designed to solve the optimization problems of outlier mining. Experimental result shows that the algrithm is feasible to solve the problems of outlier mining.

Key words: data mining, outlier detection, fractal theory, multifractal

中图分类号: