Abstract:
Based on the idea of the law of gravity, the method measuring dissimilarity and the method measuring a cluster departure from the whole are presented. Based on these, an outlier detection approach based on clustering, named EOD, is introduced. The time complexity of the detection approach is nearly linear with the size of dataset and the number of attributes, which results in good scalability and adapts to large dataset. The theoretic analysis and the experimental results on real datasets show that the approach is effective, robust and practicable.
Key words:
Clustering,
Outlier factor,
Outlier detection
摘要: 借鉴万有引力思想提出了一种差异性度量方法和度量类偏离程度的方法,以此为基础提出了一种基于聚类的异常检测方法。该异常检测方法关于数据集大小和属性个数具有近似线性时间复杂度,适合于大规模数据集。理论分析以及在真实数据集上的实验结果表明,该方法是有效的,稳健并且实用。
关键词:
聚类,
异常因子,
异常检测
CLC Number:
JIANG Shengyi; JIANG Lingmin.
Approach of Efficient Outlier Detection
[J]. Computer Engineering, 2007, 33(07): 166-168.
蒋盛益;姜灵敏. 一种高效异常检测方法[J]. 计算机工程, 2007, 33(07): 166-168.