基于核函数-主成分维数约减的离群点检测

doi:10.3969/j.issn.1000-3428.2008.08.028

计算机工程 ›› 2008, Vol. 34 ›› Issue (8): 82-84.

基于核函数-主成分维数约减的离群点检测

徐雪松，刘耀宗，赵学龙，张宏，刘凤玉

（南京理工大学计算机科学与技术学院，南京 210094）

收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-04-20 发布日期:2008-04-20

Outliers Detection Based on Kernel Function-Principle Component Dimension Reduction

XU Xue-song, LIU Yao-zong, ZHAO Xue-long, ZHANG Hong, LIU Feng-yu

（Department of Computer Science and Technology, Nanjing University of Science and Technology, Nanjing 210094）

Received:1900-01-01 Revised:1900-01-01 Online:2008-04-20 Published:2008-04-20

摘要/Abstract

摘要： 为了提高高维数据集合离群数据挖掘效率，该文分析传统的离群数据挖掘算法，提出一种离群点检测算法。该算法将非线性问题转化为高维特征空间中的线性问题，利用核函数-主成分进行维数约减，逐个扫描数据对象的投影分量，判断数据点是否为离群点，适用于线性可分数据集的离群点、线性不可分数据集的离群点的检测。实验表明了该算法的优越性。

关键词: 维数消减, 核函数, 主成分

Abstract: The data dimension reduction is a method that can enhance the outliers mining efficiency based on higher-dimension data set. This paper analyzes classical outlier mining algorithm, proposes a novel outlier detection algorithm, transforms nonlinear large-scale data into linear data in the feature space, and introduces a kernel function and principal component data transformation to reduce data dimension. On the basis of each resulting vector, it is determined which data is outlier data one by one. This paper shows that the algorithm is used to detect linear separable outlier data, and to detect nonlinear inseparable outlier data. Experimental results indicate that the algorithm is predominant.

Key words: dimension reduction, kernel function, principal component

中图分类号:

TP311.5

徐雪松;刘耀宗;赵学龙;张宏;刘凤玉. 基于核函数-主成分维数约减的离群点检测[J]. 计算机工程, 2008, 34(8): 82-84.

XU Xue-song; LIU Yao-zong; ZHAO Xue-long; ZHANG Hong; LIU Feng-yu. Outliers Detection Based on Kernel Function-Principle Component Dimension Reduction[J]. Computer Engineering, 2008, 34(8): 82-84.

https://www.ecice06.com/CN/Y2008/V34/I8/82

[1]	乔彩彩, 吴成茂, 李昌兴, 王佳烨. 结合隶属度与像素交替引导滤波的鲁棒模糊聚类算法[J]. 计算机工程, 2022, 48(8): 224-233.
[2]	阎馨, 朱永浩, 屠乃威, 吴书文, 王雨虹. 基于PCA与权重贝叶斯的工作面煤与瓦斯突出预测[J]. 计算机工程, 2021, 47(8): 315-320.
[3]	郝占军, 张岱阳, 党小超, 段渝. 基于信道状态信息的非接触式人员动作识别方法[J]. 计算机工程, 2021, 47(6): 172-181.
[4]	何必锋, 沈雷, 何晶, 蒋寒琼. 基于稀疏结构噪声检测的指静脉图像去噪算法[J]. 计算机工程, 2021, 47(5): 236-243.
[5]	杨明羽, 叶春明. 结合Bi-2DPCA与CNN的美式手语识别[J]. 计算机工程, 2021, 47(12): 278-284.
[6]	胡涛, 佃松宜, 蒋荣华. 基于长短时记忆神经网络的硬件木马检测[J]. 计算机工程, 2020, 46(7): 110-115.
[7]	党小超, 邓琦研, 郝占军. 基于30°角同心圆环形取样的室内人员检测方法[J]. 计算机工程, 2020, 46(4): 198-205.
[8]	张瑞, 陈红卫. 基于特征优化与SVPSO的工控入侵检测[J]. 计算机工程, 2020, 46(4): 19-25.
[9]	王旭仁,马慧珍,冯安然,许祎娜. 基于信息增益与主成分分析的网络入侵检测方法[J]. 计算机工程, 2019, 45(6): 175-180.
[10]	闫玉娟,李化,赵菊敏,李灯熬,刘佳. 基于CRFID和模式识别的跌倒检测系统[J]. 计算机工程, 2019, 45(6): 297-302,309.
[11]	张延良,卢冰. 基于信息增量特征选择的微表情识别方法[J]. 计算机工程, 2019, 45(5): 261-266.
[12]	狄瑞彤,王红,房有丽. 融合时间序列与多尺度特征的虚假评论识别方法[J]. 计算机工程, 2019, 45(3): 278-285,292.
[13]	杨晨晨,马春梅,朱金奇. 基于智能手机的跌倒行为识别算法研究[J]. 计算机工程, 2019, 45(2): 178-183.
[14]	夏胡云,叶学义,罗宵晗,王鹏. 多尺度空间金字塔池化PCANet的行人检测[J]. 计算机工程, 2019, 45(2): 270-277.
[15]	张裕平, 龚晓峰, 雒瑞森. 基于稀疏化双向二维主成分分析的人脸识别[J]. 计算机工程, 2019, 45(12): 232-236.

选择文件类型/文献管理软件名称

选择包含的内容

基于核函数-主成分维数约减的离群点检测

Outliers Detection Based on Kernel Function-Principle Component Dimension Reduction

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于核函数-主成分维数约减的离群点检测

Outliers Detection Based on Kernel Function-Principle Component Dimension Reduction

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价