作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2008, Vol. 34 ›› Issue (21): 144-146. doi: 10.3969/j.issn.1000-3428.2008.21.052

• 安全技术 • 上一篇    下一篇

隐私保护聚类的独立噪音算法

王子亮,郑玉明,廖湖声   

  1. (北京工业大学计算机学院,北京 100022)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-11-05 发布日期:2008-11-05

Independent Noise Algorithm for Privacy Preserving Clustering

WANG Zi-liang, ZHENG Yu-ming, LIAO Hu-sheng   

  1. (Collage of Computer, Beijing University of Technology, Beijing 100022)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-11-05 Published:2008-11-05

摘要: 数据挖掘技术具有很多优点,但存在隐私威胁等不足。该文针对聚类分析时如何保护隐私的问题,提出独立噪音思想并设计独立噪音算法(INA)。该算法对原数据叠加噪音以保护原始数据不被泄漏,所用噪音不会对数据分布造成严重影响,使后期挖掘工作可以在修改后的数据上直接进行。实验结果证明,INA算法可以取得较高的隐私保护程度和挖掘正确率。

关键词: 隐私保护, 聚类挖掘, 独立噪音

Abstract: Data mining technique has a lot of merits, but it faces criticism like privacy threatening. This paper focuses on how to preserve privacy in clustering analysis and introduces the Independent Noise Algorithm(INA). This algorithm preserves original data by adding noises while keeping its distribution. The noises won’t influence the data distributing badly so that the mining can be done directly on the amended data later. The experimental results demonstrate that INA is effective and provide acceptable values for balancing privacy and accuracy.

Key words: privacy preserving, clustering mining, independent noise

中图分类号: