摘要: 结合RSA公钥加密和伪随机数生成器技术,给出一种分布式数据库隐私保护关联规则挖掘算法——PPD-ARBSM。引入密码管理服务器和数据挖掘服务器,能保护敏感数据的安全性,利用事务相似矩阵集中快速实现全局k-项频繁集的生成,能削减各站点间局部支持数对比的通信开销。理论分析与实验结果表明,该算法具有较好的隐私性、准确性和较高的效率。
关键词:
RSA公钥加密,
隐私保护,
数据挖掘,
关联规则,
分布式数据库
Abstract: Combining advantages of the RSA public-key encryption and pseudorandom generator technology, a privacy preserving distributed mining algorithm of association rules, PPD-ARBSM is proposed. It introduces Cryptogram Management Server(CMS) and Data Mining Server (DMS) in the algorithm, PPD-ARBSM effectively protects security of sensitive data, and can make full use of similarity matrix of transactions to generate intensively and quickly global k-frequent itemsets, thus greatly cut down communication costs of contrasting local support between sites. Theoretical analysis and experimental results show that PPD-ARBSM algorithm can achieve improvements in terms of privacy, accuracy, and efficiency.
Key words:
RSA public-key encryption,
privacy preservation,
data mining,
association rule,
distributed database
中图分类号:
桂 琼;程小辉;饶建辉. 基于RSA的隐私保护关联规则挖掘算法[J]. 计算机工程, 2009, 35(17): 138-140.
GUI Qiong; CHENG Xiao-hui; RAO Jian-hui. Privacy Preservation Association Rule Mining Algorithm Based on RSA[J]. Computer Engineering, 2009, 35(17): 138-140.