作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (19): 49-52,5. doi: 10.3969/j.issn.1000-3428.2009.19.016

• 软件技术与数据库 • 上一篇    下一篇

基于P2P的自适应分布式k最近邻搜索算法

余小高1,余小鹏2   

  1. (1. 湖北经济学院信息管理学院,武汉 430205;2. 武汉工程大学经济管理学院,武汉 430073)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-10-05 发布日期:2009-10-05

P2P-based Self-adaptive Distributed k-nearest Neighbor Search Algorithm

YU Xiao-gao1, YU Xiao-peng2   

  1. (1. School of Information Management, Hubei University of Economics, Wuhan 430205; 2. School of Economic Management, Wuhan Institute of Technology, Wuhan 430073)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-10-05 Published:2009-10-05

摘要: k最近邻搜索算法无法满足数据挖掘的分布性、实时性和可扩展性要求,针对该问题提出基于P2P的自适应分布式k最近邻搜索算法(P2PAKNNs)。阐述GHT*结构,定义高维数据相似度函数HDSF(X,Y),论述GHT*中的插入算法、范围查找算法和搜索算法。给出P2PAKNNs的实现过程,通过实验证明其正确性。

关键词: k最近邻搜索算法, 度量空间, 相似性查询

Abstract: k-nearest Neighbor search algorithm(KNNs) can not satisfy the needs of distributing, real time performance and expansibility for data mining. Aiming at this problem, a P2P-based self-adaptive distributed KNNs(P2PAKNNs) is proposed. This paper expounds GHT* structure, and gives similarity measure function HDSF(X, Y). Insert algorithm, range find algorithm and search algorithm in GHT* are discussed. Implementation process of P2PAKNNs is given, and its correctness is validated by experiment.

Key words: k-nearest Neighbor search algorithm(KNNs), metric space, similarity query

中图分类号: