作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (21): 43-45. doi: 10.3969/j.issn.1000-3428.2011.21.015

• 软件技术与数据库 • 上一篇    下一篇

基于最近邻过滤的数据流Skyline算法

王艳杰,薛安荣,吴正寅   

  1. (江苏大学计算机科学与通信工程学院,江苏 镇江 212013)
  • 收稿日期:2011-04-29 出版日期:2011-11-05 发布日期:2011-11-05
  • 作者简介:王艳杰(1985-),女,硕士研究生,主研方向:数据挖掘;薛安荣,教授、博士、CCF会员;吴正寅,硕士研究生
  • 基金资助:
    国家自然科学基金资助项目(60773049);江苏大学高级人才启动基金资助项目(09JDG041)

Skyline Algorithm for Data Stream Based on Nearest Neighbor Filtering

WANG Yan-jie, XUE An-rong, WU Zheng-yin   

  1. (School of Computer Science and Telecommunication Engineering, Jiangsu University, Zhenjiang 212013, China)
  • Received:2011-04-29 Online:2011-11-05 Published:2011-11-05

摘要: 在Skyline查询算法中,Lazy算法的数据处理能力较弱。针对该问题,利用最近邻过滤的思想,提出改进的NNSC算法。在预处理阶段,利用欧氏距离定义一个阈值,如果新插入数据对象的欧氏距离大于阈值,提前处理该对象,以提高算法利用率,减少资源的消耗。实验结果证明,NNSC可降低时空复杂度,对于正相关数据,性能提升效果明显。

关键词: 数据流, Skyline算法, NNSC算法, 最近邻过滤, 欧氏距离

Abstract: Lazy of Skyline query algorithm is poor in data processing. So this paper proposes an improved algorithm called Nearest Neighbor-based Skyline Computation(NNSC) by using the idea of Nearest Neighbor(NN) filtering. It defines a threshold by using Euclidean distance in pre-processing stage. If the Euclidean distance of the recently inserted object is greater than which of the threshold, it will be processed ahead of time to improve the algorithm efficiency and reduce resource consumption. Experimental results show that NNSC can reduce the time and space complexity, and its performance is improved significantly for positive correlated data.

Key words: data stream, Skyline algorithm, Nearest Neighbor-based Skyline Computation(NNSC) algorithm, Nearest Neighbor(NN) filtering, Euclidean distance

中图分类号: