作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2006, Vol. 32 ›› Issue (8): 92-94.

• 软件技术与数据库 • 上一篇    下一篇

基于直方图聚类技术的海量数据值域查询

张雪萍 1,2,秦奋 1,3,王家耀 1,范中山4   

  1. 1. 解放军信息工程大学测绘学院,郑州 450052;2. 河南工业大学计算机科学系,郑州 450052;3. 河南大学环境与规划学院,开封 475001;4. 河南省交通科学技术研究院,郑州 450052
  • 出版日期:2006-04-20 发布日期:2006-04-20

Value Range Queries Using Histogram Clustering

ZHANG Xueping1,2, QIN Fen1,3, WANG Jiayao1, FAN Zhongshan4   

  1. 1. Institute of Surveying and Mapping, PLA Information Engineering University, Zhengzhou 450052; 2. Department of Computer Science,Henan University of Technology, Zhengzhou 450052; 3. Institute of Environment and Planning, Henan University, Kaifeng 475001;4. Henan Communications Scientific Technology Research Institute, Zhengzhou 450052
  • Online:2006-04-20 Published:2006-04-20

摘要: 随着现代科技和传感器的发展和应用,复杂多变的空间数据日益膨胀。为了有效地使用这些海量数据,不仅需要搜索元数据而且包括实际数据。要想通过扫描这些海量数据来回答值域查询显而易见是不现实的。该文研究了一种数据直方图聚类技术,用于栅格地球科学数据值域查询。实验表明,该方法不仅可以快速近似地回答统计范围查询,同时可以给出准确评价。

关键词: 直方图;聚类;数据值域查询

Abstract: With the application and development of modern science, techniques, micro and macros sensors, the tremendous amounts spatial and non-spatial data have been stored in large spatial databases. In order to use the data efficiently, it need to search for data based on not only metadata but also actual data values. To answer value range queries by scanning very large volumes of data is obviously unrealistic. This article studies a clustering technique on histograms of data values to query value range on earth science data. The experimental results show that the so-called statistical range queries can be answered quickly and approximately together with an accuracy assessment.

Key words: Histogram; Clustering; Value range queries