摘要: 随着现代科技和传感器的发展和应用,复杂多变的空间数据日益膨胀。为了有效地使用这些海量数据,不仅需要搜索元数据而且包括实际数据。要想通过扫描这些海量数据来回答值域查询显而易见是不现实的。该文研究了一种数据直方图聚类技术,用于栅格地球科学数据值域查询。实验表明,该方法不仅可以快速近似地回答统计范围查询,同时可以给出准确评价。
关键词:
直方图;聚类;数据值域查询
Abstract: With the application and development of modern science, techniques, micro and macros sensors, the tremendous amounts spatial and non-spatial data have been stored in large spatial databases. In order to use the data efficiently, it need to search for data based on not only metadata but also actual data values. To answer value range queries by scanning very large volumes of data is obviously unrealistic. This article studies a clustering technique on histograms of data values to query value range on earth science data. The experimental results show that the so-called statistical range queries can be answered quickly and approximately together with an accuracy assessment.
Key words:
Histogram; Clustering; Value range queries
张雪萍,秦奋,王家耀,范中山. 基于直方图聚类技术的海量数据值域查询[J]. 计算机工程, 2006, 32(8): 92-94.
ZHANG Xueping, QIN Fen, WANG Jiayao, FAN Zhongshan. Value Range Queries Using Histogram Clustering[J]. Computer Engineering, 2006, 32(8): 92-94.