作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (5): 64-66. doi: 10.3969/j.issn.1000-3428.2010.05.024

• 软件技术与数据库 • 上一篇    下一篇

基于分布密度的直方图与选择率估计

朱 亮1,冯彦超1,刘椿年2,杨文柱1   

  1. (1. 河北大学数学与计算机学院河北省机器学习与计算智能重点实验室,保定 071002;2. 北京工业大学计算机学院,北京 100022)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2010-03-05 发布日期:2010-03-05

Histogram Based on Distribution Density and Selectivity Estimation

ZHU Liang1, FENG Yan-chao1, LIU Chun-nian2, YANG Wen-zhu1   

  1. (1. Hebei Province Key Laboratory of Machine Learning and Computational Intelligence, School of Mathematics and Computer Science, Hebei University, Baoding 071002; 2. College of Computer Science, Beijing University of Technology, Beijing 100022)
  • Received:1900-01-01 Revised:1900-01-01 Online:2010-03-05 Published:2010-03-05

摘要: 查询选择率估计是查询处理和优化中的关键之一。提出一种基于区域分布密度的方法,用于构造直方图,使其每个桶具有均匀分布或近似均匀分布,利用直方图估计查询选择率。实验结果表明,该方法对低维数据估计得到的查询选择率精度较高,并能对高维数据进行估计。

关键词: 选择率估计, 直方图, n维超矩形, 分布密度

Abstract: Query selectivity estimation is one of the key issues for query processing and optimization. This paper presents a method based on domain distribution density to establish histograms in which the distribution of buckets is uniform or nearly. It utilizes the histograms to estimate query selectivity. Experimental results indicate that this method gets query selectivity with high precision for low-dimensional data and can estimate high-dimensional data.

Key words: selectivity estimation, histogram, n-dimension hyperrectangle, distribution density

中图分类号: