Author Login Chief Editor Login Reviewer Login Editor Login Remote Office

Computer Engineering ›› 2010, Vol. 36 ›› Issue (5): 64-66.

• Software Technology and Database • Previous Articles     Next Articles

Histogram Based on Distribution Density and Selectivity Estimation

ZHU Liang1, FENG Yan-chao1, LIU Chun-nian2, YANG Wen-zhu1   

  1. (1. Hebei Province Key Laboratory of Machine Learning and Computational Intelligence, School of Mathematics and Computer Science, Hebei University, Baoding 071002; 2. College of Computer Science, Beijing University of Technology, Beijing 100022)
  • Received:1900-01-01 Revised:1900-01-01 Online:2010-03-05 Published:2010-03-05

基于分布密度的直方图与选择率估计

朱 亮1,冯彦超1,刘椿年2,杨文柱1   

  1. (1. 河北大学数学与计算机学院河北省机器学习与计算智能重点实验室,保定 071002;2. 北京工业大学计算机学院,北京 100022)

Abstract: Query selectivity estimation is one of the key issues for query processing and optimization. This paper presents a method based on domain distribution density to establish histograms in which the distribution of buckets is uniform or nearly. It utilizes the histograms to estimate query selectivity. Experimental results indicate that this method gets query selectivity with high precision for low-dimensional data and can estimate high-dimensional data.

Key words: selectivity estimation, histogram, n-dimension hyperrectangle, distribution density

摘要: 查询选择率估计是查询处理和优化中的关键之一。提出一种基于区域分布密度的方法,用于构造直方图,使其每个桶具有均匀分布或近似均匀分布,利用直方图估计查询选择率。实验结果表明,该方法对低维数据估计得到的查询选择率精度较高,并能对高维数据进行估计。

关键词: 选择率估计, 直方图, n维超矩形, 分布密度

CLC Number: