作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (7): 5-7. doi: 10.3969/j.issn.1000-3428.2009.07.002

• 博士论文 • 上一篇    下一篇

典型Bloom过滤器的研究及其数据流应用

袁志坚,陈颖文,缪嘉嘉,贾 焰,杨树强   

  1. (国防科技大学计算机学院,长沙 410073)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-04-05 发布日期:2009-04-05

Research on Typical Bloom Filters and Their Data Stream Applications

YUAN Zhi-jian, CHEN Ying-wen, MIAO Jia-jia, JIA Yan, YANG Shu-qiang   

  1. (College of Computer, National University of Defense Technology, Changsha 410073)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-04-05 Published:2009-04-05

摘要: Bloom过滤器是一种空间高效但有一定假阳性的数据表示方法。该文分析比较计数型Bloom过滤器、光谱Bloom过滤器和动态计数过滤器的异同点及适用场合,介绍Bloom 过滤器在重复项检测及频繁项挖掘中的应用,总结Bloom过滤器给数据流带来的挑战,包括元素突发问题及数据流相异元素数目变化问题。

关键词: Bloom过滤器, 计数型Bloom过滤器, 光谱Bloom过滤器, 动态计数过滤器, 数据流

Abstract: Bloom Filter(BF) is a space-efficient randomized data structure for representing data set with a small false positive probability. This paper compares and analyzes the similarities and differences among Counting Bloom Filter(CBF), Spectral Bloom Filter(SBF) and Dynamic Counting Filter(DCF), gives some examples and applications in data stream which include duplicate items detecting and frequent items mining. It summarizes the challenges in data stream brought by BF, including element burst problem and number of distinguish element changing problem.

Key words: Bloom Filter(BF), Counting Bloom Filter(CBF), Spectral Bloom Filter(SBF), Dynamic Counting Filter(DCF), data stream

中图分类号: