作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (23): 31-33,35. doi: 10.3969/j.issn.1000-3428.2010.23.011

• 软件技术与数据库 • 上一篇    下一篇

一种面向数据流模型的流计数算法

廖豪1,2,梁峰3,谭建龙1   

  1. (1.中国科学院计算技术研究所, 北京 100190; 2.中国科学院研究生院, 北京 100049; 3.湘潭大学信息工程学院, 湖南 湘潭 411105)
  • 出版日期:2010-12-05 发布日期:2010-12-14
  • 作者简介:廖豪(1986-),男,硕士,主研方向:网络信息安全;梁峰,硕士;谭建龙,副研究员、博士
  • 基金资助:
    国家“973”计划基金资助项目(2007CB311100)

Flow Counting Algorithm for Data Stream Model

LIAO Hao1,2,LIANG Feng3,TAN Jianlong1   

  1. (1.Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; 2.Graduate University of Chinese Academy of Sciences, Beijing 100049, China; 3.College of Information Engineering, Xiangtan University, Xiangtan 411105, China)
  • Online:2010-12-05 Published:2010-12-14

摘要: 在研究数据流过程中,基于现有的概要数据结构Bloom Filter,给出改进的KBloom Filter结构,从理论上对假阳性误判进行分析,得出两者具有相同的在误判率f0下表示集合规模的上限n0,因此,KBloom Filter的误判率在可控范围内。提出基于KBloom Filter的流计数算法,与基于Bloom Filter的流计数算法相比,在相同的空间复杂度O(m)和插入操作时间复杂度O(k)情况下,该算法降低了统计结果的误差。

关键词: 数据流, 布鲁姆过滤器, 概要数据结构

Abstract: During the process of studying data stream, this paper proposes the KBloom Filter structure which is based on the existing synopsis data structure, analyses the false positive misjudgment theoretically, concludes that both can present the upper limits of set scale n0under the same misjudgment rate f0, so the misjudgment rate of KBloom Filter is within the controllable range. The flow counting algorithm based on the KBloom Filter structure is gived, under the same space complexity O(m) and insert time complexity O(k), compared to the flow counting algorithm based on the Bloom Filter structure, the error of the counting result is lower.

Key words: data stream, Bloom Filter, synopsis data structure

中图分类号: