作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (21): 39-41. doi: 10.3969/j.issn.1000-3428.2009.21.013

• 软件技术与数据库 • 上一篇    下一篇

串匹配算法中的自动机紧缩存储技术

杨毅夫1,2,刘燕兵1,2,刘 萍1,郭 莉1   

  1. (1. 中国科学院计算技术研究所,北京 100190;2. 中国科学院研究生院,北京 100039)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-11-05 发布日期:2009-11-05

Automaton Compact Representation Technology in String Matching Algorithm

YANG Yi-fu1,2, LIU Yan-bing1,2, LIU Ping1, GUO Li1   

  1. (1. Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190; 2. Graduate University of Chinese Academy of Sciences, Beijing 100039)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-11-05 Published:2009-11-05

摘要: 自动机是串匹配算法中常用的数据结构,对自动机实现紧缩存储可以节省算法空间。总结常用自动机紧缩存储方法,分析其原理、时间效率、空间效率和优缺点,给出各种方法与数据稀疏性之间的关系。运用紧缩存储方法实现基本AC算法,对随机数据和真实数据的实验结果证明该算法有效。

关键词: 紧缩存储, 自动机, 串匹配

Abstract: Automaton is one kind of data structure often being used in string matching algorithms. By realizing compact representation of automaton, the algorithm space can be decreased. This paper summarizes several frequently used compact representations of automaton, analyzes their principles, time efficiencies, space efficiencies, merits and demerits, and gives relationships between above methods and sparsity character. It implements the basic AC algorithm with compact representation method. Experimental results of random and real data demonstrate the efficiency of this algorithm.

Key words: compact representation, automaton, string matching

中图分类号: