Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2010, Vol. 36 ›› Issue (19): 62-64. doi: 10.3969/j.issn.1000-3428.2010.19.021

• Networks and Communications • Previous Articles     Next Articles

Dewey Encoding Generation Algorithm of XML Document

WU Hai-tao1,2, TANG Zhen-min1   

  1. (1. School of Computer Science and Technology, Nanjing University of Science and Technology, Nanjing 210094, China; 2. School of Communication Engineering, Nanjing Institute of Technology, Nanjing 211167, China)
  • Online:2010-10-05 Published:2010-09-27

XML文档的Dewey编码生成算法

吴海涛1,2,唐振民1   

  1. (1. 南京理工大学计算机科学与技术学院,南京 210094;2. 南京工程学院通信工程学院,南京 211167)
  • 作者简介:吴海涛(1974-),男,博士研究生,主研方向:XML数据管理,数据挖掘;唐振民,教授、博士生导师
  • 基金资助:
    国家自然科学基金资助项目(90820306)

Abstract: Dewey code is a kind of important coding scheme of XML document, it is an important preprocessing step in XML keyword search. This paper proposes two algorithms for Dewey code of XML document, that is the recursive algorithm based on DOM and the event generation algorithm based on SAX, and compares the running time, memory consumption of two algorithms. Experimental result shows that the SAX event generation algorithm is with higher speed and lower memory consumption for very large XML document.

Key words: XML document, Dewey encoding, DOM

摘要: Dewey编码是一种重要的XML文档编码方式,是对XML文档进行关键字检索等操作的重要预处理步骤。提出2种XML文档Dewey编码的生成算法:基于DOM的递归算法和基于SAX的事件生成算法,并比较2种算法的执行时间和内存使用率。实验结果证明,对于超大XML文档,采用基于SAX的事件生成算法具有较快的生成速度和较低的内存使用率。

关键词: XML文档, Dewey编码, 文档对象模型

CLC Number: