作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2008, Vol. 34 ›› Issue (3): 32-34,3. doi: 10.3969/j.issn.1000-3428.2008.03.012

• 博士论文 • 上一篇    下一篇

元素路径模型:高效的XML Schema提取方法

张海威,袁晓洁,杨 娜,王 鑫   

  1. (南开大学计算机科学与技术系,天津 300071)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-02-05 发布日期:2008-02-05

Element Path Model: Efficient Method for XML Schema Extraction

ZHANG Hai-wei, YUAN Xiao-jie, YANG Na, WANG Xin   

  1. (Department of Computer Science and Technology, Nankai University, Tianjin 300071)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-02-05 Published:2008-02-05

摘要: 提出基于元素路径模型(EPM)的XML Schema提取方法,旨在提高Hegewald等提出的XStruct系统的运行效率。基于EPM的方法使用SAX解析XML文档,提取XML元素路径模型并根据规则进行合并,得到XML元素序列表达式进而生成XML Schema。实验结果表明,基于元素路径模型方法的时间空间代价均优于XStruct系统。

关键词: 元素路径模型, 元素序列表达式, 合并操作

Abstract: This paper presents a method based on Element Path Models(EPM) for extracting XML Schema to enhance the efficiency of XStruct supposed by Hegewald, etc. XML Schema is generated by element sequence expression which is merged from element path models while parsing XML documents with SAX. Experiment shows that the method based on EPM uses less time and space than XStruct.

Key words: Element Path Model(EPM), element sequence expression, merging operation

中图分类号: