摘要: 提出基于元素路径模型(EPM)的XML Schema提取方法,旨在提高Hegewald等提出的XStruct系统的运行效率。基于EPM的方法使用SAX解析XML文档,提取XML元素路径模型并根据规则进行合并,得到XML元素序列表达式进而生成XML Schema。实验结果表明,基于元素路径模型方法的时间空间代价均优于XStruct系统。
关键词:
元素路径模型,
元素序列表达式,
合并操作
Abstract: This paper presents a method based on Element Path Models(EPM) for extracting XML Schema to enhance the efficiency of XStruct supposed by Hegewald, etc. XML Schema is generated by element sequence expression which is merged from element path models while parsing XML documents with SAX. Experiment shows that the method based on EPM uses less time and space than XStruct.
Key words:
Element Path Model(EPM),
element sequence expression,
merging operation
中图分类号:
张海威;袁晓洁;杨 娜;王 鑫. 元素路径模型:高效的XML Schema提取方法[J]. 计算机工程, 2008, 34(3): 32-34,3.
ZHANG Hai-wei; YUAN Xiao-jie; YANG Na; WANG Xin. Element Path Model: Efficient Method for XML Schema Extraction[J]. Computer Engineering, 2008, 34(3): 32-34,3.