Abstract:
Aiming at some problems of present XML queryable compressors, this paper proposes the definition and detailed algorithm of structure sign tree, which can simplify the structure of XML data by removing the repeated paths. On the basis, a new XML queryable compressor SSTQC is put forward to compress XML data and organize queries. SSTQC requires only a single pass over the XML document, and it has excellent compression performance and better query efficiency.
Key words:
XML data,
data compression,
query processing,
repeated paths,
Structure Sign Tree(SST)
摘要: 针对支持查询的XML数据压缩方法存在的路径和数据重复等问题,通过去除XML数据中的重复路径,简化XML数据结构,提出结构标记树的概念及其生成算法,设计一种基于结构标记树的可查询XML数据压缩方法SSTQC,对XML数据进行压缩和组织查询。SSTQC一次扫描XML文档,具有较好的的压缩性能和查询效率。
关键词:
XML数据,
数据压缩,
查询处理,
重复路径,
结构标记树
CLC Number:
WEI Dong-Beng, XU Rui-Min, GU Nan. XML Queryable Compression Method Based on Structure Sign Tree[J]. Computer Engineering, 2011, 37(15): 34-36.
魏东平, 徐瑞敏, 贾楠. 基于结构标记树的XML可查询压缩方法[J]. 计算机工程, 2011, 37(15): 34-36.