Abstract:
The data format XML is easy to create and parse, but significantly large and hard to implement search on it. This paper introduces XBW transformation to compress XML data to three arrays, transfers the search process from tree structure to the three arrays, and implements the search process by Rank&Select method. This paper also proposes navigation, subpath search and content search algorithms based on XBW transformation. Experimental results show that the performance of XBWZIP approach surpasses some queriable XML compressors and some general-purpose compressor in terms of compression ratio and compression time.
Key words:
XBW transformation,
Rank&Select method,
XBWZIP method
摘要: XML数据格式虽然具有易于创建和解析的特点,但数据格式过于冗长,难以实现具体的查询。该文引入XBW变换,将XML数据压缩成3个线性序列,使XML数据的查询处理从树形结构转移到这3个序列上。基于XBW变换导航、子路径查询和内容查询,采用Rank&Select方法来实现查询。实验结果表明,在压缩率和压缩时间方面,XBWZIP的性能接近或者超过一些支持查询的XML压缩方法和一些通用压缩方法。
关键词:
XBW变换,
Rank&Select方法,
XBWZIP方法
CLC Number:
HU Zhi-fei; YANG Lu-ming; LIU Bo; LI Jian-jun. XML Data Compression Query Method Based on XBW Transformation[J]. Computer Engineering, 2008, 34(19): 67-69.
胡智飞;杨路明;刘 波;李建军. 基于XBW变换的XML数据压缩查询方法[J]. 计算机工程, 2008, 34(19): 67-69.