摘要: 为提高XML文档的查询效率,提出一种基于倒排表与B+树的联合索引技术。DTD结构索引和内容索引采用倒排表作为索引单位,XML文档索引使用B+树作为索引基本组织。在DTD结构索引的结点编码中设置标识信息,便于确定需要查询的文档。通过建立DTD结构索引、XML文档索引和内容索引,实现混合型XML文档的查询。理论分析与实验结果表明,该技术具有较小的空间开销和较高的查询效率。
关键词:
可扩展标记语言文档,
编码,
倒排表,
B+树,
索引,
查询性能
Abstract: In order to improve the query effeiciency of hybrid eXtensive Makeup Language(XML) document, this paper proposes a combined index technology based on inverted table and B+ tree. The DTD structure index and content index uses inverted table as index units, and XML document index takes B+ tree as the basic organization of its index. Identification information is set in node coding of DTD structure index, and it can help to determine the document which needs to query. And XML document hybrid query is achieved by establishing DTD structure index, XML document query and content index. Theoretical analysis and experimental results show that the proposed technology not only has lower space overhead, but also has higher index efficiency.
Key words:
eXtensive Makeup Language(XML) document,
coding,
inverted table,
B+ tree,
index,
query performance
中图分类号:
刘高嵩, 万里勇, 龙军. 基于倒排表与B+树的联合索引技术[J]. 计算机工程, 2012, 38(16): 49-51.
LIU Gao-Song, MO Li-Yong, LONG Jun. Combined Index Techniques Based on Inverted Table and B+ Tree[J]. Computer Engineering, 2012, 38(16): 49-51.