作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2007, Vol. 33 ›› Issue (14): 69-70,8. doi: 10.3969/j.issn.1000-3428.2007.14.024

• 软件技术与数据库 • 上一篇    下一篇

面向相似数据集的关系数据库压缩

邓文平,朱培栋,卢锡城   

  1. (国防科技大学计算机学院,长沙 410073)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-07-20 发布日期:2007-07-20

Relational Database Compression for Similar Data Sets

DENG Wenping, ZHU Peidong, LU Xicheng   

  1. (School of Computer, National University of Defense Technology, Changsha 410073)
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-07-20 Published:2007-07-20

摘要: 采用关系数据库模型进行建模,对于同一关系框架上的数据定义了相似数据集。对单个数据集,通过关系拆分对数据库模型进行规范化处理,去除了关系内部的数据冗余;对多个数据集之间的压缩提出了一种基于0-1状态标记序列的增量式无损压缩算法,压缩后的数据可以快速地完全解压缩。试验结果表明,算法可以实现对相似数据集的高效无损压缩和快速查询。

关键词: 数据库压缩, 冗余度, 相似数据集, 无损压缩, 压缩比

Abstract: A new definition of similar data set is proposed for some special data sets which have the same attributes in a relational database model. For compression of a single data set, the database normalization is performed by partitioning the relations; for multiplet similar data sets, a data lossless compression algorithm is proposed, which is based on a 0-1 status tag sequence. With the compression method, redundancies among similar data sets evidently decrease, and decompression finishe fast and completely. Experimental results show that with the method compression on similar data sets is efficient without any loss and access to the data is fast.

Key words: database compression, redundancy, similar data set, lossless compression, compression ratio

中图分类号: