作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2008, Vol. 34 ›› Issue (11): 281-282.

• 开发研究与设计技术 • 上一篇    下一篇

金融业务数据库的数据压缩方法

贾永洁,王耀强,郑 骏   

  1. (中国建设银行上海开发中心,上海 201204)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-06-05 发布日期:2008-06-05

Data Compression Method of Financial Business Database

JIA Yong-jie, WANG Yao-qiang, ZHENG Jun   

  1. (Shanghai Development Center, China Construction Bank, Shanghai 201204)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-06-05 Published:2008-06-05

摘要: 针对金融业务中实时数据库的数据存储特点,提出结构混合压缩(SMC)算法。SMC算法利用金融数据具有纯文本、数据分散和数据项内重复少的特点,以哈夫曼编码作为算法基础,根据词频将单字和词组混合,在哈夫曼树中引入数组结构,对文本数据进行压缩。测试结果表明,SMC算法的平均数据压缩率比原始哈夫曼算法提高了约13%。

关键词: 数据压缩, 压缩算法, 哈夫曼编码

Abstract: This paper gives a new data compression algorithm——Structured Mixed Compression(SMC) algorithm, which can adapt to the characteristics of real-time database used in financial business. In financial business, the data are dispersed, formatted as text, and there are few duplicate fields in a same data item, so the SMC algorithm mixes words and phrases according to the word frequency, and imports the array to the Huffman tree to compress the business data. According to the test, result shows that the average compression ratio by using SMC algorithm is 15% more than that by using Huffman encoding.

Key words: data compression, compression algorithm, Huffman encoding

中图分类号: