作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2013, Vol. 39 ›› Issue (7): 83-85,93. doi: 10.3969/j.issn.1000-3428.2013.07.018

• 先进计算与数据处理 • 上一篇    下一篇

分布式环境中基于改进RS编码的数据存储

周福平,谢 强,丁秋林   

  1. (南京航空航天大学计算机科???学与技术学院,南京 210016)
  • 收稿日期:2012-07-16 出版日期:2013-07-15 发布日期:2013-07-12
  • 作者简介:周福平(1986-),男,硕士研究生,主研方向:海量数据处理;谢 强,副教授;丁秋林,教授、博士生导师

Data Storage Based on Improved RS Coding in Distributed Environment

ZHOU Fu-ping, XIE Qiang, DING Qiu-lin   

  1. (College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China)
  • Received:2012-07-16 Online:2013-07-15 Published:2013-07-12

摘要: 针对目前主流分布式处理平台数据冗余因子过大、数据可用性不高的问题,提出基于改进RS编码的二次分块数据存储方法。将分布式环境中的文件块分成多个数据子块,利用RS编码对其进行编码,并分布存储到不同机器节点中,以减少数据冗余。实验结果表明,该方法能有效降低数据冗余度,提高数据可用性,减少任务执行时间。

关键词: RS编码, 分布式处理, 二次分块, 数据存储, 数据可用性

Abstract: In this paper, a second block storage method is proposed to solve the problems in current distributed processing environments in which data redundancy factor is too high while data availability is low. With the algorithm based on improved Reed-Solomon(RS) coding, the blocks in distributed system can be divided into sub-blocks, then the sub-blocks are encoded and stored in different computers to complete the redundancy of data. Experimental results show that data redundancy and running time are effectively reduced and data availability is increased by this method.

Key words: Reed-Solomon(RS) coding, distributed processing, second block, data storage, data availability

中图分类号: