作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2013, Vol. 39 ›› Issue (4): 36-38,43. doi: 10.3969/j.issn.1000-3428.2013.04.009

• 先进计算与数据处理 • 上一篇    下一篇

一种分布式非结构化数据副本管理模型

林 菲1,张万军1,孙 勇2   

  1. (1. 杭州电子科技大学软件工程学院,杭州 310018;2. 浙江交通职业技术学院信息学院,杭州 311112)
  • 收稿日期:2012-05-02 出版日期:2013-04-15 发布日期:2013-04-12
  • 作者简介:林 菲(1977-),女,副教授,主研方向:分布式计算,软件工程;张万军,讲师;孙 勇,副教授
  • 基金资助:
    浙江省自然科学基金资助项目(LY12F02017);浙江省教育厅科研基金资助项目(Y201119644)

A Management Model of Distributed Unstructured Data Replica

LIN Fei 1, ZHANG Wan-jun 1, SUN Yong 2   

  1. (1. College of Software Engineering, Hangzhou Dianzi University, Hangzhou 310018, China; 2. Institute of Information Technology, Zhejiang Institute of Communications, Hangzhou 311112, China)
  • Received:2012-05-02 Online:2013-04-15 Published:2013-04-12

摘要: 针对云存储系统中数据副本管理的延时响应等问题,提出一种面向非结构化数据的分布式副本管理模型。该模型采用机架选举算法,通过提高每个机架能源利用率的方法降低系统整体能耗,为绿色数据中心提供技术保障。运用多路线性散列算法,将数据副本动态均匀地分布到不同机架的不同节点中,以提高系统性能、平衡负载和资源利用率。仿真实验结果证明,与传统的全局映射法相比,该模型可以达到较高的存储与负载平衡,具有良好的扩展性和可用性。

关键词: 分布式, 非结构化, 数据副本, 机架, 线性散, 软件事务内存

Abstract: Aiming at the problem of data replicas management time-delay in the cloud storage system, this paper puts forward a distributed replica management model for unstructured data. The model reduces the system energy consumption through carefully designed rack-selection algorithm, which is the technical basis of green data center. To improve system performance and balance utilization of resources, the model uses a multi-linear-hashing algorithm, which can balance the data replicas distribution in different nodes intelligently. Simulation experimental results show that this model can reach higher storage and load balance compared with traditional global mapping method, it has excellent system performance, scalability and availability.

Key words: distributed, unstructured, data replica, rack, linear hashing, software transactional memory

中图分类号: