作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (14): 52-54. doi: 10.3969/j.issn.1000-3428.2009.14.018

• 软件技术与数据库 • 上一篇    下一篇

面向恢复的集群计算技术

鲁晓佩,廖湘科,卢宇彤   

  1. (国防科技大学计算机学院,长沙 410073)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-07-20 发布日期:2009-07-20

Recovery-Oriented Cluster Computing Technology

LU Xiao-pei, LIAO Xiang-ke, LU Yu-tong   

  1. (School of Computer, National University of Defense Technology, Changsha 410073)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-07-20 Published:2009-07-20

摘要: 针对面向恢复计算(ROC)技术致力于在故障发生后使系统尽快恢复,从而提高系统可用性,而非从根本上避免故障发生的特点,对面向恢复的相关技术进行研究,给出ROC技术在集群系统中的应用,提出基于节点组的递归重启方法和基于Checkpoint的Undo恢复模型,用以提高集群系统的可用性,并对2种方法的改善效果进行评估。

关键词: 可用性, 面向恢复计算, 递归重启, Undo模型, 集群

Abstract: Recovery-Oriented Computing(ROC) improves system availability by repairing the system as soon as possible, instead of avoiding failure. This paper studies ROC techniques, gives its application in cluster system, proposes the method of Recursive Restartability(RR) based on node group and Undo recovery model based on Checkpoint to improve system availability. It evaluates the improvement effect of the methods.

Key words: availability, Recovery-Oriented Computing(ROC), recursive restartability, Undo model, cluster

中图分类号: