计算机工程 ›› 2007, Vol. 33 ›› Issue (05): 283-285.doi: 10.3969/j.issn.1000-3428.2007.05.100

• 开发研究与设计技术 • 上一篇    下一篇

双机容错系统中最佳检查点间隔的分析

鄢喜爱1,2,杨金民1,田 华2   

  1. (1. 湖南大学软件学院,长沙 410082;2. 湖南公安高等专科学校,长沙 410138)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-03-05 发布日期:2007-03-05

Analysis of Best Checkpoint Interval of Duplicated Fault Tolerance System

YAN Xiai1,2, YANG Jinmin1, TIAN Hua2   

  1. (1. Software College, Hunan University, Changsha 410082; 2. Hunan Public Security College, Changsha 410138)
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-03-05 Published:2007-03-05

摘要: 设置检查点是容错计算机系统进行故障恢复的重要手段。因为检查点间隔选择过大或过小都将使系统性能受到影响,所以检查点间隔的适当选定是系统性能优化的一个重要指标。该文针对双机容错系统,采用检查点设置与回卷恢复的方法提出了一种系统模型,利用马尔科夫链得到了最佳检查点间隔的求解等式,通过实验证实了求解等式的正确性。

关键词: 双机容错, 回卷恢复, 检查点间隔

Abstract: Checkpointing is one of the most important method for fault tolerant computer to recover from faults. Too big or too small checkpoint interval maybe degrade the performance of system, so proper determination of checkpoint interval can make system performance optimized. This paper presents a duplicated fault tolerance system with the methods of setting checkpoints and rollback recovery, and achieves an equation about the best checkpoint interval through the Markov chain. In the end, the correctness of this conclusion through experiment is testified.

Key words: Duplicated fault tolerance, Rollback recovery, Checkpoint interval