作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2012, Vol. 38 ›› Issue (7): 4-6. doi: 10.3969/j.issn.1000-3428.2012.07.002

• 博士论文 • 上一篇    下一篇

面向容灾的自适应故障检测框架研究

毛秀青,陈性元,杨英杰,李俊峰   

  1. (解放军信息工程大学电子技术学院,郑州 450004)
  • 收稿日期:2011-08-19 出版日期:2012-04-05 发布日期:2012-04-05
  • 作者简介:毛秀青(1980-),男,讲师、博士研究生,主研方向:信息安全,容灾备份;陈性元,教授、博士生导师;杨英杰,副教授、博士;李俊峰,硕士研究生
  • 基金资助:
    国家“863”计划基金资助项目(2009AA01Z438);河南省基础与前沿技术研究计划基金资助项目(102300413203)

Research on Adaptive Failure Detection Framework Oriented on Disaster-tolerance

MAO Xiu-qing, CHEN Xing-yuan, YANG Ying-jie, LI Jun-feng   

  1. (Institute of Electronic Technology, PLA Information Engineering University, Zhengzhou 450004, China)
  • Received:2011-08-19 Online:2012-04-05 Published:2012-04-05

摘要: 根据分布式计算的可靠性需求,提出一种面向容灾的自适应故障检测框架。该框架采用分层模块化设计,从系统的角度按数据流转的顺序,构建面向容灾的自适应故障检测算法,其主要包括监控模块、处理模块和响应模块3个重要模块。通过监测采集节点或进程的相关数据,依照建立的评测指标及配置策略,结合故障检测算法,判断主机是否存活。根据决策结果,作出发生灾难时的接管响应与系统迁移。系统实现结果表明,该框架能提高故障检测组件通用性及独立性。

关键词: 容灾, 故障检测, 自适应, 监控, 系统迁移

Abstract: Taking the QoS of failure detection into consideration, a Framework of Disaster-Tolerance Oriented Adaptive Failure Detection (DTO-FDF) is presented, which adopts hierarchical modularity design and constructs the DTO-FDF and algorithm from the system view and data circulation sequence. DTO-FDF mainly includes three modules, monitoring module, processing module, responding module. By monitoring and gathering the data of node, according to evaluation index and configuration policy, combining with the designed failure detection algorithm, the system judges the machine is alive or not. Based on the result of decision, the system makes the take-over response and the system migration. The framework makes some benefic work for the study of failure detection module of universality and independence.

Key words: disaster-tolerance, failure detection, adaptive, monitoring, system migration

中图分类号: