MAO Xiu-Jing, CHEN Xing-Yuan, YANG Yang-Jie, LI Dun-Feng
Taking the QoS of failure detection into consideration, a Framework of Disaster-Tolerance Oriented Adaptive Failure Detection (DTO-FDF) is presented, which adopts hierarchical modularity design and constructs the DTO-FDF and algorithm from the system view and data circulation sequence. DTO-FDF mainly includes three modules, monitoring module, processing module, responding module. By monitoring and gathering the data of node, according to evaluation index and configuration policy, combining with the designed failure detection algorithm, the system judges the machine is alive or not. Based on the result of decision, the system makes the take-over response and the system migration. The framework makes some benefic work for the study of failure detection module of universality and independence.