作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2018, Vol. 44 ›› Issue (7): 67-73. doi: 10.19678/j.issn.1000-3428.0047726

• 体系结构与软件技术 • 上一篇    下一篇

一种基于日志聚类的多类型故障预测方法

王卫华,应时,贾向阳,王冰明,程国力   

  1. 武汉大学 计算机学院 软件工程国家重点实验室,武汉 430072
  • 收稿日期:2017-06-26 出版日期:2018-07-15 发布日期:2018-07-15
  • 作者简介:王卫华(1994—),女,硕士研究生,主研方向为计算机软件与理论;应时,教授;贾向阳,讲师;王冰明、程国力,博士研究生。
  • 基金资助:

    国家自然科学基金“基于SaaS软件运行日志分析的软件性能问题的在线识别和诊断方法”(61672392);国家重点研发计划项目“国产化高等级病原微生物模式实验室建设及管理体系研究”(2016YFC1202204)。

A Multi-type Failure Prediction Method Based on Log Clustering

WANG Weihua,YING Shi,JIA Xiangyang,WANG Bingming,CHENG Guoli   

  1. State Key Laboratory of Software Engineering,Computer School,Wuhan University,Wuhan 430072,China
  • Received:2017-06-26 Online:2018-07-15 Published:2018-07-15

摘要:

现有故障预测方法的日志事件多数是无规律交错存在的,且不同类型的故障事件所涉及的事件数量与时间范围存在一定差异。为在故障预测时能够提供故障相关信息,提出一种基于频繁日志事件序列对多种不同类型的故障进行预测的方法。以日志事件序列之间的最长公共子序列作为其相似性度量,使用聚合层次聚类算法挖掘与故障事件相关的频繁事件序列。在频繁事件序列的基础上生成故障事件预测规则,并给出一种对故障事件预测规则进行过滤的方法,将过滤后的规则应用到测试集上进行故障预测。实验结果表明,该方法不仅能够进行有效的故障预测,而且可以平衡故障预测的准确率和召回率。

关键词: 故障事件预测, 频繁事件序列, 规则过滤, 聚合层次聚类算法, 超级计算机

Abstract:

Most of the existing fault prediction methods have log event staggered,and there are certain difference in the number of events and the time range involved in different types of faule events.In order to provide fault-related information during fault prediction,a method for predicting multiple different types of faults based on frequent log event sequences is proposed.Taking the Longest Common Subsequence(LCS) between log event sequences as its similarity measure,an aggregational hierarchical algorithm is used to mine frequent event sequences related to failure events.A fault event prediction rule is generated on the basis of a sequence of frequent events,and a method of filtering a fault event prediction rule is provided,and the filtered rule is applied to a test set for fault prediction.Experimental results show that this method can not only effectively predict failures,but also balance the accuracy and recall rate of failure prediction.

Key words: failure event prediction, frequent event sequence, rule filtering, aggregational hierarchical clustering algorithm, supercomputer

中图分类号: