计算机工程 ›› 2009, Vol. 35 ›› Issue (17): 92-93,9.doi: 10.3969/j.issn.1000-3428.2009.17.031

• 软件技术与数据库 • 上一篇    下一篇

代价敏感的缺失数据有序填充算法

苏毅娟,钟 智   

  1. (广西师范学院计算机与信息工程学院,南宁 530023)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-09-05 发布日期:2009-09-05

Cost-sensitive Missing Data Imputing Algorithm with Ordering

SU Yi-juan, ZHONG Zhi   

  1. (College of Computer and Information Engineering, Guangxi Teachers Education University, Nanning 530023)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-09-05 Published:2009-09-05

摘要: 缺失数据填充效果会对学习算法和挖掘算法的后续处理过程产生影响。针对代价敏感决策树方法没有同时考虑填充顺序和填充代价的问题,提出一种有序填充缺失数据的算法,综合考虑经济因素和建立填充器所需的有效信息。实验结果表明其预测准确率和分类准确率高于现有算法。

关键词: 代价敏感学习, 缺失数据填充, 填充顺序

Abstract: Missing data imputing effect affects the following processes of the learning algorithms and mining algorithms. Cost-sensitive decision tree method does not consider the imputing order and imputing cost at the same time. Aiming at this problem, this paper proposes a new algorithm to impute missing data with ordering. This algorithm considers the economic factor and effective information for imputing machine establishment synthetically. Experimental results demonstrate that this algorithm has high prediction accuracy and classification accuracy than existing algorithms.

Key words: cost-sensitive learning, missing data imputing, imputing order

中图分类号: