作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2007, Vol. 33 ›› Issue (03): 48-49. doi: 10.3969/j.issn.1000-3428.2007.03.018

• 软件技术与数据库 • 上一篇    下一篇

基于数据挖掘的组合近邻模型算法

郑宏珍1,刘 扬2,战德臣2   

  1. (1. 哈尔滨工业大学计算机科学与技术学院,威海 264209;2. 哈尔滨工业大学计算机科学与技术学院,哈尔滨150001)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-02-05 发布日期:2007-02-05

Multiple Nearest Neighbor Algorithm Based on Data Mining

ZHENG Hongzhen1, LIU Yang2, ZHAN Dechen2   

  1. (1. College of Computer Science and Technology, Harbin Institute of Technology, Weihai 264209; 2. College of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001)
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-02-05 Published:2007-02-05

摘要: 针对数据挖掘的组合模型问题,研究了组合模型的理论和技术,分析了组合理论在近邻法的应用现状,提出了一种通过随机属性子集组合近邻分类器的算法MNN,利用简单的投票方法,通过一个随机的属性子集来组合多重近邻分类器,对多重NN分类器的输出进行组合,MNN方法能有效地改进近邻法的分类精度。MNN方法与NN-E000相比,有两个主要的优点:(1) MNN是一个更简单的方法;(2) MNN不受多类问题的限制。

关键词: 数据挖掘, 分类模型, 组合模型

Abstract: Aimed at the multiple model problem of data mining, the theory and technology of combination model is discussed and the application of combination theory to the nearest neighbor is studied. The paper proposes an algorithm of MNN (multiple nearest neighbor) classifiers using a random subset of attributions. With the simple voting method, the multiple nearest neighbor classifiers are combined via a random attribution set and the output of the multiple NN classifiers is combined. The method of MNN can improve on the classification precision. Comparing the MNN method to NN-ECOC, two strongpoint are obtained: (1)MNN is a more simple method; (2)MNN is not limited by multiple classes.

Key words: Data mining, Classification model, Multiple model