作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2006, Vol. 32 ›› Issue (12): 17-18,21.

• 博士论文 • 上一篇    下一篇

一种新的通用概念层次编码方法

袁军鹏 1,陈铿 2,黄进 2,李连宏2,杨雨 2,朱东华2,李俊峰2   

  1. 1. 清华大学公共管理学院,北京 100084;2. 北京理工大学管理与经济学院,北京 100081
  • 出版日期:2006-06-20 发布日期:2006-06-20

A Novel Generic Concept Hierarchy Encoding Algorithm

YUAN Junpeng1, CHEN Keng2, HUANG Jin2, LI Lianhong2, YANG Yu2, ZHU Donghua2, LI Junfeng2   

  1. 1. School of Public Policy & Management, Tsinghua University, Beijing 100084;2. School of Management & Economics, Beijing Institute of Technology, Beijing 100081
  • Online:2006-06-20 Published:2006-06-20

摘要: 概念层次是目前数据挖掘和知识发现的前沿性方法。为了把概念层次用于数据挖掘,需要解决如何从现有数据集自动生成概念层次,如何存储和处理网状概念层次结构及如何提高概念层次结构的搜索效率等问题。文章提出适用于任何数据挖掘功能的通用编码方法――基于层次域的概念层次实数编码法,该方法有效地解决了概念层次的存储和检索问题,并在微机电系统领域进行了与典型算法的对比分析。

关键词: 概念层次;编码算法;数据挖掘

Abstract: As one of the useful background knowledge, concept hierarchies organize data or concepts in hierarchical forms or in certain partial order, which are used for expressing knowledge in concise, high-level terms, and facilitating mining knowledge at multiple levels of abstraction. To incorporate the concept hierarchies into a data mining system, encoding plays a key role. A novel generic encoding algorithm is proposed which can be treated as a generic purpose encoding strategy suitable for any data mining functionalities. The partial order of the hierarchy is exactly represented by the codes so that it only needs to manipulate the codes when processing mining tasks.

Key words: Concept hierarchy; Encoding algorithm; Data mining