作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2007, Vol. 33 ›› Issue (05): 62-64. doi: 10.3969/j.issn.1000-3428.2007.05.021

• 软件技术与数据库 • 上一篇    下一篇

冗余数据约简的研究与设计

纪 霞,李龙澍   

  1. (1. 安徽大学计算智能与信号处理教育部重点实验室,合肥 230039;2. 安徽大学计算机科学与技术学院,合肥 230039)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-03-05 发布日期:2007-03-05

Study and Design of Reduction for Redundant Data

JI Xia, LI Longshu   

  1. (1. Key Laboratory of Ministry of Education for Computational Intelligence and Signal Processing, Anhui University, Hefei 230039; 2. School of Computer Science and Technology, Anhui University, Hefei 230039)
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-03-05 Published:2007-03-05

摘要: Z. Pawlak于1982年提出的Rough集理论有效地分析了不确定、不精确、不一致等各种不完备信息,其优点是无需任何关于数据的初始的或附加的信息,如统计学中的概率分布。该文介绍了Rough集的基本理论在数据约简中的应用。在分析基于信息系统的粗糙集理论的基础上,描述了一种基于核与重要度的约简算法,从降低约简算法计算复杂度角度出发,修改了属性约简算法,计算了算法修改前后的复杂度。实验结果表明,修改后的算法在降低时间复杂度的同时得出了次优属性集的约简。

关键词: 粗集理论, 约简算法, 重要度

Abstract: Advanced by Z. Pawlak in 1982, the rough set theory analyzes inadequate information efficiently such as inexactitude or dissonant information. The advantage of the rough set theory is no need of any initial or extra information such as the probability dispersal of statistics and so on. This article mainly introduces the applications of the rough set theory in data reduction. On the basis of analyzing the rough set theory based on information system, description of attribute set reduction algorithm based on the core and importance degree is given. In order to reduce the computational complexity of the algorithm, an improved algorithm is put forward. A brief comparison of computation complexity of the old algorithm with the improved one is made. This paper tests the improved algorithm with practical data. The result shows that the improved algorithm reduces the computational complexity and gains the solution inferior to the best attribute reduction in most cases.

Key words: Approximation space theory, Reduction algorithm, Importance degree