作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (11): 10-12. doi: 10.3969/j.issn.1000-3428.2011.11.004

• 博士论文 • 上一篇    下一篇

基于多项式加工树模型的列联表数据挖掘

游 源1,齐 欢1,胡祥恩2   

  1. (1. 华中科技大学控制科学与工程系,武汉 430074;2. 孟菲斯大学心理系,田纳西 孟菲斯 38152)
  • 收稿日期:2010-12-04 出版日期:2011-06-05 发布日期:2011-06-05
  • 作者简介:游 源(1984-),男,博士研究生,主研方向:多项式加工树模型理论,数据分析;齐 欢,教授、博士生导师;胡祥恩,教授、博士
  • 基金资助:

    国家自然科学基金资助项目(60774036);美国国家自然科学基金资助项目(0616657);湖北省自然科学基金资助重点项目(2008CDA063)

Data Mining of Contingence Table Based on Multinomial Processing Tree Model

YOU Yuan1, QI Huan1, HU Xiang-en2   

  1. (1. Department of Control Science & Engineering, Huazhong University of Science and Technology, Wuhan 430074, China; 2. Department of Psychology, University of Memphis, Memphis 38152, USA)
  • Received:2010-12-04 Online:2011-06-05 Published:2011-06-05

摘要:

针对表格数据挖掘中优势点信息缺失的问题,提出一种列联表自动数据挖掘方法。依据原表中优势点位置,应用多项式加工树模型相关理论对原始数据进行自动树状模型拟合与聚类分组,生成待选假设关系集合,并最终完成参数估计以及拟合优度检验。通过实例证明该算法能够有效提取出优势点的隐含信息与特异规则。

关键词: 多项式加工树模型, 列联表, 数据挖掘, 优势点分析, 规则提取

Abstract:

According to the existing problems of dominate cells information missing, an automatic data mining method of contingence table is purposed. According to the location of dominate cells, the categories in the original table are classified and represented by tree structure models on the basis of relevant theories of Multinomial Processing Tree(MPT) models, the following processes including hypothesis generations, parameter estimation and goodness-of-fit test is performed automatically. Application result shows the approach can effectively extract the latent interactions and peculiarity association rules.

Key words: Multinomial Processing Tree(MPT) model, contingence table, data mining, dominate cell analysis, rules extraction

中图分类号: