作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2006, Vol. 32 ›› Issue (9): 28-30.

• 博士论文 • 上一篇    下一篇

基于依赖结构和 Gibbs Sampling 的离散数据聚类

王双成1,2,俞时权1,2,程新章1,2   

  1. 1.上海立信会计学院信息科学系,上海 201600;2. 上海立信会计学院中国立信风险管理研究院,上海201600
  • 出版日期:2006-05-05 发布日期:2006-05-05

Clustering of Discrete Data Based on Dependency Structure and Gibbs Sampling

WANG Shuangcheng1,2, YU Shiquan1,2, CHENG Xinzhang1,2   

  1. 1. Department of Information Science, Shanghai Lixin University of Commerce, Shanghai 201600;2. Risk management Research Institute, Shanghai Lixin University of Commerce, Shanghai Lixin University of Commerce, Shanghai 201600
  • Online:2006-05-05 Published:2006-05-05

摘要: 建立了一种新的离散数据聚类方法,该方法结合变量之间的依赖结构和Gibbs sampling 进行离散数据聚类,能够显著提高抽样效率,并且避免使用EM 算法进行聚类所带来的问题。试验结果表明,该方法能够有效地进行离散数据的聚类。

关键词: 聚类;离散数据;依赖结构;Gibbs 抽样;MDL 标准

Abstract: In this paper, a new method of clustering discrete data is presented. The dependency structure is combined with the Gibbs sampling to cluster. The efficiency of sampling can be markedly improved and the problems resulted from EM algorithm can be avoided. Experimental results show that this method can effectively cluster discrete data.

Key words: Clustering; Discrete data; Dependency structure; Gibbs sampling; MDL criterion