作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (1): 178-179,. doi: 10.3969/j.issn.1000-3428.2009.01.061

• 人工智能及识别技术 • 上一篇    下一篇

基于数据挖掘的符号序列聚类相似度量模型

郑宏珍,初佃辉,战德臣,徐晓飞   

  1. (哈尔滨工业大学智能计算中心,264209)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-01-05 发布日期:2009-01-05

Symbolic Sequence Clustering Regular Similarity Model Based on Data Mining

ZHENG Hong-zhen, CHU Dian-hui, ZHAN De-chen, XU Xiao-fei   

  1. (Intelligent Computing Center, Harbin Institute of Technology, Harbin 264209)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-01-05 Published:2009-01-05

摘要: 为了从消费者偏好序列中发现市场细分结构,采用数据挖掘领域中的符号序列聚类方法,提出一种符号序列聚类的研究方法和框架,给出RSM相似性度量模型。调整RSM模型参数,使得RSM可以变为与编辑距离、海明距离等价的相似性度量。通过RSM与其他序列相似性度量的比较,表明RSM具有更强的表达相似性概念的能力。由于RSM能够表达不同的相似性概念,从而使之能适用于不同的应用环境,并在其基础上提出自组织特征映射退火符号聚类模型,使得从消费者偏好进行市场细分结构研究的研究途径在实际应用中得以实现。

关键词: 符号序列聚类, 数据挖掘, 相似性模型

Abstract: From a consumer point of the sequence of preference, data mining is used in the field of symbolic sequence clustering methods to detect market segmentation structure. This paper proposes a symbolic sequence clustering methodology and framework, gives the similarity metric RSM model. By adjusting RSM model, parameters can be changed into RSM and edit distance, Hamming distance equivalent to the similarity metric. RSM is compared with other sequence similarity metric, and is more similar to the expression of the concept of capacity. As to express different similarity, the concept of RSM can be applied to different applications environment. Based on the SOM annealing symbol clustering model, the consumer preference for market segmentation can be studied in the structure, which means it is realized in practical application.

Key words: symbolic sequence clustering, data mining, similarity model

中图分类号: