作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (4): 67-69. doi: 10.3969/j.issn.1000-3428.2011.04.024

• 软件技术与数据库 • 上一篇    下一篇

基于潜在语义分析的构件聚类改进方法

任姚鹏 1,2,陈立潮 1,张英俊 1,谢斌红1   

  1. (1. 太原科技大学计算机科学与技术学院,太原 030024;2. 运城学院计算机科学与技术系,山西 运城 044000)
  • 出版日期:2011-02-20 发布日期:2011-02-17
  • 作者简介:任姚鹏(1982-),女,硕士研究生,主研方向:语义分析,数据挖掘;陈立潮,教授;张英俊,高级工程师、硕士;谢斌红,讲师、硕士
  • 基金资助:
    山西省自然科学基金资助项目“基于语义的构件检索关键技术研究”(2009011022-1)

Improved Method of Component Clustering Based on Latent Semantic Analysis

REN Yao-peng 1,2, CHEN Li-chao 1, ZHANG Ying-jun 1, XIE Bin-hong 1   

  1. (1. School of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan 030024, China; 2. Department of Computer Science and Technology, Yuncheng University, Yuncheng 044000, China)
  • Online:2011-02-20 Published:2011-02-17

摘要: 针对基于向量空间模型的构件聚类方法存在高维稀疏、无法解决同义词等问题,采用基于潜在语义分析模型对构件进行聚类分析。从用户关注点出发,通过引入等级策略提出一种基于潜在语义分析的构件聚类改进算法。实验结果表明,该方法能够提高构件聚类质量,使构件聚类结果更符合用户需求和更加人性化,提高构件检索效率和准确性。

关键词: 刻面分类, 潜在语义分析, 等级策略, 构件聚类

Abstract: Aiming at the problem that the current component clustering method based on Vector Space Model(VSM) has the high-dimensional sparse and is unable to solve the synonym, Latent Semantic Analysis(LSA) model is used to cluster the components, meanwhile from the point of user attention, grade strategy is introduced. An improved method of component clustering is proposed based on LSA model. The method is proved effective by experiments, which can improve the quality of component clustering and make the component clustering result better serve user requirement and even more humanized. It can promote the efficiency and accuracy of component retrieval.

Key words: faceted classification, Latent Semantic Analysis(LSA), grade strategy, component clustering

中图分类号: