Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2013, Vol. 39 ›› Issue (2): 207-210. doi: 10.3969/j.issn.1000-3428.2013.02.042

• Networks and Communications • Previous Articles     Next Articles

Chinese Keyword Extraction Algorithm Based on Competitive Learning Network

SHEN Xue-li, CHENG Yu-wei   

  1. (School of Electronics and Information Engineering, Liaoning Technical University, Huludao 125105, China)
  • Received:2012-03-27 Revised:2012-05-17 Online:2013-02-15 Published:2013-02-13

基于竞争学习网络的中文关键字提取算法

沈学利,程宇伟   

  1. (辽宁工程技术大学电子与信息工程学院,辽宁 葫芦岛 125105)
  • 作者简介:沈学利(1969-),男,教授,主研方向:人工神经网络,信息检索;程宇伟,硕士研究生

Abstract: To solve this problem about the accuracy of the present Chinese keyword extraction algorithm, this paper presents a new keyword extraction algorithm based on competitive learning network. The algorithm adopts the method that it takes the divided word which comes from the Chinese article as the single neuron. And it can get one or more active neurons after these neurons are input the input layer and compete with each other on the competition layer. The keywords of the Chinese article are obtained through merging the weights and clustering analysis. Experimental results show that the hit rate of extracting keywords with this algorithm is higher than the algorithm of Term Frequency-inverse Document Frequency(TF-IDE) and the traditional algorithm named Term Frequency(TF), and has a good robustness.

Key words: keyword extraction, average hit rate, competitive learning network, neuron, input layer, competitive layer

摘要: 为提高中文关键字的提取准确率,提出一种基于竞争学习网络的中文关键字提取算法。对文章进行分词,得到单个词组或短语,视其为单个神经元,将神经元输入竞争学习网络的输入层,通过竞争层上神经元的相互竞争,获得一个或几个活跃的神经元,使用合并权值及聚类分析方法得到文章的关键字。实验结果表明,该算法提取关键字的平均命中率高于词频-逆文档频率算法和传统的词频算法,鲁棒性较好。

关键词: 关键字提取, 平均命中率, 竞争学习网络, 神经元, 输入层, 竞争层

CLC Number: