Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2011, Vol. 37 ›› Issue (23): 43-45. doi: 10.3969/j.issn.1000-3428.2011.23.014

• Networks and Communications • Previous Articles     Next Articles

Research on Pruning Algorithm of Product Feature Mining in Chinese Review

LI Shi a, LI Qiu-shi b   

  1. (a. School of Information and Computer Engineering; b. School of Civil Engineering, Northeast Forestry University, Harbin 150041, China)
  • Received:2011-06-20 Online:2011-12-05 Published:2011-12-05

中文评论中产品特征挖掘的剪枝算法研究

李 实a,李秋实b   

  1. (东北林业大学 a. 信息与计算机工程学院;b. 土木工程学院,哈尔滨 150041)
  • 作者简介:李 实(1976-),女,讲师、博士,主研方向:电子商务,数据挖掘;李秋实,讲师、硕士
  • 基金资助:
    国家自然科学基金资助项目(71001023);黑龙江省教育厅科研基金资助项目(11553023);中央高校基本科研业务费专项基金资助项目(DL11BB25)

Abstract: This paper focuses on product features mining from reviews of Chinese network customers and proposes a method based on Apriori algorithm which is an unsupervised mining method. It extracts the candidate features collection by Apriori algorithm, and takes redundancy pruning and compactness pruning algorithms. According to the experimental research results, it establishes adjacent words value and p-support value. Results show that the precision and recall of mining method are effective improved by two proposed pruning algorithms.

Key words: review mining, association rule, product feature, pruning, unstructured information, unsupervised learning

摘要: 针对中文网络客户评论中的产品特征挖掘问题,提出一种基于Apriori算法的非监督挖掘方法。利用Apriori算法挖掘候选特征集合,设计邻近规则剪枝算法和最小独立支持度剪枝算法,并通过实验确定邻近规则距离值和最小独立支持度。实验结果表明,这2种剪枝算法均能有效提高产品特征挖掘的查准率和查全率。

关键词: 评论挖掘, 关联规则, 产品特征, 剪枝, 非结构化信息, 非监督学习

CLC Number: