作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (13): 199-201. doi: 10.3969/j.issn.1000-3428.2009.13.069

• 人工智能及识别技术 • 上一篇    下一篇

基于弱监督学习的产品特征抽取

伍 星,何中市,黄永文   

  1. (重庆大学计算机学院,重庆 400044)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-07-05 发布日期:2009-07-05

Product Feature Extraction Based on Weakly Supervised Learning

WU Xing, HE Zhong-shi, HUANG Yong-wen   

  1. (College of Computer, Chongqing University, Chongqing 400044)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-07-05 Published:2009-07-05

摘要: 产品评论挖掘是从自然语言描述的用户评论中获取信息的过程,产品特征抽取是产品评论挖掘的第1个阶段,产品特征的好坏决定了产品评论挖掘中后续阶段的质量。采用弱监督的学习方法,只需要提供少量的产品特征作为种子,从这些种子出现的语句中抽取文本模式,利用文本模式来发现新的产品特征。实验结果表明,从英文文本中自动抽取产品特征的实验系统,取得了较好的效果。

关键词: BootStrapping算法, 文本模式抽取, 产品评论挖掘

Abstract: The mining of product reviews is the process of extracting information in reviews which is expressed by natural language, the extraction of product feature is the first phrase of the mining of product reviews. The quality of product feature decides the quality of subsequent phrases. This paper adopts weakly supervised methods, which just need a hand of product features as the seeds, using the occurrence sentences of seed to extract text patterns, and using the text patterns to find new product features. Experimental results show that it can extract product feature from English plain text and receive good result.

Key words: BootStrapping algorithm, text pattern extraction, product review mining

中图分类号: