作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (16): 201-202. doi: 10.3969/j.issn.1000-3428.2009.16.072

• 人工智能及识别技术 • 上一篇    下一篇

基于树桩网络的贝叶斯文本分类算法

杨延娇,王治和   

  1. (西北师范大学数学与信息科学学院,兰州 730070)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-08-20 发布日期:2009-08-20

Bayes Text Classification Algorithm Based on Stump Network

YANG Yan-jiao, WANG Zhi-he   

  1. (College of Mathematics and Information Science, Northwest Normal University, Lanzhou 730070)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-08-20 Published:2009-08-20

摘要: 分析贝叶斯文本分类算法的不足,提出相应的改进算法。放宽朴素贝叶斯文本分类模型中的属性独立性假设,采用一种改进的基于贝叶斯定理的文本分类模型“树桩网络”,改进朴素贝叶斯文本分类模型。实验证明,改进后的文本分类模型适合于文本分类的需要,改善了原有分类器的性能。

关键词: 文本分类, 朴素贝叶斯, 属性独立性假设, 树桩网络

Abstract: This paper analyzes the shortcomings of Bayes and puts forward a better method to improve it. It releases attribute independence assumption of Naive Bayes text classifier. An improved text classification model based on Bayes theorem called stump network is presented to amend the Naive Bayes text classifier. Experiment shows that the revised text categorization model meets the need of text categorization, and improves the performance of former one.

Key words: text classification, Naive Bayes, attribute independence assumption, stump network

中图分类号: