作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2007, Vol. 33 ›› Issue (11): 98-99. doi: 10.3969/j.issn.1000-3428.2007.11.036

• 软件技术与数据库 • 上一篇    下一篇

防干扰的不良网页过滤算法研究

赖勇浩1,2,谢赞福1   

  1. (1. 广东技术师范学院计算机科学系,广州 510665;2. 广州网易互动娱乐有限公司,广州 510665)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-06-05 发布日期:2007-06-05

Research on Anti-jamming Bad Web Filter Algorithm

LAI Yonghao1,2, XIE Zanfu1   

  1. (1. Dept. of Computer Science, Guangdong Polytechnic Normal University, Guangzhou 510665; 2. Guangzhou Netease Interactive Entertainment Ltd., Guangzhou 510665)
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-06-05 Published:2007-06-05

摘要: 提出了一种通过优化词典匹配判定文本性质的改进算法。通过基于实时分析文本内容来判定文本性质,每秒可分析20万个汉字,实时有效地识别网页上的不良文本。可抗干扰的不良网页过滤器是基于防干扰预处理原理和防误判算法设计开发的,使识别率95%以上、误判率降低1%以下,为进一步防堵垃圾信息提供了基础。

关键词: 网页过滤算法, 防至扰预处理, 词典匹配算法

Abstract: Bring forward an improved algorithm which judge the kind of text through optimizing dictionary match. This algorithm judges the kind of text basing on real-time analysis of the content of text, by analyzing two-hundred thousand characters per second, it judges bad text in Web real-time and effectively. The bad Web filter based on the algorithm of anti-interferential pretreatment and anti-miscarriage of justice improves the recognition rate(above 95%) and reduces the anti-miscarriage of justice rate. It offers more substantial foundation of keeping away unuseful information.

Key words: Web filter algorithm, Anti-jamming preprocessing, Dictionary match algorithm

中图分类号: