作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (21): 176-178,181. doi: 10.3969/j.issn.1000-3428.2011.21.060

• 人工智能及识别技术 • 上一篇    下一篇

基于聚类和MRF模型的场景文字提取方法

章天则,赵宇明   

  1. (上海交通大学电子信息与电气工程学院,上海 200240)
  • 收稿日期:2011-03-11 出版日期:2011-11-05 发布日期:2011-11-05
  • 作者简介:章天则(1986-),男,硕士研究生,主研方向:数字图像处理;赵宇明,副教授、博士

Scene Text Extraction Method Based on Clustering and MRF Model

ZHANG Tian-ze, ZHAO Yu-ming   

  1. (School of Electronic Information and Electrical Engineering, Shanghai Jiaotong University, Shanghai 200240, China)
  • Received:2011-03-11 Online:2011-11-05 Published:2011-11-05

摘要: 提出一种从自然场景中提取文本区域的方法。该方法包括候选文本区域的提取,以及候选区域是否为文字区域的判定。候选文字区域的提取,主要利用图像的纹理特征和HSL颜色空间信息,通过改进的模糊C均值聚类函数,结合拉普拉斯掩膜与计算最大梯度差来实现。由连通域边缘密度信息、形状信息的马尔科夫随机场模型,判定候选文字区域是否为文字区域。经ICDAR2003数据库测试结果表明,该方法具有较高的精确度。

关键词: 模糊C均值聚类, HSL颜色空间, 拉普拉斯掩膜, 最大梯度差, 马尔科夫随机场模型

Abstract: This paper proposes a method for extracting text regions from natural scene images. This method includes two parts, text region candidates extraction and candidate regions further classification of text region or non-text region. The text region candidates are extracted through a modified fuzzy C-means clustering algorithm combined with Laplacian mask and maximum gradient difference value, which involves texture features and HSL color space information. The candidate regions are checked by edge density information and shape information of the connected components based on Markov Random Field(MRF) model. The proposed method achieves reasonable accuracy for text extraction from examples of the ICDAR 2003 database.

Key words: fuzzy C-means clustering, HSL color space, Laplacian mask, maximum gradient difference, Markov Random Field(MRF) model

中图分类号: