摘要: 采用一种表格识别方法实现对多种类型表格的识别,系统利用表格投影轮廓的功率谱密度作为表格的不变性特征向量。为了解决具有相互对称结构表格的识别问题,提出一种新的特征提取方法:采用区域划分的策略,综合考虑表格图像在水平方向及垂直方向上的特征,以分区投影轮廓的功率谱密度作为表格图像的特征向量。实验表明,这种方法能够有效解决具有对称结构表格的识别问题。
关键词:
特征提取;区域划分;表格识别;功率谱密度
Abstract: A method of forms identification is introduced, which can be employed in processing of documents with a certain style of application form. In order to solve the identification of such forms with a symmetry figure, a new approach of feature extraction is presented, which is a method of area division done by partition the forms image into areas. Both the horizontal feature and the vertical feature of forms image are token into account, and the power spectral density of the subareas’ profile is used as forms image’s feature vector. It is shown to be an effective method to identify the forms with symmetry figure
Key words:
Feature extraction; Area division; Form identification; Power spectral density
黄锦德,郝红卫,张冬霞. 一种新的表格识别特征提取方法[J]. 计算机工程, 2006, 32(6): 215-217.
HUANG Jinde, HAO Hongwei, ZHANG Dongxia. A New Feature Extraction of Forms Recognition[J]. Computer Engineering, 2006, 32(6): 215-217.