作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (8): 186-189. doi: 10.3969/j.issn.1000-3428.2011.08.064

• 人工智能及识别技术 • 上一篇    下一篇

哈萨克文脱机手写字符识别系统的研究与实现

达吾勒?阿布都哈依尔 1,2,海拉提?克孜尔别克 1   

  1. (1. 新疆大学信息科学与工程学院,乌鲁木齐 830046;2. 新疆多语种信息技术重点实验室,乌鲁木齐 830046)
  • 出版日期:2011-04-20 发布日期:2012-10-31
  • 作者简介:达吾勒?阿布都哈依尔(1973-),男,副教授、硕士,主研方向:自然语言与信息处理;海拉提?克孜尔别克,讲师、硕士
  • 基金资助:
    国家自然科学基金资助项目(60763005);新疆大学自然科学基金资助项目(XY080125)

Study and Implementation of Kazakh Off-line Handwritten Character Recognition System

Dawel?Abilhayer 1,2, Hayrat?Kezerbek 1   

  1. (1. College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China; 2. Key Laboratory of Multilingual Information Technology of Xinjiang, Urumqi 830046, China)
  • Online:2011-04-20 Published:2012-10-31

摘要: 由于手写哈萨克字符结构的特殊性,仅提取几种单一的字符特征进行识别时正确率较低,识别效果较差。由此采用改进的PCA方法定位单词基线位置,对每个字符提取包括笔画密度特征、投影特征、轮廓特征等在内的36种特征,使用K-W检验对各特征的分类能力进行比较,并采用线性判别函数进行分类,取得了较高的识别精度。实验结果表明,该系统针对脱机字符识别率达到94%以上。

关键词: 哈萨克字符, 改进的PCA方法, 字符特征, K-W检验, 线性判别函数

Abstract: Owing to the special structure of handwritten Kazakh characters, just using several simple features to recognize Kazakh characters encounter the problems of low recognition rate and poor recognition effects. This paper determines the word baseline by using improved PCA method, extracts 36 features for each character and the K-W checking is used to compare the classification ability of each feature. It uses the linear discriminant function to classify the characters and acquires a higher recognition rate. Experimental result shows that to off-line handwritten character the recognition accuracy achieves 94%.

Key words: Kazakh character, improved PCA method, character feature, K-W checking, linear discriminant function

中图分类号: