基于HOG的目标分类特征深度学习模型

doi:10.3969/j.issn.1000-3428.2016.12.031

计算机工程

基于HOG的目标分类特征深度学习模型

何希平^a,b,c,张琼华^d,刘波 ^a,b,c

(重庆工商大学 a.电子商务及供应链系统重庆市重点实验室;b.重庆市检测控制集成系统工程实验室; c.计算机科学与信息工程学院; d.图书馆,重庆 400067)

收稿日期:2015-10-19 出版日期:2016-12-15 发布日期:2016-12-15
作者简介:何希平(1968—),男,教授、博士,主研方向为计算机视觉、机器学习、信息安全;张琼华,馆员;刘波,讲师、博士。
基金资助:
重庆市教委科学技术研究计划项目(KJ1400612)。

Deep Learning Model of Object Classification Feature Based on HOG

HE Xiping^a,b,c,ZHANG Qionghua ^d,LIU Bo ^a,b,c

(a.Chongqing Key Laboratory of Electronic Commerce and Supply Chain System;b.Chongqing Engineering Laboratory for Detection,Control and Integrated System;c.School of Computer Science and Information Engineering; d.Library,Chongqing Technology and Business University,Chongqing 400067,China)

Received:2015-10-19 Online:2016-12-15 Published:2016-12-15

摘要/Abstract

摘要： 为提高低配置计算环境中的视觉目标实时在线分类特征提取的时效性和分类准确率,提出一种新的目标分类特征深度学习模型。根据高时效性要求,选用分类器模型离线深度学习的策略,以节约在线训练时间。针对网络深度受限和高识别率要求,提取图像的局部方向梯度直方图(HOG)特征,构建稀疏自编码器栈对HOG特征进行深层次编码,设计Softmax多分类器对所抽取的特征进行分类。在深度神经网络模型学习过程中,引入最小化各层结构风险和微调全网参数的二阶段最优化策略。利用场景图像库Caltech101和手写数字库MNIST的训练样本与测试样本进行对比实验,结果表明,该模型在局部特征提取方面的时效优于单层卷积神经网络(CNN)模型,分类准确率高于CNN、栈式自编码器等对比模型。

关键词: 计算机视觉, 目标分类, 方向梯度直方图特征, 栈式自编码器, 深度学习

Abstract: To improve the feature extraction timeliness and classification validity of real-time classification of visual object in low computing profile,a Histogram of Oriented Gradients(HOG)-based feature deep learning model for object classification is proposed.For the requirements of high timeliness,offline deep learning strategy is applied to the classifier model to save its online training time.In view of the requirements of network depth limitation and high recognition rate,the local feature of HOG feature of an image is extracted to be used as the input of the sparse autoencoder stack so as to output the high level feature code of the sample image.The Softmax multiple classifier is designed to classify the extracted features.During the learning process of the deep neural network model,the two-stage optimization strategy is introduced,which minimizes the structural risk of every layer and fine-tune the parameters of the whole model.Using some samples of the scene image database Caltech101 and that of the handwritten digits database MNIST as the training set and the others as the test set to perform the comparative experiment,results show that the time performance of the proposed model is better than that of one-layer only Convolutional Neural Network(CNN),and the classification accuracy of the trained model is higher than that of CNN,Stacked Autoencoder(SAE) comparative models.

Key words: computer vision, object classification, Histogram of Oriented Gradients(HOG) feature, Stacked Autoencoder(SAE), deep learning

中图分类号:

TP391

何希平,张琼华,刘波. 基于HOG的目标分类特征深度学习模型[J]. 计算机工程, doi: 10.3969/j.issn.1000-3428.2016.12.031.

HE Xiping,ZHANG Qionghua,LIU Bo. Deep Learning Model of Object Classification Feature Based on HOG[J]. Computer Engineering, doi: 10.3969/j.issn.1000-3428.2016.12.031.

http://www.ecice06.com/CN/Y2016/V42/I12/176

参考文献

参考文献［1］Hinton G E,Osindero S,Teh Y.A Fast Learning Algorithm for Deep Belief Nets［J］.Neural Com-putation,2006,18(1):1527-1554. ［2］Hinton G E,Salakhutdinov R R.Reducing the Dimensionality of Data with Neural Networks［J］.Science,2006,313(5786):504-507. ［3］潘泓,金立左,夏思宇,等.基于多层次互补特征的通用目标检测模型［J］.电子与信息学报,2012,34(7):1531-1537. ［4］Socher R,Pennington J,Huang E,et al.Manning.Semi-supervised Recursive Autoencoders for Predicting Sentiment Distributions［C］//Proceedings of 2011 Con-ference on Empirical Methods in Natural Language Processing.Edinburgh,UK:［s.n. ］,2011:151-161. ［5］LeCun Y,Boser B,Denker J S,et al.Backpropagation Applied to Handwritten Zip Code Recognition［J］.Neural Computation,1989,1(4):541-551. ［6］LeCun Y L,Boser B J.Denker,et al.Handwritten Digit Recognition with a Backpropagation Neural Net-work［J］.Advances in Neural Information Processing Systems,1990,2(1):299-304. ［7］Lowe D G.Object Recognition from Local Scale-invariant Features［EB/OL］.(1999-09-21).http://dx.doi.org/10.1109/iccv. ［8］Lowe D G.Distinctive Image Features from Scale-invariant Keypoints［J］.International Journal of Computer Vision,2004,60(2):91-110. (下转第187页) (上接第180页) ［9］Sánchez J,Perronnin F,Mensink T,et al.Image Classification with the Fisher Vector:Theory and Practice［J］.International Journal of Computer Vision,2013,105(3):222-245. ［10］Sydorov V,Sakurada M,Lampert C H.Deep Fisher Kernels——End to End Learning of the Fisher Kernel GMM Parameters［C］//Proceedings of IEEE Con-ference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2014:1402-1409. ［11］Sánchez J,Redolfi J.Exponential Family Fisher Vector for Image Classification［J］.Pattern Recognition Letters,2015,59(7):26-32. ［12］Dalal N,Triggs B.Histograms of Oriented Gradients for Human Detection［C］//Proceedings of IEEE Conference on Computer Vision & Pattern Recognition.Washington D.C.,USA:IEEE Press,2005:886-893. ［13］Ma Ding,Pang Xin,Qu Mingzhi,et al.A Target Tracking Method Combining HOG and Shape Con-text［J］.Journal of Computational Information Systems,2014,10(8):7053-7060. ［14］王知音,禹龙,田生伟,等.基于栈式自编码的水体提取方法［J］.计算机应用,2015,35(9):2706-2709. ［15］Bengio Y,Lamblin P,Popovici P,et al.Greedy Layer-wise Training of Deep Networks［J］.Advances in Neural Information Processing Systems,2007,19(1):153-160. ［16］李海峰,李纯果.深度学习结构和算法比较分析［J］.河北大学学报(自然科学版),2012,32(5):538-544. ［17］Li Feifei,Fergus R,Perona P.Learning Generative Visual Models from Few Training Examples:An Incremental Bayesian Approach Tested on 101 Object Categories［J］.Computer Vision & Image Under-standing,2007,106(1):59-70. ［18］Yann Lecun,Cortes C,Christopher J C.The MNIST Ddatabase of Handwritten Digits［EB/OL］.(2013-10-30).http://yann.lecun.com/exdb/mnist/. 编辑索书志

[1]	江雨燕, 陶承凤, 李平. 数据增强和自适应自步学习的深度子空间聚类算法[J]. 计算机工程, 2023, 49(8): 96-103, 110.
[2]	李泽水, 冀俊忠, 杨翠翠. 基于边权重信息深度网络嵌入的PPIN功能模块检测[J]. 计算机工程, 2023, 49(8): 69-76.
[3]	王可铮, 徐玉芬, 周尚波. 结合对比感知损失和融合注意力的图像去雾模型[J]. 计算机工程, 2023, 49(8): 207-214.
[4]	刘俊豪, 王美林, 谢兴, 宋烨兴, 许莉花. 基于改进YOLOv5的皮革瑕疵检测算法[J]. 计算机工程, 2023, 49(8): 240-249.
[5]	闫兴亚, 匡娅茜, 白光睿, 李月. 基于深度学习的学生课堂行为识别方法[J]. 计算机工程, 2023, 49(7): 251-258.
[6]	李军侠, 王星驰, 殷梓, 石德硕. 边缘深度挖掘的弱监督显著性目标检测[J]. 计算机工程, 2023, 49(7): 169-178.
[7]	吴珊, 周凤. 基于改进SSD算法的小目标检测[J]. 计算机工程, 2023, 49(7): 179-188.
[8]	席建锐, 唐红梅, 梁春阳, 刘鑫. 基于改进隐函数的点云物体重建[J]. 计算机工程, 2023, 49(7): 214-222.
[9]	齐咏生, 杜晓旭, 朱俊峰, 高胜利, 刘利强. 基于增强型轻量深度网络的牧区牲畜高效检测[J]. 计算机工程, 2023, 49(7): 278-287.
[10]	谌雨章, 黄逸姿, 张钧涵. 基于多速率空洞卷积的多尺度水下小目标检测[J]. 计算机工程, 2023, 49(6): 257-264.
[11]	周逸云, 万新军, 胡伏原, 陈昊. 基于联合注意与特征关联的实例分割算法[J]. 计算机工程, 2023, 49(6): 217-226.
[12]	张博旭, 蒲智, 程曦. 基于提示学习的维吾尔语文本分类研究[J]. 计算机工程, 2023, 49(6): 292-299,313.
[13]	于海洋, 景鹏, 张文涛, 谢赛飞, 滑志华, 宋草原. 基于残差与注意力机制的道路裂缝检测U-Net改进模型[J]. 计算机工程, 2023, 49(6): 265-273.
[14]	王爱玲, 马文臻, 邹自明, 钟佳. 基于领域自适应的卫星工程参数异常检测[J]. 计算机工程, 2023, 49(5): 29-37,47.
[15]	宋羽凯, 谢江. 基于多任务学习的轻量级语音情感识别模型[J]. 计算机工程, 2023, 49(5): 122-128.

选择文件类型/文献管理软件名称

选择包含的内容

基于HOG的目标分类特征深度学习模型

Deep Learning Model of Object Classification Feature Based on HOG

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于HOG的目标分类特征深度学习模型

Deep Learning Model of Object Classification Feature Based on HOG

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价