基于RGB特征与深度特征融合的物体识别算法

doi:10.3969/j.issn.1000-3428.2016.05.032

计算机工程

基于RGB特征与深度特征融合的物体识别算法

卢良锋 ¹,谢志军¹,叶宏武 ²

(1.宁波大学信息学院,浙江宁波 315211; 2.浙江纺织服装职业技术学院,浙江宁波 315211)

收稿日期:2015-04-24 出版日期:2016-05-15 发布日期:2016-05-13
作者简介:卢良锋(1990-),男,硕士研究生,主研方向为深度学习、物体识别;谢志军、叶宏武,副教授。
基金资助:
国家自然科学基金资助项目(60902097);宁波市自然科学基金资助项目(2013A610044);浙江省重中之重学科开放基金资助项目“信息与通信工程”(xkx11422);宁波国家高新区海外人才创业基金资助项目。

Object Recognition Algorithm Based on RGB Feature and Depth Feature Fusing

LU Liangfeng ¹,XIE Zhijun ¹,YE Hongwu ²

(1.College of Information Science,Ningbo University,Ningbo,Zhejiang 315211,China; 2.Zhejiang Fashion Institute of Technology,Ningbo,Zhejiang 315211,China)

Received:2015-04-24 Online:2016-05-15 Published:2016-05-13

摘要/Abstract

摘要： RGB图像和深度图像的同时使用能有效提高物体识别的准确率。然而,已有研究仅将RGB图像和深度图像的特征进行简单的线性连接,没有根据RGB特征和深度特征的差异性进行特征提取和融合,充分发挥RGB-D图像的优势。为此,提出一种多模态稀疏自编码算法,在进行差异性特征提取的同时完成RGB特征和深度特征的有效融合。结合多模态稀疏自编码算法和空间金字塔最大池化算法,给出一个全新的深度学习模型。该模型能够提取有辨别力的特征并完成基于RGB-D图像的物体识别工作。在2个标准的RGB-D数据库上的实验结果表明,与基于RGB-D的物体识别算法相比,该算法能够有效融合RGB特征和深度特征,取得更高的识别准确率。

关键词: RGB特征与深度特征融合, 稀疏自编码, 多模态稀疏自编码, 空间金字塔最大池化, 深度学习, 物体识别

Abstract: Combining RGB image and depth image can effectively improve the RGB-D image recognition accuracy.However,prior researchers only do simple linear connect with the RGB image and depth features and do not extract and fuse the RGB and depth features according to their difference,and do not take full advantage of RGB-D image.This paper proposes a multi-model sparse auto encoder algorithm.Multi-model sparse auto encoder algorithm can extract and fuse the RGB and depth features at the same time.By combining multi-model sparse auto encoder algorithms with spatial pyramid max pooling algorithms,it proposes a new deep learning model.New depth learning model can extract recognizable features and complete the RGB-D based object recognition.It uses two standard RGB-D databases to verify the new proposed algorithm and deep learning model.Experimental results show that compared with previous RGB-D image based object recognition algorithm,the newly proposed algorithm effectively fuses the RGB and depth features and achieves higher recognition accuracy.

Key words: RGB feature and depth feature fusing, Sparse Auto Encoding(SAE), Multi-model Sparse Auto Encoding(MMSAE), spatial pyramid max pooling, deep learning, object recognition

中图分类号:

TP391.06

卢良锋,谢志军,叶宏武. 基于RGB特征与深度特征融合的物体识别算法[J]. 计算机工程, doi: 10.3969/j.issn.1000-3428.2016.05.032.

LU Liangfeng,XIE Zhijun,YE Hongwu. Object Recognition Algorithm Based on RGB Feature and Depth Feature Fusing[J]. Computer Engineering, doi: 10.3969/j.issn.1000-3428.2016.05.032.

http://www.ecice06.com/CN/Y2016/V42/I5/186

参考文献

参考文献［1］Glorot X,Bordes A,Bengio Y.DomainAdaptation for Large-scale Sentiment Classification:A Deep Learning Approach［C］//Proceedings of the 28th International Conference on Machine Learning.Washington D.C.,USA:IMLS Press,2011:513-520. ［2］Ngiam J,Khosla A,Kim M,et al.Multimodal Deep Learning［C］//Proceedings of the 28th International Conference on Machine Learning.Washington D.C.,USA:IMLS Press,2011:689-696. ［3］孙志军,薛磊,许阳明,等.深度学习研究综述［J］.计算机应用研究,2012,29(8):2806-2810. ［4］Baccouche M,Mamalet F,Wolf C,et al.Sequential Deep Learning for Human Action Recognition［C］//Proceedings of the 2nd International Workshop on Human Behavior Understanding.Berlin,Germany:Springer,2011:29-39. ［5］李晓龙,张兆翔,王蕴红,等.深度学习在航拍场景分类中的应用［J］.计算机科学与探索,2014,8(3):305-312. ［6］Lai K,Bo Liefeng,Ren Xiaofeng,et al.A Large-scale Hierarchical Multi-view RGB-d Object Dataset［C］//Proceedings of IEEE International Conference on Robotics and Automation.Washington D.C.,USA:IEEE Press,2011:1817-1824. ［7］Blum M,Springenberg J T,Wulfing J,et al.A Learned Feature Descriptor for Object Recognition in RGB-d Data［C］//Proceedings of IEEE International Conference on Robotics and Automation.Washington D.C.,USA:IEEE Press,2012:1298-1303. ［8］Bo Liefeng,Ren Xiaofeng,Fox D.Unsupervised Feature Learning for RGB-D Based Object Recognition［C］//Proceedings of the 13th International Symposium on Experimental Robotics.Berlin,Germany:Springer,2013:387-402. ［9］Socher R,Huval B,Bath B P,et al.Convolutional-recursive Deep Learning for 3D Object Classification［M］.Nevada,USA:NIPS Foundation,2012. ［10］Browatzki B,Fischer J,Graf B,et al.Going into Depth:Evaluating 2D and 3D Cues for Object Classification on a New,Large-scale Object Dataset［C］//Proceedings of IEEE International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2011:1189-1195. ［11］Deng Jun,Zhang Zixing,Marchi E,et al.Sparse Autoencoder-based Feature Transfer Learning for Speech Emotion Recognition［C］//Proceedings of Humaine Association Conference on Affective Computing and Intelligent Interaction.New York,USA:ACM Press,2013:511-516. ［12］Bo Liefeng,Ren Xiaofeng,Fox D.Hierarchical Matching Pursuit for Image Classification:Architecture and Fast Algorithms［M］.Granada,Spain:NIPS Foundation,2011. ［13］邱龙金,贺昌政．神经网络稳定性的交叉验证模型［J］.计算机工程与应用,2010,46(34):43-45. ［14］胡局新,张功杰.基于K折交叉验证的选择性集成分类算法［J］.科技通报,2013,29(12):115-117. ［15］张琳,陈燕,李桃迎,等.决策树分类算法研究［J］.计算机工程,2011,37(13):66-67,70. 编辑顾逸斐

[1]	江雨燕, 陶承凤, 李平. 数据增强和自适应自步学习的深度子空间聚类算法[J]. 计算机工程, 2023, 49(8): 96-103, 110.
[2]	李泽水, 冀俊忠, 杨翠翠. 基于边权重信息深度网络嵌入的PPIN功能模块检测[J]. 计算机工程, 2023, 49(8): 69-76.
[3]	王可铮, 徐玉芬, 周尚波. 结合对比感知损失和融合注意力的图像去雾模型[J]. 计算机工程, 2023, 49(8): 207-214.
[4]	刘俊豪, 王美林, 谢兴, 宋烨兴, 许莉花. 基于改进YOLOv5的皮革瑕疵检测算法[J]. 计算机工程, 2023, 49(8): 240-249.
[5]	闫兴亚, 匡娅茜, 白光睿, 李月. 基于深度学习的学生课堂行为识别方法[J]. 计算机工程, 2023, 49(7): 251-258.
[6]	李军侠, 王星驰, 殷梓, 石德硕. 边缘深度挖掘的弱监督显著性目标检测[J]. 计算机工程, 2023, 49(7): 169-178.
[7]	吴珊, 周凤. 基于改进SSD算法的小目标检测[J]. 计算机工程, 2023, 49(7): 179-188.
[8]	席建锐, 唐红梅, 梁春阳, 刘鑫. 基于改进隐函数的点云物体重建[J]. 计算机工程, 2023, 49(7): 214-222.
[9]	齐咏生, 杜晓旭, 朱俊峰, 高胜利, 刘利强. 基于增强型轻量深度网络的牧区牲畜高效检测[J]. 计算机工程, 2023, 49(7): 278-287.
[10]	谌雨章, 黄逸姿, 张钧涵. 基于多速率空洞卷积的多尺度水下小目标检测[J]. 计算机工程, 2023, 49(6): 257-264.
[11]	张博旭, 蒲智, 程曦. 基于提示学习的维吾尔语文本分类研究[J]. 计算机工程, 2023, 49(6): 292-299,313.
[12]	于海洋, 景鹏, 张文涛, 谢赛飞, 滑志华, 宋草原. 基于残差与注意力机制的道路裂缝检测U-Net改进模型[J]. 计算机工程, 2023, 49(6): 265-273.
[13]	王爱玲, 马文臻, 邹自明, 钟佳. 基于领域自适应的卫星工程参数异常检测[J]. 计算机工程, 2023, 49(5): 29-37,47.
[14]	李静雯, 赵奎. 基于改进PCFG算法的口令猜测方法[J]. 计算机工程, 2023, 49(5): 38-47.
[15]	李雪松, 张锲石, 宋呈群, 康宇航, 程俊. 自动驾驶场景下的轨迹预测技术综述[J]. 计算机工程, 2023, 49(5): 1-11.

选择文件类型/文献管理软件名称

选择包含的内容

基于RGB特征与深度特征融合的物体识别算法

Object Recognition Algorithm Based on RGB Feature and Depth Feature Fusing

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于RGB特征与深度特征融合的物体识别算法

Object Recognition Algorithm Based on RGB Feature and Depth Feature Fusing

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价