室内场景的布局估计与目标区域提取算法

doi:10.19678/j.issn.1000-3428.0047659

计算机工程 ›› 2018, Vol. 44 ›› Issue (8): 257-262,267. doi: 10.19678/j.issn.1000-3428.0047659

室内场景的布局估计与目标区域提取算法

吴晓秋^a,霍智勇 ^a,b

南京邮电大学 a.通信与信息工程学院; b.江苏省图像处理与图像通信重点实验室,南京 210003

收稿日期:2017-06-21 出版日期:2018-08-15 发布日期:2018-08-15
作者简介:吴晓秋(1993—),女,硕士研究生,主研方向为图像处理、多媒体通信;霍智勇,教授。
基金资助:
国家自然科学基金(61471201,61501260);江苏省高校自然科学研究重点项目(13KJA510004);江苏省自然科学基金青年基金(BK20130867);江苏省“六大人才高峰”项目(2014-DZXX-008)。

Layout Estimation and Object Region Extraction Algorithm for Indoor Scene

WU Xiaoqiu^a,HUO Zhiyong^a,b

a.College of Telecommunications and Information Engineering; b.Jiangsu Provincial Key Lab of Image Processing and Image Communication,Nanjing University of Posts and Telecommunications,Nanjing 210003,China

Received:2017-06-21 Online:2018-08-15 Published:2018-08-15

摘要/Abstract

摘要：

现有的目标提取方法在应用于复杂的室内场景图像时,容易出现小尺寸物体与平面区域中物体被忽视,以及因遮挡造成大物体提取错误等问题。为此,提出一种针对室内RGB-D场景的无监督布局估计与目标区域提取算法。利用3D点云进行平面分割与分类以完成布局估计,采用 2种图像分割方法对RGB-D图像做过分割处理,并利用4种相似度衡量方式进行层次分组。在此基础上,根据布局估计的结果,对不同类别的区域采取不同的边界框匹配策略。实验结果表明,该方法无需预训练即可改善目标区域提取效果,在产生较少目标候选区的情况下提高边界框召回率,加快计算速度。

关键词: 深度信息, 特征融合, 室内场景, 布局估计, 图像分割, 目标提取

Abstract:

When applying most existing object proposal methods on complex indoor scenes,the results show that there are some problems such as ignoring the small size object and objects in planar regions and detection inaccuracies of big objects caused by occlusion.Aiming at above these problems,this paper proposes a layout estimation and object region extraction algorithm for indoor RGB-D scenes.Firstly,it uses the 3D point cloud for plane segmentation and classification.Secondly,it adopts two segmentation methods using RGB-D data for obtaining crude object segments and then utilizes four similarity measures for hierarchical grouping.Finally,based on the results of layout estimation,it takes diversification strategies to fit bounding boxes for different regions.Experimental result shows that the proposed algorithm can improve extraction efficiency obviously and improve bounding box proposal recall score with fewer object candidates.In addition,it does not need pre-training and has fast calculation speed.

Key words: depth information, feature fusing, indoor scene, layout estimation, image segmentation, object extraction

中图分类号:

TP391.4

吴晓秋,霍智勇. 室内场景的布局估计与目标区域提取算法[J]. 计算机工程, 2018, 44(8): 257-262,267.

WU Xiaoqiu,HUO Zhiyong. Layout Estimation and Object Region Extraction Algorithm for Indoor Scene[J]. Computer Engineering, 2018, 44(8): 257-262,267.

https://www.ecice06.com/CN/Y2018/V44/I8/257

参考文献

［1］VIOLA P,JONES M.Robust real-time face detection［J］.International Journal of Computer Vision,2004,57(2):137-154.br/ ［2］FELZENSZWALB P F,GIRSHICK R B,MCALLESTER D,et al.Object detection with discriminatively trained part-based models［J］.IEEE Transactions on Software Engineering,2010,32(9):1627-1645.br/ ［3］DALAL N,TRIGGS B.Histograms of oriented gradients for human detection［C］//Proceedings of Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2005:886-893.br/ ［4］SMOLA A J,SCHLKOPF B.A tutorial on support vector regression［J］.Statistics and Computing,2004,14(3):199-222.br/ ［5］曾接贤,程潇.结合单双行人DPM模型的交通场景行人检测［J］.电子学报,2016,44(11):2668-2675.br/ ［6］FELZENSZWALB P F,HUTTENLOCHER D P.Efficient graph-based image segmentation［J］.International Journal of Computer Vision,2004,59(2):167-181.br/ ［7］张灵,李静立,陈思平,等.异常宫颈细胞核的自适应局部分割［J］.中国图象图形学报,2013,18(10):1329-1335.br/ ［8］葛婷,牟宁,李黎.基于softmax回归与图割法的脑肿瘤分割算法［J］.电子学报,2017,45(3):644-649.br/ ［9］ARBELAEZ P,MAIRE M,FOWLKES C,et al.Contour detection and hierarchical image segmentation［J］.IEEE Transactions on Pattern Analysis and Machine Intelligence,2011,33(5):898-916.br/ ［10］SILBERMAN N,HOIEM D,KOHLI P,et al.Indoor segmentation and support inference from RGBD images［C］//Proceedings of European Conference on Computer Vision.Berlin,Germany:Springer,2012:746-760.br/ ［11］CARREIRA J,SMINCHISESCU C.CPMC:automatic object segmentation using constrained parametric min-cuts［J］.IEEE Transactions on Pattern Analysis and Machine Intelligence,2012,34(7):1312-1328.br/ ［12］GUPTA S,ARBELAEZ P,MALIK J.Perceptual organization and recognition of indoor scenes from RGB-D images［C］//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2013:564-571.br/ ［13］LIN D,FIDLER S,URTASUN R.Holistic scene understanding for 3D object detection with RGB-D cameras［C］//Proceedings of IEEE International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2013:1417-1424.br/ (下转第267页) (上接第262页) ［14］ARBELAEZ P,PONTTUSET J,BARRON J,et al.Multiscale combinatorial grouping［C］//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2014:328-335.br/ ［15］GUPTA S,GIRSHICK R,ARBELEZ P,et al.Learning rich features from RGB-D images for object detection and segmentation［C］//Proceedings of ECCV’14.Berlin,Germany:Springer,2014:345-360.br/ ［16］GEIGER A,WANG C.Joint 3D object and layout inference from a single RGB-D image［C］//Proceedings of the 37th Conference on Pattern Recognition.Berlin,Germany:Springer,2015:183-195.br/ ［17］DENG Z,TODOROVIC S,LATECKI L J.Unsupervised object region proposals for RGB-D indoor scenes［J］.Computer Vision and Image Understanding,2016,21(5):79-87.br/ ［18］LEVIN A,LISCHINSKI D,WEISS Y.Colorization using optimization［J］.ACM Transactions on Graphics,2004,23(3):686-691.br/

[1]	郭敏, 张熙涵, 李阳. 融合注意力的教师互一致性半监督医学图像分割[J]. 计算机工程, 2024, 50(9): 313-323.
[2]	李俊仪, 李向阳, 龙朝勋, 李海燕, 李红松, 余鹏飞. 基于多级区域选择与跨层特征融合的野生菌分类[J]. 计算机工程, 2024, 50(9): 179-188.
[3]	赵婉秋, 张俊虎, 李海涛. 用于建筑物分割的平行结构特征融合网络[J]. 计算机工程, 2024, 50(8): 239-248.
[4]	赵宏, 王枭. 基于Swin-Transformer的黑色素瘤图像病灶分割研究[J]. 计算机工程, 2024, 50(8): 249-258.
[5]	王富平, 刘鸿玮, 张锲石, 段冠庄. 基于深度特征抑制的遮挡人脸识别网络[J]. 计算机工程, 2024, 50(8): 259-269.
[6]	闵莉, 董冰洁, 安冬. 基于多注意力机制与跨特征融合的语义分割算法[J]. 计算机工程, 2024, 50(8): 282-289.
[7]	高爽, 史轶伦, 徐巧枝, 于磊. 基于对比学习的非对称编解码结构的心脏MRI分割研究[J]. 计算机工程, 2024, 50(8): 290-300.
[8]	陈宇航, 杨勇, 先木斯亚·买买提明, 帕力旦·吐尔逊, 樊小超, 任鸽, 刁宇峰. 基于主题感知和语义增强的作文自动评分方法[J]. 计算机工程, 2024, 50(8): 363-371.
[9]	张华青, 夏张涛, 陆晓庆, 童基均. 基于字形特征的血管外科命名实体识别[J]. 计算机工程, 2024, 50(8): 13-21.
[10]	李华昱, 张智康, 闫阳, 岳阳. 基于知识图谱增强的领域多模态实体识别[J]. 计算机工程, 2024, 50(8): 31-39.
[11]	刘锁兰, 王炎, 王洪元, 朱生升. 基于多流语义图卷积网络的人体行为识别[J]. 计算机工程, 2024, 50(8): 64-74.
[12]	王晋涛, 秦昂, 张元, 陈一飞, 王廷凤, 谢承霖, 邹刚. 基于注意力增强与特征融合的中文医学实体识别[J]. 计算机工程, 2024, 50(7): 324-332.
[13]	谭巨全, 王然. 特征融合下田径录像3D人体动作DTW捕捉算法[J]. 计算机工程, 2024, 50(7): 71-78.
[14]	张溢文, 蔡满春, 陈咏豪, 朱懿, 姚利峰. 融合空间特征的多尺度深度伪造检测方法[J]. 计算机工程, 2024, 50(7): 240-250.
[15]	李亚康, 陈刚. 小角中子散射物理模型自动化筛选[J]. 计算机工程, 2024, 50(6): 56-64.

选择文件类型/文献管理软件名称

选择包含的内容

室内场景的布局估计与目标区域提取算法

Layout Estimation and Object Region Extraction Algorithm for Indoor Scene

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

室内场景的布局估计与目标区域提取算法

Layout Estimation and Object Region Extraction Algorithm for Indoor Scene

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价