作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2021, Vol. 47 ›› Issue (6): 217-224. doi: 10.19678/j.issn.1000-3428.0058091

• 图形图像处理 • 上一篇    下一篇

基于深度感知特征提取的室内场景理解

陈苏婷, 张良臣   

  1. 南京信息工程大学 江苏省气象探测与信息处理重点实验室, 南京 210044
  • 收稿日期:2020-04-16 修回日期:2020-05-25 发布日期:2021-06-11
  • 作者简介:陈苏婷(1980-),女,教授、博士,主研方向为图像处理、机器学习;张良臣,硕士研究生。

Indoor Scene Understanding Based on Depth-Aware Feature Extraction

CHEN Suting, ZHANG Liangchen   

  1. Jiangsu Key Laboratory of Meteorological Detection and Information Processing, Nanjing University of Information Science and Technology, Nanjing 210044, China
  • Received:2020-04-16 Revised:2020-05-25 Published:2021-06-11
  • Contact: 国家自然科学基金(61906097)。 E-mail:sutingchen@nuist.edu.cn

摘要: 从深度图RGB-D域中联合学习RGB图像特征与3D几何信息有利于室内场景语义分割,然而传统分割方法通常需要精确的深度图作为输入,严重限制了其应用范围。提出一种新的室内场景理解网络框架,建立基于语义特征与深度特征提取网络的联合学习网络模型提取深度感知特征,通过几何信息指导的深度特征传输模块与金字塔特征融合模块将学习到的深度特征、多尺度空间信息与语义特征相结合,生成具有更强表达能力的特征表示,实现更准确的室内场景语义分割。实验结果表明,联合学习网络模型在NYU-Dv2与SUN RGBD数据集上分别取得了69.5%与68.4%的平均分割准确度,相比传统分割方法具有更好的室内场景语义分割性能及更强的适用性。

关键词: 语义特征, 深度特征, 特征融合, 室内场景理解, 几何信息, 深度感知特征

Abstract: The semantic segmentation for indoor scenes can be improved by the joint learning of RGB image features and 3D geometric information from RGB-D domain.However, the traditional segmentation methods require precise depth maps as the inputs, which seriously limits their application.To address the problem, this paper proposes a new network framework for indoor scene understanding.Based on the network for semantic feature and depth feature extraction, a joint learning network model is built to extract the depth-aware features.Additionally, the proposed model effectively combines learned depth features, multi-scale spatial information and the semantic features to generate more representative features, implementing more accurate semantic segmentation for indoor scenes.Experimental results show that the average segmentation accuracy of the proposed joint learning network model reaches 69.5% on NYU-Dv2 and 68.4% on SUN RGBD.The model provides better semantic segmentation performance and applicability for indoor scenes than traditional segmentation methods.

Key words: semantic feature, depth feature, feature fusion, indoor scene understanding, geometric information, depth-aware feature

中图分类号: