作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2021, Vol. 47 ›› Issue (3): 243-248. doi: 10.19678/j.issn.1000-3428.0056715

• 图形图像处理 • 上一篇    下一篇

基于多尺度和多层级特征融合的立体匹配方法

王金鹤, 车志龙, 张楠, 孟凡云, 苏翠丽, 谭浩   

  1. 青岛理工大学 信息与控制工程学院, 山东 青岛 266000
  • 收稿日期:2019-11-26 修回日期:2020-03-08 发布日期:2020-06-18
  • 作者简介:王金鹤(1963-),男,教授、博士,主研方向为模式识别与智能系统;车志龙,硕士研究生;张楠(通信作者),讲师、硕士;孟凡云,讲师、博士;苏翠丽、谭浩,硕士研究生。
  • 基金资助:
    国家自然科学基金(31271077);山东省重点研发项目(2019GGX104089);山东省高等学校科技计划项目(J17KA061)。

Stereo Matching Method Based on Multi-Scale and Multi-Level Features Fusion

WANG Jinhe, CHE Zhilong, ZHANG Nan, MENG Fanyun, SU Cuili, TAN Hao   

  1. School of Information and Control Engineering, Qingdao University of Technology, Qingdao, Shandong 266000, China
  • Received:2019-11-26 Revised:2020-03-08 Published:2020-06-18

摘要: 基于卷积神经网络的立体匹配方法未充分利用图像中各个层级的特征图信息,造成对图像在不适定区域的特征提取能力较差。提出一种融合多尺度与多层级特征的立体匹配方法。通过在双塔结构卷积神经网络模型的前端设计一个池化金字塔层,提取图像的多尺度低层结构特征。在该网络模型的后端融合最后三层网络的高级语义特征来提取图像特征,并对图像特征进行相似性度量后输出视差图。在KITTI 2015数据集上的实验结果表明,与LUO和Anita方法相比,该方法的像素误差精度分别由14.65%、8.30%降至8.02%,且可得到细节信息更好的视差图。

关键词: 立体匹配, 卷积神经网络, 特征图信息, 多层级特征融合, 视差

Abstract: The stereo matching method based on Convolutional Neural Network(CNN) does not make full use of the feature map information of each level in the image,resulting in the poor feature extraction performance in the ill posed region in the image.This paper proposes a stereo matching method based on multi-scale and multi-level features.A pooled pyramid layer is designed at the front end of the CNN model with double tower structure to extract the multi-scale low-level structural features of the image.In the back end of the network model,the high-level semantic features of the last three layers of network are fused to extract image features,and the disparity map is output after similarity measurement of image features.The experimental results on KITTI 2015 dataset show that compared with the LUO and Anita methods,the proposed method reduces the pixel error accuracy from 14.65% and 8.30% to 8.02%,and can obtain a disparity map with better detail information.

Key words: stereo matching, Convolutional Neural Network(CNN), feature map information, multi-level features fusion, disparity

中图分类号: