作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2022, Vol. 48 ›› Issue (3): 220-228. doi: 10.19678/j.issn.1000-3428.0060806

• 图形图像处理 • 上一篇    下一篇

基于视差优化的立体匹配网络

刘建国1,2,3,4, 纪郭1,2,3,4, 颜伏伍1,2,3,4, 沈建宏5, 孙云飞5   

  1. 1. 先进能源科学与技术广东省实验室佛山分中心(佛山仙湖实验室), 广东 佛山 528200;
    2. 武汉理工大学 现代汽车零部件技术湖北省重点实验室, 武汉 430070;
    3. 汽车零部件技术湖北省协同创新中心, 武汉 430070;
    4. 湖北省新能源与智能网联车工程技术研究中心, 武汉 430070;
    5. 宁波华德汽车零部件有限公司, 浙江宁波 315000
  • 收稿日期:2021-02-03 修回日期:2021-03-13 发布日期:2021-03-18
  • 作者简介:刘建国(1971-),男,副教授、博士,主研方向为机器视觉、智能驾驶;纪郭,硕士;颜伏伍,教授、博士;沈建宏、孙云飞,学士。
  • 基金资助:
    国家自然科学基金(51975434);先进能源科学与技术广东省实验室佛山分中心(佛山仙湖实验室)开放基金(XHD2020-003)。

Stereo Matching Network Based on Disparity Optimization

LIU Jianguo1,2,3,4, JI Guo1,2,3,4, YAN Fuwu1,2,3,4, SHEN Jianhong5, SUN Yunfei5   

  1. 1. Foshan Xianhu Laboratory of the Advanced Energy Science and Technology Guangdong Laboratory, Foshan, Guangdong 528200, China;
    2. Hubei Key Laboratory of Advanced Technology for Automotive Components, Wuhan University of Technology, Wuhan 430070, China;
    3. Hubei Collaborative Innovation Center for Automotive Components Technology, Wuhan 430070, China;
    4. Hubei Research Center for New Energy & Intelligent Connected Vehicle, Wuhan 430070, China;
    5. Ningbo Huade Automobile Parts Co., Ltd., Ningbo, Zhejiang 315000, China
  • Received:2021-02-03 Revised:2021-03-13 Published:2021-03-18

摘要: 现有的立体匹配算法通常采用深层卷积神经网络提取特征,对前景物体的检测更加精细,但对背景中的小物体及边缘区域匹配效果较差。为提高视差估计质量,构建一个基于视差优化的立体匹配网络CTFNet。分别提取浅层与深层特征,并基于深层特征构建全局稀疏代价卷,从而预测初始视差图。在预测的初始视差图和浅层特征的基础上构建局部稠密代价卷并进行视差优化,以细化预测视差值邻域的概率分布,提高特征不明显区域的匹配精度。此外,引入新的概率分布损失函数,监督softmax函数计算的视差值概率分布在真实视差值附近成单峰分布,提高算法的鲁棒性。实验结果表明,该网络在SceneFlow和KITTI数据集上的误匹配率分别为0.768%和1.485%,在KITTI测评网站上的误差率仅为2.20%,与PSMNet网络相比,精度和速度均得到一定提升。

关键词: 立体匹配, 视差优化, 浅层特征, 匹配代价卷, 损失函数

Abstract: Existing stereo matching algorithms usually use deep convolutional networks to extract features, and can improve the accuracy of foreground object detection, but display poor matching results for small objects and boundary areas in the background.In order to improve the quality of disparity estimation in these areas, a stereo matching network named Coarse To Fine Net(CTFNet) is proposed based on disparity optimization.The network extracts shallow and deep features separately and a global sparse cost volume is constructed based on the deep features to predict the initial disparity map.Then a local dense cost volume is constructed based on the predicted initial disparity map and shallow features, which optimizes the disparity and refines the probability distribution of the neighborhood of predicted disparity value to improve the matching accuracy of areas with less obvious features.At the same time, a new loss function for probability distribution is introduced to supervise the probability distribution calculated by the softmax function in a unimodal distribution near the true disparity value and improve the robustness of the algorithm.The experimental results show that the mismatching rate of the proposed network is 0.768% on the SceneFlow dataset and 1.485% on the KITTI data set, and its error rate on the KITTI evaluation website is only 2.20%.Compared with the PSMNet network, the proposed algorithm displays an improvement in both accuracy and speed.

Key words: stereo matching, disparity optimization, shallow feature, matching cost volume, loss function

中图分类号: