基于视差优化的立体匹配网络

doi:10.19678/j.issn.1000-3428.0060806

计算机工程 ›› 2022, Vol. 48 ›› Issue (3): 220-228. doi: 10.19678/j.issn.1000-3428.0060806

基于视差优化的立体匹配网络

刘建国^1,2,3,4, 纪郭^1,2,3,4, 颜伏伍^1,2,3,4, 沈建宏⁵, 孙云飞⁵

1. 先进能源科学与技术广东省实验室佛山分中心(佛山仙湖实验室), 广东佛山 528200;
2. 武汉理工大学现代汽车零部件技术湖北省重点实验室, 武汉 430070;
3. 汽车零部件技术湖北省协同创新中心, 武汉 430070;
4. 湖北省新能源与智能网联车工程技术研究中心, 武汉 430070;
5. 宁波华德汽车零部件有限公司, 浙江宁波 315000

收稿日期:2021-02-03 修回日期:2021-03-13 发布日期:2021-03-18
作者简介:刘建国(1971-),男,副教授、博士,主研方向为机器视觉、智能驾驶;纪郭,硕士;颜伏伍,教授、博士;沈建宏、孙云飞,学士。
基金资助:
国家自然科学基金（51975434）；先进能源科学与技术广东省实验室佛山分中心（佛山仙湖实验室）开放基金（XHD2020-003）。

Stereo Matching Network Based on Disparity Optimization

LIU Jianguo^1,2,3,4, JI Guo^1,2,3,4, YAN Fuwu^1,2,3,4, SHEN Jianhong⁵, SUN Yunfei⁵

1. Foshan Xianhu Laboratory of the Advanced Energy Science and Technology Guangdong Laboratory, Foshan, Guangdong 528200, China;
2. Hubei Key Laboratory of Advanced Technology for Automotive Components, Wuhan University of Technology, Wuhan 430070, China;
3. Hubei Collaborative Innovation Center for Automotive Components Technology, Wuhan 430070, China;
4. Hubei Research Center for New Energy & Intelligent Connected Vehicle, Wuhan 430070, China;
5. Ningbo Huade Automobile Parts Co., Ltd., Ningbo, Zhejiang 315000, China

Received:2021-02-03 Revised:2021-03-13 Published:2021-03-18

摘要/Abstract

摘要： 现有的立体匹配算法通常采用深层卷积神经网络提取特征，对前景物体的检测更加精细，但对背景中的小物体及边缘区域匹配效果较差。为提高视差估计质量，构建一个基于视差优化的立体匹配网络CTFNet。分别提取浅层与深层特征，并基于深层特征构建全局稀疏代价卷，从而预测初始视差图。在预测的初始视差图和浅层特征的基础上构建局部稠密代价卷并进行视差优化，以细化预测视差值邻域的概率分布，提高特征不明显区域的匹配精度。此外，引入新的概率分布损失函数，监督softmax函数计算的视差值概率分布在真实视差值附近成单峰分布，提高算法的鲁棒性。实验结果表明，该网络在SceneFlow和KITTI数据集上的误匹配率分别为0.768%和1.485%，在KITTI测评网站上的误差率仅为2.20%，与PSMNet网络相比，精度和速度均得到一定提升。

关键词: 立体匹配, 视差优化, 浅层特征, 匹配代价卷, 损失函数

Abstract: Existing stereo matching algorithms usually use deep convolutional networks to extract features, and can improve the accuracy of foreground object detection, but display poor matching results for small objects and boundary areas in the background.In order to improve the quality of disparity estimation in these areas, a stereo matching network named Coarse To Fine Net(CTFNet) is proposed based on disparity optimization.The network extracts shallow and deep features separately and a global sparse cost volume is constructed based on the deep features to predict the initial disparity map.Then a local dense cost volume is constructed based on the predicted initial disparity map and shallow features, which optimizes the disparity and refines the probability distribution of the neighborhood of predicted disparity value to improve the matching accuracy of areas with less obvious features.At the same time, a new loss function for probability distribution is introduced to supervise the probability distribution calculated by the softmax function in a unimodal distribution near the true disparity value and improve the robustness of the algorithm.The experimental results show that the mismatching rate of the proposed network is 0.768% on the SceneFlow dataset and 1.485% on the KITTI data set, and its error rate on the KITTI evaluation website is only 2.20%.Compared with the PSMNet network, the proposed algorithm displays an improvement in both accuracy and speed.

Key words: stereo matching, disparity optimization, shallow feature, matching cost volume, loss function

中图分类号:

TP391

刘建国, 纪郭, 颜伏伍, 沈建宏, 孙云飞. 基于视差优化的立体匹配网络[J]. 计算机工程, 2022, 48(3): 220-228.

LIU Jianguo, JI Guo, YAN Fuwu, SHEN Jianhong, SUN Yunfei. Stereo Matching Network Based on Disparity Optimization[J]. Computer Engineering, 2022, 48(3): 220-228.

https://www.ecice06.com/CN/Y2022/V48/I3/220

图/表 13

20220331202706

20220331202709

20220331202712

20220331202715

20220331202718

20220331202721

20220331202724

20220331202727

20220331202731

20220331202734

20220331202737

20220331202742

20220331202748

参考文献

[1] 王金鹤, 车志龙, 张楠, 等.基于多尺度和多层级特征融合的立体匹配算法[J].计算机工程, 2021, 47(3):243-248. WANG J H, CHE Z L, ZHANG N, et al.Stereo matching based on multi-scale and multi-feature integration[J].Computer Engineering, 2021, 47(3):243-248.(in Chinese)
[2] 赵晨园, 李文新, 张庆熙.一种改进的实时半全局立体匹配算法及硬件实现[J].计算机工程, 2021, 47(9):162-170. ZHAO C Y, LI W X, ZHANG Q X.An improved real-time semi-global stereo matching algorithm and its hardware implementation[J].Computer Engineering, 2021, 47(9):162-170.(in Chinese)
[3] 陈炎, 杨丽丽, 王振鹏.双目视觉的匹配算法综述[J].图学学报, 2020, 41(5):702-708. CHEN Y, YANG L L, WANG Z P.Literature survey on stereo vision matching algorithms[J].Journal of Graphics, 2020, 41(5):702-708.(in Chinese)
[4] ZBONTAR J, LECUN Y.Computing the stereo matching cost with a convolutional neural network[C]//Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2015:1592-1599.
[5] ZHANG K, LU J B, LAFRUIT G.Cross-based local stereo matching using orthogonal integral images[J].IEEE Transactions on Circuits and Systems for Video Technology, 2009, 19(7):1073-1079.
[6] HIRSCHMULLER H.Accurate and efficient stereo processing by semi-global matching and mutual information[C]//Proceedings of 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2005:807-814.
[7] HIRSCHMULLER H.Stereo processing by semi-global matching and mutual information[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, 30(2):328-341.
[8] LUO W J, SCHWING A G, URTASUN R.Efficient deep learning for stereo matching[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2016:5695-5703.
[9] MAYER N, ILG E, HAUSSER P, et al.A large dataset to train convolutional networks for disparity, optical flow, and scene flow estimation[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2016:4040-4048.
[10] KENDALL A, MARTIROSYAN H, DASGUPTA S, et al.End-to-end learning of geometry and context for deep stereo regression[C]//Proceedings of 2017 IEEE International Conference on Computer Vision.Washington D.C., USA:IEEE Press, 2017:66-75.
[11] PANG J H, SUN W X, REN J, et al.Cascade residual learning:a two-stage convolutional neural network for stereo matching[C]//Proceedings of 2017 IEEE International Conference on Computer Vision Workshops Venice.Washington D.C., USA:IEEE Press, 2017:878-886.
[12] CHANG J R, CHEN Y S.Pyramid stereo matching network[C]//Proceedings of 2018 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2018:5410-5418.
[13] HE K M, ZHANG X Y, REN S Q, et al.Spatial pyramid pooling in deep convolutional networks for visual recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, 37(9):1904-1916.
[14] ZHANG F H, PRISACARIU V, YANG R G, et al.GA-Net:guided aggregation net for end-to-end stereo matching[C]//Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2019:185-194.
[15] MA W C, WANG S L, HU R, et al.Deep rigid instance scene flow[C]//Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2019:3609-3617.
[16] XU H F, ZHANG J Y.AANet:adaptive aggregation network for efficient stereo matching[C]//Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2020:1956-1965.
[17] ZHU Z D, HE M Y, DAI Y C, et al.Multi-scale cross-form pyramid network for stereo matching[C]//Proceedings of the 14th IEEE Conference on Industrial Electronics and Applications.Washington D.C., USA:IEEE Press, 2019:1789-1794.
[18] ZHANG Y M, CHEN Y M, BAI X, et al.Adaptive unimodal cost volume filtering for deep stereo matching[EB/OL].[2021-01-02].https://www.researchgate.net/publication/335713171_Adaptive_Unimodal_Cost_Volume_Filtering_for_Deep_Stereo_Matching.
[19] CHEN L C, PAPANDREOU G, KOKKINOS I, et al.DeepLab:semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4):834-848.
[20] LI X T, YOU A S, ZHU Z, et al.Semantic flow for fast and accurate scene parsing[EB/OL].[2021-01-02].https://www.researchgate.net/publication/339471607_Semantic_Flow_for_Fast_and_Accurate_Scene_Parsing.
[21] MENZE M, GEIGER A.Object scene flow for autonomous vehicles[C]//Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2015:3061-3070.

选择文件类型/文献管理软件名称

选择包含的内容

基于视差优化的立体匹配网络

Stereo Matching Network Based on Disparity Optimization

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 13

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	屠乃威, 焦猛, 阎馨. 复杂环境下输电线路鸟巢目标图像检测模型[J]. 计算机工程, 2024, 50(7): 216-226.
[2]	杜田田, 王晓龙, 何劲. 复杂光照条件下基于光流的水运航道流速检测算法[J]. 计算机工程, 2024, 50(4): 60-67.
[3]	马明旭, 马宏, 宋华伟. 基于YOLO-Pose的城市街景小目标行人姿态估计算法[J]. 计算机工程, 2024, 50(4): 177-186.
[4]	王正家, 胡飞飞, 张成娟, 雷卓, 何涛. 引入轻量级Transformer的自适应窗口立体匹配算法[J]. 计算机工程, 2024, 50(2): 256-265.
[5]	蒋心璐, 陈天恩, 王聪, 赵春江. 大田环境下的农业害虫图像小目标检测算法[J]. 计算机工程, 2024, 50(1): 232-241.
[6]	李嘉新, 侯进, 盛博莹, 周宇航. 基于改进YOLOv5的遥感小目标检测网络[J]. 计算机工程, 2023, 49(9): 256-264.
[7]	刘志浩, 孟凡云, 王金鹤, 张楠. 基于空洞卷积与注意力模块的立体匹配算法[J]. 计算机工程, 2023, 49(8): 223-231.
[8]	陈露萌, 曹彦彦, 黄民, 谢鑫钢. 基于改进YOLOv5的火焰检测方法[J]. 计算机工程, 2023, 49(8): 291-301, 309.
[9]	侯华, 郭宏洋, 代超娜, 李峻辉. 结合多重注意力与迭代优化的立体匹配算法[J]. 计算机工程, 2023, 49(7): 161-168.
[10]	余嘉昕, 王春媛, 韩华, 高燕. 基于融合代价和优化引导滤波的立体匹配算法[J]. 计算机工程, 2023, 49(3): 257-262,270.
[11]	毕然, 王轶, 周喜. 基于重建误差的任务型对话未知意图检测[J]. 计算机工程, 2023, 49(2): 54-60.
[12]	胡清翔, 饶文碧, 熊盛武. 面向无人机遥感场景的轻量级小目标检测算法[J]. 计算机工程, 2023, 49(12): 169-177.
[13]	陈安琪, 陈睿, 邝祝芳, 黄华军. 基于图神经网络的不平衡欺诈检测研究[J]. 计算机工程, 2023, 49(11): 150-159.
[14]	戚玲珑, 高建瓴. 基于改进YOLOv7的小目标检测[J]. 计算机工程, 2023, 49(1): 41-48.
[15]	雷洁, 饶文碧, 杨焱超, 熊盛武. 基于分类不确定性的伪标签目标检测算法[J]. 计算机工程, 2023, 49(1): 49-56.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于视差优化的立体匹配网络

Stereo Matching Network Based on Disparity Optimization

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 13

参考文献

相关文章 15

编辑推荐

Metrics

本文评价