仿双目竞争的无参考立体图像质量评价

doi:10.19678/j.issn.1000-3428.0EC0252123

摘要/Abstract

摘要： 人类视觉皮层采用分层结构，其中双目融合与双目竞争首先发生在低级视觉区域，但当前基于深度学习的立体图像质量评价模型普遍采用在网络的不同层次上融合左右视点图像特征来估计立体图像质量值，对人类低级视觉区域感知的模拟程度存在不足。鉴于此，本文提出了一种仿双目竞争的立体图像质量评价方法。首先，模拟双目视觉竞争现象，构建了一个基于无监督方法的双目图像融合模型。通过左右视点图像的梯度幅值响应来评估图像降质程度，确定左右视点图像的融合权重。并利用深度卷积神经网络对输入图像先验知识的获取能力，建立基于编码器-解码器架构的无监督图像生成网络，以左右视点两幅图像作为学习对象，实现左右视点图像的融合。其次，利用在大规模图像数据库上预训练的ResNet50模型从融合图像中提取质量感知特征，并构建了一个基于支持向量回归的特征质量映射模型来估计立体图像的质量值。实验结果显示，在四个经典立体图像基准测试数据库上，所提出方法在PLCC（Pearson linear correlation coefficient）和SROCC（Spearman rank order correlation coefficient）两个评价指标上均超过了0.96，并且均方根误差均优于对比方法。这表明，所提出的基于无监督双目图像融合的方法能够有效模拟双目视觉效应，从而显著提高立体图像质量评价的准确性。

Abstract: The human visual cortex has a hierarchical structure, in which binocular fusion and binocular rivalry first occur in the low-level visual areas. However, current deep learning-based stereoscopic image quality assessment (SIQA) models generally estimate the quality values of stereoscopic images by fusing the features of left and right view images at different levels of the network, resulting in insufficient simulation of the perception in the low-level visual areas of human visual cortex. To address this issue, this paper proposes a SIQA method that simulates binocular rivalry to further enhance the evaluation accuracy. First, we leverage the ability of deep convolutional neural networks to acquire prior knowledge of input image and build a binocular image fusion model based on an unsupervised approach. This model takes the left and right views as learning targets to simulate the binocular fusion process in the human visual system. The gradient magnitude responses of the left and right images are utilized to calculate the image degradation coefficient, which is then used to obtain the fusion weights of the left and right views, simulating the binocular rivalry phenomenon. Then, we utilize a pre-trained ResNet50 model to extract quality-aware features from the fused image and establish a feature-quality mapping model based on support vector regression to estimate the quality value of the stereoscopic image. Experimental results demonstrate that our proposed SIQA method achieves over 0.96 on both Pearson linear correlation coefficient (PLCC) and Spearman The human visual cortex has a hierarchical structure, in which binocular fusion and binocular rivalry first occur in the low-level visual areas. However, current deep learning-based stereoscopic image quality assessment (SIQA) models generally estimate the quality values of stereoscopic images by fusing the features of left and right view images at different levels of the network, resulting in insufficient simulation of the perception in the low-level visual areas of human visual cortex. To address this issue, this paper proposes a SIQA method that simulates binocular rivalry to further enhance the evaluation accuracy. First, we leverage the ability of deep convolutional neural networks to acquire prior knowledge of input image and build a binocular image fusion model based on an unsupervised approach. This model takes the left and right views as learning targets to simulate the binocular fusion process in the human visual system. The gradient magnitude responses of the left and right images are utilized to calculate the image degradation coefficient, which is then used to obtain the fusion weights of the left and right views, simulating the binocular rivalry phenomenon. Then, we utilize a pre-trained ResNet50 model to extract quality-aware features from the fused image and establish a feature-quality mapping model based on support vector regression to estimate the quality value of the stereoscopic image. Experimental results demonstrate that our proposed SIQA method achieves over 0.96 on both Pearson linear correlation coefficient (PLCC) and Spearman

徐少平, 王子超, 唐祎玲, 熊思龙. 仿双目竞争的无参考立体图像质量评价[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0EC0252123.

XU Shaoping, WANG Zichao, TANG Yiling, XIONG Silong. The No-Reference Stereoscopic Image Quality Assessment Simulating Binocular Rivalry[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0EC0252123.

参考文献

[1] Ming Yu, XiangjieSui and Jiebin Yan. Progress in no-reference image quality assessment[J]. Journal of Image and Graphics, 2021, 26(02): 0265-0286. (方玉明, 眭相杰, 鄢杰斌. 无参考图像质量评价研究进展[J]. 中国图象图形学报, 2021, 26(02): 0265-0286.)
[2] Zhewei Fang, Yueli Cui, Mei Yu, Gangyi Jiang, Kaiyin Lian, Yulu Wen and Jiayao Xu. Blind 3D-synthesized image quality measurement by analysis of local and global statistical properties[J]. IEEE Transactions on Instrumentation and Measurement, 2023, 72: 5024915.
[3] Hanling Wang, Xiao Ke, Wenzhong Guo and Wukun Zheng. No-reference stereoscopic image quality assessment based on binocular collaboration[J]. Neural Networks, 2024, 180: 106752.
[4] Yiling Tang, Shunliang Jiang, Shaoping Xu, Tingyun Liu and Chongxi Li. Asymmetrically distorted stereoscopic image quality assessment based on ocular dominance [J]. Acta Automatica Sinica, 2019, 45(11): 2092-2106. (唐祎玲, 江顺亮, 徐少平, 刘婷云, 李崇禧. 基于眼优势的非对称失真立体图像质量评价[J]. 自动化学报, 2019, 45(11): 2092-2106.)
[5] Yueli Cui, Gangyi Jiang, Mei Yu, Yeyao Chen and Yo-Sung Ho. Stitched wide field of view light field image quality assessment: Benchmark database and objective metric[J]. IEEE Transactions on Multimedia, 2023, 26: 5092-5107.
[6] Sumei Li and Huilin Zhang. Stereo Image Quality Assessment Based on Top-Down Visual Mechanism[J]. Laser & Optoelectronics Progress, 2025, 62(2): 0237004. (李素梅, 张慧林. 基于自顶向下视觉机制的立体图像质量评价[J]. 激光与光电子学进展, 2025, 62(2): 0237004.)
[7] Patrizio Campisi, Patrick Le Callet and Enrico Marini. Stereoscopic images quality assessment[C]. In: Proceedings of the 15th European Signal Processing, ESPC2007, Poznan, Poland: IEEE, 2007: 2110−2114.
[8] Yasakethu S L P, Hewage C T E R, Fernando W A C and Kondoz. Quality analysis for 3D video using 2D video quality models[J]. IEEE Transactions on Consumer Electronics, 2008, 54(4): 1969-1976.
[9] Rafik Bensalma and Mohamed Chaker Larabi. A perceptual metric for stereoscopic image quality assessment based on the binocular energy[J]. Multidimensional Systems and Signal Processing, 2013, 24(2): 281-316;
[10] Mingjun Chen, Chechun Su, Do-Kyoung Kwon and Alan Conrad Bovik. Full-reference quality assessment of stereopairs accounting for rivalry[J]. Signal Processing: Image Communication, 2013, 28(9): 1143–1155.
[11] Feng Shao, Weisi Lin, Shanbo Gu and Gangyi Jiang. Perceptual full-reference quality assessment of stereoscopic images by considering binocular visual characteristics[J]. IEEE Transactions on image Processing, 2013, 22 (5): 1940–1953.
[12] Le Kang, Peng Ye, Yi Li and David Doermann. Convolutional neural networks for no-reference image quality assessment[C]. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2014. Washington, United States: IEEE, 2014: 1733-1740.
[13] Wujie Zhou, Jingsheng Lei, Qiuping Jiang and Lu Yu. Blind binocular visual quality predictor using deep fusion network[J]. IEEE Transactions on image Processing, 2020, 6: 883-893.
[14] Jiebin Yan, Yuming Fang, Liping Huang and Xiongkuo Min. Blind stereoscopic image quality assessment by deep neural network of multi-level feature fusion[C]. 2020 IEEE International Conference on Multimedia and Exposition, ICME 2020. London, United Kingdom: IEEE, 2020: 1-6.
[15] Sumei Li, Yueyang Li and Yongtian Han. Stereoscopic image quality assessment considering visual mechanism and multi-loss constraints[J]. Journal of Visual Communication and Image Representation, 2021, 79: 103-255.
[16] Oussai Messai and Aladine Chetouani. End-to-end deep multi-score model for no-reference stereoscopic image quality assessment[C]. 2022 IEEE International Conference on Image Processing, ICIP2022，Bordeaux, France, 2022: 2721-2725
[17] Kyohoon Sim, Jiachen Yang, Wen Lu and Xinbo Gao. Blind Stereoscopic Image Quality Evaluator Based on Binocular Semantic and Quality Channels [J]. IEEE Transactions on Multimedia, 2022, 24: 13-89.
[18] Yongli Chang, Sumei Li, Anqi Liu, Jie Jin and Wei Xiang. Coarse-to-Fine Feedback Guidance Based Stereo Image Quality Assessment Considering Dominant Eye Fusion[J]. IEEE Transactions on Multimedia, 2023, 25: 8855-8867.
[19] Yang Wang, Xiran Jia, Haiyan Long and Liying Han. Lightweight stereoscopic image quality assessment method combining peripheral vision[J]. Journal of Optoelectronics·Laser, 2024, 35(12):1267-1275. (王杨, 贾曦然, 隆海燕, 韩力英. 结合周边视觉的轻量级立体图像质量评价方法[J]. 光电子·激光, 2024, 35(12): 1267-1275.).
[20] Dmitry Ulyanov, Andrea Vedaldi and Victor Lempitsky. Deep image prior[C]. Proceeding of the IEEE conference on computer vision and pattern recognition, CVF 2018, Salt Lake City, Utah, United States: IEEE, 2018: 9446-9454.
[21] Yunyun Han, Justus Kebschull and Robert Campbell. A single-cell anatomical blueprint for intracortical information transfer from primary visual cortex[J]. Bio-archive, 2017.
[22] Chechun Su and Alan Conrad Bovik. Oriented correlation models of distorted natural images with application to natural stereopair quality evaluation[J]. IEEE Transactions on Image Processing. 2015, 24(5): 1685-1699.
[23] Jiheng Wang, Shiqi Wang and Zhou Wang. Asymmetrically compressed stereoscopic 3D videos: Quality assessment and rate-distortion performance evaluation [J]. IEEE Transactions on Image Processing, 2017, 26: 1330-1343.
[24] Feng Shao, Kemeng Li, Weisi Lin, Gangyi Jiang and Mei Yu. Using binocular feature combination for blind quality assessment of stereoscopic images [J]. IEEE Signal Processing Letters, 2015, 22: 1548-1551.
[25] Lixiong Liu, Bao Liu, Che-Chun Su, Hua Huang and Alan Conrad Bovik. Binocular spatial activity and reverse saliency driven no-reference stereopair quality assessment[J]. Signal Processing: Image Communication, 2017, 58: 287-299.
[26] Feng Shao, Zhuqing Zhang, Qiuping Jiang, Weisi Lin, Gangyi Jiang. Toward domain transfer for no-reference quality prediction of asymmetrically distorted stereoscopic images[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2018, 28(3): 573−585.
[27] Wujie Zhou, Yu Lu, Weiwei Qiu, Yang Zhou and Mingwei Wu. Local gradient patterns (LGP): An effective local-statistical-feature extraction scheme for no-reference image quality assessment[J]. Information Sciences, 2017, 397–398: 1-14. [28] Oszust Mariu
sz. No-reference image quality assessment using image statistics and robust feature descriptors[J]. IEEE Signal Processing Letters, 2017, 24: 1656-1660.
[29] Lixiong Liu, Bao Liu, Hua Huang, Alan Conrad Bovik. No-reference image quality assessment based on spatial and spectral entropies[J]. Signal Processing: Image Communication, 2014, 29(8): 856-863.
[30] Jiheng Wang, Abdul Rehman, Kai Zeng, Shiqi Wang and Zhou Wang. Quality prediction of asymmetrically distorted stereoscopic 3D images[J]. IEEE Transactions on Image Processing, 2015, 24(11): 3400-3414.
[31] Shaoping Xu, Xiaojun Chen, Yiling Tang, Shunliang Jiang, Xiaohui Cheng and Nan Xiao. Learning from multiple instances: A two-stage unsupervised image denoising framework based on deep image prior[J]. Applied Sciences, 2022, 12(21): 10767.
[32] Olaf Ronneberger, Philipp Fischer, Thomas Brox. U-net: Convolutional networks for biomedical image segmentation[C]. International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2015, Germany: Springer International Publishing, 2015: 234-241.
[33] Kaiming He, Xiangyu Zhang, Shaoqing Ren and Jian Sun. Deep residual learning for image recognition[C]. 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR2016, Las Vegas. United States: IEEE, 2016: 770-778.
[34] Chang Chih-Chung and Lin Chih-Jen. LIBSVM: A library for support vector machines[J]. ACM Transactions on Intelligent Systems and Technology, 2011, 27: 1-27: 27.
[35] Anush Krishna. Moorthy, Chunche Chen and Alan Conrad Bovik. Subjective evaluation of stereoscopic image quality[J]. Signal Processing: Image Communication. 2015, 28(8): 870-883.
[36] Jiheng Wang, Kai Zeng and Zhou Wang. Quality prediction of asymmetrically distorted stereoscopic images from single views[C]. Proceedings of 2014 IEEE International Conference on Multimedia and Expo, ICME2014. Chengdu, China: IEEE, 2014: 14-18.
[37] Anish Mittal, Anush Krishna Moorthy and Alan Conrad Bovik. No-reference image quality assessment in the spatial domain[J]. IEEE Transactions on image Processing, 2012, 21: 4695–4708.
[38] Yun Liu, Chang Tang, Zhi Zheng and Liyuan Lin. No-reference stereoscopic image quality evaluator with segmented monocular features and perceptual binocular features[J]. Neurocomputing, 2020, 405:126-137.
[39] Jiachen Yang, Yang Zhao, Yinghao Zhu, Huifang Xu, wen Lu and Qianggang Meng. Blind assessment for stereo images considering binocular characteristics and deep perception map based on deep belief network[J].Information Sciences, 2019, 474: 1-17.
[40] Tuxin Guan, Chaofeng Li, Yuhui Zheng, Shenghu Zhao, Xiaojun Wu. No-reference stereoscopic image quality assessment on both complex contourlet and spatial domain via Kernel ELM[J]. Signal Processing: Image Communication, 2022, 101: 116-547
[41] Chaofeng Li, Lixia Yun and Shoukun Xu. Blind stereoscopic image quality assessment using 3D saliency selected binocular perception and 3D convolutional neural network [J]. Multimedia Tools and Applications, 2022, 81: 18437-18455.

选择文件类型/文献管理软件名称

选择包含的内容