基于逐像素强化学习的边缘保持图像复原

doi:10.19678/j.issn.1000-3428.0068490

摘要/Abstract

摘要：

高强度的高斯噪声往往会模糊或破坏图像的细节和结构, 导致边缘信息的丢失。为此, 提出基于逐像素强化学习的边缘保持图像复原算法。首先, 为每个像素构建一个像素层智能体并设计针对边缘处的侧窗均值滤波器到动作空间中, 所有的像素层智能体共享优势行动者-评论家算法的参数, 因此模型可以同时输出所有位置的状态转移概率并选择合适的策略进行状态转移, 从而复原图像; 其次, 在特征提取共享网络中结合协调注意力, 聚焦所有像素位置在特征通道间的全局信息, 并保留位置嵌入信息; 然后, 为了缓解稀疏奖励问题, 设计一个基于图拉普拉斯正则的辅助损失, 关注图像的局部平滑信息, 对局部不平滑区域加以惩罚, 从而促进像素层智能体更加有效地学习到正确的策略以实现边缘保持。实验结果表明, 所提的算法在Middlebury2005数据集和MNIST数据集上的峰值信噪比(PSNR)分别达到32.97 dB和28.26 dB, 相比于Pixel-RL算法分别提升了0.23 dB和0.75 dB, 参数量和训练总时间分别减少了44.9%和18.2%, 在实现边缘保持的同时有效降低了模型的复杂度。

关键词: 图像复原, 深度强化学习, 逐像素强化学习, 协调注意力, 图拉普拉斯, 边缘保持

Abstract:

High-intensity Gaussian noise tends to blur or destroy the details and structure of an image, resulting in the loss of edge information. Therefore, an edge-preserving image restoration algorithm based on pixel-by-pixel reinforcement learning is proposed. First, a pixel-wise agent is constructed for each pixel. The algorithm uses a side window averaging filter at the edge of the action space. All pixel layer agents share the parameters of the advantageous actor-critic algorithm; therefore, the model can output the state transition probability of all positions simultaneously and select the appropriate strategy for the state transition to restore the image. Second, coordinated attention is combined in the feature extraction sharing network to focus on the global information of all pixel positions between the feature channels, to retain the position embedding information. Subsequently, to alleviate the problem of sparse rewards, an auxiliary loss, designed based on graph Laplacian regularity, focuses on the local smoothing information of the image, punishing the local unsmooth area to encourage the pixel-layer agent to learn the correct strategy, so as to more effectively maintain the edge. The experimental results show that the Peak Signal-to-Noise Ratio (PSNR) of the proposed algorithm on the Middlebury2005 and MNIST datasets is 32.97 dB and 28.26 dB, respectively, which is 0.23 dB and 0.75 dB higher than those obtained by the Pixel-RL algorithm, respectively. The total number of parameters and training time decrease by 44.9% and 18.2%, respectively, effectively reducing the complexity of the model while maintaining the edges.

Key words: image restoration, deep reinforcement learning, pixel-by-pixel reinforcement learning, coordinated attention, graph Laplacian, edge-preserving

江敏, 陈飞, 程航, 王美清. 基于逐像素强化学习的边缘保持图像复原[J]. 计算机工程, 2024, 50(12): 224-232.

JIANG Min, CHEN Fei, CHENG Hang, WANG Meiqing. Edge-Preserving Image Restoration Based on Pixel-by-Pixel Reinforcement Learning[J]. Computer Engineering, 2024, 50(12): 224-232.

https://www.ecice06.com/CN/Y2024/V50/I12/224

图/表 14

图1 去除高斯白噪声方差(σ=50)的图像复原结果

Fig.1 Image restoration results of Gaussian white noise removal variance (σ=50)

图2 协调注意力模块

Fig.2 Coordinated attention module

图3 算法总体架构

Fig.3 Overall architecture of the algorithm

图4 在Middlebury2005数据集上的图像复原可视化结果

Fig.4 Visualization results of image restoration on the Middlebury2005 dataset

图5 在MNIST数据集上的可视化复原结果

Fig.5 Visualization of restoration results on the MNIST dataset

图6 图像“8”的可视化复原结果对比

Fig.6 Comparison of visualized restoration results for the image "8"

图7 在Set12数据集上噪声图像逐步恢复的过程

Fig.7 Process of the step-by-step recovery of noise images on the Set12 dataset

图8 训练过程中累计折扣总奖励变化

Fig.8 Cumulative discounted total reward variation during the training process

图9 奖励为零的像素数量变化

Fig.9 Variation in the number of pixels with zero reward

参考文献 25

1	BUADES A, COLL B, MOREL J M. A non-local algorithm for image denoising[C]//Proceedings of 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05). Washington D.C., USA: IEEE Press, 2005: 60-65.
2	Dabov K , Foi A , Katkovnik V , et al. Image denoising by sparse 3-D transform-domain collaborative filtering. IEEE Transactions on Image Processing, 2007, 16 (8): 2080- 2095. doi: 10.1109/TIP.2007.901238
3	周博超, 韩雨男, 桂志国, 等. 基于VGG网络与深层字典的低剂量CT图像去噪算法. 计算机工程, 2022, 48 (4): 191- 196. doi: 10.19678/j.issn.1000-3428.0060582
	ZHOU B C , HAN Y N , GUI Z G , et al. Low-dose CT image denoising algorithm based on VGG network and deep dictionary. Computer Engineering, 2022, 48 (4): 191- 196. doi: 10.19678/j.issn.1000-3428.0060582
4	刘一畅, 马伟, 徐士彪, 等. 基于卷积神经网络的边缘保真图像去噪算法. 计算机辅助设计与图形学学报, 2020, 32 (11): 1822- 1831. doi: 10.3724/SP.J.1089.2020.18170
	LIU Y C , MA W , XU S B , et al. Edge-fidelity image denoising based on convolutional neural network. Journal of Computer-Aided Design [WT《Times New Roman》]& Computer Graphics, 2020, 32 (11): 1822- 1831. doi: 10.3724/SP.J.1089.2020.18170
5	ZHANG K , ZUO W , CHEN Y , et al. Beyond a Gaussian denoiser: residual learning of deep CNN for image denoising. IEEE Transactions on Image Processing, 2017, 26 (7): 3142- 3155. doi: 10.1109/TIP.2017.2662206
6	FURUTA R , INOUE N , YAMASAKI T . PixelRL: fully convolutional network with reinforcement learning for image processing. IEEE Transactions on Multimedia, 2019, 22 (7): 1704- 1719. URL
7	HOU Q, ZHOU D, FENG J. Coordinate attention for efficient mobile network design[C]//Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Washington D.C., USA: IEEE Press, 2021: 13713-13722.
8	DE ASIS K, HERNANDEZ-GARCIA J, HOLLAND G, et al. Multi-step reinforcement learning: a unifying algorithm[C]//Proceedings of the 32nd AAAI Conference on Artificial Intelligence. Palo Alto, USA: AAAI, 2018: 2902-2909.
9	ZHANG X, GAO W, YUAN H, et al. JE2NET: joint exploitation and exploration in reinforcement learning based image restoration[C]//Proceedings of ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP). Washington D.C., USA: IEEE Press, 2022: 2090-2094.
10	ZHANG J , ZHANG Q , ZHAO X , et al. Boosting denoisers with reinforcement learning for image restoration. Soft Computing, 2022, 26 (7): 3261- 3272. doi: 10.1007/s00500-022-06840-3
11	CHEN B H, CHENG H Y, YIN J L. Adaptive actor-critic bilateral filter[C]//Proceedings of ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Washington D.C., USA: IEEE Press, 2022: 1675-1679.
12	XI R , MA T , CHEN X , et al. Image enhancement using adaptive region-guided multi-step exposure fusion based on reinforcement learning. IEEE Access, 2023, 11 (1): 31686- 31698. doi: 10.1109/ACCESS.2023.3262751
13	MNIH V, BADIA A P, MIRZA M, et al. Asynchronous methods for deep reinforcement learning[C]//Proceedings of the 33rd International Conference on International Conference on Machine Learning. New York, USA: PMLR, 2016: 1928-1937.
14	SHUMAN D I , NARANG S K , FROSSARD P , et al. The emerging field of signal processing on graphs: extending high-dimensional data analysis to networks and other irregular domains. IEEE Signal Processing Magazine, 2013, 30 (3): 83- 98. doi: 10.1109/MSP.2012.2235192
15	PANG J , CHEUNG G . Graph Laplacian regularization for image denoising: analysis in the continuous domain. IEEE Transactions on Image Processing, 2017, 26 (4): 1770- 1785. doi: 10.1109/TIP.2017.2651400
16	ZENG J, PANG J, SUN W, et al. Deep graph Laplacian regularization for robust denoising of real images[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. Washington, USA: IEEE Press, 2019: 1759-1768.
17	杨惟轶, 白辰甲, 蔡超, 等. 深度强化学习中稀疏奖励问题研究综述. 计算机科学, 2020, 47 (3): 182- 191. URL
	YANG W Y , BAI C J , CAI C , et al. Survey on sparse reward in deep reinforcement learning. Computer Science, 2020, 47 (3): 182- 191. URL
18	钱冲, 常冬霞. 图拉普拉斯正则化稀疏变换学习图像去噪算法. 计算机工程与应用, 2022, 58 (5): 232- 239. URL
	QIAN C , CHANG D X . Image denoising algorithm based on graph Laplacian regularized sparse transform learning. Computer Engineering and Applications, 2022, 58 (5): 232- 239. URL
19	Chen F, Cheung G, Zhang X. Fast [WT《Times New Roman》]& robust image interpolation using gradient graph laplacian regularizer[C]//Proceedings of 2021 IEEE International Conference on Image Processing (ICIP). Washington D.C., USA: IEEE Press, 2021: 1964-1968.
20	YIN H, GONG Y, QIU G. Side window filtering[C]//Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Washington D.C., USA: IEEE Press, 2019: 8758-8766.
21	ROTH S, BLACK M J. Fields of experts: a framework for learning image priors[C]//Proceedings of 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05). Washington D.C., USA: IEEE Press, 2005: 860-867.
22	MA K , DUANMU Z , WU Q , et al. Waterloo exploration database: new challenges for image quality assessment models. IEEE Transactions on Image Processing, 2016, 26 (2): 1004- 1016. doi: 10.1109/TIP.2016.2631888
23	BUTLER D J, WULFF J, STANLEY G B, et al. A naturalistic open source movie for optical flow evaluation[C]//Proceedings of 12th European Conference on Computer Vision (ECCV). New York, USA: Springer, 2012: 611-625.
24	FUJITA Y , NAGARAJAN P , KATAOKA T , et al. ChainerRL: a deep reinforcement learning library. The Journal of Machine Learning Research, 2021, 22 (1): 3557- 3570.
25	KINGMA D P, BA J. Adam: a method for stochastic optimization[EB/OL]. (2014-12-22)[2023-09-07]. https://arxiv.org/pdf/1412.6980.

[1]	石琼, 段辉, 师智斌. 基于深度强化学习的可信任务卸载方案[J]. 计算机工程, 2024, 50(8): 142-152.
[2]	孙文洁, 李宗民, 孙浩淼. 基于图神经网络的多智能体强化学习值函数分解方法[J]. 计算机工程, 2024, 50(5): 62-70.
[3]	傅明建, 郭福强. 基于深度强化学习的无信号灯路口决策研究[J]. 计算机工程, 2024, 50(5): 91-99.
[4]	冯妍舟, 刘建霞, 王海翼, 冯国昊, 白宇. 基于多级残差信息蒸馏的真实图像去噪方法[J]. 计算机工程, 2024, 50(3): 216-223.
[5]	杜海军, 余粟. 基于时空图注意力网络的服务机器人动态避障[J]. 计算机工程, 2024, 50(2): 105-112.
[6]	倪苏婕, 陈兵, 石优. 一种联合V2I和V2V的任务卸载优化方案[J]. 计算机工程, 2024, 50(12): 174-183.
[7]	宋艳蕊, 庄雷, 徐泽汐, 冯旭, 莫文帅. 基于云边协同的可靠服务功能链部署算法[J]. 计算机工程, 2024, 50(12): 184-193.
[8]	何杰, 马强. 基于深度强化学习的C-V2X任务卸载研究[J]. 计算机工程, 2024, 50(12): 200-212.
[9]	毕千, 钱程, 张可, 王成. 基于深度强化学习的多智能体角度跟踪方法设计[J]. 计算机工程, 2024, 50(11): 10-17.
[10]	王腾, 黄俊松, 王乐庭, 张才坤, 李枭扬. 基于MADDPG的多阵面相控阵雷达引导搜索资源优化算法[J]. 计算机工程, 2024, 50(11): 38-48.
[11]	蔡梓越, 谭北海, 余荣, 黄旭民, 王思明. 面向6G物联网设备协同的区块链动态分片[J]. 计算机工程, 2024, 50(1): 50-59.
[12]	杨康, 任愈, 吴学杰. 融合对比度拉伸的地铁隧道环境图像复原算法[J]. 计算机工程, 2024, 50(1): 224-231.
[13]	胡水. 基于深度强化学习的智能兵棋推演决策方法[J]. 计算机工程, 2023, 49(9): 303-312.
[14]	孔凌辉, 饶哲恒, 徐彦彦, 潘少明. 基于深度强化学习的无线网络智能路由算法[J]. 计算机工程, 2023, 49(9): 199-207, 216.
[15]	张冠莹, 伊鹏, 李丹, 朱棣, 毛明. 面向大规模网络的服务功能链部署方法[J]. 计算机工程, 2023, 49(8): 122-129.

选择文件类型/文献管理软件名称

选择包含的内容