基于多粒度特征引导的简牍文字图像修复网络

doi:10.19678/j.issn.1000-3428.0253501

摘要/Abstract

摘要： 简牍文字图像中存在的结构和纹理语义混淆、退化类型复杂、文字像素与背景噪音对比度低等问题，现有图像修复方法在处理具有复杂退化场景的简牍文字图像时普遍存在结构与纹理语义耦合、难以区分建模不同退化程度像素以及掩膜感知能力不足等问题，导致文字结构破坏、修复不稳定及伪影现象频发。本文提出了一种基于多粒度特征引导的简牍文字图像修复——AmdmaNet。首先，在纹理修复网络和结构修复网络中分别重建受结构边缘约束的纹理特征和基于相对全变分量（RTV）的结构特征，避免结构和纹理语义混淆的问题；随后，在图像细化阶段引入多尺度动态范围分布图自注意力机制（Mdma），对不同退化程度的像素进行分类处理，有效缓解修复过度或修复不充分的问题；进一步，采用自适应掩膜感知像素洗牌下采样方法（Ampd），通过受损像素对周围完整区域自适应地分配权重，增强模型对破损区域的置信度，再根据破损区域的位置信息引导图像下采样，确保掩码位置不发生偏移，显著减少了伪影、模糊及马赛克等现象。最后，在自建的简牍文字图像数据集上进行实验验证，实验结果表明，所提出方法在主观视觉感受和客观评价指标上均优于当前主流图像修复算法，尤其在处理文字笔画断裂、背景噪声干扰等复杂场景时表现出更强的鲁棒性。

Abstract: Existing methods for inpainting bamboo slip text images struggle with structural-texture confusion, complex degradation, and low text-background contrast, often causing structural damage, instability, and artifacts. This paper proposes AmdmaNet, a multi-granularity feature-guided inpainting network. It separately reconstructs texture and structural features to avoid semantic confusion. A Multi-scale Dynamic-range Map Attention (Mdma) mechanism classifies pixels by degradation level, preventing over/under-inpainting. An Adaptive Mask-aware Pixel-shuffle Downsampling (Ampd) method weights damaged pixels using surrounding information and guides downsampling to prevent mask shift, reducing artifacts, blur, and mosaics. Experiments on a custom dataset show our method outperforms state-of-the-art approaches in both visual quality and metrics, demonstrating superior robustness for complex cases like broken strokes and background noise.

王铁君, 鲁子怡, 胡晓燕, 康梦洋, 王文昊, 王恺彦, 徐成杰. 基于多粒度特征引导的简牍文字图像修复网络[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0253501.

Tiejun Wang, Ziyi Lu, Xiaoyan Hu, Mengyang Kang, Wenhao Wang, Kaiyan Wang, Chengjie Xu. A Multi-Granularity Feature Fusion Network for the Restoration of Jiandu Text Images[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0253501.

参考文献

[1] 屈路明. 武汉大学历史学院2008年度学术动态[J]. 历史教学问题, 2009(6): 104-106. Qu, L.(2009).Academic Activities of Wuhan University’s History College in 2008.Historical Research Issues, 6, 104–106.
[2] LIU G, REDA F A, SHIH K J, et al. Image Inpainting for Irregular Holes Using Partial Convolutions[C/OL]//Proceedings of the European Conference on Computer Vision (ECCV). 2018: 85-100[2025-12-25]. https://openaccess.thecvf.com/content_ECCV_2018/html/Guilin_Liu_Image_Inpainting_for_ECCV_2018_paper.html.
[3] YAN Z, LI X, LI M, et al. Shift-Net: Image Inpainting via Deep Feature Rearrangement[C/OL]//Proceedings of the European Conference on Computer Vision (ECCV). 2018: 1-17[2025-12-25].https://openaccess.thecvf.com/content_ECCV_2018/html/Zhaoyi_Yan_Shift-Net_Image_Inpainting_ECCV_2018_paper.html.
[4] NAZERI K, NG E, JOSEPH T, et al. EdgeConnect: Generative Image Inpainting with Adversarial Edge Learning[A/OL]. arXiv, 2019[2025-12-25]. http://arxiv.org/abs/1901.00212. DOI:10.48550/arXiv.1901.00212.
[5] RARES A, REINDERS M J T, BIEMOND J. Edge-based image restoration[J/OL]. IEEE Transactions on Image Processing, 2005, 14(10): 1454-1468. DOI:10.1109/TIP.2005.854466.
[6] GUO X, YANG H, HUANG D. Image Inpainting via Conditional Texture and Structure Dual Generation[C/OL]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021: 14134-14143[2025-12-25].https://openaccess.thecvf.com/content/ICCV2021/html/Guo_Image_Inpainting_via_Conditional_Texture_and_Structure_Dual_Generation_ICCV_2021_paper.html.
[7] LI Z, ZHANG Y, DU Y, et al. STNet: Structure and texture-guided network for image inpainting[J/OL]. Pattern Recognition, 2024, 156: 110786. DOI:10.1016/j.patcog.2024.110786.
[8] CHEN W, YUE H, WANG J, et al. An improved edge detection algorithm for depth map inpainting[J/OL]. Optics and Lasers in Engineering, 2014, 55: 69-77. DOI:10.1016/j.optlaseng.2013.10.025.
[9] WEI Z, MIN W, WANG Q, et al. ECNFP: Edge-constrained network using a feature pyramid for image inpainting[J/OL]. Expert Systems with Applications, 2022, 207: 118070. DOI:10.1016/j.eswa.2022.118070.
[10] CAO C, FU Y. Learning a Sketch Tensor Space for Image Inpainting of Man-Made Scenes[C/OL]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021: 14509-14518[2025-12-25]. https://openaccess.thecvf.com/content/ICCV2021/html/Cao_Learning_a_Sketch_Tensor_Space_for_Image_Inpainting_of_Man-Made_ICCV_2021_paper.html?ref=https://githubhelp.com.
[11] DONG Q, CAO C, FU Y. Incremental Transformer Structure Enhanced Image Inpainting With Masking Positional Encoding[C/OL]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022:11358-11368[2025-12-25].https://openaccess.thecvf.com/content/CVPR2022/html/Dong_Incremental_Transformer_Structure_Enhanced_Image_Inpainting_With_Masking_Positional_Encoding_CVPR_2022_paper.html.
[12] CAO C, DONG Q, FU Y. ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors[J/OL]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45(10): 12667-12684. DOI:10.1109/TPAMI.2023.3280222.
[13] REN Y, YU X, ZHANG R, et al. StructureFlow: Image Inpainting via Structure-Aware Appearance Flow[C/OL]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019: 181-190[2025-12-25].https://openaccess.thecvf.com/content_ICCV_2019/html/Ren_StructureFlow_Image_Inpainting_via_Structure-Aware_Appearance_Flow_ICCV_2019_paper.html.
[14] LIU H, JIANG B, SONG Y, et al. Rethinking Image Inpainting via a Mutual Encoder-Decoder with Feature Equalizations[A/OL]. arXiv, 2020[2025-12-25]. http://arxiv.org/abs/2007.06929.DOI:10.48550/arXiv.2007.06929.
[15] DENG Y, HUI S, ZHOU S, et al. Context Adaptive Network for Image Inpainting[J/OL]. IEEE Transactions on Image Processing, 2023, 32: 6332-6345. DOI:10.1109/TIP.2023.3298560.
[16] ZHU M, HE D, LI X, et al. Image Inpainting by End-to-End Cascaded Refinement With Mask Awareness[J/OL]. IEEE Transactions on Image Processing, 2021, 30: 4855-4866. DOI:10.1109/TIP.2021.3076310.
[17] ISOGAWA M, MIKAMI D, IWAI D, et al. Mask Optimization for Image Inpainting[J/OL]. IEEE Access, 2018, 6: 69728-69741. DOI:10.1109/ACCESS.2018.2877401.
[18] YU J, LIN Z, YANG J, et al. Free-Form Image Inpainting With Gated Convolution[C/OL]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019:4471-4480[2025-12-25].https://openaccess.thecvf.com/content_ICCV_2019/html/Yu_Free-Form_Image_Inpainting_With_Gated_Convolution_ICCV_2019_paper.html.
[19] CHEN S, ATAPOUR-ABARGHOUEI A, SHUM H P H. HINT: High-Quality INpainting Transformer With Mask-Aware Encoding and Enhanced Attention[J/OL]. IEEE Transactions on Multimedia, 2024, 26: 7649-7660. DOI:10.1109/TMM.2024.3369897.
[20] MIAO W, WANG L, LU H, et al. ITrans: generative image inpainting with transformers[J/OL]. Multimedia Systems, 2024, 30(1): 21. DOI:10.1007/s00530-023-01211-w.
[21] NADERI M, GIVKASHI M, KARIMI N, et al. SFI-Swin: Symmetric Face Inpainting with Swin Transformer by Distinctly Learning Face Components Distributions[A/OL]. arXiv, 2023[2025-12-25]. http://arxiv.org/abs/2301.03130. DOI:10.48550/arXiv.2301.03130.
[22]Xing C, Ren Z. Binary inscription character inpainting based on improved context encoders[J]. IEEE Access, 2023, 11: 55834-55843.
[23] Zhu S, Fang P, Zhu C, et al. Text image inpainting via global structure-guided diffusion models[C]//Proceedings of the AAAI Conference on Artificial Intelligence. 2024, 38(7): 7775-7783.
[24] Liu Y, Zhang E, Lin G, et al. A structural information-guided cross-modal method for damaged inscription inpainting via vision-language models[J]. npj Heritage Science, 2025, 13(1): 485.
[25] 陈善雄, 朱世宇, 熊海灵, 等. 一种双判别器GAN的古彝文字符修复方法[J/OL]. 自动化学报, 2022, 48(3): 853-864. DOI:10.16383/j.aas.c190752. Chen Shan-Xiong, Zhu Shi-Yu, Xiong Hai-Ling,Zhao Fu-Jia, Wang Ding-Wang, Liu Yun. A method of inpainting ancient Yi characters based ondual discriminator generative adversarial networks.Acta Automatica Sinica, 2022, 48(3): 853−864 doi: 10.16383/j.aas.c190752.
[26] WENJUN Z, BENPENG S, RUIQI F, et al. EA-GAN: restoration of text in ancient Chinese books based on an example attention generative adversarial network[J/OL]. Heritage Science,2023,11(1): 42. DOI:10.1186/s40494-023-00882-y.
[27] 段荧, 龙华, 瞿于荃, 等. 基于部分卷积的文字图像不规则干扰修复算法研究[J]. 计算机工程与科学, 2021, 43(9): 1634-1644. DUAN Ying, LONG Hua, QU Yu-quan, SHAO Yu-bin, DU Qing-zhi, . An irregular interference repair algorithm of text images based on partial convolution[J]. Computer Engineering & Science, 2021, 43(09): 1634-1644.
[28] 李超, 李思樵, 张靖熙, 等. 基于深度学习算法的碑文提取与修复系统[J]. 信息技术与信息化, 2024(10): 193-196. Li, C., Li, S., Zhang, J., et al. (2024). Extraction and Restoration System of Epitaphs Based on Deep Learning Algorithms. Information Technology and Informatization, 10(10), 193-196.
[29] GULATI A, QIN J, CHIU C C, 等. Conformer: Convolution-augmented Transformer for Speech Recognition[A/OL].arXiv,2020[2025-12-25].http://arxiv.org/abs/2005.08100. DOI:10.48550/arXiv.2005.08100.
[30] 张兰云. 简牍文字提取与识别研究[D/OL]. 西北师范大学, 2018[2025-12-26]. https://kns.cnki.net/KCMS/detail/detail.aspx?dbcode=CMFD&dbname=CMFD201802&filename=1017199984.nh. Zhang, L. (2017). Research on the extraction and recognition of bamboo slip characters [Master’s thesis, Northwest Normal University].
[31] PENG X, ZHAO H, WANG X, 等. C3N: content-constrained convolutional network for mural image completion[J/OL]. Neural Computing and Applications, 2023, 35(2): 1959-1970. DOI:10.1007/s00521-022-07806-0.
[32] He K, Zhang X, Ren S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 770-778.
[33] Liu Z, Lin Y, Cao Y, et al. Swin transformer: Hierarchical vision transformer using shifted windows[C]//Proceedings of the IEEE/CVF international conference on computer vision. 2021: 10012-10022.
[34] Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[J]. arXiv preprint arXiv:1409.1556, 2014.
[35] Zhang Y, Shi Y, Zhang P, et al. MegaHan97K: A large-scale dataset for mega-category Chinese character recognition with over 97K categories[J]. Pattern Recognition, 2025: 111757.

选择文件类型/文献管理软件名称

选择包含的内容