作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• •    

基于多粒度特征引导的简牍文字图像修复网络

  • 发布日期:2026-03-30

A Multi-Granularity Feature Fusion Network for the Restoration of Jiandu Text Images

  • Published:2026-03-30

摘要: 简牍文字图像中存在的结构和纹理语义混淆、退化类型复杂、文字像素与背景噪音对比度低等问题,现有图像修复方法在处理具有复杂退化场景的简牍文字图像时普遍存在结构与纹理语义耦合、难以区分建模不同退化程度像素以及掩膜感知能力不足等问题,导致文字结构破坏、修复不稳定及伪影现象频发。本文提出了一种基于多粒度特征引导的简牍文字图像修复——AmdmaNet。首先,在纹理修复网络和结构修复网络中分别重建受结构边缘约束的纹理特征和基于相对全变分量(RTV)的结构特征,避免结构和纹理语义混淆的问题;随后,在图像细化阶段引入多尺度动态范围分布图自注意力机制(Mdma),对不同退化程度的像素进行分类处理,有效缓解修复过度或修复不充分的问题;进一步,采用自适应掩膜感知像素洗牌下采样方法(Ampd),通过受损像素对周围完整区域自适应地分配权重,增强模型对破损区域的置信度,再根据破损区域的位置信息引导图像下采样,确保掩码位置不发生偏移,显著减少了伪影、模糊及马赛克等现象。最后,在自建的简牍文字图像数据集上进行实验验证,实验结果表明,所提出方法在主观视觉感受和客观评价指标上均优于当前主流图像修复算法,尤其在处理文字笔画断裂、背景噪声干扰等复杂场景时表现出更强的鲁棒性。

Abstract: Existing methods for inpainting bamboo slip text images struggle with structural-texture confusion, complex degradation, and low text-background contrast, often causing structural damage, instability, and artifacts. This paper proposes AmdmaNet, a multi-granularity feature-guided inpainting network. It separately reconstructs texture and structural features to avoid semantic confusion. A Multi-scale Dynamic-range Map Attention (Mdma) mechanism classifies pixels by degradation level, preventing over/under-inpainting. An Adaptive Mask-aware Pixel-shuffle Downsampling (Ampd) method weights damaged pixels using surrounding information and guides downsampling to prevent mask shift, reducing artifacts, blur, and mosaics. Experiments on a custom dataset show our method outperforms state-of-the-art approaches in both visual quality and metrics, demonstrating superior robustness for complex cases like broken strokes and background noise.