Author Login Chief Editor Login Reviewer Login Editor Login Remote Office

Computer Engineering ›› 2026, Vol. 52 ›› Issue (4): 176-186. doi: 10.19678/j.issn.1000-3428.0070162

• Computer Vision and Image Processing • Previous Articles     Next Articles

Domain Adaptive Remote Sensing Image Segmentation Based on Hierarchical Attention

WANG Shasha1, LI Weitao2, LIU Xingyu2, GAO Hui2,3,*()   

  1. 1. School of Data Science, Hebi Polytechnic, Hebi 458030, Henan, China
    2. School of Computer Science and Engineering, University of Electronic Science and Technology, Chengdu 611731, Sichuan, China
    3. Information Industry Technology Research Institute of Kashi Region, Kashi 844099, Xinjiang, China
  • Received:2024-07-22 Revised:2024-09-14 Online:2026-04-15 Published:2024-12-10
  • Contact: GAO Hui

基于层级注意力的域自适应遥感图像分割

王沙沙1, 李帷韬2, 刘星宇2, 高辉2,3,*()   

  1. 1. 鹤壁职业技术学院大数据学院, 河南 鹤壁 458030
    2. 电子科技大学计算机科学与工程学院, 四川 成都 611731
    3. 喀什地区电子信息产业技术研究院, 新疆 喀什 844099
  • 通讯作者: 高辉
  • 作者简介:

    王沙沙, 女, 讲师、硕士, 主研方向为人工智能、计算机视觉

    李帷韬, 硕士

    刘星宇, 硕士

    高辉(通信作者), 教授

  • 基金资助:
    四川省科技计划项目(2023YFG0021); 河南省科技攻关计划项目(212102310550)

Abstract:

Remote sensing semantic image segmentation technology has significant applications in resource management, natural disaster management, and environmental monitoring and protection. However, different remote sensing image datasets often exhibit issues such as spectral confusion between different objects and spectral variations within the same object. These issues significantly reduce the generalization performance of deep learning models, and cross-domain performance degradation in remote sensing semantic image segmentation algorithms poses a significant challenge. To address these issues, optimizations are performed from two perspectives: neural network architecture and domain adaptation strategies. First, a TransConv network based on a hierarchical multihead self-attention mechanism and multiscale feature fusion is proposed. This network effectively enhances feature extraction and fusion capabilities through sliding window patching, multilayer self-attention modules, and a lightweight feedforward neural network, thereby improving the model's generalization performance. Second, a self-training-based domain adaptation technique is introduced, which optimizes the image input, model parameters, and learning process. As a result, labeled source domain knowledge is successfully transferred to the unlabeled target domain, significantly improving the segmentation performance in the target domain. Experimental results demonstrate that the improved TransConv network significantly outperforms other algorithms in terms of generalization performance. In addition, it excels in domain adaptation tasks with the self-training-based domain adaptation technique. The proposed approach thus enhances the accuracy and generalization capability of remote sensing image semantic segmentation, reduces the impact of erroneous pseudo-labels, and addresses the class imbalance problem, providing more reliable technical support for practical applications.

Key words: remote sensing image, Convolutional Neural Network (CNN), Transformer network, hierarchical attention, domain adaptive

摘要:

遥感图像语义分割技术在资源管理、自然灾害管理、环境监测和保护等领域具有重要应用价值, 然而不同的遥感图像数据集往往存在大量的异物同谱和同物异谱等现象, 极大地降低了深度学习模型的泛化性能, 同时遥感图像语义分割算法中存在跨域预测性能下降的问题。为了解决上述问题, 从神经网络模型架构和域自适应策略两个方面进行优化。首先, 提出了基于层级多头自注意力机制与多尺度特征融合的TransConv网络, 通过滑动窗口切块、多层自注意力模块和轻量前馈神经网络, 有效提升特征提取和融合的能力, 从而增强模型的泛化性能。其次, 提出一种基于自训练的域自适应技术, 该技术通过优化图像输入、模型参数和学习过程, 将带标注的源域知识成功迁移至未标注的目标域, 大幅提高了目标域的分割性能。实验结果表明, 改进后的TransConv网络不仅在泛化性能上显著优于其他算法, 基于自训练的域自适应技术也在域自适应任务中表现出色, 提升了遥感图像语义分割的准确性和泛化能力, 减少了错误伪标签的影响和解决了类不平衡问题, 为实际应用提供了更为可靠的技术支持。

关键词: 遥感图像, 卷积神经网络, Transformer网络, 层级注意力, 域自适应