Multi-level Loss-assisted Siamese Network for Remote Sensing Image Change Detection

doi:10.19678/j.issn.1000-3428.0260291

Abstract

Abstract: Remote sensing image change detection aims to precisely localize land cover changes by comparatively analyzing the spatiotemporal evolution information contained in bi-temporal imagery, and has become a core task in fields such as dynamic monitoring of land resources, urban expansion assessment, and disaster emergency response. However, influenced by multiple factors including complex terrain interference, variations in illumination conditions, seasonal vegetation succession, and sensor imaging noise, change regions often exhibit characteristics such as substantial scale variations, discrete spatial distribution, and ambiguous boundary delineation. Existing change detection models suffer from insufficient exploitation of multi-scale information and inadequate extraction of deep global semantic correlations, rendering it challenging for these models to effectively discriminate genuine land surface changes from pseudo-changes, thereby constraining their discrimination accuracy in open-scene scenarios. To address the aforementioned limitations, a Multi-level Loss-assisted Siamese Network (MLLA_SiaNet) for remote sensing image change detection is proposed. The model adopts a weight-sharing Siamese architecture to extract multi-dimensional features from bi-temporal images separately, and generates hierarchical feature maps through a multi-level differential encoder. To overcome the linear limitations inherent in conventional differencing methods, we introduce a multi-angle difference representation strategy coupled with a channel-spatial hybrid attention mechanism, and design a Differential Fusion Module (DFM) to acquire high-quality difference features, thereby achieving adaptive suppression of background interference and precise focusing on genuine change characteristics. To compensate for the deficiency in global semantic representation, we integrate a spatial pooling pyramid with a Gaussian pyramid and propose a Deep Semantic Pyramid (DSP) module to construct multi-level semantic aggregation features, effectively expanding the receptive field and strengthening long-range contextual dependency modeling. During the decoding stage, the model employs a progressive upsampling strategy combined with a feature fusion mechanism to hierarchically restore spatial details, thereby enabling the reconstruction of high-resolution prediction maps. Furthermore, we introduce a deeply supervised Multi-level Loss-assisted (MLA) strategy to optimize the training process; by imposing auxiliary constraints on the outputs of each decoder layer, this strategy ensures consistency between local edge information and global contextual semantics, thereby constructing an end-to-end feature learning framework. To systematically validate the effectiveness of the proposed model, comparative experiments are conducted and results are comprehensively analyzed on two publicly available benchmark datasets, namely SYSU-CD and LEVIR-CD. On the SYSU-CD dataset, MLLA_SiaNet achieves an F1-score of 82.13%, outperforming seven other comparative methods and surpassing the second-best method, SFEARNet, by 1.3 percentage points; its precision and recall attain optimal values of 83.42% and 80.88%, respectively, achieving simultaneous improvement in both precision and recall metrics. On the LEVIR-CD dataset, MLLA_SiaNet achieves a precision of 89.48%, fully demonstrating the effectiveness of the proposed method in suppressing pseudo-change factors such as illumination variations, shadow effects, and seasonal vegetation changes; the F1-score of our model on the LEVIR-CD dataset reaches 85.87%, outperforming other state-of-the-art methods including SFEARNet (precision 84.89%), BIT (precision 82.80%), and IFN (precision 82.29%).Both quantitative and qualitative analyses of the experimental results demonstrate that the model exhibits superior robustness under varying spatial resolutions and complex land cover conditions. Ablation studies further corroborate the advantages of the DFM, DSP, and MLA modules in enhancing overall model performance, and the effectiveness of each architectural stage is empirically verified through analysis of the visualized response feature maps. In summary, this study mitigates the impacts of several critical challenges in remote sensing image change detection tasks, including insufficient multi-scale feature interaction, weak correlation modeling of global semantic information, and difficulties in suppressing pseudo-change interference. Future work will focus on lightweight model deployment, multi-temporal sequence modeling, and self-supervised pre-training techniques, as well as expanding systematic evaluations of model robustness across diverse application scenarios.

摘要： 遥感图像变化检测旨在通过对比分析双时相影像包含的时空演变信息，精准定位地表覆盖的变化情况，已成为国土资源动态监测、城市扩张评估及灾害应急响应等领域的核心任务。然而，受复杂地形干扰、光照条件差异、季节植被更替以及传感器成像噪声等多重因素影响，变化区域常常呈现尺度跨度大、空间分布离散以及边界模糊等特性。现有变化检测模型存在对多尺度信息利用不充分以及深层全局语义关联提取不充分的问题，模型难以有效区分真实地表演变与伪变化，制约了其在开放场景下的判别精度。针对上述局限，提出一种面向遥感图像变化检测的多级损失辅助孪生网络（Multi-level loss-assisted Siamese-Network，MLLA_SiaNet）。该模型采用权值共享孪生架构分别提取双时相图像的多维特征，通过多级差分编码器生成层次化特征图。为了突破传统差分方法的线性局限，引入多角度差异表示策略并耦合通道-空间混合注意力机制，设计差分融合模块（Differential Fusion Module，DFM）获取高质量差异特征，实现背景干扰的自适应抑制与真实变化特征的精准聚焦。为了弥补全局语义缺失，将空间池化金字塔与高斯金字塔结合，提出深度语义提取模块（Deep Semantic Pyramid，DSP）构建多层级语义聚合特征，有效扩大感受野并强化长程上下文依赖建模。模型的解码阶段采用渐进式上采样与特征融合机制逐级恢复空间细节，实现高分辨率预测图像的重建。并引入深度监督的多级辅助损失（Multi-level Loss-assisted，MLA）优化训练过程，通过对解码器各层输出进行辅助约束，确保局部边缘信息与全局信息一致性，构建端到端特征学习模型。为系统验证模型有效性，在SYSU-CD与LEVIR-CD公开数据集上开展对比实验并分析结果。在SYSU-CD数据集上，MLLA_SiaNet以82.13%的F1分数优于其他七种对比方法，较次优方法SFEARNet提升1.3个百分点；其精确度与召回率分别达到最优值83.42%和80.88%，实现了查准率与查全率的同步提升。在LEVIR-CD数据集上，MLLA_SiaNet的精确度达到了89.48%，充分说明所提出的方法在抑制光照、阴影及植被季节性变化等伪变化因素方面的有效性；本模型在LEVIR-CD数据集上的F1分数为85.87%，优于SFEARNet（精确度84.89%）、BIT（精确度82.80%）与IFN(精确度82.29%)等其他方法。对实验结果的定量分析与定性分析说明，模型在不同分辨率与复杂地物条件下均展现出较好的鲁棒性。消融实验进一步证实了DFM、DSP与MLA模块在提升模型性能方面的优势，并通过分析模型的可视化响应特征图，验证了模型各个阶段的有效性。综上，本研究缓解了遥感图像变化检测任务中多尺度特征交互不足、全局语义信息关联性较弱以及对伪变化抑制困难等关键问题的影响。未来工作将聚焦于轻量化部署、多时相序列建模及自监督预训练技术，拓展模型鲁棒性的系统性评测。

ZHAO Yijing, QIN Na, LIU Yuan, SONG Menghao. Multi-level Loss-assisted Siamese Network for Remote Sensing Image Change Detection[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0260291.

赵一静, 秦娜, 刘远, 宋梦浩. 面向遥感图像变化检测的多级损失辅助孪生网络 Sensing Image Change Detection[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0260291.

/ Recommend / Download Citations

URL: https://www.ecice06.com/EN/10.19678/j.issn.1000-3428.0260291

References

[1] HEGAZY R, KALOOP M R. Monitoring urban growth and land use change detection with GIS and remote sensing techniques in Daqahlia governorate Egypt[J]. International Journal of Sustainable Built Environment, 2015, 4(1): 117–124.
[2] SAMANTA S, PAL D K. Change detection of land use and land cover over a period of 20 years in Papua New Guinea[J]. Natural Science, 2016, 8(3): 138.
[3] ZHIYONG L, et al. Diagnostic analysis on change vector analysis methods for LCCD using remote sensing images[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2021, 14: 10199–10212.
[4] LEI T, ZHANG Y, LV Z, et al. Landslide inventory mapping from bitemporal images using deep convolutional neural networks[J]. IEEE Geoscience and Remote Sensing Letters, 2019, 16(6): 982–986.
[5] 陈曦,梁方,王威.基于对偶树复小波变换与PCA方法结合的图像变化检测算法研究[J].计算机工程与科学, 2014, 36(8): 560-1565. CHEN X, FANG L, WANG W. Image change detection based on dual-tree complex wavelet transform and principal component analysis[J]. Computer Engineering & Science, 2014, 36(8): 1560–1565.
[6] J. F. Mas, “Monitoring land-cover changes: A comparison of change detection techniques,” Int. J. Remote Sens., 1999, 20(1): 139–152.
[7] F. Bovolo and L. Bruzzone, “A theoretical framework for unsu-pervised change detection based on change vector analysis in the polar domain,” IEEE Trans. Geosci. Remote Sens., 2007, 45(1): 218–236.
[8] NIELSEN A A, CONRADSEN K, SIMPSON J J. Multivariate alteration detection (MAD) and MAF postprocessing in multispectral, bitemporal image data: New approaches to change detection studies[J]. Remote Sensing of Environment, 1998, 64(1): 1–19.
[9] LIU S, DU P, GAMBA P, et al. Fusion of difference images for change detection in urban areas[C]//2011 Joint Urban Remote Sensing Event. Munich, Germany: IEEE, 2011: 165-168. DOI: 10.1109/JURSE.2011.5764745.
[10] DAUDT R C, SAUX B L, BOULCH A. Fully convolutional siamese networks for change detection[C]// Proceedings of the 2018 IEEE International Conference on Image Processing (ICIP). Athens: IEEE, 2018: 4063–4067.
[11] WANG D, CHEN X, JIANG M, et al. ADS-Net: An attention-based deeply supervised network for remote sensing image change detection[J/OL]. International Journal of Applied Earth Observation and Geoinformation, 2021, 101: 102348.
[12] 田青林,陆冬华,李瑶,等.基于密集混合注意力网络的遥感影像建筑物变化检测[J].光学学报, 2025 ,45(06) :306-316. TIAN Q, LU D, LI Y, et al. Building change detection in remote sensing images based on a dense hybrid attention network[J/OL]. Acta Optica Sinica, 2025, 45(6): 0628008.
[13] LIU S, ZHAO D, TANG L. A Siamese network-based large-size remote sensing change detection network based on differential enhancement[J/OL]. Pattern Recognition Letters, 2025, 197: 319–324.
[14] XU Y, LEI T, NING H, et al. From macro to micro: A lightweight interleaved network for remote sensing image change detection[J/OL]. IEEE Transactions on Geoscience and Remote Sensing, 2025, 63: 1–14.
[15] 姜有泽,刘向阳.基于多时相-ChangeFormer的遥感图像建筑物变化检测方法[J/OL].计算机工程,1-12[2026-03-05].https://doi.org/10.19678/j.issn.1000-3428.0070443. JIANG Y Z, LIU X Y. Building change detection method in remote sensing images based on multi-temporal Change Former[J/OL]. Computer Engineering, 2025: 1-12[2026-03-05]. https://doi.org/10.19678/j.issn.1000-3428.0070443.
[16] 吴潮宇,杨斌.基于大核重参U-Net的遥感影像变化检测[J].计算机工程, 2025, 51(03): 261-273. DOI: 10.19678/j.issn.1000-3428.0068459. WU C Y, YANG B. Large-kernel reparametrized U-Net for remote sensing image change detection[J]. Computer Engineering, 2025, 51(3): 261-273. https://doi.org/10.19678/j.issn.1000-3428.0068459.
[17] 陈海永,吕承杰,杜春,等.孪生注意力门控融合的遥感图像变化检测编解码网络[J].计算机工程与科学, 2023, 45(09): 1593-1601. CHEN H, LÜ C, DU C, et al. A twin attention gated fusion encoder-decoder network for remote sensing image change detection[J]. Computer Engineering & Science, 2023, 45(9): 1593–1601.
[18] KAUR G, AQAF Y. Developments in deep learning for change detection in remote sensing: A review[J]. Transactions in GIS, 2024, 28(2): 223–257.
[19] QI Q, WANG Y. Application and optimization of deep learning in change detection for high-resolution remote sensing imagery[J]. Academic Journal of Science and Technology, 2025, 15(3): 40–42.
[20] ZHANG X, LIU J, ZHANG W, et al. Spectral-spatial attentions and deep supervision for change detection in remote sensing images[C]// IGARSS 2024 - 2024 IEEE International Geoscience and Remote Sensing Symposium. Piscataway: IEEE, 2024: 10306–10310.
[21] SAGAR A S M S, CHEN Y, XIE Y, et al. MSA R-CNN: A comprehensive approach to remote sensing object detection and scene understanding[J]. Expert Systems with Applications, 2024, 241: 122788.
[22] ZHANG C, LIU J, ZHANG W, et al. A deeply supervised image fusion network for change detection in high resolution bi-temporal remote sensing images[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2020, 166: 183–200.
[23] WU X, YANG L, MA Y, et al. An end-to-end multiple side-outputs fusion deep supervision network based remote sensing image change detection algorithm[J]. Signal Processing, 2023, 213: 109203.
[24] CAI Y, LIAO S, HE W, et al. CSANet: A channel-spatial attention network for remote sensing image change detection[J]. International Journal of Remote Sensing, 2023, 44(19): 5936–5959.
[25] WU J, XIE C, ZHANG Z, et al. A deeply supervised attentive high-resolution network for change detection in remote sensing images[J]. Remote Sensing, 2022, 15(1): 45.
[26] CHEN J, LI J Y, XU J, et al. Remote sensing image change detection based on similarity sensing network[C]// 2024 5th International Conference on Computer, Big Data and Artificial Intelligence (ICCBD+AI). Piscataway: IEEE, 2024: 386–390.
[27] HOU X, LIN J, SHANG C, et al. Cross-scale heterogeneous convolution change detection based on spatial-spectral information fusion for remote sensing imagery[C]// ZHENG H, GLASS D, MULVENNA M, et al. Advances in computational intelligence systems: UKCI 2024. Advances in Intelligent Systems and Computing, vol 1462. Cham: Springer, 2024: 3–14.
[28] LI S, SONG Y, WU X, et al. MFMENet: Multi-scale features mutual enhancement network for change detection in remote sensing images[J]. International Journal of Remote Sensing, 2024, 45(10): 3248–3273.
[29] REN W, WANG Z, XIA M, et al. MFINet: Multi-scale feature interaction network for change detection of high-resolution remote sensing images[J]. Remote Sensing, 2024, 16(7): 1269.
[30] HUANG Z, YOU H. MFSFNet: Multi-scale feature subtraction fusion network for remote sensing image change detection[J]. Remote Sensing, 2023, 15(15): 3740.
[31] WOO S, PARK J, LEE J Y, et al. CBAM: Convolutional block attention module[C]// Proceedings of the European Conference on Computer Vision (ECCV). Munich: Springer, 2018: 3–19.
[32] CHEN H, QI Z, SHI Z. Remote sensing image change detection with transformers[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 5607514.
[33] LI J, et al. A large-scale dataset for change detection in high-resolution remote sensing imagery[J]. IEEE Geoscience and Remote Sensing Letters, 2018, 15(8): 1264–1268. X.
[34] CHEN X, et al. LEVIR-CD: A change detection dataset for remote sensing images[EB/OL]. arXiv, 2020.
[35] FANG S, LI K, SHAO J, et al. SNUNet-CD: A densely connected Siamese network for change detection of VHR images[J]. IEEE Geoscience and Remote Sensing Letters, 2022, 19: 1–5.
[36] LI M, MING D, XU L, et al. SFEARNet: A Network Combining Semantic Flow and Edge-Aware Refinement for Highly Efficient Remote Sensing Image Change Detection[J/OL]. IEEE Transactions on Geoscience and Remote Sensing, 2025, 63: 1-18. DOI:10.1109/TGRS.2025.3545906.
[37] Zeiler M D, Fergus R. Visualizing and Understanding Convolutional Networks[C]// Fleet D, Pajdla T, Schiele B, et al. Computer Vision — ECCV 2014. Cham: Springer International Publishing, 2014: 818-833.
[38] ZAGORUYKO S, KOMODAKIS N. Paying more attention to attention: improving the performance of convolutional neural networks via attention transfer[C]// Proceedings of the 5th International Conference on Learning Representations. Toulon, France: ICLR, 2017.
[39] SHAW P, USZKOREIT J, VASWANI A. Self-attention with relative position representations[C]// Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. New Orleans, Louisiana: Association for Computational Linguistics, 2018: 464-468.

Please choose a citation manager

Content to export