Thyroid Ultrasound Nodule Segmentation Based on MAD-UNet

doi:10.19678/j.issn.1000-3428.0260134

Abstract

Abstract: To address challenges in thyroid ultrasound nodule segmentation, including blurred boundaries, low contrast, and highly variable small lesions, this paper proposes an improved model named MAD-UNet. The model improves contour delineation by strengthening cross-layer feature transfer consistency and deformable context modeling. A Multi-Directional Separable Attention Module (MDSAM) is embedded in the skip connections between the encoder and the decoder. MDSAM applies direction-aware channel–spatial joint attention to reweight key edge responses. This design enhances the consistency between shallow spatial details and deep semantic information. It strengthens boundary localization and alleviates gradient attenuation during deep network training. In addition, the Transformer encoder depth is extended to 24 layers to better model long-range dependencies and global context. Furthermore, a Deformable Adaptive Multi-Scale Context Module (DAMCM) is introduced. DAMCM combines deformable modeling with multi-scale context aggregation. It enables adaptive fusion of local structure alignment and global context supplementation. It improves representation of irregular contours and small targets. On the TN3K, DDTI, and Shanghai Sixth People's Hospital THN-L datasets, the Dice scores reach 89.10%、90.53% and 91.17%, respectively. The overall performance exceeds the TransUNet baseline on all datasets. Complexity evaluation shows 215.27M parameters, 65.96G FLOPs, and an inference speed of 111 FPS. Visualization analysis shows stronger robustness for nodule contours under complex ultrasound conditions. The experimental results verify the effectiveness of the model in fine boundary delineation and small lesion recognition. The method provides a basis for subsequent deployment and optimization in clinical application scenarios.

摘要： ：针对甲状腺超声结节分割中边界模糊、对比度低及小体积多变等难题，提出改进模型MAD-UNet，通过强化跨层特征传递一致性与形变上下文建模提升轮廓刻画能力。在编码器与解码器的跳跃连接处嵌入多方向可分离注意力模块（Multi-Directional Separable Attention Module，MDSAM），通过方向感知的通道—空间联合注意力对关键边缘响应进行重加权，增强浅层空间细节与深层语义信息的一致性，从而强化边界定位并缓解深层网络训练中的梯度衰减问题。其次，将Transformer编码器深度扩展至24层，以更充分地建模长程依赖与全局上下文。进一步地，引入形变自适应多尺度上下文模块（Deformable Adaptive Multi-Scale Context Module，DAMCM），结合形变建模与多尺度上下文聚合，实现局部结构对齐与全局语境补充的自适应融合，增强对不规则轮廓与细小目标的表达能力。模型在TN3K、DDTI与上海第六人民医院THN-L数据集上的Dice系数分别达到89.10%、90.53%和91.17%。整体性能均优于TransUNet基线；复杂度评估显示，模型参数量为215.27M、浮点运算量（floating-point operations，FLOPs）为65.96G、推理速度为111帧每秒（frames per second，FPS）。可视化分析显示在复杂超声条件下对结节轮廓具有更强鲁棒性。实验结果验证了该模型在精细边界刻画与小病灶识别方面的有效性，为后续面向临床应用场景的部署与优化提供了方法基础。

Fu Su, Wang Shuaiqun. Thyroid Ultrasound Nodule Segmentation Based on MAD-UNet[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0260134.

付苏, 王帅群. 基于MAD-UNet的甲状腺超声结节分割[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0260134.

/ Recommend / Download Citations

URL: https://www.ecice06.com/EN/10.19678/j.issn.1000-3428.0260134

References

[1] CHEN J, YOU H, LI K. A review of thyroid gland segmentation and thyroid nodule segmentation methods for medical ultrasound images[J]. Computer Methods and Programs in Biomedicine, 2020, 185: 105329.
[2] ZHENG T, QIN H, CUI Y, et al. Segmentation of thyroid glands and nodules in ultrasound images using the improved U-Net architecture[J/OL]. BMC Medical Imaging:56[2026-01-18]. https://doi.org/10.1186/s12880-023-01011-8.
[3] XIANG Y, ACHARYA R, LE Q, et al. Thyroid nodule segmentation in ultrasound images using transformer models with masked autoencoder pre-training[J/OL]. Frontiers in Artificial Intelligence:1618426[2026-01-18]. https://doi.org/10.3389/frai.2025.161842.
[4] RONNEBERGER O, FISCHER P, BROX T. U-Net: Convolutional networks for biomedical image segmentation[C]//Proceedings of the Medical Image Computing and Computer-Assisted Intervention (MICCAI 2015). Cham: Springer, 2015: 234-241.
[5] OKTAY O, SCHLEMPER J, FOLGOC L L, et al. Attention U-Net: learning where to look for the pancreas[EB/OL]. [2026-01-18]. https://arxiv.org/abs/1804.03999.
[6] CHEN Y, WANG K, LIAO X, et al. Channel-Unet: a spatial channel-wise convolutional neural network for liver and tumors segmentation[J/OL]. Frontiers in Genetics:1110[2026-01-18]. https://doi.org/10.3389/fgene.2019.01110.
[7] ZHOU Z, RAHMAN SIDDIQUEE M M, TAJBAKHSH N, et al. UNet++: redesigning skip connections to exploit multiscale features in image segmentation[J]. IEEE Transactions on Medical Imaging, 2020, 39(6): 1856-1867.
[8] LIU Z, MAO H, WU C Y, et al. A ConvNet for the 2020s[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, NJ: IEEE Press, 2022: 11976-11986.
[9] ZHOU Y, CHANG H, LU X, et al. DenseUNet: improved image classification method using standard convolution and dense transposed convolution[J]. Knowledge-Based Systems, 2022, 254: 109658.
[10] ROY S, KOEHLER G, ULRICH C, et al. MedNeXt: transformer-driven scaling of ConvNets for medical image segmentation[C]//Medical Image Computing and Computer Assisted Intervention – MICCAI 2023. Cham: Springer, 2023: 405-415.
[11] RAHMAN M M, MUNIR M, MARCULESCU R. EfficientMedNeXt: multi-receptive dilated convolutions for medical image segmentation[C]//Medical Image Computing and Computer Assisted Intervention – MICCAI 2025. Cham: Springer, 2026: 196-206.
[12] PROCHAZKA A, ZEMAN J. Thyroid nodule segmentation in ultrasound images using U-Net with ResNet encoder: achieving state-of-the-art performance on all public datasets[J]. AIMS MedicalScience, 2025, 12(2): 124-144.
[13] LI X, FU C, WANG Q, et al. GSE-Nets: global structure enhancement decoder for thyroid nodule segmentation[J]. Biomedical Signal Processing and Control, 2025, 102: 107340.
[14] YETGINLER B, ATACAK I. An improved V-Net model for thyroid nodule segmentation[J]. Applied Sciences, 2025, 15(7): 3873.
[15] JEEM S I, RADIF T Z, AABIRA S, et al. A residual cross-gated deeply-supervised U-Net for robust thyroid nodule segmentation on TN3K and DDTI ultrasound datasets[EB/OL]. [2026-01-18]. https://doi.org/10.2139/ssrn.5928283.
[16] ZHOU Y, WANG B, YANG J, et al. SGBTransNet: bridging the semantic gap in medical image segmentation models using transformers[J]. Biomedical Signal Processing and Control, 2024, 98: 106746.
[17] AZAD R, KAZEROUNI A, HEIDARI M, et al. Advances in medical image analysis with vision transformers: a comprehensive review[J]. Medical Image Analysis, 2024, 91: 103000.
[18] CAO H, WANG Y, CHEN J, et al. Swin-Unet: UNet-like pure transformer for medical image segmentation[C]//Proceedings of the Computer Vision – ECCV 2022 Workshops. Cham: Springer, 2023: 205-218.
[19] HUANG X, DENG Z, LI D, et al. MISSFormer: an effective transformer for 2D medical image segmentation[J]. IEEE Transactions on Medical Imaging, 2023, 42(5): 1484-1494.
[20] AL HASAN M M, ZAMAN M, JAWAD A, et al. WaveFormer: a 3D transformer with wavelet-driven feature representation for efficient medical image segmentation[C]//Medical Image Computing and Computer Assisted Intervention – MICCAI 2025. Cham: Springer, 2026: 684-694.
[21] GOWDA S N, CLIFTON D A. CC-SAM: SAM with cross-feature attention and context for ultrasound image segmentation [C]//Proceedings of the European Conference on Computer Vision (ECCV). Cham: Springer, 2025: 108-124.
[22] OAD A, KOONDHAR I H, DONG F, et al. Symmetry-aware SwinUNet with integrated attention for transformer-based segmentation of thyroid ultrasound images[J]. Symmetry, 2026, 18(1): 141.
[23] Chen J, Lu Y, Yu Q, et al. TransUNet: transformers make strong encoders for medical image segmentation[EB/OL]. [2021-02-08][2026-01-18]. https://arxiv.org/abs/2102.04306.
[24] 杨本臣, 贾宇航, 金海波. 融合多分支特征的肝脏和肝脏肿瘤的体积分割[J]. 计算机工程, 2023, 49(10): 194-201. YANG B C, JIA Y H, JIN H B. Volume segmentation of liver and liver tumor with fusion of multi-branch features[J]. Computer Engineering, 2023, 49(10): 194-201.
[25] CHEN B, LIU Y, ZHANG Z, et al. TransAttUnet: Multi-level attention-guided U-Net with transformer for medical image segmentation[J]. IEEE Transactions on Emerging Topics in Computational Intelligence,2024, 8(1): 55-68.
[26] 张天森, 徐晓娜, 赵悦, 等. 基于级联Transformer和U-Net的MRI肝脏图像分割[J]. 计算机工程, 2025, 51(10): 308-318. ZHANG T S, XU X N, ZHAO Y, et al. MRI Liver Image Segmentation Based on Cascade Transformer and U-Net[J]. Computer Engineering, 2025, 51(10): 308-318.
[27] 周晨阳, 刘雪宇, 梁少华,等. 基于Swin Transformer的肾动脉血管检测分割与定量分析[J]. 计算机工程, 2025, 51(9): 252-267. ZHOU C Y，LIU X Y, LIANG S H, et al. Segmentation and Quantitative Analysis of Renal Artery Vessel Detection Based on Swin Transformer[J]. Computer Engineering, 2025, 51(9): 252-267.
[28] KUANG H, WANG Y, TAN X, et al. LW-CTrans: a lightweight hybrid network of CNN and Transformer for 3D medical image segmentation[J]. Medical Image Analysis, 2025, 102: 103545.
[29] LI Y, ZOU Y, HE X, et al. HFA-UNet: hybrid and full attention UNet for thyroid nodule segmentation[J]. Knowledge-Based Systems, 2025, 328: 114245.
[30] WANG F, WANG C, MA C, et al. Medical image segmentation model based on multi-scale fusion and feature reconstruction convolution[J]. Biomedical Signal Processing and Control, 2026, 112: 108464.
[31] SUN X, WEI B, JIANG Y, et al. CLIP-TNseg: a multi-modal hybrid framework for thyroid nodule segmentation in ultrasound images[J/OL]. IEEE Signal Processing Letters:1-5[2026-01-18]. https://doi.org/10.1109/LSP.2025.3556789.
[32] HOU Q, ZHOU D, FENG J. Coordinate attention for efficient mobile network design[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, NJ: IEEE Press, 2021: 13713-13722.
[33] DAI J, QI H, XIONG Y, et al. Deformable convolutional networks[C]//Proceedings of the IEEE International Conference on Computer Vision (ICCV). Piscataway, NJ: IEEE, 2017: 764-773.
[34] CHEN L C, PAPANDREOU G, KOKKINOS I, et al. DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4): 834-848.
[35] GONG H, CHEN G, WANG R, et al. Multi-task learning for thyroid nodule segmentation with thyroid region prior[C]//Proceedings of the 18th IEEE International Symposium on Biomedical Imaging (ISBI). Piscataway, NJ: IEEE Press, 2021: 257-261.
[36] PEDRAZA L, VARGAS C, NARVAEZ F, et al. An open access thyroid ultrasound image database[C]//Proceedings of the 10th International Symposium on Medical Information Processing and Analysis. Bellingham, WA: SPIE, 2015: 188-193.

Please choose a citation manager

Content to export