基于多尺度区域特征融合的多器官语义分割模型

doi:10.19678/j.issn.1000-3428.0069269

摘要/Abstract

摘要：

深度学习逐渐被广泛应用于医学图像分割领域，基于注意力机制的分割算法是目前研究的主要方法。现有大多数基于注意力机制的2D图像分割模型在多器官分割任务中往往关注切片的整体分割效果，而忽略了切片中小目标特征信息的丢失或欠分割问题，使模型分割性能受到限制。针对这一问题，提出一种基于多尺度特征融合和改进注意力机制的多器官语义分割模型DASC-Net。DASC-Net的整体框架基于编码器-解码器架构，编码器采用ResNet 50，与解码器之间设置跳跃连接。注意力机制由1个双重注意力模块(DAM)和1个小目标提取(SOC)模块的并联结构实现，从而进行多尺度区域特征融合。DASC-Net不仅可以感知到较大目标的特征信息，还可以通过注意力权重重建的方式保留小目标的特征信息，提高了模型的分割性能。在CHAOS数据集上的实验结果表明，DASC-Net在灵敏度、Jaccard相似系数、正类预测值(PPV)、Dice相似系数和平均交并比(mIoU)上分别可以达到83.72%、75.79%、87.75%、85.63%和77.60%，在Synapse数据集上的Dice相似系数和95%豪斯多夫距离(HD95)指标数值分别为82.44%和21.25 mm。DASC-Net在2个数据集上的表现均优于其他分割网络，具有可靠、准确的分割性能。

关键词: 深度学习, 医学图像分割, 注意力机制, 多器官, 小目标提取模块

Abstract:

Deep learning has been widely applied to medical imaging. A medical image segmentation model based on an attention mechanism is one of the main methods used in current research. For the multi-organ segmentation task, most existing 2D segmentation models mainly focus on the overall segmentation effect of slices, while ignoring the loss or under-segmentation of small object feature information in slices, which limits the model′s segmentation performance. To solve this problem, this study proposes a multi-organ semantic segmentation model, DASC-Net, based on multi-scale feature fusion and an improved attention mechanism. The overall framework of the DASC-Net is based on an encoder-decoder architecture. The encoder uses the ResNet 50 network and sets a skip connection with the decoder. The attention mechanism is realized using the parallel structure of a Dual Attention Module (DAM) and a Small Object Capture (SOC) module to perform multi-scale regional feature fusion. DASC-Net not only perceives the feature information of larger objects but also retains the feature information of small objects through attention weight reconstruction, which effectively addresses the limitations of the attention module and further improves the segmentation performance of the model. The experimental results on the CHAOS dataset show that DASC-Net can obtain 83.72%, 75.79%, 87.75%, 85.63% and 77.60% on the Sensitivity, Jaccard similarity coefficient, Positivity Predictive Value (PPV), Dice similarity coefficient, and mean Intersection over Union (mIoU) indicators, respectively; the Dice similarity coefficient and 95% Hausdorff Distance (HD95) values on the Synapse dataset are 82.44% and 21.25 mm, respectively. DASC-Net performs better than the other segmentation networks on both datasets, which demonstrates its reliable and accurate segmentation performance.

Key words: deep learning, medical image segmentation, attention mechanism, multi-organ, Small Object Capture(SOC) module

郝宏达, 罗健旭. 基于多尺度区域特征融合的多器官语义分割模型[J]. 计算机工程, 2025, 51(8): 270-280.

HAO Hongda, LUO Jianxu. Multi-Organ Semantic Segmentation Model Based on Multi-Scale Region Feature Fusion[J]. Computer Engineering, 2025, 51(8): 270-280.

https://www.ecice06.com/CN/Y2025/V51/I8/270

图/表 12

图1 DASC-Net网络架构

Fig.1 Network architecture of DASC-Net

图2 DAM结构

Fig.2 DAM structure

图3 小目标提取模块结构

Fig.3 Structure of small object capture module

图4 DASC-Net解码器

Fig.4 The decoder of DASC-Net

图5 消融实验可视化结果

Fig.5 Visualization results of ablation experiment

图6 对比实验的可视化结果

Fig.6 Visualization results of comparative experiments

参考文献 35

1	EVERINGHAM M , ESLAMI S M A , VAN GOOL L , et al. The pascal visual object classes challenge: a retrospective. International Journal of Computer Vision, 2015, 111, 98- 136. doi: 10.1007/s11263-014-0733-5
2	褚张晴晴, 钟志强, 颜子夜, 等. 基于特征融合与注意力机制的脑肿瘤分割算法. 计算机工程, 2023, 49 (10): 154- 161. doi: 10.19678/j.issn.1000-3428.0066311
	CHU Z Q Q , ZHONG Z Q , YAN Z Y , et al. Brain tumor segmentation algorithm based on feature fusion and attention mechanism. Computer Engineering, 2023, 49 (10): 154- 161. doi: 10.19678/j.issn.1000-3428.0066311
3	徐蓬泉, 梁宇翔, 李英. 融合多尺度语义和剩余瓶颈注意力的医学图像分割. 计算机工程, 2023, 49 (10): 162- 170. doi: 10.19678/j.issn.1000-3428.0065985
	XU P Q , LIANG Y X , LI Y . Medical image segmentation fusing multi-scale semantic and residual bottleneck attention. Computer Engineering, 2023, 49 (10): 162- 170. doi: 10.19678/j.issn.1000-3428.0065985
4	杨本臣, 贾宇航, 金海波. 融合多分支特征的肝脏和肝脏肿瘤的体积分割. 计算机工程, 2023, 49 (10): 194- 201. doi: 10.19678/j.issn.1000-3428.0066125
	YANG B C , JIA Y H , JIN H B . Volume segmentation of liver and liver tumor with fusion of multibranch features. Computer Engineering, 2023, 49 (10): 194- 201. doi: 10.19678/j.issn.1000-3428.0066125
5	BATENBURG K J , SIJBERS J . Optimal threshold selection for tomogram segmentation by projection distance minimization. IEEE Transactions on Medical Imaging, 2009, 28 (5): 676- 686. doi: 10.1109/TMI.2008.2010437
6	HARIHARAN B, ARBELÁEZ P, GIRSHICK R, et al. Simultaneous detection and segmentation[C]//Proceedings of the 13th European Conference on Computer Vision. Berlin, Germany: Springer, 2014: 297-312.
7	SHRIVAKSHAN G T , CHANDRASEKAR C . A comparison of various edge detection techniques used in image processing. International Journal of Computer Science Issues, 2012, 9 (5): 269- 276.
8	LITJENS G , KOOI T , BEJNORDI B E , et al. A survey on deep learning in medical image analysis. Medical Image Analysis, 2017, 42, 60- 88. doi: 10.1016/j.media.2017.07.005
9	RONNEBERGER O, FISCHER P, BROX T. U-Net: convolutional networks for biomedical image segmentation[C]//Proceedings of International Conference on Medical Image Computing and Computer-Assisted Intervention. Berlin, Germany: Springer, 2015: 234-241.
10	ZHOU Z W, SIDDIQUEE M M R, NIMA T, et al. UNet++: a nested U-Net architecture for medical image segmentation[C]//Proceedings of Workshop on Deep Learning in Medical Image Analysis. Berlin, Germany: Springer, 2018: 3-11.
11	HUANG H M, LIN L F, TONG R F, et al. UNet 3+: a full-scale connected UNet for medical image segmentation[C]//Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020). Washington D. C., USA: IEEE Press, 2020: 1055-1059.
12	HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Washington D. C., USA: IEEE Press, 2016: 770-778.
13	PENG D L , XIONG S Y , PENG W J , et al. LCP-Net: a local context-perception deep neural network for medical image segmentation. Expert Systems with Applications, 2021, 168, 114234. doi: 10.1016/j.eswa.2020.114234
14	CHEN R, WANG X, JIN B, et al. CLD-Net: complement local detail for medical small-object segmentation[C]//Proceedings of 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). Washington D. C., USA: IEEE Press, 2022: 942-947.
15	HUANG Q , SU J , PRZYSTUPA K , et al. BSANet: highperformance 3D medical image segmentation. IEEE Access, 2023, 11, 79213- 79223. doi: 10.1109/ACCESS.2023.3299491
16	张艳, 马春明, 刘树东, 等. 基于多尺度特征增强的高效Transformer语义分割网络. 光电工程, 2024, 51 (12): 240237. doi: 10.12086/oee.2024.240237
	ZHANG Y , MA C M , LIU S D , et al. Multi-scale feature enhanced Transformer network for efficient semantic segmentation. Opto-Electronic Engineering, 2024, 51 (12): 240237. doi: 10.12086/oee.2024.240237
17	WANG J , ZHAO H Y , LIANG W , et al. Cross-convolutional transformer for automated multi-organs segmentation in a variety of medical images. Physics in Medicine & Biology, 2023, 68 (3): 035008.
18	KANG S , YANG M , QI X S , et al. Bridging feature gaps to improve multi-organ segmentation on abdominal magnetic resonance image. IEEE Journal of Biomedical and Health Informatics, 2023, 27 (3): 1477- 1487. doi: 10.1109/JBHI.2022.3229315
19	SHEN N , WANG Z , LI J , et al. Multi-organ segmentation network for abdominal CT images based on spatial attention and deformable convolution. Expert Systems with Applications, 2023, 211, 118625.
20	徐辉, 祝玉华, 甄彤, 等. 深度神经网络图像语义分割方法综述. 计算机科学与探索, 2021, 15 (1): 47- 59.
	XU H , ZHU Y H , ZHEN T , et al. Survey of image semantic segmentation methods based on deep neural network. Journal of Frontiers of Computer Science and Technology, 2021, 15 (1): 47- 59.
21	田萱, 王亮, 丁琪. 基于深度学习的图像语义分割方法综述. 软件学报, 2019, 30 (2): 440- 468.
	TIAN X , WANG L , DING Q . Review of image semantic segmentation based on deep learning. Journal of Software, 2019, 30 (2): 440- 468.
22	HU J , SHEN L , ALBANIE S , et al. Squeeze-and-excitation networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42, 2011- 2023.
23	ROY A G, NAVAB N, WACHINGER C. Concurrent spatial and channel 'squeeze & excitation'in fully convolutional networks[C]//Proceedings of International Conference on Medical Image Computing and Computer-Assisted Intervention. Berlin, Germany: Springer, 2018: 421-429.
24	FU J, LIU J, TIAN H J, et al. Dual attention network for scene segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2019: 3146-3154.
25	CHEN J N, LU Y Y, YU Q H, et al. TransUNet: Transformers make strong encoders for medical image segmentation[EB/OL]. [2023-12-15]. https://arxiv.org/abs/2102.04306?context=cs.
26	SCHLEMPER J , OKTAY O , SCHAAP M , et al. Attention gated networks: learning to leverage salient regions in medical images. Medical Image Analysis, 2019, 53, 197- 207.
27	GU Z W , CHENG J , FU H Z , et al. CE-Net: context encoder network for 2D medical image segmentation. IEEE Transactions on Medical Imaging, 2019, 38 (10): 2281- 2292.
28	SINHA A , DOLZ J . Multi-scale self-guided attention for medical image segmentation. IEEE Journal of Biomedical and Health Informatics, 2021, 25 (1): 121- 130.
29	HUANG Z L , WANG X G , WEI Y C , et al. CCNet: criss-cross attention for semantic segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45 (6): 6896- 6908.
30	SONG J H , CHEN X J , ZHU Q L , et al. Global and local feature reconstruction for medical image segmentation. IEEE Transactions on Medical Imaging, 2022, 41 (9): 2273- 2284.
31	CAO H, WANG Y Y, CHEN J, et al. Swin-Unet: Unet-like pure transformer for medical image segmentation[EB/OL]. [2023-12-15]. https://arxiv.org/abs/2105.05537?context=eess.IV.
32	WANG H Y, XIE S, LIN L F, et al. Mixed transformer U-Net for medical image segmentation[C]//Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Washington D.C., USA: IEEE Press, 2022: 2390-2394.
33	LIN G, CHEN L. A multi-scale fusion network with transformer for medical image segmentation[C]//Proceedings of the 3rd International Conference on Neural Networks, Information and Communication Engineering. Washington D.C., USA: IEEE Press, 2023: 224-228.
34	YU J, HE X, QIN J, et al. Trans-UNeter: a new decoder of Trans-UNet for medical image segmentation[C]//Proceedings of IEEE International Conference on Bioinformatics and Biomedicine (BIBM). Washington D.C., USA: IEEE Press, 2023: 2338-2341.
35	ZHAO L, TIAN X, LIU Y. Transformer based position information enhancement for medical image segmentation[C]//Proceedings of the 4th Asia Conference on Information Engineering (ACIE). Washington D.C., USA: IEEE Press, 2024: 92-96.

[1]	林帆, 李建华. 基于多阶门控聚合网络的光学化学结构识别[J]. 计算机工程, 2025, 51(8): 364-372.
[2]	武东辉, 王金凤, 仇森, 刘国志. 基于EWBiLSTM-ATT的数据手套手语识别[J]. 计算机工程, 2025, 51(8): 107-119.
[3]	张昭理, 李家豪, 刘海, 石佛波, 何嘉文. 基于个性化遗忘建模的知识追踪方法[J]. 计算机工程, 2025, 51(8): 120-130.
[4]	闫建红, 刘芝妍, 王震. 融合时空注意力机制的多尺度卷积车辆轨迹预测[J]. 计算机工程, 2025, 51(8): 406-414.
[5]	倪源松, 韩军, 邹小燕, 胡广怡, 王文帅. 两阶段自适应分块输电线路螺栓缺陷检测方法[J]. 计算机工程, 2025, 51(8): 281-291.
[6]	武东辉, 王金凤, 仇森, 刘国志. 基于EWBiLSTM-ATT的数据手套手语识别[J]. 计算机工程, 2025, 51(8): 107-119.
[7]	彭菊红, 张弛, 高谦, 张光明, 谈栋华, 赵明俊. 基于改进的YOLOv8算法的钢材缺陷检测[J]. 计算机工程, 2025, 51(7): 152-160.
[8]	李姜辛, 王鹏, 汪卫. 多机理指导的深度学习工业时序预测框架[J]. 计算机工程, 2025, 51(7): 47-58.
[9]	周哲臣, 胡冀苏, 钱旭升, 郑毅, 戴亚康, 周志勇. 基于查询自适应双层自注意力机制的MRI脑组织分割[J]. 计算机工程, 2025, 51(7): 294-304.
[10]	宋杰, 徐慧英, 朱信忠, 黄晓, 陈晨, 王泽宇. 基于YOLOv8改进的跌倒检测算法: OEF-YOLO[J]. 计算机工程, 2025, 51(7): 127-139.
[11]	欧阳昱中, 韩锐, 刘驰. 边缘侧领域自适应中长尾视觉识别技术研究[J]. 计算机工程, 2025, 51(7): 171-179.
[12]	孟波, 史旭华, 张彬. 基于双分支卷积和深度插值的点云表面重建[J]. 计算机工程, 2025, 51(7): 119-126.
[13]	周莎, 车生兵, 考友琛, 张旭, 郭甚驿. 基于特征选择和时空特征的网络入侵检测[J]. 计算机工程, 2025, 51(7): 223-231.
[14]	沙宇洋, 陆京涛, 杜浩凡, 翟小兵, 孟维宇, 廉旭, 罗刚, 李克峰. 适用于导盲场景的多尺度特征融合轻量化道路图像分割算法[J]. 计算机工程, 2025, 51(7): 314-325.
[15]	余鹏, 杨佳琦, 陈欣然, 贺超波. 基于二部图对比学习的特征增强推荐算法[J]. 计算机工程, 2025, 51(7): 100-110.

选择文件类型/文献管理软件名称

选择包含的内容