基于多注意力机制的视网膜血管分割模型

doi:10.19678/j.issn.1000-3428.0069579

摘要/Abstract

摘要：

针对视网膜血管结构细微复杂、边界模糊、计算成本高等问题, 提出一种基于多注意力机制的视网膜血管分割模型(GAC-UNet)。首先, 在跳跃连接中嵌入用于提取通道间关系与空间位置信息的CASP(Channel Attention Spatial Pooling)注意力模块, 将其与残差连接相结合, 构成注意力残差单元(ARU), 以优化编解码器之间的特征处理, 突出重要特征; 然后, 在编码器结构中加入用于合理分配注意力的新型图注意力网络(NGAT), 将其与CASP注意力模块相结合以构建GACA综合注意力模块, 从多个角度对血管细节和边缘进行关注; 接着, 在编码器中堆叠多个GACA模块, 实现NGAT模块内部图注意力信息的累积, 以加强模型建模全局信息的能力, 弥补并丰富边缘特征信息; 最后, 在解码器结构相应层级中聚合不同注意力模块所提取的特征信息, 通过上采样操作得到最终的分割结果。在3个视网膜公共数据集DRIVE、CHASE_DB1、STARE上进行实验, 结果表明, GAC-UNet的特异性分别为97.76%、99.16%和98.66%, 准确率分别为96.80%、96.81%和96.34%, 表明GAC-UNet能够很好地识别出细微复杂、边界模糊的血管结构, 且具有较小的模型参数量。

关键词: 视网膜血管分割, CASP注意力模块, 新型图注意力网络, GACA综合注意力模块, 注意力机制

Abstract:

To address the challenges of subtle and complex structures, blurred boundaries, and high computational costs associated with retinal vasculature, this study proposes a retinal vessel segmentation model named GAC-UNet, based on a multi-attention mechanism. First, a Channel Attention Spatial Pooling (CASP) attention module, designed to extract interchannel relationships and spatial position information, is embedded into skip connections. By integrating this module with residual connections, an Attention Residual Unit (ARU) is formed to optimize feature processing between the encoder and decoder, thereby highlighting important features. Subsequently, a New Graph Attention Network (NGAT) is introduced into the encoder architecture for rationally allocating attention. This NGAT is combined with the CASP attention module to construct the GACA integrated attention module, which enables multi-faceted attention to vessel details and edges. Multiple GACA modules are stacked within the encoder to internally accumulate graph attention information within the NGAT modules, thereby enhancing the ability of GAC-UNet to model global information and enrich the edge feature information. Finally, the feature information extracted by the different attention modules is aggregated at the corresponding levels in the decoder architecture and the final segmentation result is obtained using upsampling operations. Experimental evaluations conducted on three public retinal datasets—DRIVE, CHASE_DB1, and STARE—demonstrate that the proposed model achieves specificities of 97.76%, 99.16%, and 98.66%, and accuracies of 96.80%, 96.81%, and 96.34%, respectively. These results indicate that GAC-UNet effectively identifies subtle and complex vessel structures with blurred boundaries while maintaining a relatively small model parameter size.

Key words: retinal blood vessel segmentation, CASP attention module, the New Graph Attention Networks (NGAT), GACA composite attention module, attention mechanism

周峥, 张笃振. 基于多注意力机制的视网膜血管分割模型[J]. 计算机工程, 2025, 51(11): 268-282.

ZHOU Zheng, ZHANG Duzhen. Retinal Blood Vessel Segmentation Model Based on Multi-Attention Mechanism[J]. Computer Engineering, 2025, 51(11): 268-282.

https://www.ecice06.com/CN/Y2025/V51/I11/268

图/表 20

图1 GAC-UNet网络结构

Fig.1 GAC-UNet network structure

图2 CASP模块结构

Fig.2 CASP module structure

图3 NGAT模块结构

Fig.3 NGAT module structure

图4 GACA模块结构

Fig.4 GACA module structure

图5 不同算法在DRIVE数据集上的曲线对比

Fig.5 Comparison of curves of different algorithms on the DRIVE dataset

图6 不同算法在CHASE_DB1数据集上的曲线对比

Fig.6 Comparison of curves of different algorithms on the CHASE_DB1 dataset

图7 不同算法在STARE数据集上的曲线对比

Fig.7 Comparison of curves of different algorithms on the STARE dataset

图8 不同算法的视网膜血管分割效果

Fig.8 The segmentation effect of retinal blood vessels using different algorithms

参考文献 46

1	张欢, 仇大伟, 冯毅博, 等. U-Net模型改进及其在医学图像分割上的研究综述. 激光与光电子学进展, 2022, 59 (2): 1- 17.
	ZHANG H , QIU D W , FENG Y B , et al. Improved U-Net models and its applications in medical image segmentation: a review. Laser & Optoelectronics Progress, 2022, 59 (2): 1- 17.
2	梁礼明, 余洁, 周珑颂, 等. 多尺度密集注意力网络用于视网膜血管分割. 激光与光电子学进展, 2023, 60 (6): 122- 131.
	LIANG L M , YU J , ZHOU L S , et al. Multiscale dense attention network for retinal vessel segmentation. Laser & Optoelectronics Progress, 2023, 60 (6): 122- 131.
3	WANG C L , ODA M , HAYASHI Y , et al. Tensor-cut: a tensor-based graph-cut blood vessel segmentation method and its application to renal artery segmentation. Medical Image Analysis, 2020, 60, 101623. doi: 10.1016/j.media.2019.101623
4	KALAIE S , GOOYA A . Vascular tree tracking and bifurcation points detection in retinal images using a hierarchical probabilistic model. Computer Methods and Programs in Biomedicine, 2017, 151, 139- 149. doi: 10.1016/j.cmpb.2017.08.018
5	JIA D Q , ZHUANG X H . Learning-based algorithms for vessel tracking: a review. Computerized Medical Imaging and Graphics, 2021, 89, 101840. doi: 10.1016/j.compmedimag.2020.101840
6	SUN J D , PENG Y J , GUO Y F , et al. Segmentation of the multimodal brain tumor image used the multi-pathway architecture method based on 3D FCN. Neurocomputing, 2021, 423, 34- 45. doi: 10.1016/j.neucom.2020.10.031
7	RONNEBERGER O, FISCHER P, BROX T. U-Net: convolutional networks for biomedical image segmentation[EB/OL]. [2023-10-05]. https://arxiv.org/abs/1505.04597.
8	胡帅, 李华玲, 郝德琛. 改进U-Net的多级边缘增强医学图像分割网络. 计算机工程, 2024, 50 (4): 286- 293. doi: 10.19678/j.issn.1000-3428.0067779
	HU S , LI H L , HAO D C . Improved multistage edge-enhanced medical image segmentation network of U-Net. Computer Engineering, 2024, 50 (4): 286- 293. doi: 10.19678/j.issn.1000-3428.0067779
9	徐晓峰, 黄韫栀, 徐军. 基于各向异性注意力的双分支血管分割模型. 计算机工程, 2024, 50 (1): 348- 356. doi: 10.19678/j.issn.1000-3428.0067078
	XU X F , HUANG Y Z , XU J . Dual-branch vascular segmentation model based on anisotropic attention. Computer Engineering, 2024, 50 (1): 348- 356. doi: 10.19678/j.issn.1000-3428.0067078
10	LISKOWSKI P , KRAWIEC K . Segmenting retinal blood vessels with deep neural networks. IEEE Transactions on Medical Imaging, 2016, 35 (11): 2369- 2380. doi: 10.1109/TMI.2016.2546227
11	LI L Z, VERMA M, NAKASHIMA Y, et al. IterNet: retinal image segmentation utilizing structural redundancy in vessel networks[C]//Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV). Washington D.C., USA: IEEE Press, 2020: 3645-3654.
12	OKTAY O, SCHLEMPER J, FOLGOC L L, et al. Attention UNet: Learning where to look for the pancreas[EB/OL]. [2023-10-05]. https://arxiv.org/abs/1804.03999.
13	LAN Y C , ZHANG X M . Real-time ultrasound image despeckling using mixed-attention mechanism based residual UNet. IEEE Access, 2020, 8, 195327- 195340. doi: 10.1109/ACCESS.2020.3034230
14	GU R , WANG G T , SONG T , et al. CA-Net: comprehensive attention convolutional neural networks for explainable medical image segmentation. IEEE Transactions on Medical Imaging, 2021, 40 (2): 699- 711. doi: 10.1109/TMI.2020.3035253
15	WANG Z K , ZOU Y N , LIU P X . Hybrid dilation and attention residual U-Net for medical image segmentation. Computers in Biology and Medicine, 2021, 134, 104449. doi: 10.1016/j.compbiomed.2021.104449
16	HUANG X, DENG Z, LI D, et al. MISSFormer: an effective medical image segmentation transformer[EB/OL]. [2023-10-05]. https://arxiv.org/abs/2109.07162.
17	HENDRYCKS D, GIMPEL K. Gaussian Error Linear Units (GELUs)[EB/OL]. [2023-10-05]. https://arxiv.org/abs/1606.08415.
18	SAMUEL P M , VEERAMALAI T . VSSC-Net: vessel specific skip chain convolutional network for blood vessel segmentation. Computer Methods and Programs in Biomedicine, 2021, 198, 105769. doi: 10.1016/j.cmpb.2020.105769
19	GUO S . CSGNet: cascade semantic guided net for retinal vessel segmentation. Biomedical Signal Processing and Control, 2022, 78, 103930. doi: 10.1016/j.bspc.2022.103930
20	KIPF T N, WELLING M. Semi-supervised classification with graph convolutional networks[EB/OL]. [2023-10-05]. https://arxiv.org/abs/1609.02907.
21	MENG Y D, WEI M, GAO D X, et al. CNN-GCN aggregation enabled boundary regression for biomedical image segmentation[EB/OL]. [2023-10-05]. https://link.springer.com/chapter/10.1007/978-3-030-59719-1_35.
22	WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[EB/OL]. [2023-10-05]. https://link.springer.com/chapter/10.1007/978-3-030-01234-2_1.
23	VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[EB/OL]. [2023-10-05]. https://arxiv.org/abs/1706.03762.
24	HU J, SHEN L, SUN G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D.C., USA: IEEE Press, 2018: 7132-7141.
25	HENAFF M, BRUNA J, LECUN Y. Deep convolutional networks on graph-structured data[EB/OL]. [2023-10-05]. https://arxiv.org/abs/1506.05163.
26	LI X, YANG Y B, ZHAO Q J, et al. Spatial pyramid based graph reasoning for semantic segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Washington D.C., USA: IEEE Press, 2020: 8947-8956.
27	宋杰, 肖亮, 练智超, 等. 基于深度学习的数字病理图像分割综述与展望. 软件学报, 2021, 32 (5): 1427- 1460.
	SONG J , XIAO L , LIAN Z C , et al. Overview and prospect of deep learning for image segmentation in digital pathology. Journal of Software, 2021, 32 (5): 1427- 1460.
28	ZHOU Z W , SIDDIQUEE M M R , TAJBAKHSH N , et al. UNet++: redesigning skip connections to exploit multiscale features in image segmentation. IEEE Transactions on Medical Imaging, 2020, 39 (6): 1856- 1867. doi: 10.1109/TMI.2019.2959609
29	LADDERNET J T Z J. Multi-path networks based on U-Net for medical image segmentation[EB/OL]. [2023-10-05]. https://arxiv.org/abs/1810.07810.
30	WANG W, ZHONG J F, WU H S, et al. RVSegNet: an efficient feature pyramid cascade network for retinal vessel segmentation[EB/OL]. [2023-10-05]. https://link.springer.com/chapter/10.1007/978-3-030-59722-1_77.
31	MARDANI K , MAGHOOLI K . Enhancing retinal blood vessel segmentation in medical images using combined segmentation modes extracted by DBSCAN and morphological reconstruction. Biomedical Signal Processing and Control, 2021, 69, 102837. doi: 10.1016/j.bspc.2021.102837
32	YUAN Y C , ZHANG L , WANG L T , et al. Multi-level attention network for retinal vessel segmentation. IEEE Journal of Biomedical and Health Informatics, 2022, 26 (1): 312- 323. doi: 10.1109/JBHI.2021.3089201
33	YOU Z Y , YU H P , XIAO Z H , et al. CAS-UNet: a retinal segmentation method based on attention. Electronics, 2023, 12 (15): 3359. doi: 10.3390/electronics12153359
34	孙颖, 丁卫平, 黄嘉爽, 等. RCARUNet: 基于粗糙通道注意力机制的视网膜血管分割网络. 计算机研究与发展, 2023, 60 (4): 947- 961.
	SUN Y , DING W P , HUANG J S , et al. RCARUNet: retinal vessels segmentation network based on rough channel attention mechanism. Journal of Computer Research and Development, 2023, 60 (4): 947- 961.
35	LIU Y H , SHEN J , YANG L , et al. ResDOUNet: a deep residual network for accurate retinal vessel segmentation from fundus images. Biomedical Signal Processing and Control, 2023, 79, 104087. doi: 10.1016/j.bspc.2022.104087
36	吕佳, 王泽宇, 梁浩城. 边界注意力辅助的动态图卷积视网膜血管分割. 光电工程, 2023, 50 (1): 33- 45.
	LV J , WANG Z Y , LIANG H C . Boundary attention assisted dynamic graph convolution for retinal vascular segmentation. Opto-Electronic Engineering, 2023, 50 (1): 33- 45.
37	LI X J , DING J Q , TANG J J , et al. Res2UNet: a multi-scale channel attention network for retinal vessel segmentation. Neural Computing and Applications, 2022, 34 (14): 12001- 12015. doi: 10.1007/s00521-022-07086-8
38	PALANIVEL D A , NATARAJAN S , GOPALAKRISHNAN S . Retinal vessel segmentation using multifractal characterization. Applied Soft Computing, 2020, 94, 106439. doi: 10.1016/j.asoc.2020.106439
39	WU H S , WANG W , ZHONG J F , et al. SCS-net: a scale and context sensitive network for retinal vessel segmentation. Medical Image Analysis, 2021, 70, 102025. doi: 10.1016/j.media.2021.102025
40	ZHOU Y Q, YU H C, SHI H. Study group learning: improving retinal vessel segmentation trained with noisy labels[EB/OL]. [2023-10-05]. https://arxiv.org/abs/2103.03451.
41	SHI T Y , BOUTRY N , XU Y C , et al. Local intensity order transformation for robust curvilinear object segmentation. IEEE Transactions on Image Processing, 2022, 31, 2557- 2569. doi: 10.1109/TIP.2022.3155954
42	MOU L, ZHAO Y T, CHEN L, et al. CS-Net: channel and spatial attention network for curvilinear structure segmentation[EB/OL]. [2023-10-05]. https://link.springer.com/chapter/10.1007/978-3-030-32239-7_80.
43	LIU W T , YANG H H , TIAN T , et al. Full-resolution network and dual-threshold iteration for retinal vessel and coronary angiograph segmentation. IEEE Journal of Biomedical and Health Informatics, 2022, 26 (9): 4623- 4634. doi: 10.1109/JBHI.2022.3188710
44	LI Y , ZHANG Y , LIU J Y , et al. Global transformer and dual local attention network via deep-shallow hierarchical feature fusion for retinal vessel segmentation. IEEE Transactions on Cybernetics, 2023, 53 (9): 5826- 5839. doi: 10.1109/TCYB.2022.3194099
45	SHI Z D , LI Y , ZOU H , et al. TCU-Net: transformer embedded in convolutional U-shaped network for retinal vessel segmentation. Sensors, 2023, 23 (10): 4897. doi: 10.3390/s23104897
46	JIANG M S , ZHU Y F , ZHANG X D . CoVi-Net: a hybrid convolutional and vision transformer neural network for retinal vessel segmentation. Computers in Biology and Medicine, 2024, 170, 108047. doi: 10.1016/j.compbiomed.2024.108047

[1]	马淦, 谷雨, 彭冬亮. 结合改进YOLOv5s和动态数据增强的海面舰船检测[J]. 计算机工程, 2025, 51(9): 294-305.
[2]	陈彦如, 刘珂良, 冉茂亮. 基于深度强化学习的外卖即时配送实时优化[J]. 计算机工程, 2025, 51(9): 328-339.
[3]	翟志鹏, 曹阳, 沈琴琴, 施佺. 基于多时空图融合与动态注意力的交通流预测[J]. 计算机工程, 2025, 51(9): 139-148.
[4]	黄金贵, 刘朋, 唐文胜. MMD-YOLOv7:黑暗条件下车辆检测方法[J]. 计算机工程, 2025, 51(9): 340-349.
[5]	符家成, 田瑾, 张玉金, 方志军. 结合前置三元组集的知识图谱推荐[J]. 计算机工程, 2025, 51(9): 101-109.
[6]	倪源松, 韩军, 邹小燕, 胡广怡, 王文帅. 两阶段自适应分块输电线路螺栓缺陷检测方法[J]. 计算机工程, 2025, 51(8): 281-291.
[7]	郝宏达, 罗健旭. 基于多尺度区域特征融合的多器官语义分割模型[J]. 计算机工程, 2025, 51(8): 270-280.
[8]	张昭理, 李家豪, 刘海, 石佛波, 何嘉文. 基于个性化遗忘建模的知识追踪方法[J]. 计算机工程, 2025, 51(8): 120-130.
[9]	闫建红, 刘芝妍, 王震. 融合时空注意力机制的多尺度卷积车辆轨迹预测[J]. 计算机工程, 2025, 51(8): 406-414.
[10]	宋杰, 徐慧英, 朱信忠, 黄晓, 陈晨, 王泽宇. 基于YOLOv8改进的跌倒检测算法: OEF-YOLO[J]. 计算机工程, 2025, 51(7): 127-139.
[11]	彭菊红, 张弛, 高谦, 张光明, 谈栋华, 赵明俊. 基于改进的YOLOv8算法的钢材缺陷检测[J]. 计算机工程, 2025, 51(7): 152-160.
[12]	刘春霞, 孟吉星, 潘理虎, 龚大立. 融合RGB与IR图像的遥感小目标检测方法[J]. 计算机工程, 2025, 51(7): 326-338.
[13]	栾孟娜, 郑秋梅, 王风华. 基于DMC-YOLO的交通标志实时检测算法[J]. 计算机工程, 2025, 51(7): 90-99.
[14]	单鹏畅, 高利剑, 董文龙, 毛启容. 基于显著目标追踪的行为检测方法[J]. 计算机工程, 2025, 51(6): 93-101.
[15]	赵小虎, 谢礼逊, 慕灯聪, 张悦. 基于TCM-YOLO网络的金属表面缺陷检测方法[J]. 计算机工程, 2025, 51(6): 338-348.

选择文件类型/文献管理软件名称

选择包含的内容