基于增量学习的结直肠息肉分割方法

doi:10.19678/j.issn.1000-3428.0069231

摘要/Abstract

摘要：

结直肠内窥镜图像在设备之间的特征分布不同，导致训练模型在新设备上的分割性能降低。为缓解模型对新设备的适应性问题，提出一种基于增量学习的微调方法，以及一种改进的结直肠息肉分割网络CPSegNet。增量学习方法包含预训练和新设备微调2个阶段，预训练使用旧设备的数据对息肉分割网络进行充分训练，微调阶段同时使用新旧设备样本进行训练，并通过采样率和正则化损失函数防止出现灾难性遗忘现象。CPSegNet采用MiT的预训练模型作为骨干网络，多层感知机(MLP)作为解码模块，不确定区域注意力(URA)作为细化模块，对边界模糊区域进行优化。为了验证学习策略对新设备的适应能力，采用Kvasir-SEG、CVC-ClinicDB、CVC-300、CVC-ColonDB、Kvasir-Sessile和ETIS-LaribPolypDB共6个数据集进行实验，其中前2个数据集为训练集，其余4个为新设备的模拟数据。以Dice相似系数和交并比(IoU)作为评价指标。实验结果表明，在无增量学习情况下CPSegNet在新设备上的性能优于主流的算法，以Kvasir-SEG作为源域数据集，将较难分割的ETIS-LaribPolypDB作为目标域数据集时，与ColonFormer算法相比的Dice相似系数提升3百分点，以CVC-ClinicDB作为源域数据集时，提升了6百分点，使用增量学习后CPSegNet和主流算法都能在新设备上获得性能提升，同时保持在旧设备上的分割精度。

关键词: 息肉分割, 增量学习, 迁移学习, 少样本学习, 灾难性遗忘

Abstract:

The feature distributions of colorectal endoscopic images differ among devices, reducing the trained model′s segmentation performance on new devices. To alleviate the model′s adaptability to new devices, a fine-tuning method based on incremental learning and an improved colorectal polyp segmentation network called CPSegNet are proposed. The incremental learning method consists of two stages: pre-training and fine-tuning on new devices. Pre-training uses data from an old device to train the polyp segmentation network adequately, and the fine-tuning stage is trained with samples from both old and new devices. This also includes a sampling rate adjustment and a regularization loss function to prevent catastrophic forgetting. CPSegNet adopts a pre-trained MiT model as the backbone network, a Multi-Layer Perceptron (MLP) as the decoding module, and an Uncertainty Region Attention (URA) mechanism as the refinement module to optimize the ambiguous boundary regions. To validate the adaptability of the learning strategy to new devices, experiments are conducted using six datasets: Kvasir-SEG, CVC-ClinicDB, CVC-300, CVC-ColonDB, Kvasir-Sessile, and ETIS-LaribPolypDB; the first two datasets are used as the training set, and the other four are simulated data for new devices. The experimental results, using the Dice similarity coefficient and Intersection over Union (IoU) metrics as evaluation indicators, demonstrate that the performance of CPSegNet on new devices is superior to that of mainstream algorithms without incremental learning, particularly on the challenging ETIS-LaribPolypDB dataset, showing increases of 3 percentage points in the Dice similarity coefficient compared with the ColonFormer algorithm when Kvasir-SEG is used as the source domain dataset. When CVC-ClinicDB is used as the source domain dataset, the Dice similarity coefficient is improved by 6 percentage points. Furthermore, both CPSegNet and mainstream algorithms exhibit performance improvements on new devices after using incremental learning, while maintaining segmentation accuracy on old devices.

Key words: polyp segmentation, incremental learning, transfer learning, few-shot learning, catastrophic forgetting

逯暄, 景路琪, 彭甫镕. 基于增量学习的结直肠息肉分割方法[J]. 计算机工程, 2025, 51(7): 284-293.

LU Xuan, JING Luqi, PENG Furong. Colorectal Polyp Segmentation Method Based on Incremental Learning[J]. Computer Engineering, 2025, 51(7): 284-293.

https://www.ecice06.com/CN/Y2025/V51/I7/284

图/表 15

图1 息肉分割的增量学习过程

Fig.1 Incremental learning process for polyp segmentation

图2 数据采样策略

Fig.2 Strategy of data sampling

图3 改进的CPSegNet模型结构

Fig.3 Structure of improved CPSegNet model

图4 感受野块结构

Fig.4 Structure of receptive field block

图5 多层感知机解码器

Fig.5 Multi-layer perceptron decoder

图6 不同算法在目标域数据集上的视觉效果对比

Fig.6 Visual effect comparison of different algorithms on target domain dataset

图7 4组消融实验在目标域数据集上的定量结果

Fig.7 Quantitative results of four ablation experiments on target domain datasets

图8 不同算法在源域数据集Kvasir-SEG上的mDice值

Fig.8 The mDice values of different algorithms on source domain dataset Kvasir-SEG

图9 不同算法在源域数据集CVC-ClinicDB上的mDice值

Fig.9 The mDice values of different algorithms on source domain dataset CVC-ClinicDB

图10 不同λ值下的实验结果

Fig.10 Experimental results at different λ value

参考文献 27

1	FUJITA H . AI-based computer-aided diagnosis (AI-CAD): the latest review to read first. Radiological Physics and Technology, 2020, 13 (1): 6- 19.
2	XU A, LI W Q, GUO P F, et al. Closing the generalization gap of cross-silo federated medical image segmentation[C]//Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2022: 20834-20843.
3	YANG J Y, AN W Z, WANG S, et al. Label-driven reconstruction for domain adaptation in semantic segmentation[C]//Proceedings of Europe Conference on Computer Vision. Berlin, Germany: Springer, 2020: 480-498.
4	ZHOU K Y , LIU Z W , QIAO Y , et al. Domain generalization: a survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45 (4): 4396- 4415.
5	张驰名, 王庆凤, 刘志勤, 等. 基于深度迁移学习的肺结节辅助诊断方法. 计算机工程, 2020, 46 (1): 271- 278. doi: 10.19678/j.issn.1000-3428.0053340
	ZHANG C M , WANG Q F , LIU Z Q , et al. Pulmonary nodule auxiliary diagnosis method based on deep transfer learning. Computer Engineering, 2020, 46 (1): 271- 278. doi: 10.19678/j.issn.1000-3428.0053340
6	ZHUANG F Z , QI Z Y , DUAN K Y , et al. A comprehensive survey on transfer learning. Proceedings of the IEEE, 2021, 109 (1): 43- 76.
7	叶世杰, 王永雄. 基于图神经网络的WSI癌症生存预测方法. 光电工程, 2024, 51 (4): 240011.
	Ye S J , Wang Y X . Graph neural network-based WSI cancer survival prediction method. Opto-Electronic Engineering, 2024, 51 (4): 240011.
8	DE LANGE M, ALJUNDI R, MASANA M, et al. A continual learning survey: defying forgetting in classification tasks[EB/OL]. [2023-12-10]. https://arxiv.org/abs/1909.08383v2.
9	HE J P, MAO R Y, SHAO Z M, et al. Incremental learning in online scenario[C]//Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2020: 13923-13932.
10	MASANA M , LIU X L , TWARDOWSKI B , et al. Class-incremental learning: survey and performance evaluation on image classification. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 45 (5): 5513- 5533.
11	REBUFFI S A, KOLESNIKOV A, SPERL G, et al. iCaRL: incremental classifier and representation learning[C]//Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition. Washington D.C., USA: IEEE Press, 2017: 5533-5542.
12	CASTRO F M, MARIN-JIMENEZ M J, GUIL N, et al. End-to-end incremental learning[C]//Proceedings of Europe Conference on Computer Vision. Berlin, Germany: Springer, 2018: 241-257.
13	WU Y, CHEN Y P, WANG L J, et al. Large scale incremental learning[C]//Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2019: 374-382.
14	FAN D P, JI G P, ZHOU T, et al. PraNet: parallel reverse attention network for polyp segmentation[C]//Proceedings of International Conference on Medical Image Computing and Computer-Assisted Intervention. Berlin, Germany: Springer, 2020: 263-273.
15	KIM T, LEE H, KIM D. UACANet: uncertainty augmented context attention for polyp segmentation[C]//Proceedings of the 29th ACM International Conference on Multimedia. New York, USA: ACM Press, 2021: 2167-2175.
16	CAI L H, WU M J, CHEN L J, et al. Using guided self-attention with local information for polyp segmentation[C]//Proceedings of International Conference on Medical Image Computing and Computer-Assisted Intervention. Berlin, Germany: Springer, 2022: 629-638.
17	ZHANG Y D, LIU H Y, HU Q. TransFuse: fusing transformers and CNNs for medical image segmentation[C]//Proceedings of International Conference on Medical Image Computing and Computer Assisted Intervention. Berlin, Germany: Springer, 2021: 14-24.
18	DUC N T , OANH N T , THUY N T , et al. ColonFormer: an efficient transformer based method for colon polyp segmentation. IEEE Access, 2022, 10, 80575- 80586.
19	LI Z Z , HOIEM D . Learning without forgetting. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 40 (12): 2935- 2947.
20	RANNEN A, ALJUNDI R, BLASCHKO M B, et al. Encoder based lifelong learning[C]//Proceedings of 2017 IEEE International Conference on Computer Vision. Washington D. C., USA: IEEE Press, 2017: 1329-1337.
21	KIRKPATRICK J , PASCANU R , RABINOWITZ N , et al. Overcoming catastrophic forgetting in neural networks. Proceedings of the National Academy of Sciences of the United States of America, 2017, 114 (13): 3521- 3526.
22	HOSPEDALES T , ANTONIOU A , MICAELLI P , et al. Meta-learning in neural networks: a survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44 (9): 5149- 5169.
23	TING C, SIMON K, MOHAMMAD N, et al. A simple framework for contrastive learning of visual representations[C]//Proceedings of the 37th International Conference on Machine Learning. New York, USA: ACM Press, 2020: 1597-1607.
24	任莉莉, 边璇, 王光磊, 等. 基于深度学习的息肉分割网络GLIA-Net. 计算机工程, 2022, 48 (12): 248- 254. doi: 10.19678/j.issn.1000-3428.0062860
	REN L L , BIAN X , WANG G L , et al. Polyp segmentation network GLIA-Net based on deep learning. Computer Engineering, 2022, 48 (12): 248- 254. doi: 10.19678/j.issn.1000-3428.0062860
25	WANG W H, XIE E Z, LI X, et al. Pyramid vision transformer: a versatile backbone for dense prediction without convolutions[C]//Proceedings of 2021 IEEE/CVF International Conference on Computer Vision. Washington D. C., USA: IEEE Press, 2021: 568-578.
26	WANG W H , XIE E Z , LI X , et al. PVTv2: improved baselines with pyramid vision transformer. Computational Visual Media, 2022, 8 (3): 415- 424.
27	XIE E Z, WANG W H, YU Z D, et al. SegFormer: simple and efficient design for semantic segmentation with transformers[EB/OL].[2023-12-10]. https://arxiv.org/abs/2105.15203.

[1]	廖丁丁, 刘俊峰, 曾君, 邱晓欢. 一种基于块平均正交权重修正的连续学习算法[J]. 计算机工程, 2025, 51(6): 57-64.
[2]	张欢, 王晨, 单景东, 仇润鹤. 基于领域自适应与注意力机制的电梯安全风险预测[J]. 计算机工程, 2025, 51(2): 86-93.
[3]	易鹏, 杨晔, 严仕嘉. 基于MPCNN模型的sEMG快速迁移学习的手势识别应用研究[J]. 计算机工程, 2025, 51(1): 304-311.
[4]	丁伟杰, 顾斌杰, 潘丰. 增量式稀疏密度加权孪生支持向量回归机[J]. 计算机工程, 2024, 50(7): 123-132.
[5]	李致金, 汤佳辉, 闫金凤. 基于边缘计算的轻量化识别方法[J]. 计算机工程, 2024, 50(6): 287-295.
[6]	包善书, 车波, 邓林红. 基于双源域迁移学习的肺音信号识别[J]. 计算机工程, 2023, 49(9): 295-302, 312.
[7]	王爱玲, 马文臻, 邹自明, 钟佳. 基于领域自适应的卫星工程参数异常检测[J]. 计算机工程, 2023, 49(5): 29-37,47.
[8]	白俊卿, 韩柏迅, 张丰侠. 基于深度学习的无人机图像语义分割算法研究[J]. 计算机工程, 2023, 49(4): 233-239.
[9]	郑云涛, 叶家炜. 基于茫然传输协议的FATE联邦迁移学习方案[J]. 计算机工程, 2023, 49(2): 24-30.
[10]	刘强, 张颖, 周卫祥, 蒋先涛, 周薇娜, 周谋国. 自适应类增量学习的物联网入侵检测系统[J]. 计算机工程, 2023, 49(2): 169-174.
[11]	王雨燕, 赵佳鹏, 时金桥, 申立艳, 刘洪梦, 杨燕燕. 暗网网页用户身份信息聚合方法[J]. 计算机工程, 2023, 49(11): 187-194, 210.
[12]	伍洲, 杨寒石, 邬俊俊, 张海军, 宋晴. 进化迁移优化算法综述[J]. 计算机工程, 2023, 49(1): 1-14.
[13]	邱鸿辉, 刘海林, 陈磊. 基于协方差矩阵调整的多目标多任务优化算法[J]. 计算机工程, 2022, 48(8): 306-312.
[14]	吴正江, 张亚宁, 张真, 梅秋雨, 杨天. 拟单层覆盖粗糙集中近似集的增量更新算法[J]. 计算机工程, 2022, 48(6): 200-206,212.
[15]	任莉莉, 边璇, 王光磊, 王洪瑞. 基于深度学习的息肉分割网络GLIA-Net[J]. 计算机工程, 2022, 48(12): 248-254.

选择文件类型/文献管理软件名称

选择包含的内容