结合多尺度特征融合和改进ViT的细胞计数方法

doi:10.19678/j.issn.1000-3428.0070281

计算机工程 ›› 2026, Vol. 52 ›› Issue (5): 203-215. doi: 10.19678/j.issn.1000-3428.0070281

• 计算机视觉与图形图像处理 • 上一篇下一篇

结合多尺度特征融合和改进ViT的细胞计数方法

田辉, 段鑫龙, 郝琪雅, 隋文灏, 马裕莹, 虞祖华, 徐杨, 曹仰杰*()

郑州大学网络空间安全学院, 河南郑州 450000

收稿日期:2024-08-22 修回日期:2024-10-25 出版日期:2026-05-15 发布日期:2024-12-18
通讯作者: 曹仰杰
作者简介:
田辉, 男, 讲师、博士, 主研方向为计算机视觉、医学图像分割、数字图像处理
段鑫龙, 硕士研究生
郝琪雅, 硕士研究生
隋文灏, 硕士研究生
马裕莹, 硕士研究生
虞祖华, 硕士研究生
徐杨, 硕士研究生
曹仰杰(通信作者), 教授、博士
基金资助:
河南省自然科学基金(242300421474); 河南省科技攻关项目(222102310547)

Cell Counting Method Combining Multi-Scale Feature Fusion and Improved ViT

TIAN Hui, DUAN Xinlong, HAO Qiya, SUI Wenhao, MA Yuying, YU Zuhua, XU Yang, CAO Yangjie*()

School of Cyber Science and Engineering, Zhengzhou University, Zhengzhou 450000, Henan, China

Received:2024-08-22 Revised:2024-10-25 Online:2026-05-15 Published:2024-12-18
Contact: CAO Yangjie

摘要/Abstract

摘要：

细胞计数是临床医学研究中的常见任务之一, 在生物学和临床医学领域发挥着十分重要的作用。细胞计数任务中存在细胞重叠等情况, 导致很多计数方法将多个细胞统计为单个, 从而造成细胞计数精度降低的问题。为此, 引入U-Net医学图像分割模型并对其进行改进, 提出一种结合改进ViT(Vision Transformer)模块和多尺度特征融合的细胞计数方法, 该计数方法包括4个部分, 分别为提取深层特征的编码器、用于拼接编码器特征和解码器特征的多尺度特征融合模块、捕获全局上下文信息的改进ViT模块和用于恢复特征尺寸并输出分割结果的解码器。其中, 改进ViT模块利用新颖的空间注意力模块和通道注意力模块, 解决了传统ViT在提取特定空间和通道维度信息时能力不足的问题。多尺度特征融合模块将不同尺度的特征图进行融合, 提高模型分割不同尺寸细胞的边界的能力, 降低细胞重叠对计数精度造成的影响。此外, 为了进一步提高模型分割重叠细胞的能力, 提出一种数据增强策略, 通过将原始细胞标注转换为一定半径的圆形标注, 调整细胞标注之间的距离, 从而指导模型更好地将存在重叠的细胞进行分离。在LiveCell、MBM cells和DCC数据集上进行实验, 结果表明, 所提计数方法取得了较好的结果, 有效解决了由细胞重叠引起的计数精度降低的问题。

关键词: 细胞计数, 医学图像分割, 多尺度特征融合, Vision Transformer, 通道注意力模块, 空间注意力模块

Abstract:

Cell counting is common in clinical medical research and plays a crucial role in biology and clinical medicine. In situations where cells overlap, multiple cells may be counted as a single one, causing counting accuracy to decrease. To address this issue, this paper introduces an improved U-Net medical image segmentation model. The paper proposes a cell counting method combining an improved Vision Transformer (ViT) module and multi-scale feature fusion. This counting method comprises four parts: an encoder for extracting deep features, a multi-scale feature fusion module for concatenating encoder and decoder features, an improved ViT module for capturing global context information, and a decoder for restoring feature dimensions and outputting the segmentation results. The improved ViT module utilizes novel spatial and channel attention modules to address the insufficiency of traditional ViT in extracting specific spatial and channel dimensional information. The multi-scale feature fusion module integrates feature maps of different scales, enhancing the ability of the model to segment the boundaries of cells of different sizes and reducing the impact of cell overlap on counting accuracy. To further improve the ability of the model to segment overlapping cells, the paper proposes a data augmentation strategy. By converting the original cell annotations into circular annotations with a specific radius and adjusting the distance between the cell annotations, this strategy guides the model to separate overlapping cells more effectively. Experiments on the LiveCell, MBM cells, and DCC datasets demonstrate that the proposed counting method achieves good results, effectively addressing the issue of decreased counting accuracy caused by cell overlap.

Key words: cell counting, medical image segmentation, multi-scale feature fusion, Vision Transformer (ViT), channel attention module, spatial attention module

田辉, 段鑫龙, 郝琪雅, 隋文灏, 马裕莹, 虞祖华, 徐杨, 曹仰杰. 结合多尺度特征融合和改进ViT的细胞计数方法[J]. 计算机工程, 2026, 52(5): 203-215.

TIAN Hui, DUAN Xinlong, HAO Qiya, SUI Wenhao, MA Yuying, YU Zuhua, XU Yang, CAO Yangjie. Cell Counting Method Combining Multi-Scale Feature Fusion and Improved ViT[J]. Computer Engineering, 2026, 52(5): 203-215.

https://www.ecice06.com/CN/Y2026/V52/I5/203

图/表 20

图1 本文计数方法的整体工作流程

Fig.1 The overall working flow of the counting method in this paper

图2 本文模型结构

Fig.2 The structure of the model in this paper

图3 空间注意力模块和通道注意力模块的结构

Fig.3 The structure of spatial attention module and channel attention module

图4 多尺度特征融合模块结构

Fig.4 Multi-scale feature fusion module structure

图5 本文模型在LiveCell数据集上的分割结果

Fig.5 Segmentation results of the model in this paper on the LiveCell dataset

图6 本文模型在MBM cells数据集上的分割结果

Fig.6 Segmentation results of the model in this paper on the MBM cells dataset

图7 本文模型在DCC数据集上的分割结果

Fig.7 Segmentation results of the model in this paper on the DCC dataset

图8 LiveCell数据集上的分割结果对比

Fig.8 Comparison of segmentation results on the LiveCell dataset

图9 MBM cells数据集上的分割结果对比

Fig.9 Comparison of segmentation results on the MBM cells dataset

图10 DCC数据集上的分割结果对比

Fig.10 Comparison of segmentation results on the DCC dataset

图11 LiveCell数据集上的计数结果视觉对比

Fig.11 Visual comparison of counting results on the LiveCell dataset

图12 MBM cells数据集上的计数结果视觉对比

Fig.12 Visual comparison of counting results on the MBM cells dataset

图13 DCC数据集上的计数结果视觉对比

Fig.13 Visual comparison of counting results on the DCC dataset

图14 融合4层特征的多尺度特征融合模块结构

Fig.14 Multi-scale feature fusion module structure integrating four-layer features

参考文献 33

1	MOHAMED M, FAR B. An enhanced threshold based technique for white blood cells nuclei automatic segmentation[C]//Proceedings of the 14th IEEE International Conference on e-Health Networking, Applications and Services (Healthcom). Washington D.C., USA: IEEE Press, 2012: 202-207.
2	HARI J, PRASAD A S, RAO S K. Separation and counting of blood cells using geometrical features and distance transformed watershed[C]//Proceedings of the 2nd International Conference on Devices, Circuits and Systems (ICDCS). Washington D.C., USA: IEEE Press, 2014: 1-5.
3	LU W H , CHEN X J , ZHU W F , et al. Automatic choroid cells segmentation and counting based on approximate convexity and concavity of chain code in fluorescence microscopic image. Medical Imaging 2015: Digital Pathology, 2015, 9420, 942010.
4	ALBARADEI S A , NAPOLITANO F , ULUDAG M , et al. Automated counting of colony forming units using deep transfer learning from a model for congested scenes analysis. IEEE Access, 2020, 8, 164340- 164346. doi: 10.1109/ACCESS.2020.3021656
5	DAVE P , ALAHMARI S , GOLDGOF D , et al. An adaptive digital stain separation method for deep learning-based automatic cell profile counts. Journal of Neuroscience Methods, 2021, 354, 109102. doi: 10.1016/j.jneumeth.2021.109102
6	LAVITT F , RIJLAARSDAM D J , VAN DER LINDEN D , et al. Deep learning and transfer learning for automatic cell counting in microscope images of human cancer cell lines. Applied Sciences, 2021, 11 (11): 4912. doi: 10.3390/app11114912
7	RONNEBERGER O, FISCHER P, BROX T. U-Net: convolutional networks for biomedical image segmentation[EB/OL]. [2024-07-05]. https://arxiv.org/abs/1505.04597.
8	DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An image is worth 16×16 words: transformers for image recognition at scale[EB/OL]. [2024-07-05]. https://arxiv.org/abs/2010.11929.
9	JI X Q, LI Y, CHENG J Z, et al. Cell image segmentation based on an improved watershed algorithm[C]//Proceedings of the 8th International Congress on Image and Signal Processing (CISP). Washington D.C., USA: IEEE Press, 2016: 433-437.
10	OYEBODE K O , TAPAMO J R . Automatic segmentation of cell images by improved graph cut-based approach. Journal of Biomimetics, Biomaterials and Biomedical Engineering, 2016, 29, 74- 80. doi: 10.4028/www.scientific.net/JBBBE.29.74
11	AL-KOFAHI Y , ZALTSMAN A , GRAVES R , et al. A deep learning-based algorithm for 2-D cell segmentation in microscopy images. BMC Bioinformatics, 2018, 19 (1): 365. doi: 10.1186/s12859-018-2375-z
12	LU Y , QIN X J , FAN H Y , et al. WBC-Net: a white blood cell segmentation network based on UNet++ and ResNet. Applied Soft Computing, 2021, 101, 107006. doi: 10.1016/j.asoc.2020.107006
13	ZHOU Z W, RAHMAN S M M, TAJBAKHSH N, et al. UNet++: a nested U-Net architecture for medical image segmentation[EB/OL]. [2024-07-05]. https://arxiv.org/abs/1807.10165.
14	HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Washington D.C., USA: IEEE Press, 2016: 770-778.
15	XIAO Z , DU M , LIU J J , et al. EA-UNet based segmentation method for OCT image of uterine cavity. Photonics, 2023, 10 (1): 73. doi: 10.3390/photonics10010073
16	KOTHARI S, CHAUDRY Q, WANG M D. Automated cell counting and cluster segmentation using concavity detection and ellipse fitting techniques[C]//Proceedings of the IEEE International Symposium on Biomedical Imaging: From Nano to Macro. Washington D.C., USA: IEEE Press, 2009: 795-798.
17	林跃飞. 基于显微图像的细胞计数方法研究及分析系统的构建[D]. 泉州: 华侨大学, 2022.
	LIN Y F. Research on cell counting method and construction of analysis system based on microscopic image[D]. Quanzhou: Huaqiao University, 2022. (in Chinese)
18	刘钊. 基于U-Net++神经网络模型的细胞计数方法研究与应用[D]. 杭州: 浙江理工大学, 2023.
	LIU Z. Research and application of cell counting method based on U-Net++ neural network model[D]. Hangzhou: Zhejiang Sci-Tech University, 2023. (in Chinese)
19	MORELLI R , CLISSA L , AMICI R , et al. Automating cell counting in fluorescent microscopy through deep learning with C-ResUnet. Scientific Reports, 2021, 11, 22920. doi: 10.1038/s41598-021-01929-5
20	VIZCAÍNO A , SÁNCHEZ-CRUZ H , SOSSA H , et al. Neuron cell count with deep learning in highly dense hippocampus images. Expert Systems with Applications, 2022, 208, 118090. doi: 10.1016/j.eswa.2022.118090
21	LEMPITSKY V, ZISSERMAN A. Learning to count objects in images[EB/OL]. [2024-07-05]. https://proceedings.neurips.cc/paper_files/paper/2010/file/fe73f687e5bc5280214e0486b273a5f9-Paper.pdf.
22	丁媛媛. 基于密度回归估计的细胞计数方法研究[D]. 济南: 山东师范大学, 2023.
	DING Y Y. Research on cell counting method based on density regression estimation[D]. Jinan: Shandong Normal University, 2023. (in Chinese)
23	HE S H , MINN K T , SOLNICA-KREZEL L , et al. Deeply-supervised density regression for automatic cell counting in microscopy images. Medical Image Analysis, 2021, 68, 101892. doi: 10.1016/j.media.2020.101892
24	GUO Y , KRUPA O , STEIN J , et al. SAU-Net: a unified network for cell counting in 2D and 3D microscopy images. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2022, 19 (4): 1920- 1932. doi: 10.1109/TCBB.2021.3089608
25	LIU R , ZHU Y D , WU C , et al. Interactive dual network with adaptive density map for automatic cell counting. IEEE Transactions on Automation Science and Engineering, 2024, 21 (4): 6731- 6743. doi: 10.1109/TASE.2023.3329973
26	SUN G Q , PAN Y Z , KONG W K , et al. DA-TransUNet: integrating spatial and channel dual attention with transformer U-Net for medical image segmentation. Frontiers in Bioengineering and Biotechnology, 2024, 12, 1398237.
27	张杨, 江松林, 朱松豪. 多尺度特征融合和上下文混合注意力机制的单图像去模糊. 小型微型计算机系统, 2025, 46 (11): 2633- 2642.
	ZHANG Y , JIANG S L , ZHU S H . Single image deblurring with multi-scale feature fusion and context hybrid attention mechanism. Journal of Chinese Computer Systems, 2025, 46 (11): 2633- 2642.
28	XU Z H , TIAN B , LIU S J , et al. Collaborative attention guided multi-scale feature fusion network for medical image segmentation. IEEE Transactions on Network Science and Engineering, 2024, 11 (2): 1857- 1871. doi: 10.1109/TNSE.2023.3332810
29	栾方军, 龚琪, 袁帅. 基于注意力机制和多尺度融合的人群计数网络. 计算机工程, 2025, 51 (3): 352- 361. doi: 10.19678/j.issn.1000-3428.0069071
	LUAN F J , GONG Q , YUAN S . Crowd counting network based on attention mechanism and multiscale fusion. Computer Engineering, 2025, 51 (3): 352- 361. doi: 10.19678/j.issn.1000-3428.0069071
30	ZHAO R J, QIAN B Y, ZHANG X L, et al. Rethinking dice loss for medical image segmentation[C]//Proceedings of the IEEE International Conference on Data Mining (ICDM). Washington D.C., USA: IEEE Press, 2021: 851-860.
31	EDLUND C , JACKSON T R , KHALID N , et al. LiveCell—a large-scale dataset for label-free live cell segmentation. Nature Methods, 2021, 18 (9): 1038- 1045.
32	COHEN J P, BOUCHER G, GLASTONBURY C A, et al. Count-ception: counting by fully convolutional redundant counting[C]//Proceedings of the IEEE International Conference on Computer Vision Workshops (ICCVW). Washington D.C., USA: IEEE Press, 2018: 18-26.
33	MARSDEN M, MCGUINNESS K, LITTLE S, et al. People, penguins and Petri dishes: adapting object counting models to new visual domains and object types without forgetting[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D.C., USA: IEEE Press, 2018: 8070-8079.

[1]	李辉, 刘佳煜, 徐雅萍. 面向医学图像分割的深度学习模型架构与性能评估方法综述[J]. 计算机工程, 2026, 52(5): 81-94.
[2]	吴雪松, 陈媛媛, 周涛. 基于多尺度金字塔池化的自适应无参考图像质量评价[J]. 计算机工程, 2026, 52(3): 107-118.
[3]	闫建红, 刘芝妍, 王震. 融合时空注意力机制的多尺度卷积车辆轨迹预测[J]. 计算机工程, 2025, 51(8): 406-414.
[4]	郝宏达, 罗健旭. 基于多尺度区域特征融合的多器官语义分割模型[J]. 计算机工程, 2025, 51(8): 270-280.
[5]	栾孟娜, 郑秋梅, 王风华. 基于DMC-YOLO的交通标志实时检测算法[J]. 计算机工程, 2025, 51(7): 90-99.
[6]	李白芽. 基于CNN-Transformer的电子喉镜病灶及器官分割网络[J]. 计算机工程, 2025, 51(6): 327-337.
[7]	栾方军, 龚琪, 袁帅. 基于注意力机制和多尺度融合的人群计数网络[J]. 计算机工程, 2025, 51(3): 352-361.
[8]	许明, 屈泰澎, 姜彦吉. 改进YOLOv7在复杂场景下的交通标志检测算法[J]. 计算机工程, 2025, 51(2): 335-343.
[9]	朱子斌, 李千林, 张小燕, 韩双双. 基于双向时空特征学习的人体深度图像估计[J]. 计算机工程, 2025, 51(10): 258-269.
[10]	蒋宏贵, 胡冀苏, 钱旭升, 郑毅, 周志勇, 戴亚康. 基于联合学习和多级小波特征金字塔的MRI-TRUS图像配准方法[J]. 计算机工程, 2025, 51(10): 270-283.
[11]	刘兆伟, 方艳红, 郑明宇, 锁斌. 基于注意力机制与多任务的肺部疾病诊断方法[J]. 计算机工程, 2025, 51(1): 332-342.
[12]	郭敏, 张熙涵, 李阳. 融合注意力的教师互一致性半监督医学图像分割[J]. 计算机工程, 2024, 50(9): 313-323.
[13]	李俊仪, 李向阳, 龙朝勋, 李海燕, 李红松, 余鹏飞. 基于多级区域选择与跨层特征融合的野生菌分类[J]. 计算机工程, 2024, 50(9): 179-188.
[14]	高爽, 史轶伦, 徐巧枝, 于磊. 基于对比学习的非对称编解码结构的心脏MRI分割研究[J]. 计算机工程, 2024, 50(8): 290-300.
[15]	杨硕, 王一丁. 基于改进薄板样条运动模型的人脸动画算法[J]. 计算机工程, 2024, 50(6): 255-265.

选择文件类型/文献管理软件名称

选择包含的内容

结合多尺度特征融合和改进ViT的细胞计数方法

Cell Counting Method Combining Multi-Scale Feature Fusion and Improved ViT

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 20

参考文献 33

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

结合多尺度特征融合和改进ViT的细胞计数方法

Cell Counting Method Combining Multi-Scale Feature Fusion and Improved ViT

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 20

参考文献 33

相关文章 15

编辑推荐

Metrics

本文评价