融合多层感知注意力的电极微观图像分割方法

doi:10.19678/j.issn.1000-3428.0067208

摘要/Abstract

摘要：

针对氮氧传感器电极微观图像存在的物质边缘模糊、伪影、灰度不均等问题，将U-Net作为基础模型，提出融合多层感知注意力的电极微观图像语义分割方法。首先对U-Net编码层的不同尺度输出特征图使用3×3卷积进行降维，利用双线性插值统一特征尺度，以实现多尺度特征融合，增强特征信息提取能力并补偿编码下采样中的特征损失；其次通过加入空间金字塔池化来提取多尺度信息并通过1×1卷积减小计算量，同时提出多层感知注意力模块，以捕获主干特征图和增强语义信息特征图的空间位置与通道依赖关系；最后计算不同语义信息特征图的相似度关系，结合交叉熵损失提出具有捕获空间相似性能力的损失函数，在训练过程中对关键信息进行监督，辅助主干特征图学习空间位置信息，增强分割性能。实验结果表明，该方法的类别平均像素准确率为96.75%，平均交并比为94.04%，微观F1分数为96.92%，浮点运算次数为7.78×10⁹，网络所含参数量为8.08×10⁶。相对U-Net、SegNet等模型，该方法在提高少量模型复杂度的情况下，能有效改善边缘模糊及物质伪影问题，捕获空间位置与通道信息，保留图像细节特征，提高分割准确率。

关键词: 电极, 微观图像, 氮氧传感器, 语义分割, 感知注意力

Abstract:

To address the problems of blurred material edges, artifacts, and uneven grayscale in electrode microscopic images of NO_x sensors, an electrode microscopic image semantic segmentation method that fuses multi-layer perceptual attention is proposed, in which U-Net is the base model. First, different scale output feature maps of the U-Net encoding layer with a 3×3 convolution are used to reduce dimensionality. Furthermore, bilinear interpolation is used to unify feature scales to achieve multi-scale feature fusion, enhance feature information extraction, and compensate for feature loss from encoding downsampling. Second, by adding spatial pyramid pooling to extract multi-scale information and employing a 1×1 convolution to reduce the calculation, a multi-layer perceptual attention module is proposed to capture the spatial position and channel dependence of the backbone feature map and the feature map with enhanced semantic information. Finally, a loss function with the ability to capture spatial similarity is proposed based on the similarity relationship of feature maps with different semantic information combined with cross-entropy loss. The key information is supervised during the training process to assist the backbone feature map to learn spatial position information and enhance the segmentation performance. The experimental results indicate that the Mean Pixel Accuracy(MPA) of the proposed method is 96.75%, the Mean Intersection over Union(MIoU) is 94.04%, Micro-F1 is 96.92%, FLOPs is 7.78×10⁹, and the number of parameters contained in the network is 8.08×10⁶. Compared with models such as U-Net and SegNet, the proposed method can effectively address problems of edge blurring and material artifacts while increasing a little model complexity. Furthermore, it can capture spatial position and channel information, preserve detailed features of the image, and improve segmentation accuracy.

Key words: electrode, microscopic image, NO_x sensor, semantic segmentation, perceptual attention

徐威, 付晓薇, 李曦, 汪尧坤. 融合多层感知注意力的电极微观图像分割方法[J]. 计算机工程, 2024, 50(1): 329-338.

Wei XU, Xiaowei FU, Xi LI, Yaokun WANG. Electrode Microscopic Image Segmentation Method by Fusing Multi-layer Perceptual Attention[J]. Computer Engineering, 2024, 50(1): 329-338.

https://www.ecice06.com/CN/Y2024/V50/I1/329

图/表 13

图1 本文分割网络整体结构

Fig.1 The overall structure of the segmentation network in this paper

图2 MLPA总体结构

Fig.2 The overall structure of MLPA

图3 PPA的结构

Fig.3 The structure of PPA

图4 PCA的结构

Fig.4 The structure of PCA

图5 各损失函数的使用位置

Fig.5 The usage positions of each loss function

图6 语义分割实验流程

Fig.6 The procedure of semantic segmentation experiment

图7 各网络模型的分割结果

Fig.7 The segmentation results of each network model

图8 各模块的分割结果

Fig.8 Segmentation results of each module

图9 各模块组合的训练损失

Fig.9 Training loss of each module combination

参考文献 33

1	LIU T, WANG X N, LI L, et al. Review—electrochemical NO_x gas sensors based on stabilized zirconia. Journal of the Electrochemical Society, 2017, 164 (13): B610. doi: 10.1149/2.0501713jes
2	HALLEY S, RAMAIYAN K P, TSUI L, et al. A review of zirconia oxygen, NO_x, and mixed potential gas sensors— history and current trends. Sensors and Actuators B: Chemical, 2022, 370, 132363. doi: 10.1016/j.snb.2022.132363
3	邹海平. 满足国六排放柴油机SCR系统控制策略研究与验证[D]. 镇江: 江苏大学, 2021.
	ZOU H P. Research and verification of SCR system control strategy for diesel engine meeting national six emission requirements[D]. Zhenjiang: Jiangsu University, 2021. (in Chinese)
4	张琳, 卢中轩, 李腾腾, 等. NO_x传感器测量天然气发动机NO_x排放的试验研究. 小型内燃机与车辆技术, 2021, 50 (6): 48- 52. URL
	ZHANG L, LU Z X, LI T T, et al. Experimental study on measurement of NO_x emission from natural gas engine with a NO_x sensor. Small Internal Combustion Engine and Motorcycle, 2021, 50 (6): 48- 52. URL
5	MIURA N, SATO T, ANGGRAINI S A, et al. A review of mixed-potential type zirconia-based gas sensors. Ionics, 2014, 20 (7): 901- 925. doi: 10.1007/s11581-014-1140-1
6	李怡. 车载氮氧传感器控制系统研究及实现[D]. 武汉: 华中科技大学, 2021.
	LI Y. Research and implementation of control system for vehicle-mounted NO_x sensor[D]. Wuhan: Huazhong University of Science and Technology, 2021. (in Chinese)
7	WANG J X, CUI J D, ZHANG X, et al. Effect of sintering temperature on adhesion property and electrochemical activity of Pt/YSZ electrode. Materials, 2022, 15 (10): 3471. doi: 10.3390/ma15103471
8	ZHENG Y J, SAUTER U, MOOS R. Investigation of oxygen transport paths in geometrically defined thick-film composite Pt electrodes on YSZ. Journal of the Electrochemical Society, 2016, 163 (8): F877. doi: 10.1149/2.1081608jes
9	SCIAZKO A, KOMATSU Y, SHIMURA T, et al. Segmentation of solid oxide cell electrodes by patch convolutional neural network. Journal of the Electrochemical Society, 2021, 168 (4): 044504. doi: 10.1149/1945-7111/abef84
10	TONG Z, GAO J, WANG Z, et al. A new method for CF morphology distribution evaluation and CFRC property prediction using cascade deep learning. Construction and Building Materials, 2019, 222, 829- 838. doi: 10.1016/j.conbuildmat.2019.06.160
11	LIU K, OSTADHASSAN M. Multi-scale fractal analysis of pores in shale rocks. Journal of Applied Geophysics, 2017, 140, 1- 10. doi: 10.1016/j.jappgeo.2017.02.028
12	LI X, LIU Z, CUI S, et al. Predicting the effective mechanical property of heterogeneous materials by image based modeling and deep learning. Computer Methods in Applied Mechanics and Engineering, 2019, 347, 735- 753. doi: 10.1016/j.cma.2019.01.005
13	YANG X J, WANG J M, ZHU C, et al. Effect of wetting and drying cycles on microstructure of rock based on SEM. Environmental Earth Sciences, 2019, 78 (6): 183. doi: 10.1007/s12665-019-8191-6
14	RONNEBERGER O, FISCHER P, BROX T. U-Net: convolutional networks for biomedical image segmentation[EB/OL]. [2023-02-05]. https://arxiv.org/abs/1505.04597.
15	KIM D, LEE S, HONG W, et al. Image segmentation for FIB-SEM serial sectioning of a Si/C-graphite composite anode microstructure based on preprocessing and global thresholding. Microscopy and Microanalysis, 2019, 25 (5): 1139- 1154. doi: 10.1017/S1431927619014752
16	YANG X F, FU X W, LI X. Adaptive clustering SOFC image segmentation based on particle swarm optimization. Journal of Physics: Conference Series, 2019, 1229, 012020. doi: 10.1088/1742-6596/1229/1/012020
17	CHALUSIAK M, NAWROT W, BUCHANIEC S, et al. Swarm intelligence-based methodology for scanning electron microscope image segmentation of solid oxide fuel cell anode. Energies, 2021, 14 (11): 3055. doi: 10.3390/en14113055
18	HWANG H, CHOI S M, OH J, et al. Integrated application of semantic segmentation-assisted deep learning to quantitative multi-phased microstructural analysis in composite materials: case study of cathode composite materials of solid oxide fuel cells. Journal of Power Sources, 2020, 471, 228458. doi: 10.1016/j.jpowsour.2020.228458
19	CHEN L C, ZHU Y K, PAPANDREOU G, et al. Encoder-decoder with atrous separable convolution for semantic image segmentation[EB/OL]. [2023-02-05]. https://arxiv.org/abs/1802.02611.
20	MARQUES V G, DA SILVA L R D, CARVALHO B M, et al. Deep learning-based pore segmentation of thin rock sections for aquifer characterization using color space reduction[C]//Proceedings of 2019 International Conference on Systems, Signals and Image Processing. Washington D. C., USA: IEEE Press, 2019: 235-240.
21	BADRINARAYANAN V, KENDALL A, CIPOLLA R. SegNet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39 (12): 2481- 2495. doi: 10.1109/TPAMI.2016.2644615
22	YAMAGISHI R, SCIAZKO A, OUYANG Z F, et al. Super-resolved in-operando observation of SOFC pattern electrodes. ECS Transactions, 2021, 103 (1): 2087- 2098. doi: 10.1149/10301.2087ecst
23	CHAUDHARI S, MITHAL V, POLATKAN G, et al. An attentive survey of attention models. ACM Transactions on Intelligent Systems and Technology, 2021, 12 (5): 1- 32. doi: 10.48550/arXiv.1904.02874
24	OKTAY O, SCHLEMPER J, FOLGOC L L, et al. Attention U-Net: learning where to look for the pancreas[EB/OL]. [2023-02-05]. https://arxiv.org/abs/1804.03999.pdf.
25	LI H, XIONG P, AN J, et al. Pyramid attention network for semantic segmentation[EB/OL]. [2023-02-05]. https://arxiv.org/abs/1805.10180.pdf.
26	WANG Y, DENG Z J, HU X W, et al. Deep attentional features for prostate segmentation in ultrasound[EB/OL]. [2023-02-05]. https://link.springer.com/chapter/10.1007/978-3-030-00937-3_60.
27	ZHU Z, XU M D, BAI S, et al. Asymmetric non-local neural networks for semantic segmentation[C]//Proceedings of 2019 IEEE/CVF International Conference on Computer Vision. Washington D. C., USA: IEEE Press, 2019: 593-602.
28	FU J, LIU J, TIAN H J, et al. Dual attention network for scene segmentation[C]//Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2019: 3146-3154.
29	MA Y D, LIU Q, QUAN Z B. Automated image segmentation using improved PCNN model based on cross-entropy[EB/OL]. [2023-02-05]. https://ieeexplore.ieee.org/document/1434171.
30	BYRA M, JAROSIK P, SZUBERT A, et al. Breast mass segmentation in ultrasound with selective kernel U-Net convolutional neural network. Biomedical Signal Processing and Control, 2020, 61, 102027. doi: 10.1016/j.bspc.2020.102027
31	JHA D, RIEGLER M A, JOHANSEN D, et al. DoubleU-Net: a deep convolutional neural network for medical image segmentation[C]//Proceedings of 2020 IEEE International Symposium on Computer-Based Medical Systems. Washington D. C., USA: IEEE Press, 2020: 558-564.
32	GUO C L, SZEMENYEI M, YI Y G, et al. SA-UNet: spatial attention U-Net for retinal vessel segmentation[C]//Proceedings of 2020 International Conference on Pattern Recognition. Washington D. C., USA: IEEE Press, 2021: 1236-1242.
33	CHEN J, LU Y, YU Q, et al. TransUNet: transformers make strong encoders for medical image segmentation[EB/OL]. [2023-02-05]. https://arxiv.org/abs/2102.04306.pdf.

[1]	李仲, 冒睿瑞, 王晓龙, 王根一, 安国成. 基于改进PIDNet的水位线检测算法[J]. 计算机工程, 2024, 50(8): 102-112.
[2]	闵莉, 董冰洁, 安冬. 基于多注意力机制与跨特征融合的语义分割算法[J]. 计算机工程, 2024, 50(8): 282-289.
[3]	逯焕宇, 张永宏, 马光义, 谢东林, 田伟. 基于半监督对抗学习的遥感图像水体提取[J]. 计算机工程, 2024, 50(7): 251-263.
[4]	肖慈, 徐杨, 张永丹, 冯明文, 黄易仟. 结合注意力和低光增强的夜间语义分割[J]. 计算机工程, 2024, 50(7): 271-281.
[5]	陈晓玉, 沈晨, 沈阅, 孔德明. 基于改进SwiftNet的堆场图像实时分割网络[J]. 计算机工程, 2024, 50(6): 296-303.
[6]	王安政, 党建武, 岳彪, 杨景玉. 基于位置信息和注意力机制的路面裂缝检测[J]. 计算机工程, 2024, 50(4): 303-312.
[7]	王柏涵, 姜晓燕, 范柳伊. 基于深度监督隐空间构建的语义分割改进方法[J]. 计算机工程, 2024, 50(3): 191-199.
[8]	苏晓东, 李世洲, 赵佳圆, 亮洪宇, 张玉荣, 徐红岩. 基于多级叠加和注意力机制的图像语义分割[J]. 计算机工程, 2023, 49(9): 265-271, 278.
[9]	徐春波, 闫娟, 杨慧斌, 王博, 吴晗. 基于目标检测和语义分割的视觉SLAM算法[J]. 计算机工程, 2023, 49(8): 199-206, 214.
[10]	白俊卿, 韩柏迅, 张丰侠. 基于深度学习的无人机图像语义分割算法研究[J]. 计算机工程, 2023, 49(4): 233-239.
[11]	苏鸣方, 胡立坤, 黄润辉. 基于上下文注意力的室外点云语义分割方法[J]. 计算机工程, 2023, 49(3): 248-256.
[12]	马素刚, 陈期梅, 侯志强, 杨小宝, 张子贤. 基于密集连接与特征增强的语义分割算法[J]. 计算机工程, 2023, 49(3): 263-270.
[13]	范润泽, 刘宇红, 张荣芬, 李景玉. 基于多尺度注意力机制的道路场景语义分割模型[J]. 计算机工程, 2023, 49(2): 288-295.
[14]	李嘉豪, 闵卫东, 陈炯缙, 朱梦, 展国伟. 一种复杂场景下高精度交通标志检测模型[J]. 计算机工程, 2023, 49(11): 311-320.
[15]	徐蓬泉, 梁宇翔, 李英. 融合多尺度语义和剩余瓶颈注意力的医学图像分割[J]. 计算机工程, 2023, 49(10): 162-170.

选择文件类型/文献管理软件名称

选择包含的内容