嵌入SENet结构的改进YOLOV3目标识别算法

doi:10.19678/j.issn.1000-3428.0052861

计算机工程 ›› 2019, Vol. 45 ›› Issue (11): 243-248. doi: 10.19678/j.issn.1000-3428.0052861

嵌入SENet结构的改进YOLOV3目标识别算法

刘学平¹, 李玙乾^1,2, 刘励^1,2, 王哲^1,2, 刘宇³

1. 清华大学深圳研究生院, 广东深圳 518055;
2. 清华大学机械工程系, 北京 100084;
3. 长虹智能制造技术有限公司, 成都 621000

收稿日期:2018-10-12 修回日期:2018-11-23 发布日期:2018-12-04
作者简介:刘学平(1965-),男,副教授、博士,主研方向为目标识别、智能控制;李玙乾、刘励、王哲,硕士研究生;刘宇,工程师。
基金资助:
国家自然科学基金（51475263）。

Improved YOLOV3 Target Recognition Algorithm with Embedded SENet Structure

LIU Xueping¹, LI Yuqian^1,2, LIU Li^1,2, WANG Zhe^1,2, LIU Yu³

1. Graduate School at Shenzhen, Tsinghua University, Shenzhen, Guangdong 518055, China;
2. Department of Mechanical Engineering, Tsinghua University, Beijing 100084, China;
3. Changhong Intelligent Manufacturing Co., Ltd., Chengdu 621000, China

Received:2018-10-12 Revised:2018-11-23 Published:2018-12-04

摘要/Abstract

摘要： 为准确识别工业图像中的目标零件，提出一种改进的YOLOV3目标识别算法。结合K-means聚类与粒子群优化算法进行锚框计算，以降低初始点对聚类结果的影响，加快算法收敛速度。同时在YOLOV3网络shortcut层嵌入SENet结构，得到SE-YOLOV3网络。对零件图像进行数据增强并加入零件标注，制作包含10 816张图片的样本集，用于算法训练和测试。实验结果表明，该算法能够获得平均交并比为83.01%的锚框，当样本图像存在较多残缺零件干扰时，YOLOV3存在将背景识别为零件的情况，其查准率与查全率分别为72.11%和97.51%，而SE-YOLOV3能有效减少假正例数量，其查准率与查全率分别为90.39%和93.25%。

关键词: 目标识别, 卷积神经网络, SENet结构, YOLOV3网络, 粒子群优化算法

Abstract: In order to accurately identify the target parts in the industrial image,an improved YOLOV3 target recognition algorithm is proposed.The K-means clustering and particle swarm optimization algorithm are combined to calculate the anchor box to reduce the influence of the initial point on the clustering result and speed up the convergence of the algorithm.The SE-YOLOV3 network is obtained by embedding the SENet structure after the shortcut layers.A sample set containing 10 816 images is created by collecting the part images and enhancing the data while labeling the part in the image,which is used to train and test the network.Experimental results show that the proposed algorithm can obtain an anchor box with an average IoU of 83.01%.When there are more defective parts in the sample image,YOLOV3 might identify the background as a part,and the precision and recall rate are 72.11% and 97.51% respectively,while the SE-YOLOV3 can accurately identify target parts,whose precision and recall rate are 90.39% and 93.25% respectively.

Key words: target recognition, Convolutional Neural Network(CNN), SENet structure, YOLOV3 network, Particle Swarm Optimization(PSO) algorithm

中图分类号:

TP753

刘学平, 李玙乾, 刘励, 王哲, 刘宇. 嵌入SENet结构的改进YOLOV3目标识别算法[J]. 计算机工程, 2019, 45(11): 243-248.

LIU Xueping, LI Yuqian, LIU Li, WANG Zhe, LIU Yu. Improved YOLOV3 Target Recognition Algorithm with Embedded SENet Structure[J]. Computer Engineering, 2019, 45(11): 243-248.

https://www.ecice06.com/CN/Y2019/V45/I11/243

图/表 11

20191118171431

20191118171434

20191118171438

20191118171440

20191118171443

20191118171446

20191118171449

20191118171452

20191118171455

20191118171459

20191118171501

参考文献

[1] 翟俊海,张素芳,郝璞.卷积神经网络及其研究进展[J].河北大学学报(自然科学版),2017,37(6):640-651.
[2] 周飞燕,金林鹏,董军.卷积神经网络研究综述[J].计算机学报,2017,40(6):1229-1251.
[3] 袁安富,曹金燕,余莉.一种基于SURF特征的零件识别算法[J].计算机应用与软件,2015,32(1):186-189.
[4] GIRSHICK R.Fast R-CNN[C]//Proceedings of IEEE International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2015:1440-1448.
[5] REN Shaoqing,HE Kaiming,GIRSHICK R,et al.Faster R-CNN:towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149.
[6] LIU Wei,ANGUELOV D,ERHAN D,et al.SSD:single shot multibox detector[C]//Proceedings of European Conference on Computer Vision.Berlin,Germany:Springer,2016:21-37.
[7] REDMON J,DIVVALA S,GIRSHICK R,et al.You only look once:unified,real time object detection[C]//Proceedings of Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2015:779-788.
[8] REDMON J,FARHADI A.YOLO9000:better,faster,stronger[C]//Proceedings of Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2017:6517-6525.
[9] REDMON J.YOLOV3:an incremental improvement[EB/OL].[2018-09-30].https://pjreddie.com/darknet/yolo/.
[10] TAO Jing,WANG Hongbo,ZHANG Xinyu,et al.An object detection system based on YOLO in traffic scene[C]//Proceedings of the 6th International Conference on Computer Science and Network Technology.Washington D.C.,USA:IEEE Press,2017:315-319.
[11] 郑志强,刘妍妍,潘长城,等.改进YOLOV3遥感图像飞机识别应用[J].电光与控制,2018,6(9):15-19.
[12] HU Jie,SHEN Li,ALBANIE S,et al.Squeeze and excitation networks[EB/OL].[2018-09-30].https://arxiv.org/abs/1709.01507.
[13] 杨淑莹.模式识别与智能计算[M].北京:电子工业出版社,2008.
[14] 彭刚,杨诗琪,黄心汉,等.改进的基于区域卷积神经网络的微操作系统目标检测方法[J].模式识别与人工智能,2018,31(2):142-149.
[15] 周志华.机器学习[M].北京:清华大学出版社,2015.

选择文件类型/文献管理软件名称

选择包含的内容

嵌入SENet结构的改进YOLOV3目标识别算法

Improved YOLOV3 Target Recognition Algorithm with Embedded SENet Structure

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 11

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	王志浩, 钱沄涛. 基于Swin Transformer的双流遥感图像时空融合超分辨率重建[J]. 计算机工程, 2024, 50(9): 33-45.
[2]	李俊俊, 董建刚, 李坤. 基于Kubernetes的集群节能策略研究[J]. 计算机工程, 2024, 50(9): 82-91.
[3]	张鲁, 田春伟, 宋焕生, 刘侍刚. 用于低剂量CT图像去噪的多级双树复小波网络[J]. 计算机工程, 2024, 50(9): 266-275.
[4]	高煜宝, 文志诚. 基于注意力机制的双路解码器图像去噪方法[J]. 计算机工程, 2024, 50(9): 324-332.
[5]	王蕾, 党时鹏, 潘丰. 基于卷积神经网络的隐匿性旁路预测模型[J]. 计算机工程, 2024, 50(8): 40-49.
[6]	耿丽丽, 牛保宁. 基于通道相似度熵的卷积神经网络裁剪[J]. 计算机工程, 2024, 50(7): 133-143.
[7]	张洋, 刘畅, 李少青. 基于可控制性度量的图神经网络门级硬件木马检测方法[J]. 计算机工程, 2024, 50(7): 164-173.
[8]	牛瑞婷, 严天峰, 高锐, 王映植. 低信噪比下基于深度学习TCNN-MobileNet的调制识别[J]. 计算机工程, 2024, 50(7): 204-215.
[9]	张溢文, 蔡满春, 陈咏豪, 朱懿, 姚利峰. 融合空间特征的多尺度深度伪造检测方法[J]. 计算机工程, 2024, 50(7): 240-250.
[10]	逯焕宇, 张永宏, 马光义, 谢东林, 田伟. 基于半监督对抗学习的遥感图像水体提取[J]. 计算机工程, 2024, 50(7): 251-263.
[11]	火久元, 王虹阳, 巨涛, 胡军. 多场景下基于AHP-EWM的人体健康状态评估模型研究[J]. 计算机工程, 2024, 50(7): 372-380.
[12]	于洋, 孙芳芳, 吕华, 李扬, 王晓民. 基于多尺度时空注意力网络的微表情检测方法[J]. 计算机工程, 2024, 50(6): 228-235.
[13]	周春雷, 宋继勐, 沈子奇, 余晗, 雷杰, 林兵. 数联网标识解析系统中的标识数据布局策略[J]. 计算机工程, 2024, 50(6): 311-320.
[14]	高家豪, 胡创业, 丁男, 刘战东. 智能网联汽车中联合驾驶风格的交通流数据有效性分析[J]. 计算机工程, 2024, 50(6): 367-376.
[15]	黄君泽, 吴文渊, 李轶, 石明全, 王正江. 面向动态公交的离散分层记忆粒子群优化算法[J]. 计算机工程, 2024, 50(4): 20-30.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

嵌入SENet结构的改进YOLOV3目标识别算法

Improved YOLOV3 Target Recognition Algorithm with Embedded SENet Structure

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 11

参考文献

相关文章 15

编辑推荐

Metrics

本文评价