脱离预训练的多尺度目标检测网络模型

doi:10.19678/j.issn.1000-3428.0056417

计算机工程 ›› 2020, Vol. 46 ›› Issue (6): 248-255. doi: 10.19678/j.issn.1000-3428.0056417

脱离预训练的多尺度目标检测网络模型

包壮壮¹, 赵学军¹, 王明芳², 董玉浩¹, 庞梦洋¹, 黄林¹, 贺刚³

1. 空军工程大学基础部, 西安 710051;
2. 中国人民解放军 93861部队, 陕西咸阳 713800;
3. 中国人民解放军 32055部队, 南京 210046

收稿日期:2019-10-28 修回日期:2019-12-10 发布日期:2019-12-20
作者简介:包壮壮(1996-),男,硕士研究生,主研方向为深度学习、目标检测;赵学军、王明芳,副教授、博士;董玉浩、庞梦洋、黄林,硕士研究生;贺刚,博士。
基金资助:
国家自然科学基金（61472443）。

Multi-Scale Target Detection Network Model Trained from Scratch

BAO Zhuangzhuang¹, ZHAO Xuejun¹, WANG Mingfang², DONG Yuhao¹, PANG Mengyang¹, HUANG Lin¹, HE Gang³

1. Department of Basic Science, Air Force Engineering University, Xi'an 710051, China;
2. Unit 93861 of Chinese People's Liberation Army, Xiangyang, Shaanxi 713800, China;
3. Unit 32055 of Chinese People's Liberation Army, Nanjing 210046, China

Received:2019-10-28 Revised:2019-12-10 Published:2019-12-20

摘要/Abstract

摘要： 为提高卷积神经网络目标检测模型精度并增强检测器对小目标的检测能力，提出一种脱离预训练的多尺度目标检测网络模型。采用脱离预训练检测网络使其达到甚至超过预训练模型的精度，针对小目标特点设计新的Deformable-ScratchNet网络模型，调整网络结构并融合浅层信息以提高对小目标的检测性能。实验结果表明，与Faster-RCNN等经典网络模型相比，该模型在PASCAL VOC数据集和自制遥感军事目标数据集上的检测精度更高。

关键词: 脱离预训练, 可变卷积, 小目标检测, 多尺度目标, 遥感图像

Abstract: In order to improve the accuracy of the target detection model using convolutional neural network and enhance the detection ability of the detector for small targets,this paper proposes a multi-scale target detection network model trained from scratch.The detection network is trained from scratch to increase its accuracy to the level of pre-trained models or even higher.Then a new Deformable-ScratchNet network model is designed according to the characteristics of small targets.Its network structure is adjusted,and shallow information is integrated with the model to improve the detection performance of small targets.Experimental results show that compared with Faster-RCNN and other classic network models,the proposed model has higher detection accuracy on the PASCAL VOC data set and self-made remote sensing image of military target data set.

Key words: trained from scratch, variable convolution, small target detection, multi-scale target, remote sensing image

中图分类号:

TP183

包壮壮, 赵学军, 王明芳, 董玉浩, 庞梦洋, 黄林, 贺刚. 脱离预训练的多尺度目标检测网络模型[J]. 计算机工程, 2020, 46(6): 248-255.

BAO Zhuangzhuang, ZHAO Xuejun, WANG Mingfang, DONG Yuhao, PANG Mengyang, HUANG Lin, HE Gang. Multi-Scale Target Detection Network Model Trained from Scratch[J]. Computer Engineering, 2020, 46(6): 248-255.

https://www.ecice06.com/CN/Y2020/V46/I6/248

图/表 11

20200617091329

20200617091332

20200617091334

20200617091338

20200617091342

20200617091346

20200617091349

20200617091352

20200617091355

20200617091359

20200617091402

参考文献

[1] RUSSAKOVSKY O,DENG J,SU H,et al.ImageNet large scale visual recognition challenge[J].International Journal of Computer Vision,2015,115(3):211-252.
[2] ZHU Rui,ZHANG Shifeng,WANG Xiaobo,et al.ScratchDet:exploring to train single-shot object detectors from scratch[EB/OL].(2018-10-19)[2019-09-01].https://arxiv.org/abs/1810.08425v3.
[3] SHEN Zhiqiang,LIU Zhuang,LI Jianguo,et al.Dsod:Learning deeply supervised object detectors from scratch[C]//Proceedings of International Conference on Computer Vision.Venice,Italy:IEEE Press,2017:1937-1945.
[4] SANTURKAR S,TSIPRAS D,ILYAS A,et al.How does batch normalization help optimization?[EB/OL].(2018-05-29)[2019-09-01].https://arxiv.org/abs/1805.11604.
[5] LIN T Y,GOYAL P,GIRSHICK R,et al.Focal loss for dense object detection[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2020,42(2):318-327.
[6] SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[EB/OL].(2014-09-04)[2019-09-01].https://arxiv.org/abs/1409.1556.
[7] HE Kaiming,ZHANG Xiangyu,REN Shaoqing,et al.Deep residual learning for image recognition[C]//Proceedings of IEEE Conference on Computer Vision & Pattern Recognition. Las Vegas,USA:IEEE Press,2016:2-8.
[8] FU C Y,LIU W,RANGA A,et al.DSSD:deconvolutional single shot detector[EB/OL].(2017-01-23)[2019-09-01].https://arxiv.org/abs/1701.06659.
[9] EVERINGHAM M,GOOL L V,WILLIAMS C,et al.Pascal visual object classes challenge results[J].International Journal of Computer Vision,2010,88:303-307.
[10] EVERINGHAM M,VAN GOOL L,WILLIAMS C K I,et al.The Pascal Visual Object Classes (VOC) challenge[J].International Journal of Computer Vision,2010,88(2):303-338.
[11] SHEN Z Q,SHI H H,ROGERIO F,et al.Learning object detectors from scratch with gated recurrent feature pyramids[EB/OL].(2017-12-04)[2019-09-01].https://arxiv.org/abs/1712.00886v1.
[12] IOFFE S,SZEGEDY C.Batch normalization:accelerating deep network training by reducing internal covariate shift[C]//Proceedings of International Conference on International Conference on Machine Learning.Lille,France:[s.n.],2015:21-29.
[13] DAI J F,QI H Z,XIONG Y W,et al.Deformable convolutional networks[C]//Proceedings of 2017 IEEE International Conference on Computer Vision.Venice,Italy:IEEE Press,2017:764-773.
[14] SHELHAMER E,LONG J,DARRELL T.Fully convolutional networks for semantic segmentation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(4):640-651.
[15] LI Hongyan,LI Chungeng,AN Jubai,et al.Attention mechanism improves CNN remote sensing image object detection[J].Journal of Image and Graphics,2019,24(8):1400-1408. 李红艳,李春庚,安居白,等.注意力机制改进卷积神经网络的遥感图像目标检测[J].中国图象图形学报,2019,24(8):1400-1408.
[16] LIU W,ANGUELOV D,ERHAN D,et al.SSD:single shot multibox detector[C]//Proceedings of ECCV'16.Amsterdam,Holland:Springer International Publishing,2016:21-37.
[17] REDMON J,FARHADI A.YOLO9000:better,faster,stronger[C]//Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition.Honolulu,USA:IEEE Press,2017.
[18] REN S Q,HE K M,GIRSHICK R,et al.Faster R-CNN:towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149.
[19] REN Yun,ZHU Changren,XIAO Shunping.Deformable faster R-CNN with aggregating multi-layer features for partially occluded object detection in optical remote sensing images[J].Remote Sensing,2018,10(9):1470-1478.

选择文件类型/文献管理软件名称

选择包含的内容

脱离预训练的多尺度目标检测网络模型

Multi-Scale Target Detection Network Model Trained from Scratch

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 11

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	代尹翘, 肖武龙, 李柏林, 李立. 基于改进YOLOv5s的莴笋芯部检测算法[J]. 计算机工程, 2026, 52(6): 352-364.
[2]	魏文泉, 莫宏伟. 基于改进YOLOv5s的PCB缺陷检测算法[J]. 计算机工程, 2026, 52(5): 226-238.
[3]	杨路, 刘俊杰, 余翔. 多尺度信息增强的遥感图像目标检测算法[J]. 计算机工程, 2026, 52(4): 200-213.
[4]	汤伟博, 方强, 李沛根, 艾龙金, 熊金红, 夏海廷. 基于RSD-YOLO的无人机航拍图像小目标检测[J]. 计算机工程, 2026, 52(4): 214-228.
[5]	王沙沙, 李帷韬, 刘星宇, 高辉. 基于层级注意力的域自适应遥感图像分割[J]. 计算机工程, 2026, 52(4): 176-186.
[6]	唐克, 魏飞鸣, 李东瀛, 郁文贤. 基于改进YOLOv8的轻量化无人机图像目标检测算法[J]. 计算机工程, 2026, 52(3): 97-106.
[7]	曹继卫, 罗飞, 丁炜超. BS-YOLO: 基于BSAM注意力机制和SCConv的小目标检测算法[J]. 计算机工程, 2026, 52(3): 119-127.
[8]	张信佳, 王芳. 基于多层次特征融合和注意力机制的无人机图像小目标检测算法[J]. 计算机工程, 2026, 52(2): 148-157.
[9]	王熠, 李智, 张丽, 石雪丽, 刘登波, 卢妤. 基于遥感图像场景分类的频域量化对抗攻击[J]. 计算机工程, 2026, 52(1): 266-281.
[10]	欧寒芝, 黄睿. 混合噪声下的遥感图像多标签分类[J]. 计算机工程, 2026, 52(1): 188-195.
[11]	马淦, 谷雨, 彭冬亮. 结合改进YOLOv5s和动态数据增强的海面舰船检测[J]. 计算机工程, 2025, 51(9): 294-305.
[12]	王舒梦, 徐慧英, 朱信忠, 黄晓, 宋杰, 李毅. 基于改进YOLOv8n的航拍轻量化小目标检测算法: PECS-YOLO[J]. 计算机工程, 2025, 51(9): 280-293.
[13]	倪源松, 韩军, 邹小燕, 胡广怡, 王文帅. 两阶段自适应分块输电线路螺栓缺陷检测方法[J]. 计算机工程, 2025, 51(8): 281-291.
[14]	苗茹, 李祎, 周珂, 张俨娜, 常然然, 孟更. 一种改进的Faster R-CNN遥感图像多目标检测模型研究[J]. 计算机工程, 2025, 51(8): 292-304.
[15]	栾孟娜, 郑秋梅, 王风华. 基于DMC-YOLO的交通标志实时检测算法[J]. 计算机工程, 2025, 51(7): 90-99.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

脱离预训练的多尺度目标检测网络模型

Multi-Scale Target Detection Network Model Trained from Scratch

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 11

参考文献

相关文章 15

编辑推荐

Metrics

本文评价