1 |
张慧, 王坤峰, 王飞跃. 深度学习在目标视觉检测中的应用进展与展望. 自动化学报, 2017, 43 (8): 1289- 1305.
URL
|
|
ZHANG H , WANG K F , WANG F Y . Advances and perspectives on applications of deep learning in visual object detection. Acta Automatica Sinica, 2017, 43 (8): 1289- 1305.
URL
|
2 |
张顺, 龚怡宏, 王进军. 深度卷积神经网络的发展及其在计算机视觉领域的应用. 计算机学报, 2019, 42 (3): 453- 482.
URL
|
|
ZHANG S , GONG Y H , WANG J J . The development of deep convolution neural network and its applications on computer vision. Chinese Journal of Computers, 2019, 42 (3): 453- 482.
URL
|
3 |
周飞燕, 金林鹏, 董军. 卷积神经网络研究综述. 计算机学报, 2017, 40 (6): 1229- 1251.
URL
|
|
ZHOU F Y , JIN L P , DONG J . Review of convolutional neural network. Chinese Journal of Computers, 2017, 40 (6): 1229- 1251.
URL
|
4 |
张冬明, 靳国庆, 代锋, 等. 基于深度融合的显著性目标检测算法. 计算机学报, 2019, 42 (9): 2076- 2086.
URL
|
|
ZHANG D M , JIN G Q , DAI F , et al. Salient object detection based on deep fusion of hand-crafted features. Chinese Journal of Computers, 2019, 42 (9): 2076- 2086.
URL
|
5 |
蒋弘毅, 王永娟, 康锦煜. 目标检测模型及其优化方法综述. 自动化学报, 2021, 47 (6): 1232- 1255.
URL
|
|
JIANG H Y , WANG Y J , KANG J Y . A survey of object detection models and its optimization methods. Acta Automatica Sinica, 2021, 47 (6): 1232- 1255.
URL
|
6 |
TU Z , GUO Z , XIE W , et al. Fusing disparate object signatures for salient object detection in video. Pattern Recognition, 2017, 72, 285- 299.
doi: 10.1016/j.patcog.2017.07.028
|
7 |
LIU L , OUYANG W L , WANG X G , et al. Deep learning for generic object detection: a survey. International Journal of Computer Vision, 2020, 128 (2): 261- 318.
|
8 |
HOY M, TU Z G, DANG K, et al. Learning to predict pedestrian intention via variational tracking networks[C]//Proceedings of the 21st International Conference on Intelligent Transportation Systems. Washington D.C., USA: IEEE Press, 2018: 3132-3137.
|
9 |
MHALLA A , CHATEAU T , GAZZAH S , et al. An embedded computer-vision system for multi-object detection in traffic surveillance. IEEE Transactions on Intelligent Transportation Systems, 2019, 20 (11): 4006- 4018.
doi: 10.1109/tits.2018.2876614
|
10 |
LIU Y , MA Z , LIU X M , et al. Privacy-preserving object detection for medical images with Faster R-CNN. IEEE Transactions on Information Forensics and Security, 2022, 17, 69- 84.
doi: 10.1109/TIFS.2019.2946476
|
11 |
黄凯奇, 陈晓棠, 康运锋, 等. 智能视频监控技术综述. 计算机学报, 2015, 38 (6): 1093- 1118.
URL
|
|
HUANG K Q , CHEN X T , KANG Y F , et al. Intelligent visual surveillance: a review. Chinese Journal of Computers, 2015, 38 (6): 1093- 1118.
URL
|
12 |
代科学, 李国辉, 涂丹, 等. 监控视频运动目标检测减背景技术的研究现状和展望. 中国图象图形学报, 2006, 11 (7): 919- 927.
URL
|
|
DAI K X , LI G H , TU D , et al. Prospects and current studies on background subtraction techniques for moving objects detection from surveillance video. Journal of Image and Graphics, 2006, 11 (7): 919- 927.
URL
|
13 |
AHMED F, TARLOW D, BATRA D. Optimizing expected intersection-over-union with candidate-constrained CRFs[C]//Proceedings of IEEE International Conference on Computer Vision. Washington D.C., USA: IEEE Press, 2016: 1850-1858.
|
14 |
ZHANG Z, SABUNCU M R. Generalized cross entropy loss for training deep neural networks with noisy labels[EB/OL]. [2022-06-05]. https://arxiv.org/abs/1805.07836.
|
15 |
BAE S H . Object detection based on region decomposition and assembly. Proceedings of the AAAI Conference on Artificial Intelligence, 2019, 33 (1): 8094- 8101.
|
16 |
|
17 |
WU B, NEVATIA R. Cluster boosted tree classifier for multi-view, multi-pose object detection[C]//Proceedings of the 11th IEEE International Conference on Computer Vision. Washington D.C., USA: IEEE Press, 2007: 1-8.
|
18 |
NOWOZIN S. Optimal decisions from probabilistic models: the intersection-over-union case[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Washington D.C., USA: IEEE Press, 2014: 548-555.
|
19 |
SAXENA E , GOSWAMI M N . Automatic object detection in image processing: a survey. International Journal on Recent and Innovation Trends in Computing and Communi-cation, 2014, 2 (12): 4239- 4242.
|
20 |
NEUBECK A, VAN GOOL L. Efficient non-maximum suppression[C]//Proceedings of the 18th International Conference on Pattern Recognition. Washington D.C., USA: IEEE Press, 2006: 850-855.
|
21 |
|
22 |
DEVIN C, ABBEEL P, DARRELL T, et al. Deep object-centric representations for generalizable robot learning[C]//Proceedings of IEEE International Conference on Robotics and Automation. Washington D.C., USA: IEEE Press, 2018: 7111-7118.
|
23 |
ZHANG D J , ZHANG Z , ZOU L , et al. Part-based visual tracking with spatially regularized correlation filters. The Visual Computer, 2020, 36 (3): 509- 527.
|
24 |
AKBAS E , ECKSTEIN M P . Object detection through search with a foveated visual system. PLoS Computational Biology, 2017, 13 (10): e1005743.
|
25 |
WANG A T, SUN Y H, KORTYLEWSKI A, et al. Robust object detection under occlusion with context-aware CompositionalNets[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D.C., USA: IEEE Press, 2020: 12642-12651.
|
26 |
|
27 |
FORT A , DELPUECH C , PERNIER J , et al. Dynamics of cortico-subcortical cross-modal operations involved in audio-visual object detection in humans. Cerebral Cortex, 2002, 12 (10): 1031- 1039.
doi: 10.1093/cercor/12.10.1031
|
28 |
GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition. Washington D.C., USA: IEEE Press, 2014: 580-587.
|
29 |
GIRSHICK R. Fast R-CNN[C]//Proceedings of 2015 IEEE International Conference on Computer Vision. Washington D.C., USA: IEEE Press, 2015: 1440-1448.
|
30 |
REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: towards real-time object detection with region proposal networks[C]//Proceedings of IEEE Conference on Pattern Analysis and Machine Intelligence. Washington D.C., USA: IEEE Press, 2016: 1137-1149.
|
31 |
HE K M, GKIOXARI G, DOLLÁR P, et al. Mask R-CNN[C]//Proceedings of IEEE International Conference on Computer Vision. Washington D.C., USA: IEEE Press, 2017: 2980-2988.
|
32 |
REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Washington D.C., USA: IEEE Press, 2016: 779-788.
|
33 |
REDMON J, FARHADI A. YOLO9000: better, faster, stronger[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Washington D.C., USA: IEEE Press, 2017: 6517-6525.
|
34 |
|
35 |
|
36 |
|
37 |
LIN T Y , GOYAL P , GIRSHICK R , et al. Focal loss for dense object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42 (2): 318- 327.
doi: 10.1109/TPAMI.2018.2858826
|
38 |
DUAN K W, BAI S, XIE L X, et al. CenterNet: keypoint triplets for object detection[C]//Proceedings of IEEE/CVF International Conference on Computer Vision. Washington D.C., USA: IEEE Press, 2020: 6568-6577.
|
39 |
|
40 |
RANJAN R , PATEL V M , CHELLAPPA R . HyperFace: a deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019, 41 (1): 121- 135.
doi: 10.1109/TPAMI.2017.2781233
|
41 |
|
42 |
REZATOFIGHI H, TSOI N, GWAK J, et al. Generalized intersection over union: a metric and a loss for bounding box regression[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D.C., USA: IEEE Press, 2020: 658-666.
|
43 |
WANG X L, GUPTA A. Unsupervised learning of visual representations using videos[C]//Proceedings of IEEE International Conference on Computer Vision. Washington D.C., USA: IEEE Press, 2016: 2794-2802.
|
44 |
ZHENG Z H , WANG P , LIU W , et al. Distance-IoU loss: faster and better learning for bounding box regression. Proceedings of the AAAI Conference on Artificial Intelligence, 2020, 34 (7): 12993- 13000.
doi: 10.1609/aaai.v34i07.6999
|
45 |
|
46 |
LONG J, SHELHAMER E, DARRELL T. Fully convolutional networks for semantic segmentation[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Washington D.C., USA: IEEE Press, 2015: 3431-3440.
|
47 |
PETERSEN S E , POSNER M I . The attention system of the human brain: 20 years after. Annual Review of Neuroscience, 2012, 35, 73- 89.
doi: 10.1146/annurev-neuro-062111-150525
|
48 |
LAW H , DENG J . CornerNet: detecting objects as paired keypoints. International Journal of Computer Vision, 2020, 128 (3): 642- 656.
|
49 |
JIAO L C , ZHANG F , LIU F , et al. A survey of deep learning-based object detection. IEEE Access, 2019, 7, 128837- 128868.
doi: 10.1109/ACCESS.2019.2939201
|