作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2020, Vol. 46 ›› Issue (11): 261-266. doi: 10.19678/j.issn.1000-3428.0056527

• 图形图像处理 • 上一篇    下一篇

基于多视图架构深度神经网络的图像威胁识别

叶晴昊1, 涂岱键1, 毕奇2, 秦飞巍1, 葛瑞泉1, 白静3   

  1. 1. 杭州电子科技大学 计算机学院, 杭州 310018;
    2. 武汉大学 遥感信息工程学院, 武汉 430079;
    3. 北方民族大学 计算机科学与工程学院, 银川 750021
  • 收稿日期:2019-11-07 修回日期:2019-12-10 发布日期:2019-12-13
  • 作者简介:叶晴昊(1998-),男,本科生,主研方向为机器视觉、机器学习;涂岱键,本科生;毕奇,硕士研究生;秦飞巍(通信作者),副教授、博士;葛瑞泉,讲师、博士;白静,副教授、博士。
  • 基金资助:
    国家自然科学基金(61702146,61762003,61972121);国家级大学生创新创业训练计划项目(201810336023);浙江省认知医疗工程技术研究中心开放课题(2018KFJJ05)。

Image Threat Recognition Based on Multiple View Architecture Deep Neural Network

YE Qinghao1, TU Daijian1, BI Qi2, QIN Feiwei1, GE Ruiquan1, BAI Jing3   

  1. 1. School of Computer Science, Hangzhou Dianzi University, Hangzhou 310018, China;
    2. School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China;
    3. School of Computer Science and Engineering, North Minzu University, Yinchuan 750021, China
  • Received:2019-11-07 Revised:2019-12-10 Published:2019-12-13

摘要: 由于藏匿物体的大小、形状和位置未知,且样本类别不均衡,常用的深度学习方法存在误报率较高的问题。为此,构建一种基于多视图架构的深度卷积神经网络模型。通过残差连接卷积神经网络对特征进行提取,使用基于稠密连接的长短期记忆注意力模型模拟人类多角度观察,以强化威胁信息表达,并基于焦点损失函数优化网络,从而构成端到端的架构。在HD-AIT毫米波人体威胁扫描数据集上的测试结果表明,相比其他基线模型,该模型的准确率和召回率分别可达到0.997、0.999。

关键词: 威胁识别, 毫米波图像, 深度学习, 图像识别, 注意力机制

Abstract: When applied to security checks,common deep learning methods have to address the high false alarm rate caused by the unknown size,shape and location of hidden objects as well as unbalanced sample categories.To deal with the problem,this paper proposes a deep convolutional neural network model based on multiple view architecture.The model uses convolutional neural networks with residual connections to extract features.Then a Long Short Term Memory(LSTM) attention model based on dense connections is used to simulate the process of human observations from multiple perspectives to enhance the expression of threat-related information.At the same time,the network is optimized based on the focus loss function to form an end-to-end framework.The experimental results on HD-AIT millimeter-wave scaned human body threating dataset show that the proposed model increases the accuracy to 0.997 and recall rate to 0.999 compared with other baseline models.

Key words: threat recognition, millimeter wave image, deep learning, image recognition, attention mechanism

中图分类号: