基于MobileNet的恶意软件家族分类模型

doi:10.19678/j.issn.1000-3428.0054313

计算机工程 ›› 2020, Vol. 46 ›› Issue (4): 162-168. doi: 10.19678/j.issn.1000-3428.0054313

基于MobileNet的恶意软件家族分类模型

曾娅琴^a, 张琳琳^b,c, 张若楠^a, 杨波^a

新疆大学 a. 软件学院;b. 网络空间安全学院;c. 信息科学与工程学院, 乌鲁木齐 830046

收稿日期:2019-03-20 修回日期:2019-05-17 出版日期:2020-04-15 发布日期:2019-05-24
作者简介:曾娅琴(1991-),女,硕士研究生,主研方向为恶意代码检测及分类、深度学习;张琳琳(通信作者),副教授、博士;张若楠、杨波,硕士研究生。
基金资助:
国家自然科学基金"移动学习情境感知模型研究"（61867006）；新疆维吾尔自治区科技厅创新环境建设专项"校园网安全审计数据共享和威胁情报分析平台"（PT1811）；新疆维吾尔自治区创新环境建设专项（自然科学基金）联合基金"多种技术融合的Android恶意软件检测方法研究"（2019D01C062）；新疆维吾尔自治区高校科研计划项目—自然科学基金面上项目"基于异常模型的移动应用软件运行时行为检测方法研究"（XJEDU2017M005）。

Malware Family Classification Model Based on MobileNet

ZENG Yaqin^a, ZHANG Linlin^b,c, ZHANG Ruonan^a, YANG Bo^a

a. School of Software;b. College of Cyberspace Security;c. College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China

Received:2019-03-20 Revised:2019-05-17 Online:2020-04-15 Published:2019-05-24

摘要/Abstract

摘要： 现有基于卷积神经网络（CNN）的恶意代码分类方法存在计算资源消耗较大的问题。为降低分类过程中的计算量和参数量，构建基于恶意代码可视化和轻量级CNN模型的恶意软件家族分类模型。将恶意软件可视化为灰度图，以灰度图的相似度表示同一家族的恶意软件在代码结构上的相似性，利用灰度图训练带有深度可分离卷积的神经网络模型MobileNet v2，自动提取纹理特征，并采用Softmax分类器对恶意代码进行家族分类。实验结果表明，该模型对恶意代码分类的平均准确率为99.32%，较经典的恶意代码可视化模型高出2.14个百分点。

关键词: 卷积神经网络, 恶意软件分类, 纹理特征, MobileNet v2模型, Softmax模型

Abstract: The existing malicious code classification method based on Convolutional Neural Network(CNN) has the problem of large computational resource consumption.In order to reduce the computational quantity and parameter quantity in the classification process,this paper constructs a malware family classification model based on malicious code visualization and lightweight CNN.The malware is visualized as grayscale to represent the similarity on code structure of the same malware family.Then the gray map is used to train the neural network model MobileNet v2 with deep separable convolution,so as to automatically extract the texture features.The Softmax classifier is used to classify the malicious code.Experimental results show that the average classificationaccuracy ofthe proposed model is 99.32%,which is 2.4 percentage points higher than the classic malicious code visualization model.

Key words: Convolutional Neural Network(CNN), malware classification, texture feature, MobileNet v2 model, Softmax model

中图分类号:

TP309

曾娅琴, 张琳琳, 张若楠, 杨波. 基于MobileNet的恶意软件家族分类模型[J]. 计算机工程, 2020, 46(4): 162-168.

ZENG Yaqin, ZHANG Linlin, ZHANG Ruonan, YANG Bo. Malware Family Classification Model Based on MobileNet[J]. Computer Engineering, 2020, 46(4): 162-168.

http://www.ecice06.com/CN/Y2020/V46/I4/162

图/表 14

20200414125105

20200414125109

20200414125113

20200414125116

20200414125120

20200414125124

20200414125128

20200414125132

20200414125135

20200414125139

20200414125143

20200414125147

20200414125152

20200414125156

参考文献 25

[1]	ZHANG Jinglian,PENG Yanbing.Research on malware code classification based on features fusion[J].Computer Engineering,2019,45(8):281-286,295.(in Chinese)张景莲,彭艳兵.基于特征融合的恶意代码分类研究[J].计算机工程,2019,45(8):281-286,295.
[2]	ZHOU Zizhan,WANG Junfeng.Research on feature extraction of malware bytecode based on GPU acceleration[J].Journal of Sichuan University(Natural Science Edition),2019,56(2):45-52.(in Chinese)周紫瞻,王俊峰.基于GPU加速的恶意代码字节码特征提取方法研究[J].四川大学学报(自然科学版),2019,56(2):45-52.
[3]	IMRAN M,AFZAL M T,QADIR M A.Similarity-based malware classification using hidden Markov model[C]//Proceedings of the 4th International Conference on Cyber Security,Cyber Warfare,and Digital Forensic(CyberSec).Washington D.C.,USA:IEEE Press,2015:129-134.
[4]	MACÍAS M,BARRÍA C,ACUNA A,et al.SGSI support throught malware's classification using a pattern analysis[C]//Proceedings of 2016 IEEE International Conference on Automatica.Washington D.C.,USA:IEEE Press,2016:1-4.
[5]	SALEHI Z,GHIASI M,SAMI A.A miner for malware detection based on API function calls and their arguments[C]//Proceedings of the 16th CSI International Symposium on Artificial Intelligence and Signal Processing.Washington D.C.,USA:IEEE Press,2012:563-568.
[6]	ALAM S,TRAORE I,SOGUKPINAR I.Annotated control flow graph for metamorphic malware detection[J].The Computer Journal,2015,58(10):2608-2621.
[7]	KONO K,PHOMKEONA S,OKAMURA K.An unknown malware detection using execution registry access[C]//Proceedings of the 42nd Annual Computer Software and Applications Conference.Washington D.C.,USA:IEEE Press,2018:487-491.
[8]	LIU Yashu,WANG Zhihai,HOU Yueran,et al.Malware visualization and automatic classification with enhanced information density[J].Journal of Tsinghua University(Science and Technology),2019,59(1):9-14.(in Chinese)刘亚姝,王志海,侯跃然,等.信息密度增强的恶意代码可视化与自动分类方法[J].清华大学学报(自然科学版),2019,59(1):9-14.
[9]	LIU Yashu,LAI Yukun,WANG Zhihai,et al.A new learning approach to malware classification using discriminative feature extraction[J].IEEE Access,2019,7:13015-13023.
[10]	HAN Xiaoguang,QU Wu,YAO Xuanxia,et al.Research on malicious code variants detection based on texture fingerprint[J].Journal of Communications,2014,35(8):125-136.(in Chinese)韩晓光,曲武,姚宣霞,等,基于纹理指纹的恶意代码变种检测方法研究[J].通信学报,2014,35(8):125-136.
[11]	KUMARI M,HSIEH G,OKONKWOC A.Deep learning approach to malware multi-class classification using image processing techniques[C]//Proceedings of 2017 International Conference on Computational Science and Computational Intelligence.Washington D.C.,USA:IEEE Press,2017:13-18.
[12]	WANG Tingting,XU Ning.Malware variants detection based on opcode image recognition in small training set[C]//Proceedings of 2017 IEEE 2nd International Conference on Cloud Computing and Big Data Analysis.Washington D.C.,USA:IEEE Press,2017:328-332.
[13]	REN Zhuojun,CHEN Guang.Application of entropy visualization method in malware classification[J].Computer Engineering,2017,43(9):167-171.(in Chinese)任卓君,陈光.熵可视化方法在恶意代码分类中的应用[J].计算机工程,2017,43(9):167-171.
[14]	FU Jianwen,XUE Jingfeng,WANG Yong,et al.Malware visualization for fine-grained classification[J].IEEE Access,2018,6:14510-14523.
[15]	DAI Yihui,YIN Xudong.Malicious code detection based on random forest[J].Cyberspace Security,2018,9(2):70-75.(in Chinese)戴逸辉,殷旭东.基于随机森林的恶意代码检测[J].网络空间安全,2018,9(2):70-75.
[16]	KHAN R U,ZHANG X,KUMAR R.Analysis of ResNet and GoogleNet models for malware detection[J].Journal of Computer Virology and Hacking Techniques,2019,15(1):29-37.
[17]	KIM H J.Image-based malware classification using convolutional neural network[M]//PARK J J,LOIA V,YI G,et al.Advances in computer science and ubiquitous computing.Berlin,Germany:Springer,2017:1352-1357.
[18]	NI Sang,QIAN Quan,ZHANG Rui.Malware identification using visualization images and deep learning[J].Computers & Security,2018,77(6):871-885.
[19]	CUI Zhihua,XUE Fei,CAI Xingjuan,et al.Detection of malicious code variants based on deep learning[J].IEEE Transactions on Industrial Informatics,2018,14(7):3187-3196.
[20]	LUO Shiqi.Research on malware analysis and detection based on deep learning[D].Urumqi:Xinjiang University,2018.(in Chinese)罗世奇.深度学习的恶意代码分析与检测技术研究[D].乌鲁木齐:新疆大学,2018.
[21]	NATARAJ L,KARTHIKEYAN S,JACOB G,et al.Malware images:visualization and automatic classification[C]//Proceedings of the 8th International Symposium on Visualization for Cyber Security.New York,USA:ACM Press,2011:1-7.
[22]	SANDLER M,HOWARD A,ZHU M,et al.Mobilenet v2:inverted residuals and linear bottlenecks[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2018:4510-4520.
[23]	HOWARD A G,ZHU M,CHEN B,et al.MobileNets:efficient convolutional neural networks for mobile vision applications[EB/OL].[2019-03-01].https://arxiv.xilesou.top/abs/1704.04861.
[24]	KALASH M,ROCHAN M,MOHAMMEDN,et al.Malware classification with deep convolutional neural networks[C]//Proceedings of the 9th IFIP International Conference on New Technologies,Mobility and Security.Washington D.C.,USA:IEEE Press,2018:1-5.
[25]	YANG Chun,WEN Yu,GUO Jianbin,et al.A convolutional neural network based classifier for uncompressed malware samples[C]//Proceedings of the 1st Workshop on Security-Oriented Designs of Computer Architectures and Processors.New York,USA:ACM Press,2018:15-17.

选择文件类型/文献管理软件名称

选择包含的内容

基于MobileNet的恶意软件家族分类模型

Malware Family Classification Model Based on MobileNet

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 14

参考文献 25

相关文章 15

编辑推荐

Metrics

本文评价

[1]	曹坪, 杨怀志, 薄一军, 尤嘉, 张淳杰, 李丹勇. 面向低质量裂缝图像的多知识蒸馏分类[J]. 计算机工程, 2023, 49(7): 204-213.
[2]	白明昌. 基于折叠路径聚合的属性网络节点嵌入方法[J]. 计算机工程, 2023, 49(7): 76-84.
[3]	代祖华, 刘园园, 狄世龙. 语义增强的图神经网络方面级文本情感分析[J]. 计算机工程, 2023, 49(6): 71-80.
[4]	沈学利, 田桂源, 姜彦吉, 马琳琳. 基于双阶段Conv-Transformer的时频域语音增强算法[J]. 计算机工程, 2023, 49(6): 123-130.
[5]	丁子轩, 俞雷, 张娟, 李想, 王新宇. 基于深度残差自适应注意力网络的图像超分辨率重建[J]. 计算机工程, 2023, 49(5): 231-238.
[6]	陈治旭, 靳雁霞, 芦烨, 杨晶, 刘亚变, 史志儒. 基于子图卷积神经网络的多精度服装建模方法[J]. 计算机工程, 2023, 49(4): 174-181.
[7]	徐康, 李霏, 姬东鸿. 结合依存图卷积与文本片段搜索的方面情感三元组抽取[J]. 计算机工程, 2023, 49(4): 61-67.
[8]	衡红军, 苗菁. 语义与句法信息加强的二元标记实体关系联合抽取[J]. 计算机工程, 2023, 49(4): 77-84.
[9]	钟宝荣, 吴夏灵. 基于高分辨率网络的轻量型人体姿态估计研究[J]. 计算机工程, 2023, 49(4): 226-232,239.
[10]	杨晶晶, 谢海燕, 薛妮妮, 张傲明. 基于双通道残差网络的水下图像去噪研究[J]. 计算机工程, 2023, 49(4): 188-198.
[11]	刘晶晶, 黄浩. 引入非局部模块卷积神经网络的基频提取模型[J]. 计算机工程, 2023, 49(3): 128-133,160.
[12]	邹长龙, 安敬民, 李冠宇. 基于邻域聚合与CNN的知识图谱实体类型补全[J]. 计算机工程, 2023, 49(3): 134-141.
[13]	翟社平, 张宇航, 柏晓夏. 融合实体邻域信息的知识图谱嵌入负采样方法[J]. 计算机工程, 2023, 49(3): 95-104.
[14]	程小辉, 李钰, 康燕萍. 基于中间图特征提取的卷积网络双标准剪枝[J]. 计算机工程, 2023, 49(3): 105-112.
[15]	陈柏霖, 王天极, 任丽娜, 黄瑞章. 融合ELECTRA和文本局部信息的中文语法错误检测方法[J]. 计算机工程, 2023, 49(3): 304-311.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于MobileNet的恶意软件家族分类模型

Malware Family Classification Model Based on MobileNet

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 14

参考文献 25

相关文章 15

编辑推荐

Metrics

本文评价