Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2020, Vol. 46 ›› Issue (3): 129-137,143. doi: 10.19678/j.issn.1000-3428.0053360

• Cyberspace Security • Previous Articles     Next Articles

Intelligent Detection for Big Data Scripting Attack Based on Image Processing Inspired Method and Vectorization

ZHANG Haijuna, CHEN Yinghuib   

  1. a. College of Computer Science;b. School of Mathematics, Jiaying University, Meizhou, Guangdong 514015, China
  • Received:2018-12-10 Revised:2019-03-26 Published:2019-05-28

基于类图像处理与向量化的大数据脚本攻击智能检测

张海军a, 陈映辉b   

  1. 嘉应学院 a. 计算机学院;b. 数学学院, 广东 梅州 514015
  • 作者简介:张海军(1978-),男,讲师、博士,主研方向为智能计算、自然语言处理、模式识别;陈映辉(通信作者),讲师。
  • 基金资助:
    国家自然科学基金(61171141,61573145);广东省自然科学基金重点项目(2014B010104001,2015A030308018);广东省普通高等学校人文社会科学省市共建重点研究基地课题(18KYKT11);广东省嘉应学院自然科学基金重点项目(2017KJZ02)。

Abstract: In this paper,the methods similar to image processing and vectorization are used for the verctorization of access traffic corpus big data,and the intelligent detection for big data cross-site scripting attack is achieved.Besides,this paper uses methods similar to image processing for data acquisition,data cleaning,data sampling and feature extraction.Then,a word vectorization algorithm based on neural network is designed to obtain the big data of word vectorization.On this basis,the DCNNs intelligent detection algorithm with different depth is proposed.Finally,experiments with different hyper-parameter are conducted,and the obtained average recognition rate,variance and standard deviation show that the proposed algorithm has high recognition rate and stability.

Key words: Web intrusion detection, Cross-Site Scripting(XSS) attack, natural language processing, big data, cyberspace security

摘要: 通过类图像处理与向量化方法对访问流量语料库大数据进行词向量化处理,实现面向大数据跨站脚本攻击的智能检测。利用类图像处理方法进行数据获取、数据清洗、数据抽样和特征提取,设计一种基于神经网络的词向量化算法,得到词向量化大数据。在此基础上,提出多种不同深度的DCNNs智能检测算法。设置不同的超参数进行实验得到算法的识别率均值、方差和标准差,结果表明,该算法具有较高的识别率和稳定性。

关键词: Web入侵检测, 跨站脚本攻击, 自然语言处理, 大数据, 网络空间安全

CLC Number: