作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (4): 46-48. doi: 10.3969/j.issn.1000-3428.2011.04.017

• 软件技术与数据库 • 上一篇    下一篇

基于k-gram频数的静态软件胎记

陈 林,刘粉林,芦 斌,谢 鑫   

  1. CHEN Lin, LIU Fen-lin, LU Bin, XIE Xin
  • 出版日期:2011-02-20 发布日期:2011-02-17
  • 作者简介:陈 林(1984-),男,硕士研究生,主研方向:软件版权保护,软件胎记;刘粉林,教授、博士生导师;芦 斌,讲师、硕士;谢 鑫,硕士研究生
  • 基金资助:
    国家自然科学基金资助项目(60970141, 60902102)

Static Software Birthmark Based on k-gram Frequency

CHEN Lin, LIU Fen-lin, LU Bin, XIE Xin   

  1. (Institute of Information Engineering, PLA Information Engineering University, Zhengzhou 450002, China)
  • Online:2011-02-20 Published:2011-02-17

摘要: 基于k-gram的静态软件胎记无法较好区分2个代码量差距较大的程序,且其胎记的鲁棒性一般。为此,将程序k-gram碎片的频数向量作为软件胎记,以其中一个程序的k-gram碎片集为基准对另一个程序的频数向量进行预变换,计算变换后的2个频数向量之间夹角的余弦,作为胎记的相似度。实验结果表明,对于Java类文件的检测,该胎记的可信性和鲁棒性均有一定提高。

关键词: 软件盗版检测, 静态软件胎记, k-gram频数

Abstract: Static software birthmark based on k-gram can not appropriate to distinguish the two programs with much different amount of codes and its robustness is generally. Aiming at these problems, this paper presents a static software birthmark based on k-gram frequencies. For reaching a better balance between its credibility and resilience, it performs the fore-transform on the frequency vectors, and then uses vector cosine to calculate the birthmark similarity. Experimental results show that both the credibility and resilience of the birthmark are improved for Java class files.

Key words: software theft detection, static software birthmark, k-gram frequency

中图分类号: