摘要: 提出一种表单Ajax信息项提取方法。该方法在独立于浏览器运行的程序中嵌入JavaScript引擎,本地化构建DOM对象和Ajax应用对象,利用JavaScript引擎跟踪执行脚本代码,模拟执行用户在浏览器下的操作,从而自动获取表单Ajax信息项数据。实验结果表明,该方法可以完整获取Deep Web查询接口的表单信息,提高搜索准确率。
关键词:
查询接口,
表单特征提取,
Ajax技术,
JavaScript引擎
Abstract: This paper presents an extraction method of form Ajax information item. The method embeds JavaScript engine into the running programs which is independent of the browser, reconstructs the DOM and Ajax objects locally. It simulates the user operation in using the browser with the JavaScript engine tracking and executing the script to automatic gain form Ajax information item data. Experimental results show that the method can completely obtain form information of Deep Web query interface, and it can improve search accuracy.
Key words:
query interface,
form feature extraction,
Ajax technology,
JavaScript engine
中图分类号:
段青玲, 杨仁刚, 朱杨. 一种表单Ajax信息项提取方法[J]. 计算机工程, 2011, 37(3): 44-46.
DUAN Jing-Ling, YANG Ren-Gang, SHU Yang. Extraction Method of Form Ajax Information Item[J]. Computer Engineering, 2011, 37(3): 44-46.