Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2009, Vol. 35 ›› Issue (9): 204-207. doi: 10.3969/j.issn.1000-3428.2009.09.072

• Artificial Intelligence and Recognition Technology • Previous Articles     Next Articles

Quality Estimation Model of Deep Web Data Source

HU Peng-yu1, ZHAO Peng-peng1, FANG Wei1, CUI Zhi-ming1,2   

  1. (1. Institute of Intelligent Information Processing & Application, Soochow University, Suzhou 215006; 2. Key Lab of Computer Information Processing Technology of Jiangsu Province, Suzhou 215006)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-05-05 Published:2009-05-05

深网数据源质量估计模型

胡鹏昱1,赵朋朋1,方 巍1,崔志明1,2   

  1. (1. 苏州大学智能信息处理及应用研究所,苏州 215006;2. 江苏省计算机信息处理技术重点实验室,苏州 215006)

Abstract: In order to get valuable information from the mass Deep Web, this paper proposes a quality estimation model of Deep Web data sources, considering the query capability of interface, the quality of interface pages and the Quality of Services(QoS), using the SVM and Ranking SVM machine learning approach to obtain the quality estimation function. Experimental results show the Kendall’s distance between data sources quality sort sequences made by this quality estimation function and the artificial one is more than 0.5, and achieves higher accuaracy.

Key words: Deep Web, query capability, query interface, Quality of Services(QoS)

摘要: 为从海量深网中获得有价值的信息,提出一种深网数据源质量估计模型,综合考虑接口查询能力、接口页面质量和服务质量3方面因素,采用SVM和Ranking SVM机器学习方法得到质量估计函数。实验结果表明,该估计函数得到的数据源质量排序序列和人工排序序列的Kendall’s 距离超过0.5,且获得较高的精度。

关键词: 深网, 查询能力, 查询接口, 服务质量

CLC Number: