Rules-based Deep Web Information Retrieval

doi:10.3969/j.issn.1000-3428.2008.13.019

Computer Engineering ›› 2008, Vol. 34 ›› Issue (13): 51-53. doi: 10.3969/j.issn.1000-3428.2008.13.019

• Software Technology and Database • Previous Articles Next Articles

Rules-based Deep Web Information Retrieval

YANG Ju-feng1, SHI Guang-shun1, ZHAO Yu-juan1,2, WANG Qing-ren1

(1. Institute of Machine Intelligence, Nankai University, Tianjin 300071; 2. Tianjin Meteorological Information Center, Tianjin 300074)

Received:1900-01-01 Revised:1900-01-01 Online:2008-07-05 Published:2008-07-05

基于规则集的Deep Web信息检索

杨巨峰1，史广顺1，赵玉娟1,2，王庆人1

(1. 南开大学机器智能研究所，天津 300071；2. 天津市气象信息中心，天津 300074)

Abstract

Abstract: This paper proposes a novel rules-based model to extract data from Deep Web pages. The model comprises four layers, main processing parts as task allocation, information extraction, data cleaning which work based on the rules of structure, logic and application. It applies the new model to three intelligent system, scientific paper retrieval, electronic ticket ordering and resume searching. Experimental results show that the proposed method is robust and feasible.

Key words: information retrieval, Deep Web, rules set, data extraction

摘要： 提出一种基于规则集的新型Deep Web信息检索模型。该模型包含4个层次，主要处理环节如任务分派、信息提取、数据清洗等引入了Deep Web特有的结构规则、逻辑规则和应用规则协助工作。把该模型应用于科技文献检索、电子机票定购和工作简历搜索3个领域，实验结果证明该模型灵活、可信，有效信息查全率达到96%以上。

关键词: 信息检索, 深层网络, 规则集, 数据提取

CLC Number:

TP311

YANG Ju-feng; SHI Guang-shun; ZHAO Yu-juan; WANG Qing-ren. Rules-based Deep Web Information Retrieval[J]. Computer Engineering, 2008, 34(13): 51-53.

杨巨峰;史广顺;赵玉娟;王庆人. 基于规则集的Deep Web信息检索[J]. 计算机工程, 2008, 34(13): 51-53.

/ / Recommend / Download Citations

URL: http://www.ecice06.com/EN/10.3969/j.issn.1000-3428.2008.13.019

http://www.ecice06.com/EN/Y2008/V34/I13/51

[1]	LI Pei, CHEN Qiaosong, CHEN Pengchang, DENG Xin, WANG Jin, PIAO Changhao. Multi-Modal Fine-Grained Retrieval Based on Modal Specific and Modal Shared Feature Information [J]. Computer Engineering, 2022, 48(11): 62-68,76.
[2]	GAO Jun,HUANG Xiance. Design and Implementation of Correlation Weight Algorithm Based on Hadoop Platform [J]. Computer Engineering, 2019, 45(3): 26-31.
[3]	ZHANG Qianqian,TIAN Xuedong,YANG Fang,LI Xinfu. Integration Retrieval Model Based on Transformation of Mathematical Text and Expression [J]. Computer Engineering, 2019, 45(3): 175-181,187.
[4]	SAIMAITI Maimaitimin, ESMAEL Abdurehim. Research on Uyghur Stop Words Extraction Method [J]. Computer Engineering, 2019, 45(10): 288-292,300.
[5]	XIAN Xuefeng,CUI Zhiming,FANG Ligang,GU Caidong,SUN Xun. Data Source Two-layer Selection Model for Deep Web Localized Data Integration [J]. Computer Engineering, 2017, 43(3): 32-39.
[6]	WANG Ying,LUO Zhunchen,YU Yang. Research on Microblog Diversification Retrieval Problem Based on Rank Learning Model [J]. Computer Engineering, 2017, 43(11): 152-160.
[7]	QIN Huazheng,HU Zhongshun,YANG Deqing,XIAO Yanghua. Encyclopedia Related Entity Construction Based on Category Template Mining [J]. Computer Engineering, 2016, 42(9): 180-185,191.
[8]	WU Guangxian,LIU Nianyi,LIU Boya. Design and Application of BGN-type CPA Secure Encryption Scheme Based on LWE [J]. Computer Engineering, 2016, 42(12): 118-123.
[9]	DENG Song. Deep Web Data Source Selection for Entity Information Integrated Retrieval [J]. Computer Engineering, 2016, 42(10): 75-79.
[10]	JI Pengfei,LI Yuangang,LU Shengqi,DAI Kaiyu. Personalized Customization System of Travel Route Based on Semantic Web [J]. Computer Engineering, 2016, 42(10): 308-317.
[11]	DENG Xiaojun,MAN Junfeng,OUYANG Min. Online Evaluation Algorithm of Sorting Device Based on K-armed Dueling Bandits Problem [J]. Computer Engineering, 2015, 41(9): 271-275.
[12]	LI Jinzhong,YANG Wei,XIA Jiewu,ZENG Xiaohui,SUN Lingyu. Learning to Rank Method Based on Hooke & Jeeves Pattern Search [J]. Computer Engineering, 2015, 41(7): 215-218.
[13]	XU Jia-ming, LI Xiao-dong, JIN Jian, MA Ying. An Efficient Multiple Patterns String Matching Algorithm [J]. Computer Engineering, 2014, 40(3): 315-320.
[14]	ZHANG Xu-dong, SUN Zhi-ming, LIU Ya-ning, SHAN Dong-dong, YAN Hong-fei. Inverted Index Compression Algorithms Based on 64-bit Architecture [J]. Computer Engineering, 2014, 40(2): 71-76.
[15]	ZHU Jing-hua,WANG Xiao-ling. XML Keyword Search Based on Extended Query Expression [J]. Computer Engineering, 2014, 40(10): 25-31.

Please choose a citation manager

Content to export

Rules-based Deep Web Information Retrieval

基于规则集的Deep Web信息检索

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments

模态框（Modal）标题

Please choose a citation manager

Content to export

Rules-based Deep Web Information Retrieval

基于规则集的Deep Web信息检索

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments