[1] Xi Wensi, Edward F, Roy T, et al. Machine Learning Approach for Homepage Fnding Task[C]//Proc. of the 9th International Symposium on String Processing and Information Retrieval. London, UK: Springer-Verlag, 2002. [2] Craswell N, Hawking D. Overview of the Trec-2002 Web Track[C]//Proc. of the 11th Text Retrieval Conference. Gaithersburg, USA: [s. n.], 2003. [3] 黄豫清, 戚广志, 张福炎. 从Web文档中构造半结构化信息的抽取器[J]. 软件学报, 2000, 11(11): 73-78. [4] Zhai Yanhong, Liu Bing. Structured Data Extraction from the Web Based on Partial Tree Alignment[J]. IEEE Transactions on Knowledge and Data Engineering, 2006, 18(12): 1614-1628. [5] 赵 洪, 肖 洪, 薛德军, 等. Web表格信息抽取研究综述[J]. 现代图书情报技术, 2008, (3): 24-31. [6] Craswell N, Hawking D, Robertson S. Effective Site Finding Using Link Anchor Information[C]//Proc. of SIGIR’01. New York, USA: ACM Press, 2001. [7] 周 博, 刘奕群, 张 敏, 等. 锚文本检索有效性分 析[J]. 软件学报, 2011, 22(8): 1714-1724. [8] 王钟斐, 王 彪. 基于锚文本相似度的PageRank改进算法[J]. 计算机工程, 2010, 36(24): 258-260. [9] Kraft R, Zien J. Mining Anchor Text for Query Re?ne- ment[C]//Proc. of the 13th International Conference on World Wide Web. New York, USA: ACM Press, 2004. [10] Metzler D, Novak J, Cui Hang, et al. Building Enriched Document Representations Using Aggregated Anchor Text[C]//Proc. of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA: ACM Press, 2009. [11] Weninger T, Fumarola F, Han Jiawei. Mapping Web Pages to Database Records via Link Paths[C]//Proc. of the 19th ACM International Conference on Information and Knowledge Management. New York, USA: ACM Press, 2010. [12] Yen J. Finding the K Shortest Loopless Paths in a Network[J]. Management Science, 1971, 17(1): 712-716.
|