参考文献
[1]Srinivasan P,Menczer F,Pant G.A General Evaluation Framework for Topical Crawlers[J].Information Retrieval,2005,8(3):417-447.
[2]Hersovici M,Jacovi M,Maarek Y S,et al.The Shark-search Algorithm.An Application:Tailored Web Site Mapping[J].Computer Networks and ISDN Systems,1998,30(1):317-326.
[3]Menczer F,Pant G,Srinivasan P.Topical Web Crawlers:Evaluating Adaptive Algorithms[J].ACM Transactions on Internet Technology,2004,4(4):378-419.
[4]Cho J,Garcia-Molina H,Page L.Efficient Crawling Through URL Ordering[J].Computer Networks,2012,56(18):3849-3858.
[5]Kleinberg J M.Authoritative Sources in a Hyperlinked Environment[J].Journal of the ACM,1999,46(5):604-632.
[6]Haveliwala T H.Topic-sensitive Pagerank[C]//Pro-ceedings of the 11th International Conference on World Wide Web.New York,USA:ACM Press,2002:517-526.
[7]Richardson M,Domingos P.The Intelligent Surfer:Probabilistic Combination of Link and Content Info-rmation in PageRank[C]//Proceedings of the Neural Information Processing Systems Conference.[S.l.]:Neural Information Processing Systems Foundation,2001:1441-1448.
[8]Batsakis S,Petrakis E G M,Milios E.Improving the Performance of Focused Web Crawlers[J].Data & Knowledge Engineering,2009,68(10):1001-1013.
[9]贺晟,程家兴,蔡欣宝.基于模拟退火算法的主题爬虫[J].计算机技术与发展,2009,19(12):55-58.
[10]李璐,张国印,李正文.基于 SVM 的主题爬虫技术研究[J].计算机科学,2015,42(2):118-122.
[11]皮靖,邵雄凯,肖雅夫.基于朴素贝叶斯算法的主题爬虫的研究[J].计算机与数字工程,2012,40(6):76-78.
[12]Ehrig M,Maedche A.Ontology-focused Crawling of Web Documents[C]//Proceedings of 2003 ACM Symposium on Applied Computing.New York,USA:ACM Press,2003:1174-1178.
[13]Campos R,Rojas O,Marin M,et al.Distributed Ontology-driven Focused Crawling[C]//Proceedings of the 21st Euromicro International Conference on Parallel,Distri-buted and Network-based Processing.Washington D.C.,USA:IEEE Press,2013:108-115.
[14]Zheng Haitao,Kang B Y,Kim H G.An Ontology-based Approach to Learnable Focused Crawling[J].Infor-mation Sciences,2008,178(23):4512-4522.
[15]石静,吴云芳,邱立坤,等.基于大规模语料库的汉语词义相似度计算方法[J].中文信息学报,2013,27(1):1-7.
[16]朱嫣岚,闵锦,周雅倩,等.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20.
[17]胡哲,郑诚.改进的概念语义相似度计算[J].计算机工程与设计,2010,31(5):1121-1124.
[18]黄果,周竹荣.基于领域本体的概念语义相似度计算研究[J].计算机工程与设计,2007,28(10):2460-2463.
[19]崔其文,解福.改进的领域本体概念语义相似度计算方法[J].计算机应用与软件,2012,29(2):173-174.
[20]李文杰,赵岩.基于本体结构的概念间语义相似度算法[J].计算机工程,2010,36(23):4-6.
[21]Peng Tao,Zhang Changli,Zuo Wanli.Tunneling Enhanced by Web Page Content Block Partition for Focused Crawl-ing[J].Concurrency and Computation:Practice and Experience,2008,20(1):61-74.
[22]宋聚平,王永成.对网页PageRank 算法的改进[J].上海交通大学学报,2003,37(3):397-400.
[23]Menczer F,Pant G,Srinivasan P,et al.Evaluating Topic-driven Web Crawlers[C]//Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.New York,USA:ACM Press,2001:241-249.
编辑金胡考 |