参考文献
[1] Embley D W, Jiang Yuan, Ng Y K. Record-boundary Dis- covery in Web Documents[C]//Proc. of ACM SIGMOD Inter- national Conference on Management of Data. New York, USA: [s. n.], 1999.
[2] Buttler D, Liu Ling, Pu C. A Fully Automated Object Extraction System for the World Wide Web[C]//Proc. of the 21st International Conference on Distributed Computing Systems. New York, USA: [s. n.], 2001.
[3] Liu Bing, Grossman R, Zhai Yanhong. Mining Data Records in Web Pages[C]//Proc. of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, USA: [s. n.], 2003.
[4] 杨 舟, 卓 林, 赵朋朋, 等. 一种针对商品数据记录的自动抽取方法[J]. 计算机工程, 2010, 36(23): 262-265.
[5] Reis D C, Golgher P B, Silva A S, et al. Automatic Web News Extraction Using Tree Edit Distance[C]//Proc. of the 13th International Conference on World Wide Web. New York, USA: [s. n.], 2004.
[6] Tai Kuochung. The Tree-to-Tree Correction Problem[J]. Journal of the ACM, 1979, 26(3): 422-433.
[7] 乔少杰, 唐常杰, 陈瑜等. 基于树编辑距离的层次聚类算 法[J]. 计算机科学与探索, 2007, 1(3): 282-292.
[8] 聂 卉, 黄贵鹏. 树编辑距离在Web信息抽取中的应用与实现[J]. 现代图书情报技术, 2010, (5): 29-34.
[9] 姜 波, 丁岳伟. 基于约束树编辑距离与导航树的信息采集[J]. 计算机工程, 2009, 35(14): 75-77.
[10] 刘守群, 朱 明, 谭晓彬. 一种基于树匹配的网页语义块挖掘算法[J]. 小型微型计算系统, 2009, 30(8): 1541-1545.
编辑 刘 冰
|