[1]丁兆云,贾焰,周斌.微博数据挖掘研究综述[J].计算机研究与发展,2014,51(4):691-706.
[2]LIU B.Web data mining:exploring hyperlinks,contents,and usage data [M].Berlin,Germany:Springer,2009.
[3]杨亮,许侃,林鸿飞,等.博客作者声誉度分析[J].计算机科学与探索,2013,7(9):838-847.
[4]杨风雷,黎建辉.用户生成内容中的垃圾意见研究综述[J].计算机应用研究,2011,28(10):3601-3605.
[5]JINDAL N,LIU B.Review spam detection[C]//Proceedings of IEEE International Conference on World Wide Web.Washington D.C.,USA:IEEE Press,2007:1189-1190.
[6]JINDAL N,LIU B.Opinion spam and analysis[C]//Proceedings of IEEE International Conference on Web Search and Data Mining.Washington D.C.,USA:IEEE Press,2008:219-230.
[7]邓冰娜,王煜,刘宇.一种应用于博客的垃圾评论识别方法[J].郑州大学学报(理学版),2011,43(1):65-69.
[8]黄铃,李学明.基于AdaBoost的微博垃圾评论识别方法[J].计算机应用,2013,33(12):3563-3566.
[9]LAI C L,XU K Q,LAU R Y K,et al.High-order concept associations mining and inferential language modeling for online review spam detection[C]//Proceedings of IEEE International Conference on Data Mining Workshops.Washington D.C.,USA:IEEE Press,2010:1120-1127.
[10]刁宇峰,杨亮,林鸿飞.基于LDA模型的博客垃圾评论发现[J].中文信息学报,2011,25(1):41-47.
[11]SURENDRA S,AIXIN S.Hspam14:a collection of 14 million tweets for hashtag-oriented spam research[C]//Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval.New York,USA:ACM Press,2015:223-232.
[12]姚子瑜,屠守中,黄民烈,等.一种半监督的中文垃圾微博过滤方法[J].中文信息学报,2016,30(5):176-186.
[13]FREUND Y,SCHAPIRE R E.A decision-theoretic generalization of on-line learning and an application to boosting[C]//Proceedings of European Conference on Computational Learning Theory.Berlin,Germany:Springer,1995:23-27.
[14]YAROWSKY D.Unsupervised word sense disambiguation rivaling supervised methods[C]//Proceedings of the 33rd Annual Meeting on Association for Computational Linguistics,Washington D.C.,USA:IEEE Press,1995:189-196.
[15]NIGAM K,MCCALLUM A K,THRUN S,et al.Text classification from labeled and unlabeled documents using EM[J].Machine Learning,2000,39(2):103-134.
[16]ZHOU Z H,LI M.Tri-training:exploiting unlabeled data using three classifiers[J].IEEE Transactions on Knowledge & Data Engineering,2005,17(11):1529-1541.
[17]BREIMAN L.Random forests[J].Machine Learning,2001,45(1):5-32.
[18]ZHOU Z H,ZHAN D C,YANG Q.Semi-supervised learning with very few labeled training examples[C]//Proceedings of AAAI Conference on Artificial Intelligence.[S.1.]:AAAI Press,2007:675-680.
[19]LI M,ZHOU Z H.Improve computer-aided diagnosis with machine learning techniques using undiagnosed samples[J].IEEE Transactions on Systems,Man,and Cybernetics,Part A,2007,37(6):1088-1098.
[20]田久乐,赵蔚.基于同义词词林的词语相似度计算方法[J].吉林大学学报(信息科学版),2010,28(6):602-608.
[21]张剑峰,夏云庆,姚建民.微博文本处理研究综述[J].中文信息学报,2012,26(4):21-27.
[22]CHANG C C,LIN C J.LIBSVM:A library for support vector machines[J].ACM Transactions on Intelligent Systems & Technology,2007,2(3):27-33.
|