面向社交媒体评论的上下文语境讽刺检测模型

doi:10.19678/j.issn.1000-3428.0056606

计算机工程 ›› 2021, Vol. 47 ›› Issue (1): 66-71. doi: 10.19678/j.issn.1000-3428.0056606

面向社交媒体评论的上下文语境讽刺检测模型

韩虎^1,2, 赵启涛¹, 孙天岳¹, 刘国利¹

1. 兰州交通大学电子与信息工程学院, 兰州 730070;
2. 甘肃省人工智能与图形图像工程研究中心, 兰州 730070

收稿日期:2019-11-15 修回日期:2020-01-07 发布日期:2020-01-17
作者简介:韩虎(1977-),男,副教授、博士,主研方向为机器学习、数据挖掘;赵启涛、孙天岳、刘国利,硕士研究生。
基金资助:
国家自然科学基金（61562057）；国家社会科学基金（17BXW071）；甘肃省科技计划项目（18JR3RA104）。

Contextual Sarcasm Detection Model for Social Media Comments

HAN Hu^1,2, ZHAO Qitao¹, SUN Tianyue¹, LIU Guoli¹

1. School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China;
2. Gansu Provincial Engineering Research Center for Artificial Intelligence and Graphic and Image Processing, Lanzhou 730070, China

Received:2019-11-15 Revised:2020-01-07 Published:2020-01-17

摘要/Abstract

摘要： 讽刺是日常交际中一种常见的语用现象，能够丰富说话者的观点并间接地表达说话者的深层含义。讽刺检测任务的研究目标是挖掘目标语句的讽刺倾向。针对讽刺语境表达变化多样以及不同用户、不同主题下的讽刺含义各不相同等特征，构建融合用户嵌入与论坛主题嵌入的上下文语境讽刺检测模型。该模型借助ParagraphVector方法的序列学习能力对用户评论文档与论坛主题文档进行编码，从而获取目标分类句的用户讽刺特征与主题特征，并利用一个双向门控循环单元神经网络得到目标句的语句编码。在标准讽刺检测数据集上进行的实验结果表明，与传统Bag-of-Words、CNN等模型相比，该模型能够有效提取语句的上下文语境信息，具有较高的讽刺检测分类准确率。

关键词: 自然语言处理, 上下文语境讽刺检测, 深度学习, ParagraphVector模型, 双向门控循环单元模型

Abstract: Sarcasm is a common pragmatic phenomenon in daily communication that enriches the views of speakers and indirectly expresses the their deep meaning.The research goal of sarcasm detection task is to mine the sarcasm tendency of target sentences.As the contexts and expressions of sarcasm is diverse,and the meaning of sarcasm varies according to users and topics,this paper proposes a contextual sarcasm detection model fusing users' embedding and forum topic embedding.The model uses the sequence learning ability of ParagraphVector method to encode the documents of user comments and forum topics to obtain the satirical features of users and topic features of the target sentence.Then a Bi-directional-Gated Recurrent Unit(Bi-GRU) neural network is used to obtain the sentence code of the target sentence.Experimental results on the standard sarcasm detection dataset show that compared with traditional Bag-of-Words,CNN and other models,this model can effectively extract the contextual information of sentences,and has a higher accuracy of sarcasm detection and classification.

Key words: Natural Language Processing(NLP), contextual sarcasm detection, deep learning, ParagraphVector model, Bi-directional-Gated Recurrent Unit(Bi-GRU) model

中图分类号:

TP391

韩虎, 赵启涛, 孙天岳, 刘国利. 面向社交媒体评论的上下文语境讽刺检测模型[J]. 计算机工程, 2021, 47(1): 66-71.

HAN Hu, ZHAO Qitao, SUN Tianyue, LIU Guoli. Contextual Sarcasm Detection Model for Social Media Comments[J]. Computer Engineering, 2021, 47(1): 66-71.

http://www.ecice06.com/CN/Y2021/V47/I1/66

图/表 6

20210125163247

20210125163252

20210125163255

20210125163258

20210125163301

20210125163304

参考文献

[1] PANG B,LEE L.Opinion mining and sentiment analysis[J].Foundations and Trends in Information Retrieval,2008,2(1/2):1-13.
[2] CARVALHO P,SARMENTO L,SILVA M J,et al.Clues for detecting irony in user-generated contents:oh…!! it's "so easy"[C]//Proceedings of the 1st International CIKM Workshop on Topic-sentiment Analysis for Mass Opinion.New York,USA:ACM Press,2009:53-56.
[3] RILOFF E,QADIR A,SURVE P,et al.Sarcasm as contrast between a positive sentiment and negative situation[C]//Proceedings of 2013 IEEE Conference on Empirical Methods in Natural Language Processing.Washington D.C.,USA:IEEE Press,2013:704-714.
[4] BAMMAN D,SMITH N A.Contextualized sarcasm detection on twitter[C]//Proceedings of the 9th International AAAI Conference on Web and Social Media.[S.1.]:AAAI Press,2015:457-468.
[5] AMIR S,WALLACE B C,LYU H,et al.Modelling context with user embeddings for sarcasm detection in social media[EB/OL].[2019-10-10].https://arxiv.xilesou.top/pdf/1607.00976.pdf.
[6] HAZARIKA D,PORIA S,GORANTLA S,et al.CASCADE:contextual sarcasm detection in online discussion forums[EB/OL].[2019-10-10].https://arxiv.xilesou.top/pdf/1805.06413.
[7] HOTELLING H.Relations between two sets of variates[J].Biometrika,1936,28(3/4):321-377.
[8] KHODAK M,SAUNSHI N,VODRAHALLI K.A large self-annotated corpus for sarcasm[EB/OL].[2019-10-10].https://arxiv.xilesou.top/pdf/1704.05579.pdf.
[9] LE Q,MIKOLOV T.Distributed representations of sentences and documents[C]//Proceedings of IEEE Inter-national Conference on Machine Learning.Washington D.C.,USA:IEEE Press,2014:1188-1196.
[10] CHO K,VAN MERRIENBOER B,GULCEHRE C,et al.Learning phrase representations using RNN encoder-decoder for statistical machine translation[EB/OL].[2019-10-10].https://arxiv.xilesou.top/pdf/1406.1078.pdf.
[11] MIKOLOV T,SYTSKEVER I,CHEN K,et al.Distributed representations of words and phrases and their com-positionality[J].Neural Information Processing Systems,2013,26:3111-3119.
[12] LECUN Y,BEBGIO Y,HINTON G.Deep learning[J].Nature,2015,521(7553):436-444.
[13] PORIN S,CAMBRIA E,HAZARIKA D,et al.A deeper look into sarcastic tweets using deep convolutional neural networks[EB/OL].[2019-10-10].https://arxiv.xilesou.top/pdf/1610.08815.pdf.
[14] ZHANG Meishan,ZHANG Yue,FU Guohong.Tweet sarcasm detection using deep neural network[C]//Pro-ceedings of the 26th International Conference on Com-putational Linguistics:Technical Papers.Washington D.C.,USA:IEEE Press,2016:2449-2460.
[15] GHOSH A,VEALE T.Fracking sarcasm using neural network[C]//Proceedings of the 7th IEEE Workshop on Computational Approaches to Subjectivity,Sentiment and Social Media Analysis.Washington D.C.,USA:IEEE Press,2016:161-169.
[16] TAY Y,TUAN L A,HUI S C,et al.Reasoning with sarcasm by reading in-between[EB/OL].[2019-10-10].https://arxiv.xilesou.top/pdf/1805.02856.pdf.
[17] GOLDBERG Y.Neural network methods for natural language processing[J].Human Language Technologies,2017,10(1):288-309.
[18] BENGIO Y,DUCHARME R,VINCENT P,et al.A neural probabilistic language model[J].Journal of Machine Learning Research,2003,3:1137-1155.
[19] PENNINGTON J,SOCHER R,MANNING C.GloVe:global vectors for word representation[C]//Proceedings of 2014 IEEE Conference on Empirical Methods in Natural Language Processing.Washington D.C.,USA:IEEE Press,2014:1532-1543.
[20] KIM Y.Convolutional neural networks for sentence classification[EB/OL].[2019-10-10].https://arxiv.xilesou.top/pdf/1408.5882.pdf.

选择文件类型/文献管理软件名称

选择包含的内容

面向社交媒体评论的上下文语境讽刺检测模型

Contextual Sarcasm Detection Model for Social Media Comments

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 6

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	江雨燕, 陶承凤, 李平. 数据增强和自适应自步学习的深度子空间聚类算法[J]. 计算机工程, 2023, 49(8): 96-103, 110.
[2]	李泽水, 冀俊忠, 杨翠翠. 基于边权重信息深度网络嵌入的PPIN功能模块检测[J]. 计算机工程, 2023, 49(8): 69-76.
[3]	王可铮, 徐玉芬, 周尚波. 结合对比感知损失和融合注意力的图像去雾模型[J]. 计算机工程, 2023, 49(8): 207-214.
[4]	刘俊豪, 王美林, 谢兴, 宋烨兴, 许莉花. 基于改进YOLOv5的皮革瑕疵检测算法[J]. 计算机工程, 2023, 49(8): 240-249.
[5]	闫兴亚, 匡娅茜, 白光睿, 李月. 基于深度学习的学生课堂行为识别方法[J]. 计算机工程, 2023, 49(7): 251-258.
[6]	郭艳霞, 金勇, 唐宏, 彭金枝. 基于动态卷积与残差门控的多模态情感识别[J]. 计算机工程, 2023, 49(7): 94-101.
[7]	李军侠, 王星驰, 殷梓, 石德硕. 边缘深度挖掘的弱监督显著性目标检测[J]. 计算机工程, 2023, 49(7): 169-178.
[8]	吴珊, 周凤. 基于改进SSD算法的小目标检测[J]. 计算机工程, 2023, 49(7): 179-188.
[9]	席建锐, 唐红梅, 梁春阳, 刘鑫. 基于改进隐函数的点云物体重建[J]. 计算机工程, 2023, 49(7): 214-222.
[10]	齐咏生, 杜晓旭, 朱俊峰, 高胜利, 刘利强. 基于增强型轻量深度网络的牧区牲畜高效检测[J]. 计算机工程, 2023, 49(7): 278-287.
[11]	谌雨章, 黄逸姿, 张钧涵. 基于多速率空洞卷积的多尺度水下小目标检测[J]. 计算机工程, 2023, 49(6): 257-264.
[12]	张博旭, 蒲智, 程曦. 基于提示学习的维吾尔语文本分类研究[J]. 计算机工程, 2023, 49(6): 292-299,313.
[13]	于海洋, 景鹏, 张文涛, 谢赛飞, 滑志华, 宋草原. 基于残差与注意力机制的道路裂缝检测U-Net改进模型[J]. 计算机工程, 2023, 49(6): 265-273.
[14]	王爱玲, 马文臻, 邹自明, 钟佳. 基于领域自适应的卫星工程参数异常检测[J]. 计算机工程, 2023, 49(5): 29-37,47.
[15]	宋羽凯, 谢江. 基于多任务学习的轻量级语音情感识别模型[J]. 计算机工程, 2023, 49(5): 122-128.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

面向社交媒体评论的上下文语境讽刺检测模型

Contextual Sarcasm Detection Model for Social Media Comments

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 6

参考文献

相关文章 15

编辑推荐

Metrics

本文评价