False Comment Detection Model Based on Sentiment-Enhanced BERT and Multi-Task Generative Adversarial Networks

doi:10.19678/j.issn.1000-3428.0252154

Abstract

Abstract:

Current false comment detection models face several problems such as insufficient mining of deep emotional features, lack of semantic dependency relationships, and poor generalization performance. In response to these, a false comment recognition model, DEBR-GAN, based on emotion-weighted BERT and multi-task adversarial learning, is proposed. First, using an emotion dictionary to assist in pretraining BERT, the potential emotional information in the comment text is extracted through an emotion weighting mechanism, thereby enhancing the ability to capture subtle emotional changes in the comments. Subsequently, a Recurrent Neural Network (RNN) is used to process the semantic features output by BERT, fully exploring the temporal dependencies and contextual relationships between words in comments for improving sensitivity to text details. Furthermore, to enhance the robustness and generalization ability of the model in multi-domain scenarios, DEBR-GAN draws on the adversarial learning concept of the Generative Adversarial Networks (GAN), treating the fake comment detector as a feature generator for extracting effective features shared across domains. Simultaneously, by setting category discriminators and rating discriminators, gradient reversal techniques are used in the backpropagation process to engage in adversarial games with the generator. This effectively eliminates the interference of category information and user rating preferences in the feature extraction process, thereby ensuring that the detector is highly accurate in identifying fake comments. The experimental results show that, on the Dianping dataset, the F1 value of the DEBR-GAN model is as high as 0.926. Compared with those of the model without the multi-task adversarial learning module and the current best baseline model, the classification accuracy of DEBR-GAN is increased by 5.1 and 3.51 percentage points, respectively. In addition, DEBR-GAN exhibits high recognition accuracy in handling comments with different emotional tendencies and semantic structures, thereby verifying the effectiveness and superiority of combining emotional enhancement and adversarial learning in false comment detection.

Key words: sentiment enhancement, Generative Adversarial Networks (GAN), fake comment detection, social network comment, BERT

摘要：

针对当前虚假评论检测模型存在的深层情感特征挖掘不足、语义依赖关系缺失以及泛化性能不佳的问题, 提出一种基于情感加权BERT与多任务对抗学习的虚假评论识别模型DEBR-GAN。首先, 借助情感词典辅助预训练BERT, 通过情感加权机制对评论文本中的潜在情感信息进行提取, 从而增强对评论中细微情绪变化的捕捉能力; 随后, 采用循环神经网络(RNN)对BERT输出的语义特征进行处理, 充分挖掘评论中词语之间的时序依赖及上下文关系, 以提高对文本细节的敏感性; 接着, 为提升模型在多领域场景下的鲁棒性与泛化能力, DEBR-GAN借鉴了生成对抗网络(GAN)的对抗学习思想, 将虚假评论检测器视为特征生成器, 用于提取跨领域共享的有效特征, 同时, 通过设置类别鉴别器和评分鉴别器, 在反向传播过程中采用梯度反转技术, 与生成器进行对抗博弈, 有效消除类别信息和用户评分偏好对特征提取过程的干扰, 从而保证检测器在识别虚假评论时具有高准确性。实验结果表明, 在大众点评数据集上, DEBR-GAN模型的F1值高达0.926, 与未引入多任务对抗学习模块的模型相比, 其分类准确率提高了5.1百分点, 而相较于当前最佳基线模型则提升了3.51百分点。此外, 该模型在处理不同情感倾向和语义结构的评论时均表现出较高的识别准确率, 充分验证了情感增强与对抗学习相结合在虚假评论检测中的有效性与优越性。

关键词: 情感增强, 生成对抗网络, 虚假评论检测, 社交网络评论, BERT

LI Dan, XIE Yuhan, HAN Xiaoshuai, LÜ Chen. False Comment Detection Model Based on Sentiment-Enhanced BERT and Multi-Task Generative Adversarial Networks[J]. Computer Engineering, 2026, 52(4): 276-289.

李丹, 谢语涵, 韩潇帅, 吕晨. 基于情感增强BERT与多任务生成对抗网络的虚假评论检测模型[J]. 计算机工程, 2026, 52(4): 276-289.

/ Recommend / Download Citations

URL: https://www.ecice06.com/EN/10.19678/j.issn.1000-3428.0252154

https://www.ecice06.com/EN/Y2026/V52/I4/276

Figures/Tables 12

Fig.1 Framework of DEBR-GAN

Fig.2 Word cloud for different comment categories

Fig.3 False comment sentiment heatmap

Fig.4 Real comment sentiment heatmap

Fig.5 Results of ablation experiment

Fig.6 The variation of model efficiency with semantic and emotional weights

Fig.7 The variation of model efficiency with the weights of μ and θ

References 58

1	杨铭, 祁巍, 闫相斌, 等. 在线商品评论的效用分析研究. 管理科学学报, 2012 (5): 65- 75.
	YANG M , QI W , YAN X B , et al. Utility analysis for online product review. Journal of Management Sciences in China, 2012 (5): 65- 75.
2	钟科, 李佩镅, 马士伟. 游客在线评论中嗅觉文字线索的价值. 旅游科学, 2019, 33 (6): 17- 32.
	ZHONG K , LI P M , MA S W . The value of olfactory text cues in visitors' online reviews. Tourism Science, 2019, 33 (6): 17- 32.
3	施运梅, 袁博, 张乐, 等. IMTS: 融合图像与文本语义的虚假评论检测方法. 数据分析与知识发现, 2022, 6 (8): 84- 96.
	SHI Y M , YUAN B , ZHANG L , et al. IMTS: detecting fake reviews with image and text semantics. Data Analysis and Knowledge Discovery, 2022, 6 (8): 84- 96.
4	QU Z , LÜ C , CHI C H . MUSH: multi-stimuli hawkes process based sybil attacker detector for user-review social networks. IEEE Transactions on Network and Service Management, 2022, 19 (4): 4600- 4614. doi: 10.1109/TNSM.2022.3186513
5	李璐旸, 秦兵, 刘挺. 虚假评论检测研究综述. 计算机学报, 2018, 41 (4): 946- 968.
	LI L Y , QIN B , LIU T . Survey on fake review detection research. Chinese Journal of Computers, 2018, 41 (4): 946- 968.
6	LEE M , JEONG M , LEE J . Roles of negative emotions in customers' perceived helpfulness of hotel reviews on a user-generated review website: a text mining approach. International Journal of Contemporary Hospitality Management, 2017, 29 (2): 762- 783. doi: 10.1108/IJCHM-10-2015-0626
7	YE J T , KUMAR S , AKOGLU L . Temporal opinion spam detection by multivariate indicative signals. Proceedings of the International AAAI Conference on Web and Social Media, 2016, 10 (1): 743- 746.
8	LIU A X , XIE Y , ZHANG J R . It's not just what you say, but how you say it: the effect of language style matching on perceived quality of consumer reviews. Journal of Interactive Marketing, 2019, 46, 70- 86. doi: 10.1016/j.intmar.2018.11.001
9	KIM S M, PANTEL P, CHKLOVSKI T, et al. Automatically assessing review helpfulness[C]//Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing. [S. l.]: ACL, 2006: 423.
10	ANDERSON E, SIMESTER D. Deceptive reviews: the influential tail[EB/OL]. [2025-01-05]. https://www.semanticscholar.org/paper/Deceptive-Reviews-:-The-Influential-Tail-Simester/9061fb46185cffde44168aff4eb17f25d520f93a/figure/0.
11	ZHU L , LIN Y , CHENG M M . Sentiment and guest satisfaction with peer-to-peer accommodation: when are online ratings more trustworthy?. International Journal of Hospitality Management, 2020, 86, 102369. doi: 10.1016/j.ijhm.2019.102369
12	鄢慧丽, 沈宸欣, 熊浩. 反讽对评论有用性的机制研究: 以酒店在线评论为情境[J/OL]. 旅游科学: 1-23. [2025-01-05]. https://link.cnki.net/doi/10.16323/j.cnki.lykx.20240913.001.
	YAN H L, SHEN C X, XIONG H. Research on mechanism of irony on review helpfulness: a scenario of hotel online reviews[J/OL]. Tourism Science: 1-23. [2025-01-05]. https://link.cnki.net/doi/10.16323/j.cnki.lykx.20240913.001. (in Chinese)
13	曹蓓, 赵奎. 基于双重情感和多特征融合的虚假新闻检测. 计算机工程, 2025, 51 (6): 193- 203. doi: 10.19678/j.issn.1000-3428.0070158
	CAO B , ZHAO K . Dual emotion and multi-feature fusion based fake news detection. Computer Engineering, 2025, 51 (6): 193- 203. doi: 10.19678/j.issn.1000-3428.0070158
14	TRUEMAN T E , KUMAR A , NARAYANASAMY P , et al. Attention-based C-BiLSTM for fake news detection. Applied Soft Computing, 2021, 110, 107600. doi: 10.1016/j.asoc.2021.107600
15	CHIN J Y, ZHAO K Q, JOTY S, et al. ANR: aspect-based neural recommender[C]//Proceedings of the 27th ACM International Conference on Information and Knowledge Management. New York, USA: ACM Press, 2018: 147-156.
16	MCAULEY J, LESKOVEC J. Hidden factors and hidden topics: understanding rating dimensions with review text[C]//Proceedings of the 7th ACM Conference on Recommender Systems. New York, USA: ACM Press, 2013: 165-172.
17	XU F , SHENG V S , WANG M W . Near real-time topic-driven rumor detection in source microblogs. Knowledge-Based Systems, 2020, 207, 106391. doi: 10.1016/j.knosys.2020.106391
18	王根生, 朱奕, 李胜. 一种融合知识图谱的图注意力神经网络谣言实时检测方法. 数据分析与知识发现, 2024, 8 (6): 95- 106.
	WANG G S , ZHU Y , LI S . A real-time rumor detection method based on the graph attention neural network integrated with the knowledge graph. Data Analysis and Knowledge Discovery, 2024, 8 (6): 95- 106.
19	黄学坚, 王根生, 罗远胜, 等. 融合多元用户特征和内容特征的微博谣言实时检测模型. 小型微型计算机系统, 2022, 43 (12): 2518- 2527.
	HUANG X J , WANG G S , LUO Y S , et al. Weibo rumors real-time detection model based on fusion of multi user features and content features. Journal of Chinese Computer Systems, 2022, 43 (12): 2518- 2527.
20	郭贤伟, 赖华, 余正涛, 等. 融合情绪知识的案件微博评论情绪分类. 计算机学报, 2021, 44 (3): 564- 578.
	GUO X W , LAI H , YU Z T , et al. Emotion classification of case-related microblog comments integrating emotional knowledge. Chinese Journal of Computers, 2021, 44 (3): 564- 578.
21	张换香, 彭俊杰. 基于方面级情感分析的深度语义挖掘模型. 电子学报, 2024, 52 (7): 2307- 2319.
	ZHANG H X , PENG J J . A deep semantic mining model based on aspect-level sentiment analysis. Acta Electronica Sinica, 2024, 52 (7): 2307- 2319.
22	曾雪强, 华鑫, 刘平生, 等. 基于情感轮和情感词典的文本情感分布标记增强方法. 计算机学报, 2021, 44 (6): 1080- 1094.
	ZENG X Q , HUA X , LIU P S , et al. Emotion wheel and lexicon based text emotion distribution label enhancement method. Chinese Journal of Computers, 2021, 44 (6): 1080- 1094.
23	MENG Y, ZHANG Y Y, HUANG J X, et al. Text classification using label names only: a language model self-training approach[EB/OL]. [2025-01-05]. https://arxiv.org/abs/2010.07245.
24	程艳, 尧磊波, 张光河, 等. 基于注意力机制的多通道CNN和BiGRU的文本情感倾向性分析. 计算机研究与发展, 2020, 57 (12): 2583- 2595.
	CHENG Y , YAO L B , ZHANG G H , et al. Text sentiment orientation analysis of multi-channels CNN and BiGRU based on attention mechanism. Journal of Computer Research and Development, 2020, 57 (12): 2583- 2595.
25	OTT M, CHOI Y, CARDIE C, et al. Finding deceptive opinion spam by any stretch of the imagination[EB/OL]. [2025-01-05]. https://arxiv.org/abs/1107.4557.
26	DENG X L, CHEN R Y. Sentiment analysis based online restaurants fake reviews hype detection[EB/OL]. [2025-01-05]. https://link.springer.com/chapter/10.1007/978-3-319-11119-3_1.
27	赵军, 王红. 融合情感极性和逻辑回归的虚假评论检测方法. 智能系统学报, 2016, 11 (3): 336- 342.
	ZHAO J , WANG H . Detection of fake reviews based on emotional orientation and logistic regression. CAAI Transactions on Intelligent Systems, 2016, 11 (3): 336- 342.
28	WU Z Y , PI D C , CHEN J F , et al. Rumor detection based on propagation graph neural network with attention mechanism. Expert Systems with Applications, 2020, 158, 113595. doi: 10.1016/j.eswa.2020.113595
29	陈妍, 张小威, 金赞, 等. 基于加权GraphSAGE和生成对抗网络的医保欺诈识别方法. 系统工程理论与实践, 2024, 44 (2): 732- 751.
	CHEN Y , ZHANG X W , JIN Z , et al. Medical fraud detection method based on weighted GraphSAGE and generative adversarial network. Systems Engineering-Theory & Practice, 2024, 44 (2): 732- 751.
30	任亚峰, 尹兰, 姬东鸿. 基于语言结构和情感极性的虚假评论识别. 计算机科学与探索, 2014, 8 (3): 313- 320.
	REN Y F , YIN L , JI D H . Deceptive reviews detection based on language structure and sentiment polarity. Journal of Frontiers of Computer Science & Technology, 2014, 8 (3): 313- 320.
31	FAN C, YAN H Y, DU J C, et al. A knowledge regularized hierarchical approach for emotion cause analysis[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). [S. l.]: ACL, 2019: 5613-5623.
32	SHIN B, LEE T, CHOI J D. Lexicon integrated CNN models with attention for sentiment analysis[EB/OL]. [2025-01-05]. https://arxiv.org/abs/1610.06272.
33	JINDAL N, LIU B. Opinion spam and analysis[C]//Proceedings of the International Conference on Web Search and Web Data Mining. New York, USA: ACM Press, 2008: 219.
34	LI X L, YU P S, LIU B, et al. Positive unlabeled learning for data stream classification[C]//Proceedings of the 2009 SIAM International Conference on Data Mining. New York, USA: ACM Press, 2009: 259-270.
35	杜姗, 杨敏, 仇蓉蓉. 基于SMOTE-RF与多维特征向量的在线商品虚假评论识别研究. 情报杂志, 2023, 42 (4): 156- 164.
	DU S , YANG M , QIU R R . Research on recognition of fake online product reviews based on SMOTE-RF and multidimensional feature vector. Journal of Intelligence, 2023, 42 (4): 156- 164.
36	张文, 王强, 步超骐, 等. 基于Co-training协同训练的在线虚假评论识别研究. 系统工程理论与实践, 2020, 40 (10): 2669- 2683.
	ZHANG W , WANG Q , BU C Q , et al. A study on deceptive review identification based on Co-training. Systems Engineering-Theory & Practice, 2020, 40 (10): 2669- 2683.
37	王一杰, 崔彩霞. 基于代价敏感图卷积网络的虚假评论检测研究. 太原师范学院学报(自然科学版), 2023, 22 (4): 37- 42.
	WANG Y J , CUI C X . Research on false review detection based on cost-sensitive graph convolutional network. Journal of Taiyuan Normal University (Natural Science Edition), 2023, 22 (4): 37- 42.
38	XU Y Z, LI Q. Attention-based feature fusion network for fake reviews detection[C]//Proceedings of the 3rd International Conference on Artificial Intelligence and Advanced Manufacture. New York, USA: ACM Press, 2022: 666-671.
39	MOHAWESH R , XU S X , TRAN S N , et al. Fake reviews detection: a survey. IEEE Access, 2021, 9, 65771- 65802. doi: 10.1109/ACCESS.2021.3075573
40	GOLDANI M H , MOMTAZI S , SAFABAKHSH R . Detecting fake news with capsule neural networks. Applied Soft Computing, 2021, 101, 106991. doi: 10.1016/j.asoc.2020.106991
41	MA J, GAO W, WONG K F. Detect rumors on Twitter by promoting information campaigns with generative adversarial learning[C]//Proceedings of the World Wide Web Conference. New York, USA: ACM Press, 2019: 3049-3055.
42	张洪志, 但志平, 董方敏, 等. 基于循环生成对抗网络和Wasserstein损失的谣言检测研究. 数据分析与知识发现, 2024, 8 (7): 32- 43.
	ZHANG H Z , DAN Z P , DONG F M , et al. Detecting rumors based on CycleGAN and Wasserstein loss. Data Analysis and Knowledge Discovery, 2024, 8 (7): 32- 43.
43	CHENG M X , LI Y Z , NAZARIAN S , et al. From rumor to genetic mutation detection with explanations: a GAN approach. Scientific Reports, 2021, 11, 5861. doi: 10.1038/s41598-021-84993-1
44	VASWANI A. Attention is all you need[EB/OL]. [2025-01-05]. https://arxiv.org/abs/1706.03762.
45	PENNEBAKER J W, FRANCIS M E, BOOTH R J. Linguistic inquiry and word count: LIWC 2001[EB/OL]. [2025-01-05]. https://www.researchgate.net/publication/239667728_Linguistic_Inquiry_and_Word_Count_LIWC_LIWC2001.
46	HU M Q, LIU B. Mining and summarizing customer reviews[C]//Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, USA: ACM Press, 2004: 168-177.
47	王科, 夏睿. 情感词典自动构建方法综述. 自动化学报, 2016, 42 (4): 495- 511.
	WANG K , XIA R . A survey on automatical construction methods of sentiment lexicons. Acta Automatica Sinica, 2016, 42 (4): 495- 511.
48	徐琳宏, 林鸿飞, 赵晶. 情感语料库的构建和分析. 中文信息学报, 2008, 22 (1): 116- 122.
	XU L H , LIN H F , ZHAO J . Construction and analysis of emotional corpus. Journal of Chinese Information Processing, 2008, 22 (1): 116- 122.
49	ZHENG H Z, XUE M H, LU H, et al. Smoke screener or straight shooter: detecting elite sybil attacks in user-review social networks[EB/OL]. [2025-01-05]. https://arxiv.org/abs/1709.06916.
50	LI F T, HUANG M L, YANG Y, et al. Learning to identify review spam[C]//Proceedings of the 22nd International Joint Conference on Artificial Intelligence. New York, USA: ACM Press, 2011: 2488-2493.
51	SAMADI M , MOUSAVIAN M , MOMTAZI S . Deep contextualized text representation and learning for fake news detection. Information Processing & Management, 2021, 58 (6): 102723.
52	LÜ C, HUANG D M, JIA Q Y, et al. Predictable model for detecting sybil attacks in mobile social networks[C]//Proceedings of the IEEE Wireless Communications and Networking Conference (WCNC). Washington D.C., USA: IEEE Press, 2021: 1-6.
53	WANG Y Q, MA F L, JIN Z W, et al. EANN: event adversarial neural networks for multi-modal fake news detection[C]//Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. New York, USA: ACM Press, 2018: 849-857.
54	ZHANG X Y, CAO J, LI X R, et al. Mining dual emotion for fake news detection[C]//Proceedings of the Web Conference 2021. New York, USA: ACM Press, 2021: 3465-3476.
55	MA J, GAO W, MITRA P, et al. Detecting rumors from microblogs with recurrent neural networks[C]//Proceedings of the 25th International Joint Conference on Artificial Intelligence. New York, USA: ACM Press, 2016: 3818-3824.
56	RAKHLIN A. Convolutional neural networks for sentence classification[EB/OL]. [2025-01-05]. https://github.com/yoonkim/CNN_sentence.
57	LIU Y H, OTT M, GOYAL N, et al. RoBERTa: a robustly optimized BERT pretraining approach[EB/OL]. [2025-01-05]. https://arxiv.org/abs/1907.11692.
58	CUI Y M , CHE W X , LIU T , et al. Pre-training with whole word masking for Chinese BERT. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 29, 3504- 3514. doi: 10.1109/TASLP.2021.3124365

[1]	FU Yan, LIU Peiyi, YE Ou. Named Entity Recognition Method for Unsafe Underground Behaviors in Coal Mines [J]. Computer Engineering, 2026, 52(4): 424-432.
[2]	JIA Jianghao, ZHANG Ziwei, GAO Liting, WEN Juan, XUE Yiming. Text Steganalysis Method Based on Hierarchy-Aware Matching [J]. Computer Engineering, 2026, 52(2): 245-252.
[3]	ZHANG Yimin, HUANG Xiaoying, HUANG Zhengyang, YANG Chaoxiang, WAN Yongjing, JIANG Cuiling. Application of Bionic Design Algorithms Based on Latent Spatial Multi-Scale Fusion of Cross-Domain Images [J]. Computer Engineering, 2025, 51(5): 266-278.
[4]	YIN Zhaoliang, HUANG Yuxin, YU Zhengtao, WANG Guanwen, AI Chuanxian. A Method for Analyzing News Themes Involving Cases with Integrated Crime Classification [J]. Computer Engineering, 2025, 51(4): 208-216.
[5]	YANG Jiahui, YOU Zaijin, NI Lifu, ZHAO Yu, LI Wanying. Study on Optimization of Berth-Quay Crane Emission Reduction Cooperative Scheduling in Container Terminals [J]. Computer Engineering, 2025, 51(10): 381-391.
[6]	ZHENG Yazhou, LIU Wanping, HUANG Dong. A BERT-CNN-GRU Detection Method Based on Attention Mechanism [J]. Computer Engineering, 2025, 51(1): 258-268.
[7]	QU Xiaoya, LI Bing, WEN Liqiang. Research on Event Extraction for Administrative Law Enforcement Case Texts [J]. Computer Engineering, 2024, 50(9): 63-71.
[8]	Han CHEN, Chunlei ZHAO, Haoda JIANG, Chundong WANG. Research on App User Intent Recognition Based on Fusion Model and Semantic Network [J]. Computer Engineering, 2024, 50(8): 50-63.
[9]	LI Xue, WANG Yawen, ZHANG Qianjin. Automatic Naming of Source Code Based on Information Retrieval [J]. Computer Engineering, 2024, 50(6): 304-310.
[10]	LI Tianfang, PU Yuanyuan, ZHAO Zhengpeng, XU Dan, QIAN Wenhua. Image-to-Image Translation Based on CLIP and Dual-Spatially Adaptive Normalization [J]. Computer Engineering, 2024, 50(5): 229-240.
[11]	ZHOU Zhaochen, FANG Qingmao, WU Xiaohong, HU Ping, HE Xiaohai. Machine Reading Comprehension Model Based on MacBERT and Adversarial Training [J]. Computer Engineering, 2024, 50(5): 41-50.
[12]	WANG Minghu, SHI Zhikui, SU Jia, ZHANG Xinsheng. Sequence Recommendation Method Based on RoBERTa and Graph-Enhanced Transformer [J]. Computer Engineering, 2024, 50(4): 121-131.
[13]	Chenmin GAN, Hong TANG, Haolan YANG, Xiaojie LIU, Jie LIU. Abstractive Text Summarization Method Incorporating Convolutional Shrinkage Gating [J]. Computer Engineering, 2024, 50(2): 98-104.
[14]	YUAN Pingyu, QIU Lin. Web Attacks Detection Method Based on BERT with Multi-Model Fusion [J]. Computer Engineering, 2024, 50(11): 197-206.
[15]	SHI Junxiao, CHEN Yanping, MU Zhaonan. Predicate Center Word Recognition Model Fused with Multiscale Span Features [J]. Computer Engineering, 2024, 50(10): 137-144.

Please choose a citation manager

Content to export