融合词性语义扩展信息的事件检测模型

doi:10.19678/j.issn.1000-3428.0066880

摘要/Abstract

摘要：

事件检测是事件抽取中的关键步骤，依赖于触发词进行事件类型分类。现有主流事件检测方法在稀疏标记数据上性能较差，模型过度拟合密集标注的触发词，在稀疏标记的触发词或者未见过的触发词上容易失效。改进方法通常通过扩充更多训练实例来缓解这一问题，但扩充后的数据分布不平衡，存在内置偏差，仍然表现不佳。为此，建立一种融合词性语义扩展信息的事件检测模型。对词粒度扩展信息进行分析，在不增加训练实例的条件下缩小候选触发词的范围，并对候选触发词进行语义扩展，挖掘候选触发词的上下文中蕴含的丰富语义，缓解了标记数据稀疏造成模型训练不充分的情况。通过词性筛选模块寻找候选触发词并对其进行语义扩展挖掘词粒度语义信息，融合句子粒度语义信息提升语义表征的鲁棒性，最终利用Softmax分类器进行分类完成事件检测任务。实验结果表明，该模型在ACE2005和KBP2015数据集上的事件检测任务中的F1值分别达到79.5%和67.5%，有效提升了事件检测性能，并且在稀疏标记数据实验中的F1值达到78.5%，明显改善了标记数据稀疏带来的不良影响。

关键词: 事件检测, 稀疏标记, 词性筛选, 语义扩展, 语义融合, 动态多池化

Abstract:

Event detection is one of the key steps in event extraction, which depends on the identified triggers for event type classification. Current mainstream event detection methods exhibit poor performance on sparsely labeled data, which overfit the model with densely labeled triggers and fail on the sparsely labeled or unseen triggers. Most previous methods mitigate this problem by adding more training examples; however, the expanded data are distributed unevenly, have built-in biases, and still perform poorly. To this end, this study explores word granularity expansion information to mitigate the impact of the problem of sparsely labeled data by reducing the range of candidate triggers, and mining the rich semantic information in the contexts without increasing the number of training instances. First, a part of speech selection module is applied to find candidate triggers and extend their semantics, which digs out word granularity semantic information. Thereafter, sentence granularity semantic information is incorporated to improve the robustness of semantic information. Finally, event types classification is performed by Softmax function, which completes the event detection task. Experimental results on ACE2005 and KBP2015 datasets demonstrate that the model achieves F1 scores of 79.5% and 67.5% in the event detection task, respectively, effectively improving the performance of event detection. The F1 score reaches 78.5% in the sparsely labeled data experiments, thereby alleviating the sparsely labeled data problem significantly.

Key words: event detection, sparse label, part of speech filtering, semantic extension, semantic integration, dynamic multi-pooling

严海宁, 余正涛, 黄于欣, 宋燃, 杨溪. 融合词性语义扩展信息的事件检测模型[J]. 计算机工程, 2024, 50(3): 89-97.

Haining YAN, Zhengtao YU, Yuxin HUANG, Ran SONG, Xi YANG. Event Detection Model Integrating Part of Speech Semantic Extension Information[J]. Computer Engineering, 2024, 50(3): 89-97.

http://www.ecice06.com/CN/Y2024/V50/I3/89

图/表 9

图1 融合词性语义扩展信息的事件检测框架

Fig.1 Framework of event detection integrating part of speech semantic extension information

图2 ACE2005中事件类型样本大小分布

Fig.2 Sample size distribution of event type in the ACE2005 dataset

图3 ACE2005数据集触发词词性统计

Fig.3 Statistics of ACE2005 dataset trigger part of speech

参考文献 32

1	Linguistic Data Consortium. ACE(automatic content extraction) English annotation guidelines for events version 5.4. 3[EB/OL]. [2022-12-07]. https://www.ldc.upenn.edu/.
2	XIANG W, WANG B. A survey of event extraction from text. IEEE Access, 2019, 7, 173111- 173137. doi: 10.1109/ACCESS.2019.2956831
3	AHN D. The stages of event extraction[C]//Proceedings of Workshop on Annotating and Reasoning About Time and Events. New York, USA: ACM Press, 2006: 1-8.
4	李中秋, 洪宇, 王捷, 等. 基于实体画像增强网络的事件检测方法. 中文信息学报, 2022, 36 (8): 81- 91. URL
	LI Z Q, HONG Y, WANG J, et al. Entity profile enhancement network for event detection. Journal of Chinese Information Processing, 2022, 36 (8): 81- 91. URL
5	陈斌, 周勇, 刘兵. 基于卷积双向长短期记忆网络的事件触发词抽取. 计算机工程, 2019, 45 (1): 153- 158. doi: 10.3969/j.issn.1007-130X.2019.01.020
	CHEN B, ZHOU Y, LIU B. Event trigger word extraction based on convolutional bidirectional long short term memory network. Computer Engineering, 2019, 45 (1): 153- 158. doi: 10.3969/j.issn.1007-130X.2019.01.020
6	FERGUSON J, LOCKARD C, WELD D, et al. Semi-supervised event extraction with paraphrase clusters[C]//Proceedings of 2018 Conference of the North American Chapter of the Association for Computational Linguistic. Stroudsburg, USA: Association for Computational Linguistics, 2018: 359-364.
7	ARAKI J, MITAMURA T. Open-domain event detection using distant supervision[C]//Proceedings of the 27th International Conference on Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2018: 878-891.
8	CAO Y X, HU Z K, CHUA T S, et al. Low-resource name tagging learned with weakly labeled data[C]//Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Stroudsburg, USA: Association for Computational Linguistics, 2019: 261-270.
9	CHEN Y B, LIU S L, ZHANG X, et al. Automatically labeled data generation for large scale event extraction[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2017: 409-419.
10	WANG X, HAN X, LIU Z, et al. Adversarial training for weakly supervised event detection[C]//Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1(Long and Short Papers). Stroudsburg, USA: Association for Computational Linguistics, 2019: 998-1008.
11	ZENG Y, FENG Y, MA R, et al. Scale up event extraction learning via automatic training data generation[C]//Proceedings of the 32nd AAAI Conference on Artificial Intelligence and 30th Innovative Applications of Artificial Intelligence Conference and 8th AAAI Symposium on Educational Advances in Artificial Intelligence. Palo Alto, USA: AAAI Press, 2018: 6045-6052.
12	TONG M H, XU B, WANG S, et al. Improving event detection via open-domain trigger knowledge[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2020: 5887-5897.
13	JI H, GRISHMAN R. Refining event extraction through cross-document inference[C]//Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2008: 254-262.
14	QIN Y, ZHANG Y, ZHANG M, et al. Feature-rich segment-based news event detection on Twitter[C]//Proceedings of International Joint Conference on Natural Language Processing. Stroudsburg, USA: Association for Computational Linguistics, 2013: 302-310.
15	LIU S L, CHEN Y B, HE S Z, et al. Leveraging FrameNet to improve automatic event detection[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics(Volume 1: Long Papers). Stroudsburg, USA: Association for Computational Linguistics, 2016: 2134-2143.
16	LIU S L, CHEN Y B, LIU K, et al. Exploiting argument information to improve event detection via supervised attention mechanisms[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics(Volume 1: Long Papers). Stroudsburg, USA: Association for Computational Linguistics, 2017: 1789-1798.
17	CHEN Y B, XU L H, LIU K, et al. Event extraction via dynamic multi-pooling convolutional neural networks[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing(Volume 1: Long Papers). Stroudsburg, USA: Association for Computational Linguistics, 2015: 167-176.
18	NGUYEN T H, GRISHMAN R. Modeling skip-grams for event detection with convolutional neural networks[C]//Proceedings of 2016 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, USA: Association for Computational Linguistics, 2016: 886-891.
19	LIU X, LUO Z C, HUANG H Y. Jointly multiple events extraction via attention-based graph information aggregation[C]//Proceedings of 2018 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, USA: Association for Computational Linguistics, 2018: 1247-1256.
20	LAI V D, NGUYEN T N, NGUYEN T H. Event detection: gate diversity and syntactic importance scores for graph convolution neural networks[C]//Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, USA: Association for Computational Linguistics, 2020: 5405-5411.
21	HU B, LIU Y, CHEN N Y, et al. SEGCN-DCR: a syntax-enhanced event detection framework with decoupled classification rebalance. Neurocomputing, 2022, 481, 55- 66. doi: 10.1016/j.neucom.2022.01.069
22	LIU J, CHEN Y B, LIU K, et al. How does context matter? On the robustness of event detection with context-selective mask generalization[C]//Proceedings of the Findings of the Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2020: 2523-2532.
23	LI R, ZHAO W L, YANG C, et al. Treasures outside contexts: improving event detection via global statistics[C]//Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, USA: Association for Computational Linguistics, 2021: 2625-2635.
24	CAO K, WEI C, GAIDON A, et al. Learning imbalanced datasets with label-distribution-aware margin loss[C]//Proceedings of the 33rd International Conference on Neural Information Processing Systems. Berlin, Germany: Springer, 2019: 1567-1578.
25	KHAN S H, HAYAT M, BENNAMOUN M, et al. Cost-sensitive learning of deep feature representations from imbalanced data. IEEE Transactions on Neural Networks and Learning Systems, 2018, 29 (8): 3573- 3587. doi: 10.1109/TNNLS.2017.2732482
26	PETRONI F, ROCKTÄSCHEL T, RIEDEL S, et al. Language models as knowledge bases?[C]//Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Stroudsburg, USA: Association for Computational Linguistics, 2019: 2463-2473.
27	MITAMURA T, LIU Z Z, HOVY E. Overview of TAC-KBP 2015 event nugget track[EB/OL]. [2022-12-07]. https://tac.nist.gov/publications/2015/additional.papers/TAC2015.KBP_Event_Nugget_overview.proceedings.pdf.
28	苗佳, 段跃兴, 张月琴, 等. 基于CNN-BiGRU模型的事件触发词抽取方法. 计算机工程, 2021, 47 (9): 69-74, 83. URL
	MIAO J, DUAN Y X, ZHANG Y Q, et al. Method for extracting event trigger words based on the CNN-BiGRU model. Computer Engineering, 2021, 47 (9): 69-74, 83. URL
29	NGUYEN T H, CHO K, GRISHMAN R. Joint event extraction via recurrent neural networks[C]//Proceedings of 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg, USA: Association for Computational Linguistics, 2016: 300-309.
30	SHA L, QIAN F, CHANG B, et al. Jointly extracting event triggers and arguments by dependency-bridge RNN and tensor-based argument interaction[C]//Proceedings of the 32nd AAAI Conference on Artificial Intelligence and 30th Innovative Applications of Artificial Intelligence Conference and 8th AAAI Symposium on Educational Advances in Artificial Intelligence. Palo Alto, USA: AAAI Press, 2018: 5916-5923.
31	NGUYEN T H, GRISHMAN R. Graph convolutional networks with argument-aware pooling for event detection[C]//Proceedings of the 32nd AAAI Conference on Artificial Intelligence and 30th Innovative Applications of Artificial Intelligence Conference and 8th AAAI Symposium on Educational Advances in Artificial Intelligence. Palo Alto, USA: AAAI Press, 2018: 5900-5907.
32	CUI S Y, YU B W, LIU T W, et al. Edge-enhanced graph convolution networks for event detection with syntactic relation[C]//Proceedings of the Findings of the Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2020: 2329-2339.

[1]	谢雨飞,吕钊. 基于语义扩展与注意力网络的问题细粒度分类[J]. 计算机工程, 2019, 45(1): 165-171,177.
[2]	华昕佳,张帅,李凤荣,赵鲁阳. 带状无线传感器网络间歇性故障检测[J]. 计算机工程, 2015, 41(12): 119-124,129.
[3]	褚衍杰,魏强,李云照. 基于关键词语义与作用域扩展的事件检测[J]. 计算机工程, 2014, 40(8): 273-276,281.
[4]	姜参，马荣娟. WSN中基于压缩感知的异常事件检测方案[J]. 计算机工程, 2014, 40(3): 137-142.
[5]	蔡偃武,高大启,阮彤,蒋锐权. 面向大规模数据的在线新事件检测[J]. 计算机工程, 2014, 40(10): 37-42.
[6]	陈浩, 张书奎, 杨凯. 基于支配集的多传感器集成复合事件检测[J]. 计算机工程, 2013, 39(3): 118-122.
[7]	汪洋, 帅建梅. 基于语义扩展模型的中文网页关键词抽取[J]. 计算机工程, 2012, 38(22): 163-166.
[8]	万静, 王文聪, 易军凯. 一种基于本体的知识库语义扩展搜索方法[J]. 计算机工程, 2012, 38(06): 19-21.
[9]	李晗, 武奇生. 融合颜色相关性和纹理差异的阴影检测方法[J]. 计算机工程, 2011, 37(15): 146-148.
[10]	王永恒;杨圣洪;郭波. 高效的射频识别数据流层次复杂事件检测[J]. 计算机工程, 2010, 36(06): 84-85.
[11]	石时需;郑启伦;曹波. 一种基于区域的视频车辆跟踪系统[J]. 计算机工程, 2008, 34(17): 196-199.
[12]	王颖颖;张　赟;胡乃静. 在线新事件检测系统中的性能提升策略[J]. 计算机工程, 2008, 34(15): 72-74.

选择文件类型/文献管理软件名称

选择包含的内容