Research on Automatic Scoring for English Essay Based on Multi-Scale Context

doi:10.19678/j.issn.1000-3428.0067224

Abstract

Abstract:

Presently, the automatic scoring model for essays lacks extraction of semantic features from different context scales, and fails to calculate the degree of correlation between the topic of the essay from the sentence level. This study proposes a method MSC for automatic scoring of English esssay based on a multi-scale context. The method uses an XLNet English pre-training model to extract word and sentence embeddings from the original essay text, accurately captures vector embeddings that match the context when processing long sequence texts, improves the quality of dynamic vector semantic representation, addresses the problem of polysemy, and extracts phrase level embeddings at different scales through a one-dimensional convolution module. The MSC network captures high-dimensional latent contextual semantic associations at the word, phrase, and sentence levels by combining Built-in Self-Attention Simple Recurrence Units (BSASRU) and global attention mechanisms. It uses sentence vectors to calculate semantic similarity with the essay topic and extracts topic level features. All features are input into the fusion layer and are automatically graded through a linear layer. The experimental results on the publicly available standard English essay scoring dataset ASAP demonstrate that the MSC model achieves an average Quadratic Weighted Kappa (QWK) value of 80.5%. Moreover, it achieves the best performance on multiple subsets, outperforming the deep learning automatic scoring model in experimental comparison, thereby proving its effectiveness in English essay automatic scoring tasks.

Key words: automatic scoring for English essay, pre-training model, multi-scale context, global attention, topic level characteristics

摘要：

目前作文自动评分模型缺乏对不同尺度上下文语义特征的提取，未能从句子级别计算与作文主题关联程度的特征。提出基于多尺度上下文的英文作文自动评分研究方法MSC。采用XLNet英文预训练模型提取原始作文文本单词嵌入和句嵌入，避免在处理长序列文本时无法准确捕捉到符合上下文语境的向量嵌入，提升动态向量语义表征质量，解决一词多义问题，并通过一维卷积模块提取不同尺度的短语级别嵌入。多尺度上下文网络通过结合内置自注意力简单循环单元和全局注意力机制，分别捕捉单词、短语和句子级别的作文高维潜在上下文语义关联关系，利用句向量与作文主题计算语义相似度提取篇章主题层次特征，将所有特征输入融合层通过线性层得到自动评分结果。在公开的标准英文作文评分数据集ASAP上的实验结果表明，MSC模型平均二次加权的Kappa值达到了80.5%，且在多个子集上取得了最佳效果，优于实验对比的深度学习自动评分模型，证明了MSC在英文作文自动评分任务上的有效性。

关键词: 英文作文自动评分, 预训练模型, 多尺度上下文, 全局注意力, 主题层次特征

Mingcheng YU, Yagu DANG, Qilin WU, Xu JI, Kexin BI. Research on Automatic Scoring for English Essay Based on Multi-Scale Context[J]. Computer Engineering, 2024, 50(3): 259-266.

于明诚, 党亚固, 吴奇林, 吉旭, 毕可鑫. 基于多尺度上下文的英文作文自动评分研究[J]. 计算机工程, 2024, 50(3): 259-266.

/ / Recommend / Download Citations

URL: http://www.ecice06.com/EN/10.19678/j.issn.1000-3428.0067224

http://www.ecice06.com/EN/Y2024/V50/I3/259

Figures/Tables 11

Fig.1 Overall structure of the MSC model

Fig.2 Word embedding process

Fig.3 Sentence embedding process

Fig.4 Structure of built-in self-attention simple recurrence units model

Fig.5 The average training time among different recurrent networks on subsets

References 26

1	RAMESH D, SANAMPUDI S K. An automated essay scoring systems: a systematic literature review. Artificial Intelligence Review, 2022, 55(3): 2495- 2527.
2	丁革建, 刘畅. 作文自动评分技术综述. 计算机应用, 2022, 42(S1): 386- 390. URL
	DING G J, LIU C. Survey of automated essay scoring technology. Journal of Computer Applications, 2022, 42(S1): 386- 390. URL
3	ZHANG H R, LITMAN D. Automated topical component extraction using neural network attention scores from source-based essay scoring[EB/OL]. [2023-02-18]. https://arxiv.org/abs/2008.01809.
4	刘磊. 英语学习者作文自动评分特征选择及模型优化研究. 计算机应用与软件, 2021, 38(12): 193-200, 206. doi: 10.3969/j.issn.1000-386x.2021.12.032
	LIU L. Feature selection and model optimization of automatic essay scoring for English learners. Computer Applications and Software, 2021, 38(12): 193-200, 206. doi: 10.3969/j.issn.1000-386x.2021.12.032
5	周明, 贾艳明, 周彩兰, 等. 基于篇章结构的英文作文自动评分方法. 计算机科学, 2019, 46(3): 234- 241. URL
	ZHOU M, JIA Y M, ZHOU C L, et al. English automated essay scoring methods based on discourse structure. Computer Science, 2019, 46(3): 234- 241. URL
6	LI X, YANG H L, HU S Z, et al. Enhanced hybrid neural network for automated essay scoring. Expert Systems, 2022, 39(10): 1- 22.
7	PARK Y H, CHOI Y S, PARK C Y, et al. EssayGAN: essay data augmentation based on generative adversarial networks for automated essay scoring. Applied Sciences, 2022, 12(12): 5803. doi: 10.3390/app12125803
8	DEVLIN J, CHANG M W, LEE K, et al. BERT: pre-training of deep bidirectional Transformers for language understanding [EB/OL]. [2023-02-18]. https://arxiv.org/pdf/1810.04805.pdf.
9	周险兵, 樊小超, 任鸽, 等. 基于多层次语义特征的英文作文自动评分方法. 计算机应用, 2021, 41(8): 2205- 2211. URL
	ZHOU X B, FAN X C, REN G, et al. Automated English essay scoring method based on multi-level semantic features. Journal of Computer Applications, 2021, 41(8): 2205- 2211. URL
10	夏林中, 罗德安, 刘俊, 等. 基于注意力机制的双层LSTM自动作文评分系统. 深圳大学学报(理工版), 2020, 37(6): 559- 566. URL
	XIA L Z, LUO D A, LIU J. et al. Attention-based two-layer long short-term memory model for automatic essay scoring. Journal of Shenzhen University(Science and Engineering), 2020, 37(6): 559- 566. URL
11	YANG R S, CAO J N, WEN Z Y, et al. Enhancing automated essay scoring performance via fine-tuning pre-trained language models with combination of regression and ranking[C]//Proceedings of the Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2020: 1560-1569.
12	WANG Y J, WANG C, LI R B, et al. On the use of BERT for automated essay scoring: joint learning of multi-scale essay representation[EB/OL]. [2023-02-18]. https://arxiv.org/abs/2205.03835v2.
13	RODRIGUEZ P U, JAFARI A, ORMEROD C M. Language models and automated essay scoring[EB/OL]. [2023-02-18]. https://arxiv.org/pdf/1909.09482.pdf.
14	FARAG Y, YANNAKOUDAKIS H, BRISCOE T. Neural automated essay scoring and coherence modeling for adversarially crafted input[EB/OL]. [2023-02-18]. https://arxiv.org/pdf/1804.06898.pdf.
15	YANG Z L, DAI Z H, YANG Y M, et al. XLNet: generalize autoregressive pretraining for language understanding[C]//Proceedings of the 33rd International Conference on Neural Information Processing Systems. New York, USA: ACM Press, 2021: 5753-5763.
16	SHERVIN M, NAL K, ERIK C, et al. Deep learning-based text classification: a comprehensive review. ACM Computing Surveys, 2022, 54(3): 1- 62.
17	LEI T, ZHANG Y, WANG I S, et al. Simple recurrent units for highly parallelizable recurrence[EB/OL]. [2023-02-18]. https://arxiv.org/pdf/1709.02755.pdf.
18	CAI T T, ZHANG X S. Imbalanced text sentiment classification based on multi-channel BLTCN-BLSTM self-attention. Sensors, 2023, 23(4): 2257.
19	LEI T. When attention meets fast recurrence: training language models with reduced computer[EB/OL]. [2023-02-18]. https://arxiv.org/abs/2102.12459.
20	REN Y, HAN J F, LIN Y C, et al. An ontology-based and deep learning-driven method for extracting legal facts from Chinese legal texts. Electronics, 2022, 11(12): 1821.
21	LIU H T, CHEN G, LI P P, et al. Multi-label text classification via joint learning from label embedding and label correlation. Neurocomputing, 2021, 460, 385- 398. doi: 10.1016/j.neucom.2021.07.031
22	ZHANG J, ZHANG P, KONG B W, et al. Continuous self-attention models with neural ode networks[C]//Proceedings of the 35th AAAI Conference on Artificial Intelligence. [S. l. ]: AAAI Press, 2021: 14393-14401.
23	王曙燕, 原柯. 基于RoBERTa-WWM的大学生论坛情感分析模型. 计算机工程, 2022, 48(8): 292-298, 305. doi: 10.19678/j.issn.1000-3428.0062008
	WAGN S Y, YUAN K. Sentiment analysis model of college student forum based on RoBERTa-WWM. Computer Engineering, 2022, 48(8): 292-298, 305. doi: 10.19678/j.issn.1000-3428.0062008
24	RAMNARAIN-SEETOHUL V, BASSOO V, ROSUNALLY Y. Similarity measures in automated essay scoring systems: a ten-year review. Education and Information Technologies, 2022, 27, 5573- 5604.
25	ZHANG M R, LUCAS J, HINTON G, et al. Lookahead optimizer: k steps forward, 1 step back[C]//Proceedings of the 33rd International Conference on Neural Information Processing Systems. New York, USA: ACM Press, 2019: 32.
26	LIU L Y, JIANG H M, GE P C, et al. On the variance of the adaptive learning rate and beyond[C]//Proceedings of the 8th International Conference on Learning Representations. Stroudsburg, USA: Association for Computational Linguistics, 2020: 1-10.

[1]	ZHANG Boxu, PU Zhi, CHENG Xi. Research on Uyghur Text Classification Based on Prompt Learning [J]. Computer Engineering, 2023, 49(6): 292-299,313.
[2]	ZHU Hong, NIU Haoran, ZHU Tong. Entity Recognition of Industry Figures Based on Character and Word Fusion and Adversarial Training [J]. Computer Engineering, 2023, 49(5): 56-62.
[3]	WU Xueying, DUAN Youxiang, CHANG Lunjie, LI Shiyin, SUN Qifeng. Research on Entity and Relation Joint Extraction for Geological Domain [J]. Computer Engineering, 2023, 49(3): 121-127.
[4]	Qilin WU, Yagu DANG, Shanwei XIONG, Xu JI, Kexin BI. Sentiment Analysis Model of Students' Teaching Evaluation Text Based on Hybrid Feature Network [J]. Computer Engineering, 2023, 49(11): 24-29, 39.
[5]	RU Niuniu, YU Jinwei, YANG Weihua, BIAN Wei. Video Interpolation Based on Compression and Refined Deep Voxel Flow Model [J]. Computer Engineering, 2022, 48(9): 248-253.
[6]	XING Tongtong, SUN Rencheng, SHAO Fengjing, SUI Yi. Research on Weight Initialization Method in Deep Learning [J]. Computer Engineering, 2022, 48(7): 104-113.
[7]	WANG Gang, SUN Yuanyuan, CHEN Yanguang, LIN Hongfei. Segmented Summarization Model for Legal Documents [J]. Computer Engineering, 2022, 48(6): 288-294.
[8]	QIU Zhen, XI Xuefeng, CUI Zhiming, SHENG Shengli, HU Fuyuan. Few-Shot Image Classification Based on Multi-Resolution Self-Distillation Network [J]. Computer Engineering, 2022, 48(12): 232-240.
[9]	LIU Gaojun, LI Yaxin, DUAN Jianyong. Chinese Machine Reading Comprehension Based on Hybrid Attention Mechanism [J]. Computer Engineering, 2022, 48(10): 67-72,80.
[10]	WANG Tao, LIU Chaohui, ZHENG Qingqing, HUANG Jiaxi. Multi-turn Task-oriented Dialogue Technology Based on Unidirectional Transformer and Siamese Network [J]. Computer Engineering, 2021, 47(7): 55-58,66.

Please choose a citation manager

Content to export