基于自注意力机制的时间序列插补

doi:10.19678/j.issn.1000-3428.0069678

摘要/Abstract

摘要：

船舶轨迹数据作为海上交通的核心数据，可以用于轨迹预测、预警等任务，具有非常明显的时序特征，但海上环境恶劣、通信可靠性差等因素导致收集到的船舶轨迹数据普遍存在数据缺失的问题，对含有缺失数据的时间序列进行学习会严重影响时间序列分析的准确性。当前主流的解决方案是对缺失数据进行近似插补，主要基于卷积模型沿着时间轴对时间序列进行重塑，捕捉时间序列的局部特征，但对长时间序列的全局特征捕捉能力较弱。Transformer模型通过其核心的自注意力机制来捕获时间序列各个时间点之间的关系，从而增强模型对时间序列全局特征的捕捉能力，但注意力机制是通过矩阵乘计算得到的，导致其忽视了时间序列自身的时序性，得到的全局特征权重不具有时间跨度依赖性。因此，针对长时间序列全局特征捕捉的问题，提出一种基于自注意力机制的变体网络GANet。GANet首先通过自注意力机制获得基础的时间序列点之间的全局特征权重矩阵，再使用门控循环单元在时间轴上对全局特征权重矩阵进行遗忘与更新，从而得到具有时间跨度依赖性的全局特征权重矩阵；然后使用该矩阵进行数据重构，对缺失数据进行插补。GANet通过结合自注意力机制与门控机制实现了在捕捉全局特征的同时考虑时间跨度对各个时间点的影响，使得捕获到的全局特征具有时间跨度依赖性。实验结果表明，与现有Autoformer、Informer等模型相比，GANet对于Trajectory、ETT、Electricity数据集具有较好的插补效果。

关键词: 自注意力机制, 门控循环单元, 全局特征捕捉, 时间跨度依赖性, 时间序列插补

Abstract:

As core data for maritime traffic, ship trajectory data can be used for trajectory prediction, early warning, and other tasks with pronounced temporal characteristics. However, owing to factors such as harsh marine environments and poor communication reliability, missing ship trajectory data is a common problem. Learning from time series containing missing data can affect the accuracy of time series analysis significantly. The current mainstream solution is to approximate the imputation of missing data, mainly based on convolutional models, to reshape the time series along a timeline to capture its local features of the time series. However, the ability to capture the global features of long time series is limited. The Transformer enhances the ability of a model to capture the global features of a time series by capturing the relationships between various time points in the time series through its core self-attention mechanism. However, because its attention is calculated through matrix multiplication, it ignores the temporal nature of the time series, and the obtained global feature weights do not have a time span dependency. Therefore, to address the issue of capturing global features in long time series, this study proposes the GANet, a variant network based on the self-attention mechanism. GANet first obtains the basic global feature weight matrix from the time series points through the self-attention mechanism and then uses gated recurrent units to forget and update this global feature weight matrix on the timeline, thereby obtaining a global feature weight matrix with time span dependency, which is then used for data reconstruction to impute the missing data. GANet combines the self-attention mechanism and gating mechanism to capture global features while considering the impact of the time span on different time points, making the captured global feature time span dependent. Experimental results show that compared with existing models, such as Autoformer and Informer, GANet achieves better interpolation performance on Trajectory, ETT, and Electricity datasets.

Key words: self-attention mechanism, gated recurrent unit, global feature capture, time span dependency, time series imputation

徐磊, 曾艳, 袁俊峰, 岳鲁鹏, 殷昱煜, 张纪林, 薛梅婷, 韩猛. 基于自注意力机制的时间序列插补[J]. 计算机工程, 2025, 51(11): 90-99.

XU Lei, ZENG Yan, YUAN Junfeng, YUE Lupeng, YIN Yuyu, ZHANG Jilin, XUE Meiting, HAN Meng. Time Series Imputation Based on Self-Attention Mechanism[J]. Computer Engineering, 2025, 51(11): 90-99.

https://www.ecice06.com/CN/Y2025/V51/I11/90

图/表 12

图1 GANet模型结构

Fig.1 GANet model structure

图2 不同时间粒度的序列映射

Fig.2 Sequence mapping with different time granularities

图3 全局特征捕捉模块

Fig.3 Global feature capture module

图4 特征融合层

Fig.4 Feature fusion layer

图5 在不同缺失率下的模型插补结果

Fig.5 Model imputation results under different missing rates

图6 GANet、NPE、NTF、NMF插补结果的误差均值

Fig.6 Mean errors of imputation results for GANet, NPE, NTF and NMF

图7 Abolition与GANet的MSE值

Fig.7 MSE values of Abolition and GANet

图8 Abolition与GANet的MAE值

Fig.8 MAE values of Abolition and GANet

图9 GANet不同循环层数的MSE值

Fig.9 MSE values of GANet with different number of loop layers

图10 GANet不同循环层数的MAE值

Fig.10 MAE values of GANet with different numbers of loop layers

参考文献 28

1	SILVA I , MOODY G , SCOTT D J , et al. Predicting in-hospital mortality of ICU patients: the PhysioNet/computing in cardiology challenge 2012. Computing in Cardiology, 2012, 39, 245- 248.
2	YI X, ZHENG Y, ZHANG J, et al. ST-MVL: filling missing values in geo-sensory time series data[C]//Proceedings of the 25th International Joint Conference on Artificial Intelligence. Palo Alto, USA: AAAI Press, 2016: 2704-2710.
3	DU W J , COTE D , BARBER C , et al. Forecasting loss of signal in optical networks with machine learning. Journal of Optical Communications and Networking, 2021, 13 (10): 109- 121. doi: 10.1364/JOCN.423667
4	BRUNI R , DARAIO C , AURELI D . Imputation techniques for the reconstruction of missing interconnected data from higher Educational Institutions. Knowledge-Based Systems, 2021, 212, 106512. doi: 10.1016/j.knosys.2020.106512
5	于明霞. 基于深度学习的交通数据插补与预测方法研究[D]. 长沙: 湖南大学, 2022.
	YU M X. Research on traffic data interpolation and prediction method based on deep learning[D]. Changsha: Hunan University, 2022. (in Chinese)
6	杨正泽. 轨道动检数据异常检测及缺失数据插补算法研究[D]. 北京: 北京交通大学, 2023.
	YANG Z Z. Research on anomaly detection and missing data interpolation algorithm of track dynamic inspection data[D]. Beijing: Beijing Jiaotong University, 2023. (in Chinese)
7	SHAO Z Z, ZHANG Z, WANG F, et al. Pre-training enhanced spatial-temporal graph neural network for multivariate time series forecasting[C]//Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. New York, USA: ACM Press, 2022: 1567-1577.
8	MIRZA M, OSINDERO S. Conditional generative adversarial nets[EB/OL]. [2024-03-05]. https://arxiv.org/abs/1411.1784.
9	CAO W, WANG D, LI J, et al. BRITS: bidirectional recurrent imputation for time series[EB/OL]. [2024-03-05]. https://arxiv.org/abs/1805.10572v1.
10	雷未, 王建, 吉同元, 等. 基于深度学习框架的长序列大坝监测缺失数据插补模型. 水利水电科技进展, 2023, 43 (6): 82- 88.
	LEI W , WANG J , JI T Y , et al. Long-series missing data interpolation model for dam monitoring based on deep learning framework. Advances in Science and Technology of Water Resources, 2023, 43 (6): 82- 88.
11	HE Y D , ZHAO J B . Temporal convolutional networks for anomaly detection in time series. Journal of Physics: Conference Series, 2019, 1213 (4): e042050.
12	WU H X, XU J H, WANG J M, et al. Autoformer: decomposition Transformers with auto-correlation for long-term series forecasting[EB/OL]. [2024-03-05]. https://arxiv.org/abs/2106.13008v5.
13	KITAEV N, KAISER L, LEVSKAYA A. Reformer: the efficient Transformer[EB/OL]. [2024-03-05]. https://arxiv.org/pdf/2001.04451.
14	ANDERSON O D , KENDALL M . Time-series. The Statistician, 1976, 25 (4): 308.
15	ZHANG A Q , SONG S X , WANG J M , et al. Time series data cleaning. Proceedings of the VLDB Endowment, 2017, 10 (10): 1046- 1057. doi: 10.14778/3115404.3115410
16	YOON J, JORDON J, SCHAAR M. GAIN: missing data imputation using generative adversarial nets[C]//Proceedings of International Conference on Machine Learning. [S. l. ]: PMLR, 2020: 5689-5698.
17	FENG H H , CHEN G S , YIN C , et al. A SVM regression based approach to filling in missing values. Berlin, Germany: Springer, 2005.
18	PANTANOWITZ A , MARWALA T . Missing data imputation through the use of the random forest algorithm. Berlin, Germany: Springer, 2009.
19	CHE Z P , PURUSHOTHAM S , CHO K , et al. Recurrent neural networks for multivariate time series with missing values. Scientific Reports, 2018, 8 (1): 6085. doi: 10.1038/s41598-018-24271-9
20	HOCHREITER S , SCHMIDHUBER J . Long short-term memory. Neural Computation, 1997, 9 (8): 1735- 1780. doi: 10.1162/neco.1997.9.8.1735
21	郑欣彤, 边婷婷, 张德强, 等. 基于深度学习的温度观测数据长时间缺失值插补方法. 计算机系统应用, 2022, 31 (4): 221- 228.
	ZHENG X T , BIAN T T , ZHANG D Q , et al. Interpolation of long time missing values of temperature based on deep learning. Computer Systems and Applications, 2022, 31 (4): 221- 228.
22	ZHOU T, MA Z Q, WEN Q S, et al. FEDformer: frequency enhanced decomposed Transformer for long-term series forecasting[EB/OL]. [2024-03-05]. https://arxiv.org/abs/2201.12740v3.
23	ZHOU H Y, ZHANG S H, PENG J Q, et al. Informer: beyond efficient Transformer for long sequence time-series forecasting[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Palo Alto, USA: AAAI Press, 2021: 11106-11115.
24	GU A, GOEL K, RÉ C. Efficiently modeling long sequences with structured state spaces[EB/OL]. [2024-03-05]. https://arxiv.org/abs/2111.00396v3.
25	LI S Y, JIN X Y, XUAN Y, et al. Enhancing the locality and breaking the memory bottleneck of Transformer on time series forecasting[EB/OL]. [2024-03-05]. https://arxiv.org/abs/1907.00235v3.
26	LIU S, YU H, LIAO C, et al. Pyraformer: low-complexity pyramidal attention for long-range time series modeling and forecasting[EB/OL]. [2024-03-05]. https://openreview.net/forum?id=0EXmFzUn5I.
27	LIU Y , WU H X , WANG J M , et al. Non-stationary Transformers: exploring the stationarity in time series forecasting. Advances in Neural Information Processing Systems, 2022, 35, 9881- 9893.
28	WOO G, LIU C H, SAHOO D, et al. ETSformer: exponential smoothing Transformers for time-series forecasting[EB/OL]. [2024-03-05]. https://arxiv.org/abs/2202.01381v2.

[1]	翟志鹏, 曹阳, 沈琴琴, 施佺. 基于多时空图融合与动态注意力的交通流预测[J]. 计算机工程, 2025, 51(9): 139-148.
[2]	刘云翔, 梁智超. 一种高效的连续时序图注意力网络的交通预测模型[J]. 计算机工程, 2025, 51(4): 350-359.
[3]	孙亭, 杨洁, 李家璇, 王耀宗. 面向弱光交通场景的YOLOv7道路标志检测算法优化[J]. 计算机工程, 2025, 51(3): 342-351.
[4]	贾硕, 林士飏, 杨苗会, 孙滕. 改进鲸鱼优化GRU的窄路短时车流量预测[J]. 计算机工程, 2025, 51(2): 111-125.
[5]	周嘉文, 郑小盈, 祝永新, 林思敏, 陈凌曜, 曾洪斌, 郭俞, 王馨莹. 多头自注意力与双线性池化融合的心肌缺血影像分类[J]. 计算机工程, 2025, 51(11): 246-257.
[6]	郑雅洲, 刘万平, 黄东. 一种基于注意力机制的BERT-CNN-GRU检测方法[J]. 计算机工程, 2025, 51(1): 258-268.
[7]	刘钟, 唐宏, 王宁喆, 朱传润. 融合RNN与稀疏自注意力的文本摘要方法[J]. 计算机工程, 2025, 51(1): 312-320.
[8]	杨明强, 卢健. 基于跨模态注意力的目标语音提取[J]. 计算机工程, 2024, 50(9): 121-129.
[9]	王夙喆, 张雪英, 陈晓玉, 李凤莲, 吴泽林. 基于有效注意力和GAN结合的脑卒中EEG增强算法[J]. 计算机工程, 2024, 50(8): 336-344.
[10]	陈瀚, 赵春蕾, 蒋昊达, 王春东. 基于融合模型与语义网络的App用户意图识别研究[J]. 计算机工程, 2024, 50(8): 50-63.
[11]	贺姗, 蔺素珍, 王彦博, 李大威. 基于特征融合的多波段图像描述生成方法[J]. 计算机工程, 2024, 50(6): 236-244.
[12]	徐浩宸, 刘满华. 基于多层次自注意力网络的人脸特征点检测[J]. 计算机工程, 2024, 50(2): 239-246.
[13]	王正家, 胡飞飞, 张成娟, 雷卓, 何涛. 引入轻量级Transformer的自适应窗口立体匹配算法[J]. 计算机工程, 2024, 50(2): 256-265.
[14]	郭祥振, 李思潼, 卢锐, 郭森, 崔学荣, 杨钢. 基于多任务联合注意力的结肠息肉分割网络[J]. 计算机工程, 2024, 50(2): 327-336.
[15]	丁国辉, 刘宇琪, 王言开, 耿施展, 姜天昊. 基于翻转网络的低相关性序列数据预测研究[J]. 计算机工程, 2024, 50(2): 78-90.

选择文件类型/文献管理软件名称

选择包含的内容