多机理指导的深度学习工业时序预测框架

doi:10.19678/j.issn.1000-3428.0069406

摘要/Abstract

摘要：

工业时序预测对于优化生产过程和增强决策制定至关重要。现有基于深度学习的方法由于缺乏领域知识而常常效果不理想。现有研究使用机理模型指导深度学习以解决此问题，但这些方法通常只考虑单一机理模型，忽略了工业过程中多个时序预测机理的情形以及工业时序的复杂性。为此，提出基于注意力机制的多机理指导的深度学习工业时序预测(M-MDLITF)通用框架，其能够将多个机理嵌入深度工业时序预测网络指导训练，并且将不同机理的优势通过注意力机制集成于最终预测结果。多机理深度维纳(M-DeepWiener)作为M-MDLITF框架的实例化方法，利用上下文滑动窗口和Transformer编码器架构来挖掘工业时序的复杂模式。在1个模拟数据集和2个真实数据集上的实验结果表明，M-DeepWiener具有良好的运行效率和鲁棒性，比单机理深度维纳(DeepWiener)、经典维纳机理和纯数据驱动方法具有更高的预测准确率，其中在模拟数据集上比单机理模型DeepWiener-M1的误差降低了20%。

关键词: 工业时序预测, 深度学习, 机理模型, 多机理集成, 复杂模式挖掘

Abstract:

Industrial time-series forecasting is critical for optimizing production processes and enhancing decision-making. Existing deep learning-based methods often underperform in this context due to a lack of domain knowledge. Prior studies have proposed using mechanistic models to guide deep learning; however, these approaches typically consider only a single mechanistic model, ignoring scenarios with multiple time-series prediction mechanisms in industrial processes and the inherent complexity of industrial time-series (e.g., multiscale dynamics and nonlinearity). To address this issue, this study proposes a Multi-Mechanism-guided Deep Learning for Industrial Time-series Forecasting (M-MDLITF) framework based on attention mechanisms. This framework embeds multiple mechanistic models into a deep industrial time-series prediction network to guide training and integrate the strengths of different mechanisms by focusing on final predictions. As an instantiation of the M-MDLITF, the Multi-mechanism Deep Wiener (M-DeepWiener) method employs contextual sliding windows and a Transformer-encoder architecture to capture complex patterns in industrial time-series. Experimental results from a simulated dataset and two real-world datasets demonstrate that M-DeepWiener achieves high computational efficiency and robustness. It significantly outperforms the single-mechanism Deep Wiener (DeepWiener), classical Wiener mechanistic models, and purely data-driven methods, reducing the prediction error by 20% compared to DeepWiener-M1 on the simulated dataset.

Key words: industrial time-series prediction, deep learning, mechanism model, multi-mechanism integration, complex pattern mining

李姜辛, 王鹏, 汪卫. 多机理指导的深度学习工业时序预测框架[J]. 计算机工程, 2025, 51(7): 47-58.

LI Jiangxin, WANG Peng, WANG Wei. Multi-mechanism-guided Deep Learning Framework for Industrial Time-series Forecasting[J]. Computer Engineering, 2025, 51(7): 47-58.

https://www.ecice06.com/CN/Y2025/V51/I7/47

图/表 11

图1 M-MDLITF网络架构

Fig.1 M-MDLITF network architecture

图2 注意力集成器架构

Fig.2 Attention integrator architecture

图3 M-DeepWiener时序参数辨识模块

Fig.3 M-DeepWiener time-series parameter identification module

图4 M-DeepWiener和其他消融方法在所有测试数据上的RMSE

Fig.4 RMSE of M-DeepWiener and other ablation methods on all test data

图5 M-DeepWiener-NE和M-DeepWiener-NE-NP中的每一个独立机理以及集成机理在所有测试数据上的RMSE

Fig.5 RMSE of each independent mechanism and integrated mechanism in M-DeepWiener-NE and M-DeepWiener-NE-NP on all test data

图6 M-DeepWiener超参数设置对模型预测误差的影响

Fig.6 Influence of M-DeepWiener hyperparameter settings on model prediction error

图7 M-DeepWiener训练收敛曲线

Fig.7 M-DeepWiener training convergence curve

参考文献 30

1	SCHWAB K . The fourth industrial revolution. Sydney, Australia: Crown Currency, 2017.
2	李子豪, 张轶, 刘学, 等. 实时系统多路径任务概率时序分析研究综述. 小型微型计算机系统, 2024, 45 (11): 2586- 2593. URL
	LI Z H , ZHANG Y , LIU X , et al. A survey on probabilistic timing analysis of multi-path tasks in real-time systems. Journal of Chinese Mini-Micro Computer Systems, 2024, 45 (11): 2586- 2593. URL
3	丁小欧, 于晟健, 王沐贤, 等. 基于相关性分析的工业时序数据异常检测. 软件学报, 2020, 31 (3): 726- 747. doi: 10.13328/j.cnki.jos.005907
	DING X O , YU S J , WANG M X , et al. Anomaly detection on industrial time series based on correlation analysis. Journal of Software, 2020, 31 (3): 726- 747. doi: 10.13328/j.cnki.jos.005907
4	LI N P , LEI Y G , YAN T , et al. A Wiener-process-model-based method for remaining useful life prediction considering unit-to-unit variability. IEEE Transactions on Industrial Electronics, 2019, 66 (3): 2092- 2101. URL
5	WEI Z B , DONG G Z , ZHANG X N , et al. Noise-immune model identification and state-of-charge estimation for lithium-ion battery using bilinear parameterization. IEEE Transactions on Industrial Electronics, 2021, 68 (1): 312- 323. doi: 10.1109/TIE.2019.2962429
6	杨海民, 潘志松, 白玮. 时间序列预测方法综述. 计算机科学, 2019, 46 (1): 21- 28. doi: 10.11896/j.issn.1002-137X.2019.01.004
	YANG H M , PAN Z S , BAI W . Review of time series prediction methods. Computer Science, 2019, 46 (1): 21- 28. doi: 10.11896/j.issn.1002-137X.2019.01.004
7	LI X , ZHANG W , DING Q , et al. Diagnosing rotating machines with weakly supervised data using deep transfer learning. IEEE Transactions on Industrial Informatics, 2020, 16 (3): 1688- 1697. doi: 10.1109/TII.2019.2927590
8	QIU S H , CUI X P , PING Z W , et al. Deep learning techniques in intelligent fault diagnosis and prognosis for industrial systems: a review. Sensors, 2023, 23 (3): 1305. doi: 10.3390/s23031305
9	LINARDATOS P , PAPASTEFANOPOULOS V , KOTSIANTIS S . Explainable AI: a review of machine learning interpretability methods. Entropy, 2020, 23 (1): 18. doi: 10.3390/e23010018
10	LIU T H , WEI H K , LIU S X , et al. Industrial time series forecasting based on improved Gaussian process regression. Soft Computing, 2020, 24 (20): 15853- 15869. doi: 10.1007/s00500-020-04916-6
11	RAMASSO E . Investigating computational geometry for failure prognostics. International Journal of Prognostics and Health Management, 2014, 5 (1): 5. URL
12	LI N P , LEI Y G , GUO L , et al. Remaining useful life prediction based on a general expression of stochastic process models. IEEE Transactions on Industrial Electronics, 2017, 64 (7): 5709- 5718. doi: 10.1109/TIE.2017.2677334
13	LE LOSQ C , VALENTINE A P , MYSEN B O , et al. Structure and properties of alkali aluminosilicate glasses and melts: Insights from deep learning. Geochimica et Cosmochimica Acta, 2021, 314, 27- 54. doi: 10.1016/j.gca.2021.08.023
14	WANG B , LEI Y G , LI N P , et al. A hybrid prognostics approach for estimating remaining useful life of rolling element bearings. IEEE Transactions on Reliability, 2020, 69 (1): 401- 412. doi: 10.1109/TR.2018.2882682
15	YANG J , CHAI T Y , LUO C M , et al. Intelligent demand forecasting of smelting process using data-driven and mechanism model. IEEE Transactions on Industrial Electronics, 2019, 66 (12): 9745- 9755.
16	NGANYU TANYU D , NING J F , FREUDENBERG T , et al. Deep learning methods for partial differential equations and related parameter identification problems. Inverse Problems, 2023, 39 (10): 103001. URL
17	HUANG B , WANG J H . Applications of physics-informed neural networks in power systems-a review. IEEE Transactions on Power Systems, 2023, 38 (1): 572- 588. doi: 10.1109/TPWRS.2022.3162473
18	RAISSI M , PERDIKARIS P , KARNIADAKIS G E . Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational Physics, 2019, 378, 686- 707. doi: 10.1016/j.jcp.2018.10.045
19	WANDEL N, WEINMANN M, NEIDLIN M, et al. Spline-PINN: approaching PDEs without data using fast, physics-informed Hermite-spline CNNs[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Palo Alto, USA: AAAI Press, 2022: 8529-8538.
20	ZHANG H B, LI J X, LIANG S, et al. Towards a generic framework for mechanism-guided deep learning for manufacturing applications[C]//Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. New York, USA: ACM Press, 2023: 5532-5543.
21	SHEN T, ZHOU T Y, LONG G D, et al. DiSAN: directional self-attention network for RNN/CNN-free language understanding[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Palo Alto, USA: AAAI Press, 2018: 1-9.
22	XU H Z, WANG Y J, JIAN S L, et al. Beyond outlier detection: outlier interpretation by attention-guided triplet deviation network[C]//Proceedings of the Web Conference 2021. New York, USA: ACM Press, 2021: 1328-1339.
23	WANG Y , SUN Y B , LIU Z W , et al. Dynamic graph CNN for learning on point clouds. ACM Transactions on Graphics, 2019, 38 (5): 1- 12.
24	WU H X, XU J H, WANG J M, et al. Autoformer: decomposition Transformers with auto-correlation for long-term series forecasting[EB/OL]. [2024-01-14]. https://arxiv.org/abs/2106.13008.
25	ZHOU H Y, ZHANG S H, PENG J Q, et al. Informer: beyond efficient transformer for long sequence time-series forecasting[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Palo Alto, USA: AAAI Press, 2021: 11106-11115.
26	韩璐, 霍纬纲, 张永会, 等. 基于多尺度特征融合与双注意力机制的多元时间序列预测. 计算机工程, 2023, 49 (9): 99- 108. doi: 10.19678/j.issn.1000-3428.0065846
	HAN L , HUO W G , ZHANG Y H , et al. Multivariate time series forecasting based on multi-scale feature fusion and dual-attention mechanism. Computer Engineering, 2023, 49 (9): 99- 108. doi: 10.19678/j.issn.1000-3428.0065846
27	NIE Y, NGUYEN N H, SINTHONG P, et al. A time series is worth 64 words: long-term forecasting with Transformers[EB/OL]. [2024-01-14]. https://arxiv.org/abs/2211.14730.
28	DEMPSTER A , PETITJEAN F , WEBB G I . ROCKET: exceptionally fast and accurate time series classification using random convolutional kernels. Data Mining and Knowledge Discovery, 2020, 34 (5): 1454- 1495.
29	SALINAS D , FLUNKERT V , GASTHAUS J , et al. DeepAR: probabilistic forecasting with autoregressive recurrent networks. International Journal of Forecasting, 2020, 36 (3): 1181- 1191. doi: 10.1016/j.ijforecast.2019.07.001
30	ZENG A L, CHEN M X, ZHANG L, et al. Are Transformers effective for time series forecasting?[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Palo Alto, USA: AAAI Press, 2023: 11121-11128.

[1]	孟波, 史旭华, 张彬. 基于双分支卷积和深度插值的点云表面重建[J]. 计算机工程, 2025, 51(7): 119-126.
[2]	周莎, 车生兵, 考友琛, 张旭, 郭甚驿. 基于特征选择和时空特征的网络入侵检测[J]. 计算机工程, 2025, 51(7): 223-231.
[3]	余鹏, 杨佳琦, 陈欣然, 贺超波. 基于二部图对比学习的特征增强推荐算法[J]. 计算机工程, 2025, 51(7): 100-110.
[4]	沙宇洋, 陆京涛, 杜浩凡, 翟小兵, 孟维宇, 廉旭, 罗刚, 李克峰. 适用于导盲场景的多尺度特征融合轻量化道路图像分割算法[J]. 计算机工程, 2025, 51(7): 314-325.
[5]	周哲臣, 胡冀苏, 钱旭升, 郑毅, 戴亚康, 周志勇. 基于查询自适应双层自注意力机制的MRI脑组织分割[J]. 计算机工程, 2025, 51(7): 294-304.
[6]	欧阳昱中, 韩锐, 刘驰. 边缘侧领域自适应中长尾视觉识别技术研究[J]. 计算机工程, 2025, 51(7): 171-179.
[7]	王培吉, 邹承明. 基于向量转换的卷积计算优化方法[J]. 计算机工程, 2025, 51(6): 74-82.
[8]	陈思帆, 杨家志, 黄琳, 吕志玮, 沈露. 融合可变形核和自注意力的点云分类分割边卷积网络[J]. 计算机工程, 2025, 51(6): 146-154.
[9]	曹蓓, 赵奎. 基于双重情感和多特征融合的虚假新闻检测[J]. 计算机工程, 2025, 51(6): 193-203.
[10]	庞鑫, 葛凤培, 李艳玲. 声景识音：数字化时代声学场景分类的探索与前沿[J]. 计算机工程, 2025, 51(6): 1-19.
[11]	廖丁丁, 刘俊峰, 曾君, 邱晓欢. 一种基于块平均正交权重修正的连续学习算法[J]. 计算机工程, 2025, 51(6): 57-64.
[12]	秦永旺, 张洋, 胡星, 刘胜, 李少青. 基于图注意力网络的门级网表功能识别[J]. 计算机工程, 2025, 51(6): 29-37.
[13]	赵瑶谦, 滕奇志, 何小海, 税爱, 陈洪刚. 基于自注意力特征蒸馏的轻量级图像超分辨率重建[J]. 计算机工程, 2025, 51(5): 257-265.
[14]	郝志峰, 黎阳霖, 许柏炎, 蔡瑞初. 面向跨域自然语言生成SQL语句的超图神经网络[J]. 计算机工程, 2025, 51(5): 114-123.
[15]	魏铭康, 李嘉楠, 韩林, 高伟, 赵荣彩, 王洪生. 面向深度学习编译器的多粒度量化框架支持与优化[J]. 计算机工程, 2025, 51(5): 62-72.

选择文件类型/文献管理软件名称

选择包含的内容