Non-convex Temporal Difference Low-Rank Constrained Motion Discontinuous Spatio-Temporal Behavior Understanding

doi:10.19678/j.issn.1000-3428.0260017

Abstract

Abstract: To address the challenges of low-rank degradation and partial observation in spatio-temporal data caused by unstructured occlusion, motion distortion, and multi-source noise coupling in complex dynamic scenes, this paper proposes a motion discontinuous spatio-temporal behavior understanding framework that integrates non-convex temporal difference low-rank constraints and hierarchical trajectory-behavior semantic mapping. Firstly, a temporal difference low-rank recovery model based on the non-convex Schatten-p norm is constructed, and the Alternating Direction Method of Multipliers (ADMM) is employed to reconstruct motion data under high missing rates and noise pollution. Secondly, based on the recovered data, structured trajectory clusters are built by combining multi-object tracking, and trajectory neighborhood interaction features are extracted. Furthermore, a three-level behavior understanding model is proposed: behavior primitive classification based on multilayer perceptrons, interaction pattern recognition based on graph attention networks, and semantic fusion and behavior narrative generation incorporating spatio-temporal context, achieving end-to-end mapping from trajectories to high-level semantics. Experiments show that the proposed method significantly outperforms baseline approaches in data recovery quality under a 60% high missing rate, achieving behavior recognition accuracies of 92.7% on both the NTU RGB+D (X-Sub) and the self-built motion dataset BAS, which is 5.6 percentage points higher than the best comparative method. Ablation studies further validate the effectiveness of each module: the NTDLR recovery module improves the recognition rate from 78.3% to 86.7% under 60% missing data, trajectory neighborhood encoding enhances it to 88.2%, and the complete three-level model achieves optimal performance through synergistic interaction. The results of interaction pattern recognition and semantic description generation also notably surpass those of mainstream graph convolutional networks and their variants. This research provides an interpretable and scalable algorithmic framework for discontinuous and interactive motion behavior understanding in complex dynamic scenes.

摘要： 针对复杂动态场景中因非结构化遮挡、运动畸变与多源噪声耦合导致的时空数据低秩退化与部分观测难题，本文提出一种融合非凸时序差分低秩约束与层级化轨迹‑行为语义映射的运动非连续时空行为理解框架。首先，构建基于非凸Schatten‑p范数的时序差分低秩恢复模型，采用交替方向乘子法实现高缺失与噪声污染下的运动数据重建；其次，在恢复数据基础上结合多目标跟踪构建结构化轨迹簇，并提取轨迹邻域交互特征；进而，提出一个三层级行为理解模型：基于多层感知机的行为基元分类、基于图注意力网络的交互模式识别，以及融合时空上下文的语义融合与行为叙事生成，实现从轨迹到高层语义的端到端映射。实验表明：所提方法在60%高缺失率下恢复质量显著优于基线，在NTU RGB+D（X‑Sub）与自建运动数据集BAS上的行为识别准确率均达到92.7%，较最优对比方法提升5.6个百分点；消融实验进一步验证了各模块的有效性，其中NTDLR恢复模块在60%缺失下将识别率从78.3%提升至86.7%，轨迹邻域编码提升至88.2%，完整三层级模型协同作用下达到最优性能。交互模式识别与语义描述生成亦显著优于主流图卷积网络及其变体。本研究为复杂动态场景下非连续、交互式运动行为理解提供了可解释、可扩展的算法框架。

Qiang Zhenqian, Zhang Yupeng, Wang Danping. Non-convex Temporal Difference Low-Rank Constrained Motion Discontinuous Spatio-Temporal Behavior Understanding[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0260017.

强振乾, 张予鹏, 王丹萍. 非凸时序差分低秩约束的运动非连续时空行为理解[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0260017.

/ Recommend / Download Citations

URL: https://www.ecice06.com/EN/10.19678/j.issn.1000-3428.0260017

References

[1] 郭志鑫,冯秀芳.MEC-Net:基于运动捕捉和通道注意力的行为识别方法[J].计算机工程与设计,2024,45(6):1805-1811. Guo Zhixin,Feng Xiufang.MEC-Net:Video action recognition method based on motion extract with channel attention[J].Computer Engineering and Design,2024,45(6):1805-1811.
[2] Wang R,Hu Z,Song X,et al. Trajectory Distribution Aware Graph Convolutional Network for Trajectory Prediction Considering Spatio-Temporal Interactions and Scene Information[J].IEEE Transactions on Automatic Control,2024,36(8):13.
[3] 沈鑫,郭新东,谭强强,等.双目视觉下人体运动姿态轨迹提取[J].现代电子技术,2025,48(19):52-56. Shen Xin,Guo Xindong,Tan Qiangqiang,et al.Human motion posture trajectory extraction based on binocular vision[J].Modern Electronics Technique,2025,48(19):52-56.
[4] Tariq O,Han D.NanoMST:A Hardware-Aware Multiscale Transformer Network for TinyML-Based Real-Time Inertial Motion Tracking[J].IEEE Internet of Things Journal,2025,12(18):37763-37776.
[5] 圣文顺,沈嘉慧,陈琦.复杂交通场景下新型行人多目标跟踪方法[J].液晶与显示,2025,40(5):785-795. Sheng Wenshun,Shen Jiahui,Chen Qi.Novel pedestrian multi-target tracking method in complex traffic scenarios[J].Chinese Journal of Liquid Crystals and Displays,2025,40(5):785-795.
[6] Li B ,Chen H,An Z,et al. The continuous memory: A neural network with ordinary differential equations for continuous-time series analysis[J].Applied Soft Computing,2024,167(PartA):11.
[7] Ji C H,Choi Y H,Han Y H.Curiosity-driven dual-policy action selection in temporal difference learning for model predictive control[J].Neural Computing and Applications,2025,37(19):14171-14187.
[8] Logan S.A Phase Noise Analysis Tool for Sparse Numerical Phase Noise Data[J].Microwave Journal,2024,67(7):9.
[9] Feng X,Gong H,Qiu G,et al.Fusion of Multifucus Image with Noise Based on Adaptive Sparse and Low-Rank Representations[J].Journal of information processing systems,2024,20(5):602-616.
[10] Dong D,Wen J,Huang D,et al.Fast Trajectory Tracking Control Algorithm for Autonomous Vehicles Based on the Alternating Direction Multiplier Method(ADMM)to the Receding Optimization of Model Predictive Control(MPC)[J].Sensors(Basel,Switzerland),2023,23(20):8391-8391.
[11] Shafir T.Modeling Emotion Perception from Body Movements for Human-Machine Interactions Using Laban Movement Analysis[J].Modeling Visual Aesthetics,Emotion,and Artistic Style,2024:313-330.
[12] Tharatipyakul A,Srikaewsiew T,Pongnumkul S.Deep learning- based human body pose estimation in providing feedback for physical movement:A review[J].Heliyon,2024,10(17).
[13] Rubén Molero,Marta Martínez-Pérez,Clara Herrero-Martín,et al.Improving electrocardiographic imaging solutions:A comprehensive study on regularization parameter selection in L-curve optimization in the Atria[J].Computers in Biology and Medicine,2024,182(c):109141.
[14] Kim D H,Chang J H.Mitigating Overfitting in Structured Pruning of ASR Models with Gradient-Guided Parameter Regularization[J].Interspeech 2024,2024:4493-4497.
[15] SONG Y F,ZHANG Z,SHAN C,et al.Constructing stronger and faster baselines for skeleton-based action recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2022,45(2):1474-1488.
[16] LIU S C,WANG X T, XIONG R Q,et al.GCN-based multi-modality fusion network for action recognition[J].IEEE Transactions on Multimedia,2025(27):1242-1253.
[17] KILIC U,KARADAG O O,OZYER G T.AGMS-GCN: Attention-guided multi-scale graph convolutional networks for skeleton-based action recognition[J].Knowledge-Based Systems,2025,311:113045.
[18] ALOWONOU K C,HAN J H.MSA-GCN:Exploiting Multi-scale temporal dynamics with adaptive graph convolution for skeleton-based action recognition[J].IEEE Access,2024(12):193552-193563.
[19] CUI X,ZHANG J,HE Y,et al.GCN-Former:A method for action recognition using graph convolutional networks and Transformer[J].Applied Sciences,2025,15(8):4511.
[20] 曹毅,李杰,叶培涛,等.利用可选择多尺度图卷积网络的骨架行为识别[J].电子与信息学报,2025,47(03):839-849. Cao Yi,Li Jie,Ye Peitao,et al.Skeleton-based action recognition with selective multi-scale graph convolutional network[J].Journal of Electronics&Information Technology,2025,47(03):839-849.
[21] Wen K,Burgner-Kahrs J.Modeling and Analysis of Tendon-Driven Parallel Continuum Robots Under Constant Curvature and Pseudo-Rigid-Body Assumptions[J].Journal of Mechanisms and Robotics:Transactions of the ASME,2023,15(4).
[22] Xin L I,Wei S.Experimental Study on Full-Surface Buckling of Variable Curvature Cylindrical Shell Using Multi-camera 3D-DIC System[J].Transactions of Nanjing University of Aeronautics and Astronautics,2024,41(5):589-598.
[23] Yu Y,Yang Z H,Yu W Y J.Long-Range Fast Single-Pixel Localization of Multiple Moving Targets[J].IEEE sensors journal,2024,24(15):24699-24707.
[24] 胡文玉,彭绍婷,郭震宇,等.非凸时序差分低秩约束的人体运动捕获数据恢复算法[J].浙江大学学报(理学版),2025,52(1):146-158. Hu Wenyu, Peng Shaoting, Guo Zhenyu,et al.Human motion capture data recovery based on non-convex low-rank priors of temporal difference [J]. Journal of Zhejiang University(Science Edition), 2025, 52 (1): 146-158.
[25] Khan S,Wong A,Tripp B.Modeling the Role of Contour Integration in Visual Inference[J].Neural Computation,2024,36(1):42.

Please choose a citation manager

Content to export