Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2023, Vol. 49 ›› Issue (5): 277-285,294. doi: 10.19678/j.issn.1000-3428.0064047

• Development Research and Engineering Application • Previous Articles     Next Articles

Fall Detection Method Integrating Multi-Feature and Semantic Graph Convolution Network

CHEN Wenxuan, ZENG Bi, GUO Zhixing   

  1. School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China
  • Received:2022-02-28 Revised:2022-06-23 Published:2022-08-30

融合多特征与语义图卷积网络的摔倒检测方法

陈文轩, 曾碧, 郭植星   

  1. 广东工业大学 计算机学院, 广州 510006
  • 作者简介:陈文轩(1996-),男,硕士研究生,主研方向为深度学习、计算机视觉;曾碧(通信作者),教授、博士;郭植星,硕士研究生。
  • 基金资助:
    国家自然科学基金(61703115);广东省自然科学基金(2018A030313868)。

Abstract: Falls have a serious impact on the lives and health of elderly people.Fall detection can reduce the risk of elderly people falling again,thereby ensuring their ability to live and improving their quality of life.Currently,vision-based fall detection methods can achieve good accuracy on experimental datasets,but they do not generalize well to real-world environments and often do not conform to the action judgment logic in practical applications.To solve this problem,this paper proposes a robust fall detection method based on 2D human posture estimation,which combines the optical flow method and human posture-based estimation method. This study designed an optimized framework for fall detection and constructed a detection model that integrates multiple features and semantic graph convolution.The model is trained using a strategy that is more suitable for motion judgment logic to improve the generalization of the fall detection system for a real environment. Experiments on three public datasets,Le2i Fall Detection Dataset,UP Fall Detection Dataset,and Multiple Cameras Fall Detection Dataset,as well as self-collected datasets,show that the overall detection accuracy of the model reaches 98.3%.The model based on the proposed optimization framework and training strategy cooperates with YOLOv3 and Alpha_pose to implement an overall fall detection method that achieves a frame rate of approximately 25FPS on a GTX1060 graphics card.This method exhibits good robustness in real-world testing, and compared to previous vision-based detection methods,it is more suitable for deployment in practical application environments.

Key words: graph convolution network, lightweight network, fall detection, multi-feature fusion, semantic information, training strategy

摘要: 摔倒事件严重影响老年人的生命健康,对摔倒行为进行检测可以降低老年人再次跌倒的风险,从而保证其生活能力以及提高生活质量。目前基于视觉的摔倒检测方法在实验数据集上能够取得较好的精度,但是无法很好地泛化到现实环境中,在实际应用时往往并不符合动作判断逻辑。针对该问题,对比光流法以及基于人体姿态估计的方法,在2D人体姿态估计的基础上提出一种鲁棒的摔倒检测方法。设计一种摔倒检测优化框架,构建融合多特征与语义图卷积的检测模型,采用更贴合动作判断逻辑的训练策略对该模型进行训练,以提高摔倒检测系统在现实环境中的泛化性。在Le2i Fall Detection Dataset、UP Fall Detection Dataset和Multiple Cameras Fall Detection Dataset这3个公开数据集以及自收集数据集上进行实验,结果表明,该模型的总体检测准确率达到98.3%,基于所提优化框架与训练策略的模型配合YOLOv3和Alpha_pose实现的整体摔倒检测方法在GTX1060显卡中帧率达到约25FPS,在现实场景测试中体现出较好的鲁棒性,相较以往的基于视觉的检测方法更适合部署在实际应用环境中。

关键词: 图卷积网络, 轻量级网络, 摔倒检测, 多特征融合, 语义信息, 训练策略

CLC Number: