作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 多媒体技术及应用 • 上一篇    下一篇

面向电力机房监控的音视频融合检测方法研究

袁慧 1,张大伟 2,张珂 3,湛永松 3   

  1. (1.国网湖北省电力公司信息通信公司,武汉 430077; 2.中国科学院自动化研究所,北京 100190;3.桂林电子科技大学 广西高校云计算与复杂系统重点实验室,广西 桂林 541004)
  • 收稿日期:2015-11-16 出版日期:2016-12-15 发布日期:2016-12-15
  • 作者简介:袁慧(1974—),男,高级工程师,主研方向为信息系统运维管理;张大伟,博士研究生;张珂,硕士研究生;湛永松,教授。
  • 基金资助:
    国家自然科学基金(61332017);广西科技计划项目(1598018-6);广西高校图像图形智能处理重点实验室课题(GIIP201403)。

Research on Audio-video Fusion Detection Method for Electrical Room Monitoring

YUAN Hui  1,ZHANG Dawei  2,ZHANG Ke  3,ZHAN Yongsong  3   

  1. (1.Informance and Telecommunication Branch,State Grid Hubei Electric Power Company,Wuhan 430077,China; 2.Institute of Automation,Chinese Academy of Sciences,Beijing 100190,China; 3.Guangxi Colleges and Universities Key Laboratory of Cloud Computing and Complex Systems, Guilin University of Electronic Technology,Guilin,Guangxi 541004,China)
  • Received:2015-11-16 Online:2016-12-15 Published:2016-12-15

摘要: 为实现电力机房重点区域指示灯安全事件的监控,提出一种音视频融合检测方法。采用监控区域彩色图像非线性变换和最大类间方差法自动阈值分割技术进行指示灯定位,将局部区域统计直方图作为视频特征向量。采用连续多帧梅尔频率倒谱系数建立监控区域的音频特征向量。利用主成分分析对连续多帧的音视频融合特征向量进行降维处理,并借助支持向量机对不同安全事件进行分类检测。实验结果表明,与单独采用音频或者视频进行安全事件检测的方法相比,该方法具有较高的检测率和较低的误检率。

关键词: 电力机房, 彩色图像非线性变换, 最大类间方差法, 梅尔频率倒谱系数, 音视频融合, 安全监控

Abstract: An audio-video fusion detection method is proposed in this paper for indicator light security incident surveillance in key areas of the electrical room.The optimal indicator range is determined by color image nonlinear transform for monitoring area and OTSU automatic threshold segmentation technique.The local gray scale histograms are taken as the video feature vector,and the Mel-frequency Cepstral Coefficien(MFCC) for multi frame is selected as the audio feature vector.The audio feature vector and the video feature vector are combined into a relatively large-scale vector.Principal Component Analysis (PCA) is employed for dimensionality reduction of the fused feature vector,and Support Vector Machine(SVM) is used to classify and detect the security incidents.Experimental result show that the proposed method obtains higher detection rate and lower false positive rate over the traditional single security incident detection(only audio-based or video-based) methods.

Key words: electrical room, color image nonlinear transformation, OTSU, Mel-frequency Cepstral Coefficient(MFCC), audio-video fusion, security monitoring

中图分类号: