摘要: 对新闻视频进行结构分析,提出一种基于多模态特征融合的新闻故事单元分割方法。将新闻视频分割成音频流和视频流,选择静音区间为音频候选点,将镜头边界切变点作为视频候选点,做主持人镜头和主题字幕的探测,挑选主持人镜头为候选区间,并记录主题字幕的起始位置和结束位置,利用时间轴融合音频候选点、视频候选点、主持人镜头和主题字幕,对新闻视频进行故事单元分割。实验结果表明,该方法的查全率为83.18%,查准率为83.92%。
关键词:
新闻视频,
多模态特征,
字幕,
音频,
故事单元分割
Abstract: News story unit segmentation method based on multi-modal feature fusion is proposed in this paper by analyzing news video structure. News video is divided into audio stream and video stream. Mute intervals are detected as audio candidate points, and the shot segmentations for news video are detected and shot boundary points are chosen as video candidate points, anchorperson shot and topic caption are detected. Story units are detected by fusing audio candidate points, video candidate points, anchorperson shot and topic caption based on time axis. Experimental results show that this method can get 83.18% in recall and 83.92% in precision.
Key words:
news videom,
ulti-modal feature,
caption,
audio,
story unit segmentation
中图分类号:
刘嘉琦, 封化民, 闫建鹏. 基于多模态特征融合的新闻故事单元分割[J]. 计算机工程, 2012, 38(24): 161-165.
LIU Jia-Qi, BIAN Hua-Min, YAN Jian-Feng. News Story Unit Segmentation Based on Multi-modal Feature Fusion[J]. Computer Engineering, 2012, 38(24): 161-165.