基于视觉词模糊权重的视频语义标注

doi:10.3969/j.issn.1000-3428.2012.13.039

计算机工程 ›› 2012, Vol. 38 ›› Issue (13): 131-133.

基于视觉词模糊权重的视频语义标注

霍华，赵刚

(河南科技大学电子信息工程学院，河南洛阳 471003)

收稿日期:2011-09-05 出版日期:2012-07-05 发布日期:2012-07-05
作者简介:霍华(1968－)，男，副教授、博士后，主研方向：智能信息处理，光纤通道技术，嵌入式系统；赵刚，硕士研究生
基金资助:
国家自然科学基金资助项目(60743008)；河南省国际科技合作计划基金资助项目(104300510063)

Video Semantic Annotation Based on Visual Word Fuzzy Weighting

HUO Hua, ZHAO Gang

(College of Electronic Information Engineering, Henan University of Science and Technology, Luoyang 471003, China)

Received:2011-09-05 Online:2012-07-05 Published:2012-07-05

摘要/Abstract

摘要： 针对视觉词袋模型的量化误差与视觉词含糊性，提出一种基于视觉词模糊权重的视频语义标注方案。该方案在训练样本集的预聚类基础上，逐个聚类训练单类支持向量机OC-SVM。根据样本特征与聚类超球球心的距离函数及聚类超球的空间分布确定视觉词映射及权重，以提高视觉词的表达力、区别力。实验结果表明，基于该方案的视频语义标注精度分别比TF方案和VWA方案提高34%和16%。

关键词: 视频语义标注, 视觉词袋模型, 模糊权重方案, 单类支持向量机, 聚类超球, 模糊隶属度

Abstract: This paper proposes a formulation of visual word weighting scheme Fuzzy Weighting Scheme(FWS) aiming at the Bag of Visual Word(BoVW) model vector quantization loss and visual word ambiguity. Based on K-Nearest Neighbors(KNN) pre-clustering results, One-Class Support Vector Machine(OC-SVM) on each clustering samples subset is trained. Visual words corresponding to a single local visual feature vector are determined according to the spatial distribution information of clustering-hyperspheres and fuzzy weights are evaluated according to the distance function between sample feature and center of clustering-hypersphere. FWS is designed to boost the visual word expressiveness and discriminativeness. Experimental results show that the scheme outperforms TF scheme and VWA scheme by 34% and 16% respectively on video semantic annotation precision.

Key words: video semantic annotation, Bag of Visual Word(BoVW) model, Fuzzy Weighting Scheme(FWS), One-Class Support Vector Machine (OC-SVM), clustering hypersphere, fuzzy membership degree

中图分类号:

TP391

霍华, 赵刚. 基于视觉词模糊权重的视频语义标注[J]. 计算机工程, 2012, 38(13): 131-133.

HE Hua, DIAO Gang. Video Semantic Annotation Based on Visual Word Fuzzy Weighting[J]. Computer Engineering, 2012, 38(13): 131-133.

https://www.ecice06.com/CN/Y2012/V38/I13/131

[1]	陈新荃,陈晓东,蒋林华. 基于Spark平台的人脸图像检索系统[J]. 计算机工程, 2018, 44(2): 251-256.
[2]	屈强,刘中晅,陈波. 基于修正倒数型距离贴近度的传感器数据模糊加权融合法[J]. 计算机工程, 2016, 42(5): 313-316.
[3]	彭天强,栗芳. 基于二进制哈希与空间金字塔的视觉词袋模型生成方法[J]. 计算机工程, 2016, 42(12): 164-170.
[4]	刘立群，王联国，火久元，韩俊英，刘成忠. 基于模糊阈值补偿的混合蛙跳算法[J]. 计算机工程, 2014, 40(5): 168-172.
[5]	霍华, 赵刚. 基于改进视觉词袋模型的图像标注方法[J]. 计算机工程, 2012, 38(22): 276-278.
[6]	李彬, 陈武凡. 基于MS-FCM算法的MR图像分割方法[J]. 计算机工程, 2010, 36(16): 198-199.
[7]	路远. 基于模糊支持向量机的步态识别[J]. 计算机工程, 2009, 35(21): 189-191.
[8]	许亮. 改进的模糊最小二乘支持向量机模型[J]. 计算机工程, 2009, 35(14): 236-237.
[9]	吴成茂. 模糊互信息及其在图像分割中的应用[J]. 计算机工程, 2008, 34(7): 218-220.
[10]	骆玉霞;刘金刚;. 基于RST修正核单类SVM的程序行为控制系统[J]. 计算机工程, 2008, 34(3): 154-156.
[11]	倪建军;朱昌平;范新南. 基于模糊隶属度函数的传感器失效管理方法[J]. 计算机工程, 2007, 33(20): 55-57.
[12]	朱志宇，张冰，刘维亭. 基于模糊支持向量机的语音识别方法[J]. 计算机工程, 2006, 32(2): 180-182.

选择文件类型/文献管理软件名称

选择包含的内容

基于视觉词模糊权重的视频语义标注

Video Semantic Annotation Based on Visual Word Fuzzy Weighting

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 12

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于视觉词模糊权重的视频语义标注

Video Semantic Annotation Based on Visual Word Fuzzy Weighting

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 12

编辑推荐

Metrics

本文评价