基于MFF-GAN的图像集视觉总结

doi:10.19678/j.issn.1000-3428.0050237

计算机工程 ›› 2019, Vol. 45 ›› Issue (2): 202-206. doi: 10.19678/j.issn.1000-3428.0050237

基于MFF-GAN的图像集视觉总结

张文凯^1,2,3,孙皓^1,2,孙显^1,2,王宏琦^1,2

1.中国科学院电子学研究所,北京 100190; 2.中国科学院空间信息处理与应用系统技术重点实验室,北京 100190; 3.中国科学院大学,北京 100190)

收稿日期:2018-01-23 出版日期:2019-02-15 发布日期:2019-02-15
作者简介:张文凯(1990—),男,博士研究生,主研方向为计算机视觉、图像处理;孙皓、孙显,副研究员、博士;王宏琦,研究员、博士。
基金资助:
国家自然科学基金(41501485)。

Image Set Visual Summarization Based on MFF-GAN

ZHANG Wenkai ^1,2,3,SUN Hao ^1,2,SUN Xian ^1,2,WANG Hongqi ^1,2

1.Institute of Electronics,Chinese Academy of Sciences,Beijing 100190,China; 2.Key Laboratory of Spatial Information Processing and Application System Technology, Chinese Academy of Sciences,Beijing 100190,China; 3.University of Chinese Academy of Sciences,Beijing 100190,China

Received:2018-01-23 Online:2019-02-15 Published:2019-02-15

摘要/Abstract

摘要：

现有图像集视觉总结方法主要使用浅层视觉特征,或者直接应用已训练的卷积神经网络模型提取图像深层特征,选取的图像不具代表性。为此,分析并研究图像集视觉总结的图像特征表示方法,提出多特征图融合生成对抗网络(MFF-GAN)模型。该模型中的判别器通过多特征图融合的方式提取图像特征,使提取的特征能表示图像细节和高层语义信息,并在多特征图融合层后添加自编码网络对特征进行降维,避免特征维度灾难问题。NUS-WIDE数据集上的实验结果验证了MFF-GAN模型的有效性,并表明其能有效提升图像集视觉总结多样性。

关键词: 生成对抗网络, 特征学习, 视觉总结, 多特征图融合, 自编码网络

Abstract:

Existing image set visual summarization methods primarily consider the low-level visual features of images,or deep features,which extracted from trained Convolutional Neural Network(CNN) model.It makes the selected image not representative.In order to solve the problem,this paper analyzes and studies the image feature representation method in the image set visual summarization,proposes a Multi-Feature Fusion Generative Adversarial Networks(MFF-GAN) model.The discriminator in the model extracts image features by means of multi-feature image fusion,so that the extracted features can represent image details and high-level semantic information.To reduce the dimensionality of feature,the encoder network is added after the fusion layer.Experimental results on NUS-WIDE dataset valify the effectiveness of the MFF-GAN model,and show it can improve the diversity of visual summarization.

Key words: Generative Adversarial Network(GAN), feature learning, visual summarization, multi-feature fusion, autoencoder network

中图分类号:

TP391.4

张文凯,孙皓,孙显,王宏琦. 基于MFF-GAN的图像集视觉总结[J]. 计算机工程, 2019, 45(2): 202-206.

ZHANG Wenkai,SUN Hao,SUN Xian,WANG Hongqi. Image Set Visual Summarization Based on MFF-GAN[J]. Computer Engineering, 2019, 45(2): 202-206.

https://www.ecice06.com/CN/Y2019/V45/I2/202

参考文献

［1］SADEGHI F,TENA J R,FARHADI A,et al.Learning to select and order vacation photographs［C］//Proceedings of IEEE Winter Conference on Applications of Computer Vision.Washington D.C.,USA:IEEE Press,2015:510-517.
［2］SIMON I,SNAVELY N,SEITZ S M.Scene summarization for online image collections［C］//Proceedings of the 11th Inter-national Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2007:1-8.
［3］LOWE D G.Distinctive image features from scale-invariant keypoints［J］.International Journal of Computer Vision,2004,60(2):91-110.
［4］TAN L,SONG Y,LIU S,et al.ImageHive:interactive content-aware image summarization［J］.IEEE Computer Graphics and Applications,2011,32(1):46-55.
［5］YANG C,SHEN J,FAN J.Effective summarization of large-scale web images［C］//Proceedings of the 19th ACM International Conference on Multimedia.New York,USA:ACM Press,2011:1145-1148.
［6］YANG C,SHEN J,PENG J,et al.Image collection summarization via dictionary learning for sparse representation［J］.Pattern Recognition,2013,46(3):948-961.
［7］ZHAO Y,HONG R C,JIANG J G.Visual summarization of image collections by fast RANSAC［J］.Neurocomputing,2016,172(C):48-52.
［8］赵烨,蒋建国,洪日昌.基于RANSAC的SIFT匹配优化［J］.光电工程,2014,41(8):58-65.
［9］SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition［EB/OL］.［2017-12-13］.https://arxiv.org/abs/1409.1556.
［10］KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImageNet classification with deep convolutional neural networks［C］//Proceedings of the 25th International Conference on Neural Information Processing Systems.New York,USA:ACM Press,2012:1097-1105.
［11］SHEN X,TIAN X.Multi-modal and multi-scale photo collection summarization［J］.Multimedia Tools and Applications,2016,75(5):1-15.
［12］李志明.基于卷积神经网络的虹膜活体检测算法研究［J］.计算机工程,2016,42(5):239-243,248.
［13］GOODFELLOW I J,POUGETABADIE J,MIRZA M,et al.Generative adversarial nets［EB/OL］.［2017-12-13］.http://blog.csdn.net/wspba/article/details/54582391.
［14］RADFORD A,METZ L,CHINTALA S.Unsupervised representation learning with deep convolutional generative adversarial networks［EB/OL］.［2017-12-13］.https://arxiv.org/abs/1511.06434.
［15］LIN D,FU K,WANG Y,et al.MARTA GANs:unsupervised representation learning for remote sensing image classification［J］.IEEE Geoscience and Remote Sensing Letters,2017,14(11):2092-2096.
［16］CHUA T S,TANG J,HONG R,et al.NUS-WIDE:a real-world web image database from National University of Singapore［C］//Proceedings of ACM International Conference on Image and Video Retrieval.New York,USA:ACM Press,2009:48-50.
［17］CAMARGO J E,GONZALEZ F A.Multimodal latent topic analysis for image collection summarization［J］.Information Sciences,2016,328(C):270-287.
［18］DING C H Q,LI T,JORDAN M I.Convex and semi-nonnegative matrix factorizations［J］.IEEE Tran-sactions on Pattern Analysis and Machine Intelligence,2010,32(1):45-55.

[1]	王夙喆, 张雪英, 陈晓玉, 李凤莲, 吴泽林. 基于有效注意力和GAN结合的脑卒中EEG增强算法[J]. 计算机工程, 2024, 50(8): 336-344.
[2]	胡庆. 多尺度融合与双输出U-Net网络的行人重识别[J]. 计算机工程, 2024, 50(6): 102-109.
[3]	张慧妍, 梁勇, 兰景宏, 赵强. 基于记忆模块与过滤式生成对抗网络的入侵检测方法[J]. 计算机工程, 2024, 50(6): 197-207.
[4]	李田芳, 普园媛, 赵征鹏, 徐丹, 钱文华. 基于CLIP和双空间自适应归一化的图像翻译[J]. 计算机工程, 2024, 50(5): 229-240.
[5]	刘帅威, 李智, 王国美, 张丽. 基于Transformer和GAN的对抗样本生成算法[J]. 计算机工程, 2024, 50(2): 180-187.
[6]	何银银, 胡静, 陈志泊, 张荣国. 融合门控变换机制和GAN的低光照图像增强方法[J]. 计算机工程, 2024, 50(2): 247-255.
[7]	张美美, 秦品乐, 柴锐, 曾建潮, 翟双姣, 闫俊义, 冯二燕. 面向急性缺血性脑卒中的CT生成MRI算法[J]. 计算机工程, 2024, 50(2): 317-326.
[8]	戴磊, 曹林, 郭亚男, 张帆, 杜康宁. 基于生成对抗网络的深度伪造跨模型防御方法[J]. 计算机工程, 2024, 50(10): 100-109.
[9]	张学军, 席阿友, 加小红, 张斌, 李梅, 杜晓刚, 黄海燕. 基于深度学习的指纹室内定位对抗样本攻击研究[J]. 计算机工程, 2024, 50(10): 228-239.
[10]	沈梦强, 于文年, 易黎, 宋南. 基于GAN的全时间尺度语音增强方法[J]. 计算机工程, 2023, 49(6): 115-122,130.
[11]	李培育, 张雅丽. 基于改进SRGAN模型的人脸图像超分辨率重建[J]. 计算机工程, 2023, 49(4): 199-205.
[12]	罗嗣卿, 陈慧. 基于生成对抗网络的图像场景转换[J]. 计算机工程, 2023, 49(4): 217-225.
[13]	翟社平, 张宇航, 柏晓夏. 融合实体邻域信息的知识图谱嵌入负采样方法[J]. 计算机工程, 2023, 49(3): 95-104.
[14]	席荣康, 蔡满春, 芦天亮. 基于数据增强与流数据处理的Tor流量分析模型[J]. 计算机工程, 2023, 49(3): 177-184.
[15]	陈锦生, 马文臻, 方少峰, 邹自明. 基于地基气辉图像的大气重力波目标识别[J]. 计算机工程, 2023, 49(11): 13-23.

选择文件类型/文献管理软件名称

选择包含的内容

基于MFF-GAN的图像集视觉总结

Image Set Visual Summarization Based on MFF-GAN

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于MFF-GAN的图像集视觉总结

Image Set Visual Summarization Based on MFF-GAN

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价