基于POWER8的动态自适应池化算法

doi:10.3969/j.issn.1000-3428.2016.05.035

计算机工程

基于POWER8的动态自适应池化算法

景维鹏,张兴革

(东北林业大学信息与计算机工程学院,哈尔滨 150040)

收稿日期:2015-12-16 出版日期:2016-05-15 发布日期:2016-05-13
作者简介:景维鹏(1979-),男,副教授、博士,主研方向为语音识别、云计算;张兴革,硕士研究生。
基金资助:
黑龙江省自然科学基金资助项目(ZD201403);公益性行业科研专项基金资助项目(201504307)。

Dynamic Adaptive Pooling Algorithm Based on POWER8

JING Weipeng,ZHANG Xingge

(College of Information and Computer Engineering,Northeast Forestry University,Harbin 150040,China)

Received:2015-12-16 Online:2016-05-15 Published:2016-05-13

摘要/Abstract

摘要： 针对当前卷积神经网络(CNN)模型中池化层关键语音特征提取效率低下的问题,提出一种基于POWER8架构的动态自适应池化(DA-Pooling)算法。在深度学习工具Caffe上实现CNN模型,输入经过卷积层的梅尔域滤波带系数,提取局部相邻语音的特征数据,通过计算Spearman相关系数确定数据间的相关程度。根据特征权重对具有不同相关性的语音数据动态分配池化算法,以提高池化层对不同相关性数据的适应能力。DA-Pooling利用POWER8的高效浮点运算和多线程并行计算优势,提高了海量语音数据的处理效率。实验结果证明,相比现有主流 Pooling算法,DA-Pooling可提高关键语音数据的识别准确率,保证CNN中语音识别的稳定性。

关键词: 卷积神经网络, POWER8架构, 池化算法, Caffe深度学习工具, 语音特征提取, 数据相关性

Abstract: Aiming at the problem of low efficiency to extract the key speech feature in the pooling layer of the current Convolutional Neural Network(CNN) model,a Dynamic Adaptive Pooling(DA-Pooling) algorithm based on POWER8 architecture is proposed.The algorithm implements a CNN model on the deep learning tool called Caffe.The implementation method is as follows:taking filter bank features by means of the convolutional operation as input firstly,extracting local adjacent acoustic characteristic data,calculating the Spearman correlation coefficient of the extracted data to determine data correlation,making appropriate the pooling algorithm for different correlation of data according to weight.The DA-Pooling algorithm is based on the POWER8’s high-performance processing platform which has high efficient floating-point arithmetic unit and multi thread parallel technology to improve the efficiency of processing massive data.Experimental result shows that DA-Pooling algorithm can improve the recognition accuracy of the key speech data compared with the popular Pooling algorithm,and thereby improve the stability of speech signal recognition in the entire CNN.

Key words: Convolutional Neural Network(CNN), POWER8 architecture, pooling algorithm, Caffe deep learning tool, speech feature extraction, data correlation

中图分类号:

TP393

景维鹏,张兴革. 基于POWER8的动态自适应池化算法[J]. 计算机工程.

JING Weipeng,ZHANG Xingge. Dynamic Adaptive Pooling Algorithm Based on POWER8[J]. Computer Engineering.

https://www.ecice06.com/CN/Y2016/V42/I5/207

参考文献

参考文献［1］Hinton G E,Salakhutdinov R R.Reducing the Dimen-sionality of Data with Neural Networks［J］.Science,2006,313(5786):504-507. ［2］Sinharoy B,van Norstrand J A,Eickemeyer R J,et al.IBM POWER8 Processor Core Microarchitecture［J］.IBM Journal of Research and Development,2015,59(1):1-21. ［3］Abdel-Hamid O,Mohamed A,Jiang H,et al.Applying Convolutional Neural Networks Concepts to Hybrid NN-HMM Model for Speech Recognition［C］//Proceedings of IEEE International Conference on Acoustics,Speech and Signal Processing.Washington D.C.,USA:IEEE Press,2012:4277-4280. ［4］Sainath T N,Mohamed A,Kingsbury B,et al.Deep Convolutional Neural Networks for LVCSR［C］//Pro-ceedings of IEEE International Conference on Acoustics,Speech and Signal Processing.Washington D.C.,USA:IEEE Press,2013:8614-8618. ［5］Vanhoucke V,Senior A,Mao M Z.Improving the Speed of Neural Networks on CPUs［C］//Proceedings of Deep Learning and Unsupervised Feature Learning NIPS Workshop.［S.l.］:NIPS,2011:1-8. ［6］Dean J,Corrado G,Monga R,et al.Large Scale Distributed Deep Networks［Z］.2012. ［7］张佳康,陈庆奎.基于CUDA技术的卷积神经网络识算法［J］.计算机工程,2010,36(15):179-181. ［8］Sainath T N,Kingsbury B,Mohamed A,et al.Improvements to Deep Convolutional Neural Networks for LVCSR［C］//Proceedings of International Con-ference on Acoustics,Speech and Signal Processing.Washington D.C.,USA:IEEE Press,2013:315- 320. ［9］Boureau Y L,Ponce J,Lecun Y.A Theoretical Analysis of Feature Pooling in Visual Recognition［C］//Pro-ceedings of the 27th International Conference on Machine Learning.Haifa,Israel:IMLS Press,2010:111-118. ［10］Zeiler M D.Stochastic Pooling for Regularization of Deep Convolutional Neural Networks［EB/OL］.［2015-07-16］.http://www.arxiv.org/pdf/1301.3557.pdf. ［11］He K,Zhang X,Ren S,et al.Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recogni-tion［J］.IEEE Transactions on Pattern Analysis & Machine Intelligence,2015,37(9):1904-1916. ［12］Lehmann E L,Abrera H J M.Nonparametrics:Statistical Methods Based on Ranks［J］.International Encyclopedia of Education,2010,83(1977):347-353. ［13］张晴晴,刘勇,潘接林,等.基于卷积神经网络的连续语音识别［J］.工程科学学报,2015,(9):1212-1217. ［14］Jia Y,Shelhamer E,Donahue J,et al.CAFFE:Con-volutional Architecture for Fast Feature Embedding［C］//Proceedings of ACM International Conference on Multimedia.New York,USA:ACM Press,2014:675-678. ［15］Panayotov V,Chen G,Povey D,et al.Librispeech:An ASR Corpus Based on Public Domain Audio Books［C］//Proceedings of International Conference on Acoustics,Speech and Signal Processing.Washington D.C.,USA:IEEE Press,2015:5206-5210. 编辑陆燕菲

[1]	杨萍, 张汐. 改进DeepLabv3+的道路表面裂缝检测方法[J]. 计算机工程, 2025, 51(4): 261-270.
[2]	张肇鑫, 黄世泽, 张兵杰, 沈拓. 面向交通场景的运动模糊伪装对抗样本生成方法[J]. 计算机工程, 2025, 51(3): 45-53.
[3]	胡书林, 张华军, 邓小涛, 王征华. 结合依存图卷积的中文文本相似度计算研究[J]. 计算机工程, 2025, 51(3): 76-85.
[4]	张树华, 王继业, 赵传奇, 陈宏铭, 郭咏雯. 面向输电线路边缘智能的硬件加速设计[J]. 计算机工程, 2025, 51(2): 213-222.
[5]	张会影, 圣文顺. 基于标记适应的人脸年龄识别优化算法[J]. 计算机工程, 2025, 51(1): 174-181.
[6]	郑雅洲, 刘万平, 黄东. 一种基于注意力机制的BERT-CNN-GRU检测方法[J]. 计算机工程, 2025, 51(1): 258-268.
[7]	易鹏, 杨晔, 严仕嘉. 基于MPCNN模型的sEMG快速迁移学习的手势识别应用研究[J]. 计算机工程, 2025, 51(1): 304-311.
[8]	张鲁, 田春伟, 宋焕生, 刘侍刚. 用于低剂量CT图像去噪的多级双树复小波网络[J]. 计算机工程, 2024, 50(9): 266-275.
[9]	高煜宝, 文志诚. 基于注意力机制的双路解码器图像去噪方法[J]. 计算机工程, 2024, 50(9): 324-332.
[10]	王志浩, 钱沄涛. 基于Swin Transformer的双流遥感图像时空融合超分辨率重建[J]. 计算机工程, 2024, 50(9): 33-45.
[11]	李俊俊, 董建刚, 李坤. 基于Kubernetes的集群节能策略研究[J]. 计算机工程, 2024, 50(9): 82-91.
[12]	王蕾, 党时鹏, 潘丰. 基于卷积神经网络的隐匿性旁路预测模型[J]. 计算机工程, 2024, 50(8): 40-49.
[13]	耿丽丽, 牛保宁. 基于通道相似度熵的卷积神经网络裁剪[J]. 计算机工程, 2024, 50(7): 133-143.
[14]	张洋, 刘畅, 李少青. 基于可控制性度量的图神经网络门级硬件木马检测方法[J]. 计算机工程, 2024, 50(7): 164-173.
[15]	牛瑞婷, 严天峰, 高锐, 王映植. 低信噪比下基于深度学习TCNN-MobileNet的调制识别[J]. 计算机工程, 2024, 50(7): 204-215.

选择文件类型/文献管理软件名称

选择包含的内容

基于POWER8的动态自适应池化算法

Dynamic Adaptive Pooling Algorithm Based on POWER8

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于POWER8的动态自适应池化算法

Dynamic Adaptive Pooling Algorithm Based on POWER8

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价