基于神经网络的AVS-P10开环模式选择算法优化

doi:10.19678/j.issn.1000-3428.0047850

计算机工程 ›› 2018, Vol. 44 ›› Issue (9): 256-262. doi: 10.19678/j.issn.1000-3428.0047850

基于神经网络的AVS-P10开环模式选择算法优化

崔佰会¹,高戈¹,姜林 ^1,2

1.武汉大学国家多媒体软件工程技术研究中心,武汉 430072; 2.东华理工大学软件学院,南昌 330013

收稿日期:2017-07-06 出版日期:2018-09-15 发布日期:2018-09-15
作者简介:崔佰会(1991—),男,硕士研究生,主研方向为音频信号编码;高戈,副教授;姜林,副教授、博士研究生。
基金资助:
国家自然科学基金“基于上下文相关的音频非盲带宽扩展编码研究”(61762005)。

Optimization of AVS-P10 Open-loop Mode Selection Algorithm Based on Neural Network

CUI Baihui ¹,GAO Ge¹,JIANG Lin^1,2

1.National Engineering Research Center for Multimedia Software,Wuhan University,Wuhan 430072,China; 2.Software College,East China University of Technology,Nanchang 330013,China

Received:2017-07-06 Online:2018-09-15 Published:2018-09-15

摘要/Abstract

摘要：

现有的开环模式选择算法依赖信号分类的准确率,但多数情况下准确率较低,造成开环模式下编码音质较差。为此,提出一种改进的基于神经网络的开环模式选择算法。使用神经网络替换原开环模式选择的决策树算法,拟合闭环模式选择结果进行训练得到模式选择分类器,按照闭环模式选择的逻辑过程,运用神经网络预测输入的信号,在ACELP256和TVC256两种编码模式的信噪比取代编码尝试计算得到的信噪比。实验结果表明,与原AVS-P10开环选择方法相比,提出的2种模式在语音分类准确率上分别提升5.96%和18.07%,在音乐分类准确率上分别提升3.84%和20.29%,其主客观编码音质评测明显提升。

关键词: 神经网络, 先进音视频编码, 模式选择, 特征选择, 信号分类, 信噪比估计

Abstract:

The existing open-loop mode selection algorithm relies on the accuracy of signal classification,but in most cases the accuracy is low,resulting in poor coding quality in open-loop mode.Therefore an improved neural network based open-loop mode selection algorithm is proposed.The decision tree algorithm of the original open-loop mode selection is replaced by the neural network,and the closed-loop mode selection is selected for training to obtain the mode selection classifier.According to the logic process of the closed-loop mode selection,the neural network is used to predict the input signal,and the two codes are ACELP256 and TVC256.The signal to noise ratio of the mode replaces the signal-to- noise ratio that the coding attempt is calculated.Experimental results show that the accuracy of the two methods is 5.96% and 18.07%,respectively,and the accuracy of music classification is 3.84% and 20.29%,the performance of subjective and objective tone has high improvement.

Key words: neural network, advanced audio and video coding, mode selection, feature selection, signal classification, Signal to Noise Ratio(SNR) estimation

中图分类号:

TP391

崔佰会,高戈,姜林. 基于神经网络的AVS-P10开环模式选择算法优化[J]. 计算机工程, 2018, 44(9): 256-262.

CUI Baihui,GAO Ge,JIANG Lin. Optimization of AVS-P10 Open-loop Mode Selection Algorithm Based on Neural Network[J]. Computer Engineering, 2018, 44(9): 256-262.

https://www.ecice06.com/CN/Y2018/V44/I9/256

参考文献

［1］数字音视频编解码技术标准工作组.GB/T 20090——2013信息技术先进音视频编码第10部分移动语音与音频编码标准［S］.2013.br/ ［2］BERND G,PETER V.High rate data hiding in ACELP speech codecs［C］//Proceedings of ICASSP’08.Washington D.C.,USA:IEEE Press,2008:4005-4008.br/ ［3］蒋三新.混合音频信号的压缩与重建方法研究［D］.上海:上海交通大学,2015.br/ ［4］LI X,ZHAO L,WEI L,et al.Deep saliency:multi-task deep neural network model for salient object detection［J］.IEEE Transactions on Image Processing,2016,25(8):3919-3930.br/ ［5］LECOMTE J,RICHARD G,LEFEBVIR R.An improved low complexity AMR-WB+ encoder using neural networks for mode selection［C］//Proceedings of Audio Engineering Society Conference.Washington D.C.,USA:IEEE Press,2007:125-132.br/ ［6］LING X U,HUANG B,YANG Z Q.AMR-WB+:a new audio coding standard for 3rd generation mobile audio services［J］.Audio Engineering,2008,2(2).br/ ［7］荣康,涂卫平,姜林.基于随机森林的AVS-P10开环编码质量优化算法［J］.计算机工程与应用,2016,52(24):57-61.br/ ［8］KIRA K,RENDELL L A.The feature selection problem:traditional methods and a new algorithm［C］//Proceedings of the 20th National Conference on Artificial Intelligence.［S.1.］:AAAI Press,1992:129-134.br/ ［9］李晓岚.基于Relief特征选择算法的研究与应用［D］.大连:大连理工大学,2013.br/ ［10］BREIMAN L.Bagging predictors［J］.Machine Learning,1996,24(2):123-140.br/ ［11］VAPNI K,VLADIMIR N.The nature of statistical learning theory［M］.Berlin,Germany:Springer,1995.br/ ［12］白亮.音频分类与分割技术研究［D］.长沙:国防科学技术大学,2004.br/ ［13］LU L,LI S Z,ZHANG H J.Content-based audio segmentation using support vector machines［C］//Proceedings of IEEE International Conference on Multimedia and Expo.Washington D.C.,USA:IEEE Press,2001:749-752.br/ ［14］HORNIK K,STINCHCOMBE M,WHITE H.Multilayer feedforward networks are universal approximators［J］.Neural Networks,1989,2(5):359-366.br/ ［15］BREIMAN L.Bagging predictors［J］.Machine Learning,1996,24(2):123-140.br/

[1]	何杏宇, 周易歆, 罗东旭, 杨桂松. 基于图神经网络和多主体评价的教学资源推荐[J]. 计算机工程, 2024, 50(7): 13-22.
[2]	耿丽丽, 牛保宁. 基于通道相似度熵的卷积神经网络裁剪[J]. 计算机工程, 2024, 50(7): 133-143.
[3]	张洋, 刘畅, 李少青. 基于可控制性度量的图神经网络门级硬件木马检测方法[J]. 计算机工程, 2024, 50(7): 164-173.
[4]	牛瑞婷, 严天峰, 高锐, 王映植. 低信噪比下基于深度学习TCNN-MobileNet的调制识别[J]. 计算机工程, 2024, 50(7): 204-215.
[5]	张溢文, 蔡满春, 陈咏豪, 朱懿, 姚利峰. 融合空间特征的多尺度深度伪造检测方法[J]. 计算机工程, 2024, 50(7): 240-250.
[6]	逯焕宇, 张永宏, 马光义, 谢东林, 田伟. 基于半监督对抗学习的遥感图像水体提取[J]. 计算机工程, 2024, 50(7): 251-263.
[7]	李云航, 潘晴, 田妮莉. 结构相似度优化的混合多尺度医学图像融合[J]. 计算机工程, 2024, 50(7): 264-270.
[8]	张正康, 杨丹, 聂铁铮, 寇月. 基于图结构聚类的自监督学习疾病诊断方法[J]. 计算机工程, 2024, 50(7): 360-371.
[9]	李亚康, 陈刚. 小角中子散射物理模型自动化筛选[J]. 计算机工程, 2024, 50(6): 56-64.
[10]	宋庆增, 刘向东, 许康为, 刘佳辉, 任二祥, 骆丽, 魏琦, 乔飞. 基于卷积神经网络的逐级唤醒存内计算控制器设计[J]. 计算机工程, 2024, 50(6): 328-335.
[11]	更藏措毛, 黄鹤鸣, 杨毅杰. 融合多尺度特征与上下文信息的语音增强方法[J]. 计算机工程, 2024, 50(6): 138-147.
[12]	于洋, 孙芳芳, 吕华, 李扬, 王晓民. 基于多尺度时空注意力网络的微表情检测方法[J]. 计算机工程, 2024, 50(6): 228-235.
[13]	孙文洁, 李宗民, 孙浩淼. 基于图神经网络的多智能体强化学习值函数分解方法[J]. 计算机工程, 2024, 50(5): 62-70.
[14]	游奔, 李晓红, 姚锦, 冯绍杰. 基于多粒度图与注意力机制的半监督短文本分类[J]. 计算机工程, 2024, 50(5): 83-90.
[15]	张宝鑫, 杨丹, 聂铁铮, 寇月. 基于自监督的多视角图协同过滤推荐方法[J]. 计算机工程, 2024, 50(5): 100-110.

选择文件类型/文献管理软件名称

选择包含的内容

基于神经网络的AVS-P10开环模式选择算法优化

Optimization of AVS-P10 Open-loop Mode Selection Algorithm Based on Neural Network

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于神经网络的AVS-P10开环模式选择算法优化

Optimization of AVS-P10 Open-loop Mode Selection Algorithm Based on Neural Network

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价