[1] |
LOIZOU P C.Speech enhancement:theory and practice[M].[S.1.]:CRC Press,2007.
|
[2] |
YUAN Wenhao,SUN Wenzhu,XIA Bin,et al.Improving speech enhancement in unseen noise using deep convolutional neural network[J].Acta Automatica Sinica,2018,44(4):751-759.(in Chinese)袁文浩,孙文珠,夏斌,等.利用深度卷积神经网络提高未知噪声下的语音增强性能[J].自动化学报,2018,44(4):751-759.
|
[3] |
MOHAMMADIHA N,SMARAGDIS P,LEIJON A.Supervised and unsupervised speech enhancement using nonnegative matrix factorization[J].IEEE Transactions on Audio,Speech,and Language Processing,2013,21(10):2140-2151.
|
[4] |
LIU Wenju,NIE Suai,LIANG Shan,et al.Deep learning based speech separation technology and its developments[J].Acta Automatica Sinica,2016,42(6):819-833.(in Chinese)刘文举,聂帅,梁山,等.基于深度学习语音分离技术的研究现状与进展[J].自动化学报,2016,42(6):819-833.
|
[5] |
EPHRAIM Y,MALAH D.Speech enhancement using a minimum mean-square error log-spectral amplitude estimator[J].IEEE Transactions on Acoustics,Speech,and Signal Processing,1985,33(2):443-445.
|
[6] |
COHEN I.Noise spectrum estimation in adverse environments:improved minima controlled recursive averaging[J].IEEE Transactions on Speech and Audio Processing,2003,11(5):466-475.
|
[7] |
ZENG Qingyu,XIAO Qiang,WANG Yao,et al.A dual mmicro-array speech enhancement method[J].Journal of Electronics & Information Technology,2018,40(5):1187-1194.(in Chinese)曾庆宁,肖强,王瑶,等.一种双微阵列语音增强方法[J].电子与信息学报,2018,40(5):1187-1194.
|
[8] |
HAN Wei,ZHANG Xiongwei,MIN Gang,et al.A single-channel speech enhancement approach based on perceptual masking deep neural network[J].Acta Automatica Sinica,2017,43(2):248-258.(in Chinese)韩伟,张雄伟,闵刚,等.基于感知掩蔽深度神经网络的单通道语音增强方法[J].自动化学报,2017,43(2):248-258.
|
[9] |
XU Yong,DU Jun,DAI Lirong,et al.An experimental study on speech enhancement based on deep neural networks[J].IEEE Signal Processing Letters,2014,21(1):65-68.
|
[10] |
XU Yong,DU Jun,DAI Lirong,et al.A regression approach to speech enhancement based on deep neural networks[J].IEEE/ACM Transactions on Audio,Speech,and Language Processing,2015,23(1):7-19.
|
[11] |
BALDUZZI D,GHIFARY M.Strongly-typed recurrent neural networks[EB/OL].[2019-03-10].https://www.researchgate.net/publication.
|
[12] |
HOCHREITER S,SCHMIDHUBER J.Long short-term memory[J].Neural computation,1997,9(8):1735-1780.
|
[13] |
BRADBURY J,MERITY S.Quasi-recurrent neural networks[EB/OL].[2019-03-10].https://www.researchgate.net/publication.
|
[14] |
GAROFOLO J S,LAMEL L F,FISHER W M,et al.TIMIT acoustic-phonetic continuous speech corpus[EB/OL].[2019-03-10].http://catalog.Ldc.upenn.edu/LDC93S1.
|
[15] |
HU G.100 nonspeech environment sounds[EB/OL].[2019-03-10].http://web.cse.ohio-state.edu/pnl/corpus/HuNonspeech/HuCorpus,html.
|
[16] |
VARGA A,STEENEKEN H J M.Assessment for automatic speech recognition:II.NOISEX-92:a database and an experiment to study the effect of additive noise on speech recognition systems[J].Speech Communication,1993,12(3):247-251.
|
[17] |
YU D,EYERSOLE A,SELTZER M,et al.An introduction to computational networks and the computational network toolkit[EB/OL].[2019-03-10].https://www.microsoft.com/en-us/research/publication.
|
[18] |
RIX A W,BEEREDNS J G,HOLLIER M P,et al.Perceptual evaluation of speech quality:a new method for speech quality assessment of telephone networks and codecs[C]//Proceedings of 2001 IEEE International Conference on Acoustics,Speech,and Signal Processing.Washington D.C.,USA:IEEE Press,2001:749-752.
|
[19] |
TAAL C H,HENDRIKS R C,HEUSDENS R,et al.An algorithm for intelligibility prediction of time frequency weighted noisy speech[J].IEEE Transactions on Audio,Speech,and Language Processing,2011,19(7):2125-2136.
|