[1] AGARWALLA S,SARMA K K.Machine learning based sample extraction for automatic speech recognition using dialectal Assamese speech[J].Neural Networks,2016,78:97-111. [2] DESAI N P,LEHMAN C,MUNSON B,et al.Supervised and unsupervised machine learning approaches to classifying chimpanzee vocalizations[J].The Journal of the Acoustical Society of America,2018,143(3):1786-1786. [3] MONAGHAN J J M,GOEHRING T,YANG X,et al.Auditory inspired machine learning techniques can improve speech intelligibility and quality for hearing-impaired listeners[J].The Journal of the Acoustical Society of America,2017,141(3):1985-1998. [4] FAYEK H M,LECH M,CAVEDON L.Evaluating deep learning architectures for speech emotion recognition[J].Neural Networks,2017,92:60-68. [5] WU Bo,LI Kehuang,GE Fengpei,et al.An end-to-end deep learning approach to simultaneous speech dereverberation and acoustic modeling for robust speech recognition[J].IEEE Journal of Selected Topics in Signal Processing,2017,11(8):1289-1300. [6] LEE J,SKOGLUND J,SHABESTARY T,et al.Phase-sensitive joint learning algorithms for deep learning-based speech enhancement[J].IEEE Signal Processing Letters,2018,25(8):1276-1280. [7] ZHOU Yan,LIU Tao,SHANG Li.Speech sparse decomposition algorithm based on immune matching pursuit[J].Computer Engineering,2012,38(21):161-163,167.(in Chinese)周燕,刘韬,尚丽.基于免疫匹配追踪的语音稀疏分解算法[J].计算机工程,2012,38(21):161-163,167. [8] ABDELKHALIK O,DARANI S.Evolving hidden genes in genetic algorithms for systems architecture optimization[J].Journal of Dynamic Systems,Measurement,and Control,2018,140(10):101015-101026. [9] TANG Min,JIN Jian,LIU Ying,et al.Integrating topic,sentiment,and syntax for modeling online reviews:a topic model approach[J].Journal of Computing and Information Science in Engineering,2019,19(1):6-20. [10] WANG Bo,YU Fengqin.Speech endpoint detection based on multi-scale sample entropy and threshold[J].Computer Engineering,2016,42(12):268-271.(in Chinese)王波,于凤芹.基于多尺度样本熵与阈值的语音端点检测[J].计算机工程,2016,42(12):268-271. [11] XIAO Xi,ZHOU Lu.Speech recognition adaptive clustering feature extraction algorithms based on the k-means algorithm and the normalized intra-class variance[J].Journal of Tsinghua University(Science and Technology),2017,57(8):857-861.(in Chinese)肖熙,周路.基于k均值和基于归一化类内方差的语音识别自适应聚类特征提取算法[J].清华大学学报(自然科学版),2017,57(8):857-861. [12] ZHU Chunli,LI Xin.Speech endpoint detection method based on LMS noise reduction and improved dual-threshold[J].Journal of System Simulation,2017,29(9):1950-1960,1967.(in Chinese)朱春利,李昕.基于LMS减噪与改进的双门限语音端点检测方法[J].系统仿真学报,2017,29(9):1950-1960,1967. [13] SHIN W H,LEE B S,LEE Y K,et al.Speech/non-speech classification using multiple features for robust endpoint detection[C]//Proceedings of 2000 IEEE International Conference on Acoustics,Speech,and Signal Processing.Washington D.C.,USA:IEEE Press,2000:1399-1402. [14] ZHANG Long,XU Xu,CHEN Huang,et al.Supervised single-channel speech dereverberation and denoising using a two-stage model based sparse representation[J].Speech Communication,2018,97:1-8. [15] VIEN Q T,NGUYEN H X,TRESTIAN R,et al.A hybrid double-threshold based cooperative spectrum sensing over fading channels[J].IEEE Transactions on Wireless Communications,2015,15(3):1821-1834. [16] DUSTOR A.Influence of noise and voice activity detection on speaker verification[C]//Proceedings of International Conference on Computer Networks.Berlin,Germany:Springer,2016:207-215. [17] SCHAFER R W.Homomorphic systems and cepstrum analysis of speech[M].Berlin,Germany:Springer,2008. [18] ZHANG Rui,HU Ruimin,LI Gang,et al.Spectral tilt estimation for speech intelligibility enhancement using RNN based on all-pole model[C]//Proceedings of International Conference on Multimedia Modeling.Berlin,Germany:Springer,2019:144-156. [19] REN Yanzhen,LIU Dengkai,YANG Jing,et al.An AMR adaptive steganographic scheme based on the pitch delay of unvoiced speech[J].Multimedia Tools and Applications,2019,78:8091-8111. |