基于谐波加噪声激励模型的改进语音合成算法

doi:10.3969/j.issn.1000-3428.2016.12.047

计算机工程

基于谐波加噪声激励模型的改进语音合成算法

戈永侃,于凤芹

(江南大学物联网工程学院,江苏无锡 214122)

收稿日期:2015-12-10 出版日期:2016-12-15 发布日期:2016-12-15
作者简介:戈永侃(1991—),男,硕士研究生,主研方向为语音信号处理;于凤芹,教授、博士。

Improved Speech Synthesis Algorithm Based on Harmonic Plus Noise Excitation Model

GE Yongkan,YU Fengqin

(School of Internet of Things Engineering,Jiangnan University,Wuxi,Jiangsu 214122,China)

Received:2015-12-10 Online:2016-12-15 Published:2016-12-15

摘要/Abstract

摘要： 传统基于隐马尔科夫模型(HMM)的语音合成算法使用高斯白噪声和脉冲串来表示清浊音的激励信号,合成的语音较为嘈杂。为提高合成音质,基于谐波加噪声激励模型,提出一种语音合成算法。将语音信号逆滤波得到声门波信号,对声门波信号进行谐波分析提取谐波成分,并计算谐波成分的线谱对参数作为谐波特征进行HMM训练。在语音合成时根据新生成的特征参数重构出低频段谐波部分与高频段噪声部分,并将两者混合作为语音的激励信号进行语音合成。实验结果表明,与基于脉冲激励的语音合成算法相比,该算法生成的语音频谱更接近自然语音,并且能够有效地减轻合成语音的机器声,提高合成语音的自然度。

关键词: 语音合成, 谐波加噪声模型, 激励信号, 逆滤波, 隐马尔科夫模型

Abstract: The excitation signal used in the traditional Hidden Markov Model(HMM)-based speech synthesis algorithm is either a pulse train or white gaussian noise,and the synthesis speech sounds buzzy.An improved speech synthesis algorithm based on harmonic plus noise excitation model is proposed to enhance the quality of speech.After inverse filtering,the harmonic signal in glottal flow is extracted and modeled by Linear Spectrum Pairs(LSP) coefficients.The LSP coefficients are sent into HMM training as the harmonic feature.In synthesis stage,the harmonic part and the noise part are reconstructed from the newly generated coefficients and mixed together as the excitation of speech signal.Experiment results demonstrate that the excitation generated by this algorithm is more accurate compared with speech synthesis algorithm based on pulsed excitation.The algorithm can effectively relieve machine noise of synthesized speech and improve the naturalness of the speech.

Key words: speech synthesis, harmonic plus noise model, excitation signal, inverse filtering, Hidden Markov Model(HMM)

中图分类号:

TN912.33

戈永侃,于凤芹. 基于谐波加噪声激励模型的改进语音合成算法[J]. 计算机工程.

GE Yongkan,YU Fengqin. Improved Speech Synthesis Algorithm Based on Harmonic Plus Noise Excitation Model[J]. Computer Engineering.

https://www.ecice06.com/CN/Y2016/V42/I12/278

参考文献

参考文献［1］Tokuda K,Nankaku Y,Toda T,et al.Speech Synthesis Based on Hidden Markov Models［J］.Proceedings of the IEEE,2013,101(5):1234-1252. ［2］王仁华,戴礼荣,胡郁,等.基于声学统计建模的新一代语音合成技术［J］.中国科学技术大学学报,2008,38(7):725-734. ［3］Merritt T,King S.Investigating the Shortcomings of HMM Synthesis［C］//Proceedings of the 8th ISCA Workshop on Speech Synthesis.Barcelona,Spain:Atlantis Press,2013:185-190. ［4］Yoshimura T,Tokuda K,Masuko T,et al.Incorporating a Mixed Excitation Model and Postfilter into HMM-based Text-to-speech Synthesis［J］.Systems & Com-puters in Japan,2005,36(12):43-50. (下转第289页) (上接第281页) ［5］Cabral J P,Renals S,Yamagishi J,et al.HMM-based Speech Synthesiser Using the LF-model of the Glottal Source［C］//Proceedings of IEEE International Conference on Acoustics Speech & Signal Processing.Washington D.C.,USA:IEEE Press,2011:4704-4707. ［6］Drugman T,Dutoit T.The Deterministic Plus Stochastic Model of the Residual Signal and Its Applications［J］.IEEE Transactions on Audio Speech & Language Processing,2012,20(3):968-981. ［7］Raitio T,Suni A,Yamagishi J,et al.HMM-based Speech Synthesis Utilizing Glottal Inverse Filtering［J］.IEEE Transactions on Audio Speech & Language Processing,2011,19(1):153-165. ［8］Stylianou Y.Applying the Harmonic Plus Noise Model in Concatenative Speech Synthesis［J］.IEEE Tran-sactions on Speech & Audio Processing,2001,9(1):21-29. ［9］Degottex G,Stylianou Y.Analysis and Synthesis of Speech Using an Adaptive Full-band Harmonic Model［J］.IEEE Transactions on Audio Speech & Language Processing,2013,21(10):2085-2095. ［10］郭威彤,杨鸿武,宋继华,等.面向方言语音合成的文本分析研究［J］.计算机工程,2015,41(9):184-189. ［11］吴义坚,王仁华.基于HMM的可训练中文语音合成［J］.中文信息学报,2006,20(4):75-81. ［12］Pantazis Y,Stylianou Y.Improving the Modeling of the Noise Part in the Harmonic Plus Noise Model of Speech［C］//Proceedings of IEEE International Conference on Acoustics,Speech and Signal Processing.Washington D.C.,USA:IEEE Press,2008:4609-4612. ［13］Kominek J,Black A W.The CMU Arctic Speech Databases［C］//Proceedings of the 5th ISCA Workshop on Speech Synthesis.Pittsburgh,USA:Atlantis Press,2004:223-224. ［14］Klabbers E,Veldhuis R.On the Reduction of Con-catenation Artefacts in Diphone Synthesis［C］//Proceedings of the 8th International Conference on Spoken Language Processing.New York,USA:ACM Press,1998:1983-1986. ［15］Hanson H M,Chuang E S.Glottal Characteristics of Male Speakers:Acoustic Correlates and Comparison with Female Data［J］.Journal of the Acoustical Society of America,1999,106(2):1064-1077. 编辑陆燕菲

[1]	郑文秀, 赵峻毅, 文心怡, 姚引娣. 基于瓶颈复合特征的声学模型建立方法[J]. 计算机工程, 2020, 46(11): 301-305,314.
[2]	吴建伟,李艳玲,臧翰林. 基于改进帧结构的认知网络吞吐量优化方法[J]. 计算机工程, 2018, 44(6): 45-49.
[3]	胡志隆,文畅,谢凯,贺建飚. 联合HMM-UBM与RVM的声纹密码识别算法[J]. 计算机工程, 2018, 44(11): 129-134.
[4]	李勇,魏珰,王柳渝. 基于PSOLA与DCT的情感语音合成方法[J]. 计算机工程, 2017, 43(12): 278-282,291.
[5]	崔建国,高波,蒋丽英,于明月,郑蔚. LSSVM与HMM在航空发动机状态预测中的应用研究[J]. 计算机工程, 2017, 43(10): 310-315.
[6]	郭威彤,杨鸿武,宋继华,顾香,甘振业. 面向方言语音合成的文本分析研究[J]. 计算机工程, 2015, 41(9): 184-189.
[7]	蔡文学,邱珠成,黄晓宇,萧超武,陈康. 基于WiFi 指纹的室内轨迹定位模型[J]. 计算机工程, 2015, 41(6): 76-82.
[8]	肖佳林，赵聿晴，王英. 基于HMM与SVM的语音活动检测[J]. 计算机工程, 2014, 40(1): 203-208.
[9]	乐娟, 赵玺. 基于HMM的京剧机构命名实体识别算法[J]. 计算机工程, 2013, 39(6): 266-271,286.
[10]	徐英进, 蔡莲红. 基于HCSIPA的中英文混合语音合成[J]. 计算机工程, 2013, 39(4): 14-17.
[11]	袁浩, 李海洋, 郑铁然, 韩纪庆. 基于相邻帧特征相似性的快速关键词检出方法[J]. 计算机工程, 2012, 38(7): 287-289.
[12]	王晓燕, 曾庆宁, 粟秀尹. 基于PCA和HMM的心音自动识别系统[J]. 计算机工程, 2012, 38(20): 148-151.
[13]	热娜古丽?达古提, 艾斯卡尔?艾木都拉, 地里木拉提?吐尔逊. 维吾尔语CVC型音节韵律特征声学分析[J]. 计算机工程, 2011, 37(9): 193-195.
[14]	杨扬, 王志良, 杨溢. 数字家庭环境中双手交互技术研究[J]. 计算机工程, 2011, 37(4): 29-30.
[15]	欧阳宁, 宁瑞芳, 莫建文, 张彤, 刘丽群. LHMM熵的聚众事件实时检测[J]. 计算机工程, 2011, 37(20): 160-162.

选择文件类型/文献管理软件名称

选择包含的内容

基于谐波加噪声激励模型的改进语音合成算法

Improved Speech Synthesis Algorithm Based on Harmonic Plus Noise Excitation Model

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于谐波加噪声激励模型的改进语音合成算法

Improved Speech Synthesis Algorithm Based on Harmonic Plus Noise Excitation Model

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价