参考文献
[1]Tokuda K,Nankaku Y,Toda T,et al.Speech Synthesis Based on Hidden Markov Models[J].Proceedings of the IEEE,2013,101(5):1234-1252.
[2]王仁华,戴礼荣,胡郁,等.基于声学统计建模的新一代语音合成技术[J].中国科学技术大学学报,2008,38(7):725-734.
[3]Merritt T,King S.Investigating the Shortcomings of HMM Synthesis[C]//Proceedings of the 8th ISCA Workshop on Speech Synthesis.Barcelona,Spain:Atlantis Press,2013:185-190.
[4]Yoshimura T,Tokuda K,Masuko T,et al.Incorporating a Mixed Excitation Model and Postfilter into HMM-based Text-to-speech Synthesis[J].Systems & Com-puters in Japan,2005,36(12):43-50.
(下转第289页)
(上接第281页)
[5]Cabral J P,Renals S,Yamagishi J,et al.HMM-based Speech Synthesiser Using the LF-model of the Glottal Source[C]//Proceedings of IEEE International Conference on Acoustics Speech & Signal Processing.Washington D.C.,USA:IEEE
Press,2011:4704-4707.
[6]Drugman T,Dutoit T.The Deterministic Plus Stochastic Model of the Residual Signal and Its Applications[J].IEEE Transactions on Audio Speech & Language Processing,2012,20(3):968-981.
[7]Raitio T,Suni A,Yamagishi J,et al.HMM-based Speech Synthesis Utilizing Glottal Inverse Filtering[J].IEEE Transactions on Audio Speech & Language Processing,2011,19(1):153-165.
[8]Stylianou Y.Applying the Harmonic Plus Noise Model in Concatenative Speech Synthesis[J].IEEE Tran-sactions on Speech & Audio Processing,2001,9(1):21-29.
[9]Degottex G,Stylianou Y.Analysis and Synthesis of Speech Using an Adaptive Full-band Harmonic Model[J].IEEE Transactions on Audio Speech & Language Processing,2013,21(10):2085-2095.
[10]郭威彤,杨鸿武,宋继华,等.面向方言语音合成的文本分析研究[J].计算机工程,2015,41(9):184-189.
[11]吴义坚,王仁华.基于HMM的可训练中文语音合成[J].中文信息学报,2006,20(4):75-81.
[12]Pantazis Y,Stylianou Y.Improving the Modeling of the Noise Part in the Harmonic Plus Noise Model of Speech[C]//Proceedings of IEEE International Conference on Acoustics,Speech and Signal Processing.Washington D.C.,USA:IEEE
Press,2008:4609-4612.
[13]Kominek J,Black A W.The CMU Arctic Speech Databases[C]//Proceedings of the 5th ISCA Workshop on Speech Synthesis.Pittsburgh,USA:Atlantis Press,2004:223-224.
[14]Klabbers E,Veldhuis R.On the Reduction of Con-catenation Artefacts in Diphone Synthesis[C]//Proceedings of the 8th International Conference on Spoken Language Processing.New York,USA:ACM Press,1998:1983-1986.
[15]Hanson H M,Chuang E S.Glottal Characteristics of Male Speakers:Acoustic Correlates and Comparison with Female Data[J].Journal of the Acoustical Society of America,1999,106(2):1064-1077.
编辑陆燕菲 |