Research on Text Analysis for Dialect Speech Synthesis

doi:10.3969/j.issn.1000-3428.2015.09.034

Computer Engineering

Previous Articles Next Articles

Research on Text Analysis for Dialect Speech Synthesis

GUO Weitong¹,YANG Hongwu¹,SONG Jihua ²,GU Xiang¹,GAN Zhenye ¹

(1.College of Physics and Electronic Engineering,Northwest Normal University,Lanzhou 730070,China; 2.College of Information Science and Technology,Beijing Normal University,Beijing 100875,China)

Received:2014-09-11 Online:2015-09-15 Published:2015-09-15

面向方言语音合成的文本分析研究

郭威彤¹,杨鸿武¹,宋继华²,顾香¹,甘振业¹

(1.西北师范大学物理与电子工程学院,兰州 730070; 2.北京师范大学信息科学与技术学院,北京 100875)

作者简介:郭威彤(1982-)，女，硕士研究生，主研方向：自然语言处理，模式识别；杨鸿武(通讯作者)、宋继华，教授、博士；顾香，硕士研究生；甘振业，副教授、博士。
基金资助:
国家自然科学基金资助项目(61263036，61262055)；甘肃省杰出青年基金资助项目(1210RJDA007)；甘肃省青年科技研究计划基金资助项目(1208RJYA078)；西北师范大学青年教师科研能力提升计划基金资助项目(NWNU-LKQN-12-27)。

Abstract

Abstract: A text analysis method for converting grapheme to dialect phoneme is proposed for statistical parametric dialect speech synthesis.A set of Speech Assessment Methods Phonetic Alphabet(SAMPA)-based symbols are designed for labeling pronunciation of dialect by comparing the differences between Mandarin and dialect.A set of conversion rules is also designed that can transform Mandarin pronunciation to dialect pronunciation.The text analysis is conducted for Chinese sentences to obtain lexicon words and their initials and finals.A transformation-based error driven learning algorithm is used to obtain the prosodic words and prosodic phrases boundaries.The conversion rules are employed to obtain the SAMPA of dialect initials and dialect finals.The input sentences are converted into context-dependent labels.Test result shows that the proposed method can generate correct context-dependent labels.

Key words: text analysis, grapheme-to-phoneme conversion, Speech Assessment Methods Phonetic Alphabet(SAMPA), speech synthesis, syntactic analysis

摘要： 为实现方言的统计参数语音合成,提出一种从文字到方言读音的文本分析方法。通过对比普通话和方言在声韵母方面的发音异同,设计方言的语音评估方法音标字母(SAMPA),用来标注方言声韵母的读音,得到从普通话读音到方言读音的转换规则。对输入的汉语文本进行分析,获得语法词、声母、韵母信息,使用基于转换的错误驱动学习算法获得语句的韵律词和韵律短语边界,利用普通话读音到方言读音的转换规则,获得方言发音的SAMPA音标,从而将输入的文本转换为统计参数语音合成所需的上下文相关标注。测试结果表明,该方法能较为准确地生成上下文相关标注。

关键词: 文本分析, 字音转换, 语音评估方法音标字母, 语音合成, 语法分析

CLC Number:

TP391

GUO Weitong,YANG Hongwu,SONG Jihua,GU Xiang,GAN Zhenye. Research on Text Analysis for Dialect Speech Synthesis[J]. Computer Engineering, doi: 10.3969/j.issn.1000-3428.2015.09.034.

郭威彤,杨鸿武,宋继华,顾香,甘振业. 面向方言语音合成的文本分析研究[J]. 计算机工程, doi: 10.3969/j.issn.1000-3428.2015.09.034.

/ / Recommend / Download Citations

URL: http://www.ecice06.com/EN/10.3969/j.issn.1000-3428.2015.09.034

http://www.ecice06.com/EN/Y2015/V41/I9/184

References

参考文献［1］Chu Min,Lu Shinan.A Text-to-Speech System with High Intelligibility and High Naturalness for Chinese［J］.Chinese Journal of Acoustics,1996,15(1):81-90. ［2］Bourlard H,Dines J,Majimai-Doss M,et al.Current Trends in Multilingual Speech Processing［J］.Sadhana,2011,36(5):885-915. ［3］Yang Hongwu,Keiichiro O,Gan Zhenye,et al.Realizing Ti-betan Speech Synthesis by Speaker Adaptive Training［C］//Proceedings of Signal and Information Pro-cessing Associa-tion Annual Summit and Conference.Washington D.C.,USA:IEEE Press,2013:1-4. ［4］李晓红.面向语音合成的文本处理技术的改进［D］.北京:北京交通大学,2010. ［5］姚金国,代志龙.基于文本分析的知识获取系统设计与实现［J］.计算机工程,2011,37(2):157-159. ［6］陶建华,蔡莲红,赵晟.汉语语音合成中的文本分析和韵律处理［C］//中国中文信息学会20周年学术会议论文集.北京:清华大学出版社,2001:272-279. ［7］陈志刚.中文语音合成系统中文本分析的若干关键技术［D］.合肥:中国科学技术大学,2003. ［8］索南扎西.藏语语音合成关键技术研究［D］.拉萨:西藏大学,2011. ［9］高璐,陈琪,李永宏,等.藏语语音合成中文本分析的若干问题的研究［J］.西北民族大学学报:自然科学版,2010,31(2):27-33. ［10］马欢,吾守尔·斯拉木.维吾尔语文语转换系统文本分析模块初探［J］.计算机工程,2006,32(16):267-268. ［11］姚金国,代志龙.基于HCSIPA的中英文混合语音合成［J］.计算机工程,2013,39(4):14-17. ［12］Pan Nenghuang,Yu Mingshi,Tsai Z.A Chinese to Taiwanese Text-to-Speech System［J］.Communications of Institute of Information and Computing Machinery,2008,11(4):27-38. ［13］李明,蔡莲红,李勇,等.普通话与聊城话的声学特征对比及转换［C］//第7届中国语音学学术会议暨语音学前沿问题国际论坛论文集.北京:北京大学出版社,2006:1-4. ［14］贾珈,蔡莲红,李明,等.汉语普通话与沈阳方言转换的研究［J］.清华大学学报:自然科学版,2009,49(S1):1309-1315. ［15］王兵,苏恩泽.天津话语音合成系统［J］.计算技术与自动化,1995,14(4):37-39. ［16］Guo Weitong,Yang Hongwu,Pei Dong,et al.Prosody Conversion of Chinese Northwest Mandarin Dialect Based on Five Degree Tone Model［J］.JDCTA:International Journal of Digital Content Technology and Its Applications,2012,6(17):323-332. ［17］Zen Hega,Tokuda K,Black A.Statistical Parametric Speech Synthesis［J］.Speech Communication,2009,51(11):1039-1064. ［18］张家騄.汉语普通话机读音标SAMPA-SC［J］.声学学报,2009,34(1):81-86. ［19］贾玉祥,黄德智,刘武.中文语音合成中的文本正则化研究［J］.中文信息学报,2008,22(5):45-51. ［20］杨鸿武,朱玲.基于句法特征的汉语韵律边界预测［J］.西北师范大学学报:自然科学版,2013,49(1):41-45. 编辑刘冰

[1]	WANG Zhihui, WANG Xiaodong. Research on Text Classification Methods Based on Neural Network [J]. Computer Engineering, 2020, 46(3): 11-17.
[2]	ZHANG Pu, LI Xiao, LIU Chang. Evaluation Collocation Extraction Method Based on Rules [J]. Computer Engineering, 2019, 45(8): 217-223.
[3]	LI Yong,WEI Dang,WANG Liuyu. Emotional Speech Synthesis Method Based on PSOLA and DCT [J]. Computer Engineering, 2017, 43(12): 278-282,291.
[4]	HUANG Xuehua,KONG Fang,ZHOU Guodong. Expression Recognition and Anaphora Resolution in Chinese [J]. Computer Engineering, 2016, 42(9): 168-173.
[5]	GE Yongkan,YU Fengqin. Improved Speech Synthesis Algorithm Based on Harmonic Plus Noise Excitation Model [J]. Computer Engineering, 2016, 42(12): 278-281,289.
[6]	FANG Shuang,YIN Junjie,XU Wuping. Web Text Feature Algorithm Based on Similar Image Clustering [J]. Computer Engineering, 2014, 40(12): 161-165,171.
[7]	HU Xi-Qing, LIN Shi-Beng. Research on Evaluation Object Extraction from Web Document [J]. Computer Engineering, 2011, 37(6): 30-31.
[8]	TAO Jin-Guo, DAI Zhi-Long. Design and Implementation of Text Analysis Based Knowledge Acquisition System [J]. Computer Engineering, 2011, 37(2): 157-159.
[9]	FAN Na; CAI Wan-dong; ZHAO Yu. Extraction of Subjective Relation in Opinion Sentences Based on Maximum Entropy Model [J]. Computer Engineering, 2010, 36(2): 4-6.
[10]	ZHAO Hui; LIN Cheng-long; TANG Chao-jing. Automatic Selecting Algorithm of Bimodal Corpus Based on Visual Triphone [J]. Computer Engineering, 2009, 35(17): 1-3.
[11]	ZHANG Chao-meng; LI Zhan-huai; WEN Zong-chen. Query Expansion with LCA Based Concept Tree Pruning [J]. Computer Engineering, 2009, 35(14): 45-48.
[12]	MA Huan;WUSHOUER·Silamu. Research on Text Analyze Model of the Uighur Text to Speech System [J]. Computer Engineering, 2006, 32(16): 267-268.

Please choose a citation manager

Content to export

Research on Text Analysis for Dialect Speech Synthesis

面向方言语音合成的文本分析研究

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 12

Recommended Articles

Metrics

Comments

模态框（Modal）标题

Please choose a citation manager

Content to export

Research on Text Analysis for Dialect Speech Synthesis

面向方言语音合成的文本分析研究

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 12

Recommended Articles

Metrics

Comments