Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering

Previous Articles     Next Articles

Research on Text Analysis for Dialect Speech Synthesis

GUO Weitong 1,YANG Hongwu  1,SONG Jihua  2,GU Xiang  1,GAN Zhenye  1   

  1. (1.College of Physics and Electronic Engineering,Northwest Normal University,Lanzhou 730070,China; 2.College of Information Science and Technology,Beijing Normal University,Beijing 100875,China)
  • Received:2014-09-11 Online:2015-09-15 Published:2015-09-15

面向方言语音合成的文本分析研究

郭威彤1,杨鸿武1,宋继华2,顾香1,甘振业1   

  1. (1.西北师范大学物理与电子工程学院,兰州 730070; 2.北京师范大学信息科学与技术学院,北京 100875)
  • 作者简介:郭威彤(1982-),女,硕士研究生,主研方向:自然语言处理,模式识别;杨鸿武(通讯作者)、宋继华,教授、博士;顾香,硕士研究生;甘振业,副教授、博士。
  • 基金资助:
    国家自然科学基金资助项目(61263036,61262055);甘肃省杰出青年基金资助项目(1210RJDA007);甘肃省青年科技研究计划基金资助项目(1208RJYA078);西北师范大学青年教师科研能力提升计划基金资助项目(NWNU-LKQN-12-27)。

Abstract: A text analysis method for converting grapheme to dialect phoneme is proposed for statistical parametric dialect speech synthesis.A set of Speech Assessment Methods Phonetic Alphabet(SAMPA)-based symbols are designed for labeling pronunciation of dialect by comparing the differences between Mandarin and dialect.A set of conversion rules is also designed that can transform Mandarin pronunciation to dialect pronunciation.The text analysis is conducted for Chinese sentences to obtain lexicon words and their initials and finals.A transformation-based error driven learning algorithm is used to obtain the prosodic words and prosodic phrases boundaries.The conversion rules are employed to obtain the SAMPA of dialect initials and dialect finals.The input sentences are converted into context-dependent labels.Test result shows that the proposed method can generate correct context-dependent labels.

Key words: text analysis, grapheme-to-phoneme conversion, Speech Assessment Methods Phonetic Alphabet(SAMPA), speech synthesis, syntactic analysis

摘要: 为实现方言的统计参数语音合成,提出一种从文字到方言读音的文本分析方法。通过对比普通话和方言在声韵母方面的发音异同,设计方言的语音评估方法音标字母(SAMPA),用来标注方言声韵母的读音,得到从普通话读音到方言读音的转换规则。对输入的汉语文本进行分析,获得语法词、声母、韵母信息,使用基于转换的错误驱动学习算法获得语句的韵律词和韵律短语边界,利用普通话读音到方言读音的转换规则,获得方言发音的SAMPA音标,从而将输入的文本转换为统计参数语音合成所需的上下文相关标注。测试结果表明,该方法能较为准确地生成上下文相关标注。

关键词: 文本分析, 字音转换, 语音评估方法音标字母, 语音合成, 语法分析

CLC Number: