作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 开发研究与工程应用 • 上一篇    下一篇

结合分层条件随机场与标点符号的维吾尔语韵律边界预测

姑丽加玛丽·麦麦提艾力 1a,艾斯卡尔·肉孜 2a,古力米热·依玛木 1b,艾斯卡尔·艾木都拉 2b   

  1. (1.新疆师范大学 a.数学科学学院; b.文学院,乌鲁木齐 830017; 2.新疆大学 a.数学与系统科学学院; b.软件学院,乌鲁木齐 830046)
  • 收稿日期:2014-10-22 出版日期:2015-11-15 发布日期:2015-11-13
  • 作者简介:姑丽加玛丽·麦麦提艾力(1984-),女,副教授、博士,主研方向:语音和语言处理;艾斯卡尔·肉孜,讲师;古力米热·依玛木,副教授;艾斯卡尔·艾木都拉,教授。
  • 基金资助:
    国家自然科学基金资助项目(61462087);教育部社科基金资助项目(10YJA740027);新疆维吾尔自治区高校科研计划基金资助项目(XJEDU2013S27);新疆师范大学博士、博士后科研启动基金资助项目(XJNUBS1308)。

Uyghur Language Prosodic Boundary Prediction Combined with Hierarchical Conditional Random Field and Punctuation

Guljamal Mamateli 1a,Askar Rozi 2a,Gulmire Imam 1b,Askar Hamdulla 2b   

  1. (1a.School of Mathematical Sciences; 1b.School of Literature,Xinjiang Normal University,Urumqi 830017,China;2a.School of Mathematics and System Science; 2b.School of Software,Xinjiang University,Urumqi 830046,China)
  • Received:2014-10-22 Online:2015-11-15 Published:2015-11-13

摘要: 韵律结构的正确预测是高自然度语音合成系统的重要组成部分。针对维吾尔语的黏着性特点,给出其相应的韵律层次结构,采用基于条件随机场(CRF)的分层自底向上方法预测维吾尔语的韵律词和韵律短语边界,并将维吾尔语形态特征作为韵律边界预测模型的重要特征。为进一步纠正韵律边界预测错误并消除标点符号边界中不同韵律边界之间的歧义,以标点符号边界为单位建立基于CRF的标点符号韵律边界预测模型,并与双层自底向上CRF模型相结合,提出一种韵律边界预测方法。通过对不同的特征模板和模型进行反复实验,以得到更好的韵律边界预测性能。实验结果表明,该方法明显提高了韵律边界的预测召回率。

关键词: 维吾尔语, 韵律边界, 分层方法, 标点符号, 形态特征

Abstract: Correct prosodic boundary prediction is crucial for the quality of synthesized speech.This paper presents the prosodic hierarchy of Uyghur language which belongs to agglutinative language.A two-layer bottom-up hierarchical approach based on Conditional Random Field(CRF) is used for predicting prosodic word and prosodic phrase boundaries.Morphological features are considered useful for prosodic boundary prediction and added into the feature sets.In order to further enhance the accuracy of prosodic boundary prediction at punctuation sites,CRF based prosodic boundary determination method is used and integrated with bottom-up hierarchical approach.Consequently,the best prosodic boundary prediction performance is achieved by large and repeated experiment of different feature sets and different models.Experimental results show that the proposed method obviously improves the recall rate prediction of the prosodic boundary.

Key words: Uyghur language, prosodic boundary, hierarchical approach, punctuation, morphological feature

中图分类号: