结合分层条件随机场与标点符号的维吾尔语韵律边界预测

doi:10.3969/j.issn.1000-3428.2015.11.051

计算机工程

结合分层条件随机场与标点符号的维吾尔语韵律边界预测

姑丽加玛丽·麦麦提艾力 ^1a,艾斯卡尔·肉孜 ^2a,古力米热·依玛木^1b,艾斯卡尔·艾木都拉 ^2b

(1.新疆师范大学 a.数学科学学院; b.文学院,乌鲁木齐 830017; 2.新疆大学 a.数学与系统科学学院; b.软件学院,乌鲁木齐 830046)

收稿日期:2014-10-22 出版日期:2015-11-15 发布日期:2015-11-13
作者简介:姑丽加玛丽·麦麦提艾力(1984-),女,副教授、博士,主研方向:语音和语言处理;艾斯卡尔·肉孜,讲师;古力米热·依玛木,副教授;艾斯卡尔·艾木都拉,教授。
基金资助:
国家自然科学基金资助项目(61462087);教育部社科基金资助项目(10YJA740027);新疆维吾尔自治区高校科研计划基金资助项目(XJEDU2013S27);新疆师范大学博士、博士后科研启动基金资助项目(XJNUBS1308)。

Uyghur Language Prosodic Boundary Prediction Combined with Hierarchical Conditional Random Field and Punctuation

Guljamal Mamateli ^1a,Askar Rozi ^2a,Gulmire Imam ^1b,Askar Hamdulla ^2b

(1a.School of Mathematical Sciences; 1b.School of Literature,Xinjiang Normal University,Urumqi 830017,China;2a.School of Mathematics and System Science; 2b.School of Software,Xinjiang University,Urumqi 830046,China)

Received:2014-10-22 Online:2015-11-15 Published:2015-11-13

摘要/Abstract

摘要： 韵律结构的正确预测是高自然度语音合成系统的重要组成部分。针对维吾尔语的黏着性特点,给出其相应的韵律层次结构,采用基于条件随机场(CRF)的分层自底向上方法预测维吾尔语的韵律词和韵律短语边界,并将维吾尔语形态特征作为韵律边界预测模型的重要特征。为进一步纠正韵律边界预测错误并消除标点符号边界中不同韵律边界之间的歧义,以标点符号边界为单位建立基于CRF的标点符号韵律边界预测模型,并与双层自底向上CRF模型相结合,提出一种韵律边界预测方法。通过对不同的特征模板和模型进行反复实验,以得到更好的韵律边界预测性能。实验结果表明,该方法明显提高了韵律边界的预测召回率。

关键词: 维吾尔语, 韵律边界, 分层方法, 标点符号, 形态特征

Abstract: Correct prosodic boundary prediction is crucial for the quality of synthesized speech.This paper presents the prosodic hierarchy of Uyghur language which belongs to agglutinative language.A two-layer bottom-up hierarchical approach based on Conditional Random Field(CRF) is used for predicting prosodic word and prosodic phrase boundaries.Morphological features are considered useful for prosodic boundary prediction and added into the feature sets.In order to further enhance the accuracy of prosodic boundary prediction at punctuation sites,CRF based prosodic boundary determination method is used and integrated with bottom-up hierarchical approach.Consequently,the best prosodic boundary prediction performance is achieved by large and repeated experiment of different feature sets and different models.Experimental results show that the proposed method obviously improves the recall rate prediction of the prosodic boundary.

Key words: Uyghur language, prosodic boundary, hierarchical approach, punctuation, morphological feature

中图分类号:

TP311

姑丽加玛丽·麦麦提艾力,艾斯卡尔·肉孜,古力米热·依玛木,艾斯卡尔·艾木都拉. 结合分层条件随机场与标点符号的维吾尔语韵律边界预测[J]. 计算机工程, doi: 10.3969/j.issn.1000-3428.2015.11.051.

Guljamal Mamateli,Askar Rozi,Gulmire Imam,Askar Hamdulla. Uyghur Language Prosodic Boundary Prediction Combined with Hierarchical Conditional Random Field and Punctuation[J]. Computer Engineering, doi: 10.3969/j.issn.1000-3428.2015.11.051.

http://www.ecice06.com/CN/Y2015/V41/I11/299

参考文献

参考文献［1］Taylor P,Black A W.Assigning Phrase Breaks from Part-of-Speech Sequences［J］.Computer Speech and Language,1998,12:99-117. ［2］Yang Chenyu,Ling Zhenhua,Dai Lirong.Unsupervised Pro-sodic Phrase Boundary Labeling of Mandarin Speech Syn-thesis Database Using Context-dependent HMM［C］//Pro-ceedings of the 38th IEEE International Conference on Acoustics,Speech and Signal Processing.Washington D.C.,USA:IEEE Press,2013:6875-6879. ［3］Chu Min,Qian Yao.Locating Boundaries for Prosodic Constituents in Unrestricted Mandarin Texts［J］.Com-putational Linguistics and Chinese Language Processing,2001,6(1):61-82. ［4］Xu Dawei,Wang Haifeng,Li Guohua,et al.Parsing Hierarchical Prosodic Structure for Mandarin Speech Synthesis［C］//Proceedings of IEEE International Conference on Acoustics,Speech and Signal Processing.Washington D.C.,USA:IEEE Press,2006:1745-1748. ［5］Prahallad K,Raghavendra V,Black A.Learning Speaker-specific Phrase Breaks for Text-to-Speech Systems［C］//Proceedings of Speech Synthesis Workshop.Kyoto,Japan:［s.n.］,2010. ［6］李剑锋,胡国平,王仁华.基于最大熵模型的韵律短语边界预测［J］.中文信息学报,2004,18(5):149-160. ［7］Zhang Xiaonan,Xu Jun,Cai Lianhong.Prosodic Boundary Prediction Based on Maximum Entropy Model with Error-driven Modification［C］//Proceedings of International Symposium on Chinese Spoken Language Processing.Singapore:［s.n.］,2006:149-160. ［8］Liu Fangzhou,Jia Huibin,Tao Jianhua.A Maximum Entropy Based Hierarchical Model for Automatic Prosodic Boundary Labeling in Mandarin［C］//Proceedings of the 6th International Symposium on Chinese Spoken Language Processing.Kunming,China:［s.n.］,2008:253-256. (下转第307页) (上接第302页) ［9］古力米热·依玛木,艾斯卡尔·艾木都拉.维吾尔语句韵律层级的人工标注规则研究［C］//第三届全国少数民族青年自然语言信息处理学术研讨会论文集.乌鲁木齐:［出版者不详］,2010:179-182. ［10］Lafferty J,McCallum A,Pereira F.Conditional Random Fields:Probabilistic Models for Segmenting and Labeling Sequence Data［C］//Proceedings of the 18th Inter-national Conference on Machine Learning.Burlington,USA:Morgan Kaufmann Publishers,2001:282-289. ［11］Zhao Ziping,Ma Xirong.Active Learning for the Prediction of Prosodic Phrase Boundaries in Chinese Speech Synthesis Systems Using Conditional Random Fields［C］//Proceedings of the 16th International Con-ference on Software Engineering,Artificial Intelligence,Networking and Parallel/Distributed Computing.Taka-matsu,Japan:［s.n.］,2015:1-5. ［12］Fernandez R,Ramabhadran B.Discriminative Training and Unsupervised Adaptation for Labeling Prosodic Events with Limited Training Data［C］//Proceedings of Conference on International Speech Communication Association.Makuhari,Japan:［s.n.］,2010:1429-1432. ［13］Chiang Chenyu,Wang Yih-Ru,Chen Sin-Horng.Punctuation Generation Inspired Linguistic Features for Mandarin Prosodic Boundary Prediction［C］//Pro-ceedings of the 37th IEEE International Conference on Acoustics,Speech and Signal Processing.Washington D.C.,USA:IEEE Press,2012:4597-4600. 编辑顾逸斐

[1]	张博旭, 蒲智, 程曦. 基于提示学习的维吾尔语文本分类研究[J]. 计算机工程, 2023, 49(6): 292-299,313.
[2]	段大高, 梁少虎, 赵振东, 韩忠明. 基于自注意力机制的中文标点符号预测模型[J]. 计算机工程, 2020, 46(5): 291-297.
[3]	穆妮热·穆合塔尔, 李晓, 杨雅婷. 维吾尔语复杂形态对汉维机器翻译的影响研究[J]. 计算机工程, 2020, 46(2): 309-314.
[4]	塞麦提·麦麦提敏, 司马义·阿不都热依木. 维吾尔语停用词抽取方法研究[J]. 计算机工程, 2019, 45(10): 288-292,300.
[5]	王淑媛,田生伟,禹龙,冯冠军,艾山·吾买尔,李圃,赵建国. 基于堆栈降噪自编码的维吾尔语事件共指关系识别[J]. 计算机工程, 2018, 44(6): 305-310.
[6]	罗延根,李晓,蒋同海,杨雅婷,周喜,王磊. 基于词向量的维吾尔语词项归一化方法[J]. 计算机工程, 2018, 44(2): 220-225.
[7]	王俊超,黄浩,徐海华,胡英. 基于迁移学习的低资源度维吾尔语语音识别[J]. 计算机工程, 2018, 44(10): 281-285,291.
[8]	热依莱木·帕尔哈提,孟祥涛,艾斯卡尔·艾木都拉. 基于区分性关键词模型的维吾尔文本情感分类[J]. 计算机工程, 2014, 40(10): 132-136,142.
[9]	姑丽加玛丽.麦提艾力a, 艾斯卡尔.孜b, 古丽娜尔.力a, 艾斯卡尔.木都拉a. 基于分类及最佳匹配读音的维吾尔多音词消歧[J]. 计算机工程, 2013, 38(18): 22-25.
[10]	黄俊, 田生伟, 禹龙, 冯冠军. 基于维吾尔语情感词的句子情感分析[J]. 计算机工程, 2012, 38(9): 183-185.
[11]	胡什乃再尔?阿尔斯兰, 古丽娜尔?艾力, 艾斯卡尔?艾木都拉. 基于自动机的喀什方言音位变化规则研究[J]. 计算机工程, 2012, 38(20): 176-178.
[12]	武晓敏, 达瓦?伊德木草, 吾守尔?斯拉木. 自然语料缺乏的民族语言连续语音识别[J]. 计算机工程, 2012, 38(12): 129-131.
[13]	热娜古丽?达古提, 艾斯卡尔?艾木都拉, 地里木拉提?吐尔逊. 维吾尔语CVC型音节韵律特征声学分析[J]. 计算机工程, 2011, 37(9): 193-195.
[14]	禹龙, 田生伟, 冯冠军. 维吾尔语情感词汇自动识别[J]. 计算机工程, 2011, 37(7): 213-215.
[15]	薛化建, 董兴华, 周喜, 吐尔洪?吾司曼, 李晓. 基于子字单元的维吾尔语语音识别研究[J]. 计算机工程, 2011, 37(20): 208-210.

选择文件类型/文献管理软件名称

选择包含的内容

结合分层条件随机场与标点符号的维吾尔语韵律边界预测

Uyghur Language Prosodic Boundary Prediction Combined with Hierarchical Conditional Random Field and Punctuation

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

结合分层条件随机场与标点符号的维吾尔语韵律边界预测

Uyghur Language Prosodic Boundary Prediction Combined with Hierarchical Conditional Random Field and Punctuation

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价