作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2006, Vol. 32 ›› Issue (10): 189-190,220.

• 人工智能及识别技术 • 上一篇    下一篇

一种基于决策树模型的音库构建和基元选取方法

叶振兴,蔡莲红   

  1. 清华大学计算机科学与技术系,北京100084
  • 出版日期:2006-05-20 发布日期:2006-05-20

A Decision Tree Based Approach for Construction of Speech Database and Unit Selection

YE Zhenxing, CAI Lianhong   

  1. Department of Computer Science and Technology, Tsinghua University, Beijing 100084
  • Online:2006-05-20 Published:2006-05-20

摘要: 针对嵌入式设备的存储容量小、计算能力有限的特点,设计了一种基于CART(Classification and Regression Trees)决策树模型的基元预选算法和基元选取算法,可以从原始语音语料库中挑选出最有代表性的基元样本,从而有效地降低音库规模和算法的复杂度,满足了嵌入式TTS(Text-to-Speech)系统的需要。基于以上算法,移动终端上实现了一个嵌入式中文TTS 系统,实验结果表明该系统的合成语音具有较高的可懂度和自然度。

关键词: 基元预选;基元选取;决策树;文语转换;嵌入式系统

Abstract: A CART based unit pre-selection algorithm, which is to select the most representative units from the original speech corpus to reducethe footprint of the units inventory, and a unit selection algorithm with low complexity are described in this paper. Based on these two algorithms, anembedded Chinese TTS system is implemented on mobile device. The experiments show that this system can produce speech with high intelligibilityand naturalness.

Key words: Unit pre-selection; Unit selection; Decision tree; Text-to-speech(TTS); Embedded system