作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (5): 288-290. doi: 10.3969/j.issn.1000-3428.2011.05.098

• 开发研究与设计技术 • 上一篇    下一篇

基于语音识别和语速修改的语音复读系统

梁青青,杨鸿武,郭威彤,裴 东   

  1. (西北师范大学物理与电子工程学院,兰州 730070)
  • 出版日期:2011-03-05 发布日期:2012-10-31
  • 作者简介:梁青青(1983-),女,硕士研究生,主研方向:语音识 别;杨鸿武(通信作者),教授、博士;郭威彤,助理实验师、硕士;裴 东,副教授
  • 基金资助:
    国家自然科学基金资助面上项目(60875015);教育部科 学研究重点基金资助项目(208146)

Speech Repeating System Based on Speech Recognition and Speaking Rate Modification

LIANG Qing-qing, YANG Hong-wu, GUO Wei-tong, PEI Dong   

  1. (College of Physics and Electronic Engineering, Northwest Normal University, Lanzhou 730070, China)
  • Online:2011-03-05 Published:2012-10-31

摘要: 针对英语学习中的听力练习问题,利用语速修改算法和大词表连续语音识别算法,实现一个面向英语学习的语速可变、字幕同步的数字复读系统,根据字幕选择相应的语音进行复读,并实时调整语速。MOS评测结果表明,系统调节语速后的语音平均MOS得分为4.1,接近原始语音质量。语音识别结果显示,系统对英语听力材料中纯净语音的识别率达到70.8%,能够满足英语听力学习的需要。

关键词: 语速调节, 语音识别, 字幕同步, 语音复读

Abstract: Aiming at the problem of listening exercise in English learning, this paper proposes a speech repeating system which adjusts the speaking rate with TD-PSOLA algorithm and can display the subtitle of speech by large vocabulary connected speech recognition method. With the system, users can select the speech to repeat by selecting subtitle and modify speaking rate of selected speech real-time. The modified speech by system achieve 4.1 of the average Mean Opinion Score(MOS), which is close to the quality of the original voice. Result of speech recognition evaluation shows that the word level accuracy of speech recognition on pure English learning material is 70.8%.

Key words: speaking rate modification, speech recognition, subtitle synchronization, speech repeating

中图分类号: