作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (8): 26-27. doi: 10.3969/j.issn.1000-3428.2009.08.009

• 博士论文 • 上一篇    下一篇

基于听觉模型的特征在英语重音检测中的应用

陈 楠,贺前华,李 韬   

  1. (华南理工大学电子与信息学院,广州 510641)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-04-20 发布日期:2009-04-20

Application of Auditory Model-based Feature in English Lexical Stress Detection

CHEN Nan, HE Qian-hua, LI Tao    

  1. (School of Electronic and Information, South China University of Technology, Guangzhou 510641)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-04-20 Published:2009-04-20

摘要: 对于英语等“重音节拍语言”,重音是一个非常重要的韵律学特征。从听觉模型的角度出发,利用基音同步幅度峰值特征能同时表征瞬时频率和强度信息的特点进行重音检测。使用基音同步幅度峰值特征以及与传统特征的组合对英语连续语音的试验结果表明,新特征能使系统误识率降低1.5%。

关键词: 重音检测, 听觉模型, 基音同步幅度峰值

Abstract: Stress is an important prosodic feature for stress-timed language such as English. This paper presents a new approach using auditory model-based Pitch Synchronous Peak Amplitude(PSPA) feature, which incorporates frequency and intensity information in English lexical stress detection. PSPA feature, along with traditional features and their combinations to English lexical stress detection are evaluated with ISLE database. Experimental results show that the combination of new feature and traditional features demonstrates a 1.5% error rate reduction.

Key words: stress detection, auditory model, Pitch Synchronous Peak Amplitude(PSPA)

中图分类号: