Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering

Previous Articles     Next Articles

Improved Waveform Similarity Overlap-and-Add Time Warping Algorithm Based on Speech Turning Point Detection

LEI Yingsi,YANG Yan   

  1. (School of Electronic and Information Engineering,Lanzhou Jiaotong University,Lanzhou 730070,China)
  • Received:2014-08-11 Online:2015-10-15 Published:2015-10-15

基于语音转折点检测的改进波形相似叠加时长规整算法

雷颖思,杨燕   

  1. (兰州交通大学电子与信息工程学院,兰州 730070)
  • 作者简介:雷颖思(1989-),女,硕士研究生,主研方向:语音信号处理,数字图像处理;杨燕,副教授、博士。
  • 基金资助:
    甘肃省科技厅自然科学基金资助项目(1310RJZA050)。

Abstract: The Waveform Similarity Overlap-and-Add(WSOLA) algorithm neglects the perceptual characteristics of real sound speech signals,and employs uniform time scaling of the entire signal.When sampling rate is low or scaling proportion is large,the scale quality is degraded.Aiming at such problems,an enhanced WSOLA algorithm is proposed through analyzing the acoustic prediction characteristics of human auditory system.This method detects the turning points of the speech using a subband spectrum entropy measure and leaves them intact to ensure the turning points undamaged,while time scaling the remainder of the signal.A local compensate measure is further put forward to correct the whole scale accuracy.Simulation results show that the new algorithm improves the natural degree of the synthetic speech signals with the whole scale proportion unchanged.

Key words: time warping algorithm, Waveform Similarity Overlap-and-Add(WSOLA) algorithm, acoustic prediction, turning point detection, subband spectrum entropy, local compensation method

摘要: 波形相似叠加算法忽略语音本身感知特性,对整段语音统一规整,在采样率较低或规整比例较大时处理效果不佳。为此,通过分析人耳听觉系统的预测特点,提出一种改进的波形相似叠加时长规整算法。采用子带谱熵法检测出语音的转折部分并保持其不变,以保证转折区的语音信息不受损坏,并给出一种局部补偿法以修正整体规整精度。仿真结果表明,该算法在整体规整比例不变的情况下可提高合成语音的自然度。

关键词: 时长规整算法, 波形相似叠加算法, 听觉预测, 转折点检测, 子带谱熵, 局部补偿法

CLC Number: