作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2012, Vol. 38 ›› Issue (2): 169-171. doi: 10.3969/j.issn.1000-3428.2012.02.055

• 人工智能及识别技术 • 上一篇    下一篇

语音情感中基于ZCPA的VAP模型

秦宇强 1,2,张雪英 1   

  1. (1. 太原理工大学信息工程学院,太原 030024;2. 太原科技大学经济与管理学院,太原 030024)
  • 收稿日期:2011-07-12 出版日期:2012-01-20 发布日期:2012-01-20
  • 作者简介:秦宇强(1976-),男,讲师、博士,主研方向:情绪语音识别;张雪英,教授、博士生导师
  • 基金资助:
    山西省自然科学基金资助项目(2010011020-1);山西省国际科技合作基金资助项目(2011081047)

ZCPA-based VAP Model in Speech Emotion

QIN Yu-qiang 1,2, ZHANG Xue-ying 1   

  1. (1. College of Information Engineering, Taiyuan University of Technology, Taiyuan 030024, China; 2. College of Economics and Management, Taiyuan University of Science and Technology, Taiyuan 030024, China)
  • Received:2011-07-12 Online:2012-01-20 Published:2012-01-20

摘要: 分析一个基于心理学的情感空间模型原理。研究语音情感识别中7种情感(中性、喜悦、愤怒、惊讶、恐惧、悲伤和厌恶)的效价-激励-能量(VAP)维分布状况,根据过零峰值幅度(ZCPA)的最大值、最小值、均值和绝对值方差和,在VAP三维空间中分析维数水平和 ZCPA韵律特征之间的关系。实验结果表明,该情感空间模型原理有助于描述和区分各种语音情感。

关键词: 语音情感识别, 效价维, 激励维, 能量维, 过零峰值幅度

Abstract: This paper presents a conception of emotion space modeling using psychological research for reference. Based on this conception, this paper studies the Valence-Arousal-Power(VAP) distribution of the seven emotions for speech emotional recognition, including joy, anger, surprise, fear, disgust, sadness and neutral, in the three dimensional space of VAP, and analyses the relationship between the dimensional ratings and the Zero Crossings with Peak Amplitudes(ZCPA) prosodic characteristics in terms of maximum, minimum, mean and absolute square difference sum of ZCPA. Experimental results show that the conception of emotion modeling is helpful to describe and distinguish speech emotions.

Key words: speech emotional recognition, valence dimension, arousal dimension, power dimension, Zero Crossings with Peak Amplitudes (ZCPA)

中图分类号: