作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (17): 240-242. doi: 10.3969/j.issn.1000-3428.2010.17.082

• 多媒体技术及应用 • 上一篇    下一篇

改进的BIC说话人分割算法

郑继明1,张 萍2   

  1. (1. 重庆邮电大学数理学院,重庆 400065;2. 重庆邮电大学计算机科学与技术学院,重庆 400065)
  • 出版日期:2010-09-05 发布日期:2010-09-02
  • 作者简介:郑继明(1963-),男,副教授,主研方向:小波分析,多媒体技术;张 萍,硕士研究生
  • 基金资助:
    重庆市教育委员会科学技术研究基金资助项目(KJ080524)

Improved BIC Algorithm for Speaker Segmentation

ZHENG Ji-ming1, ZHANG Ping2   

  1. (1. College of Mathematics and Physics, Chongqing University of Posts and Telecommunications, Chongqing 400065; 2. College of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065)
  • Online:2010-09-05 Published:2010-09-02

摘要: 针对多人说话改变点检测问题,提出一种改进的BIC说话人分割算法。采用固定窗BIC算法对音频流进行分割,利用基于递归的分割算法和变长窗口的BIC算法确认潜在的分割点。实验结果表明,与其他BIC算法相比,该算法的准确率、召回率和综合性能较高。

关键词: BIC准测, 广播音频分割, 准确率, 召回率

Abstract: To detect the voice change of speakers, an improved Bayesian Information Criterion(BIC) algorithm for speaker segmentation is proposed in this paper. The algorithm detects potential acoustic changes using fix-windows BIC algorithm, and validates the potential acoustic changes by two methods. One is the variable-size analysis window BIC algorithm, the other is based on the recursion of segmentation algorithm. Experimental result shows that the algorithm achieves better results, and compared with the BIC algorithm, the precision ratio, recall ratio and F-measure are improved.

Key words: Bayesian Information Criterion(BIC), broadcasting audio segmentation, precision ratio, recall ratio

中图分类号: