作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 开发研究与工程应用 • 上一篇    下一篇

基于依存信息融合特征的汉语韵律预测

李勇,王柳渝,魏珰   

  1. (重庆邮电大学 自动化学院,重庆 400065)
  • 收稿日期:2016-12-05 出版日期:2018-01-15 发布日期:2018-01-15
  • 作者简介:李勇(1976—),男,副教授、博士,主研方向为无线自组织网络、认知网络、网络协同;王柳渝(通信作者)、魏珰,硕士研究生。
  • 基金资助:
    国家科技重大专项(2015ZX03003011);2015年重庆市物联网产业共性关键技术创新主题专项(cstc2015zdcy-ztzx70007)。

Chinese Prosodic Prediction Based on Dependency Information Fusion Feature

LI Yong,WANG Liuyu,WEI Dang   

  1. (School of Automation,Chongqing University of Posts and Telecommunitions,Chongqing 400065,China)
  • Received:2016-12-05 Online:2018-01-15 Published:2018-01-15

摘要: 针对目前基于浅层语法特征和依存句法单特征的汉语韵律层级预测能力较弱的情况,提出一种改进的汉语韵律预测方法。通过从输入文本的依存句法分析结果中自动提取依存句法单特征,并对其中关键特征进行特征融合,得到依存信息融合特征。将依存句法单特征与融合特征进行韵律层级预测实验对比,选取最优的依存特征组合与浅层语法特征相结合,利用决策树C4.5算法实现韵律结构层级的预测。经过大量的语料训练和测试结果表明,依存信息融合特征相比依存句法单特征整体韵律层级的预测准确率均有所提升,相对于浅层语法特征,韵律词和韵律短语的预测准确率分别提高了5.8%和15.4%。

关键词: 依存句法, 融合特征, C4.5算法, 语料, 韵律词, 韵律短语

Abstract: Aiming at the weak ability of predicting the prosodic level based on the characteristics of shallow grammar and dependence-based syntactic single feature,this paper proposes an improved Chinese prosodic prediction method.By extracting the dependency syntactic single feature from the dependency parsing results of the input text and integrating key features,the dependent information fusion feature is obtained.The syntactic single feature is compared with the fusion feature through the prosodic hierarchical predictive experiment.The optimal dependence feature combination is combined with the shallow grammar feature,and the C4.5 algorithm is used to predict the prosodic hierarchy.After a lot of corpus training and testing,results show that the prosodic predictions of interdependent information fusion features are better than the predicative ones based on dependency syntax.Compared with rhyme hierarchical law prediction based on shallow grammar features,the accuracy of prosodic word is increased by 5.8%,and the precision of prosodic phrase is increased by 15.4%.

Key words: dependency syntax, fusion feature, C4.5 algorithm, corpus, prosodic word, prosodic phrase

中图分类号: