Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2011, Vol. 37 ›› Issue (19): 189-190,197. doi: 10.3969/j.issn.1000-3428.2011.19.062

• Networks and Communications • Previous Articles     Next Articles

Multi-label Classification for Song Emotion Combined with TF-IDF

SUN Xiang-kun, DENG Wei   

  1. (School of Computer Science & Technology, Soochow University, Suzhou 215006, China)
  • Received:2011-04-11 Online:2011-10-05 Published:2011-10-05

结合TF-IDF的歌曲情感多标记分类

孙向琨,邓 伟   

  1. (苏州大学计算机科学与技术学院,江苏 苏州 215006)
  • 作者简介:孙向琨(1984-),女,硕士研究生,主研方向:图像处理,模式识别;邓 伟(通讯作者),副教授、博士

Abstract: This paper proposes a new method about combining music content and lyrics of songs. Multi-label k-Nearest Neighbor(kNN) algorithm by the angle of two vectors is applied to the emotional classification of music content based on acoustic features. Term Frequency-Inverse Document Frequency(TF-IDF) rules are used in the lyrics, and the lyrics emotion scores are calculated as its emotional features. The combing method uses the lyrics right labels to correct the content of music wrong labels. Experiment uses 396 English songs, after the new method, the accuracy of the original test from 69% to 74%.

Key words: multi-label classification, song emotion classification, multi-label k-Nearest Neighbor(kNN) algorithm, Term Frequency-Inverse Document Frequency(TF-IDF)

摘要: 提出一种结合词频-逆向文件频率(TF-IDF)规则与多标记分类的歌曲情感分析方法。对歌曲中基于声学特征的音乐内容,用带向量夹角的多标记k近邻算法进行分类,将TF-IDF规则用于歌词内容,以计算歌词情感分数,并将其作为情感特征。采用该方法对歌词内容分类错误的类别标记进行修正。选用396首英文歌曲对该算法进行测试,结果表明,与其他方法相比,该方法能使分类精确度从69%提高到74%。

关键词: 多标记分类, 歌曲情感分类, 多标记k近邻算法, 词频-逆向文件频率

CLC Number: