[1] 李海峰, 陈婧, 马琳, 等.维度语音情感识别研究综述[J].软件学报, 2020, 31(8):2465-2491. LI H F, CHEN J, MA L, et al.Dimensional speech emotion recognition review[J].Journal of Software, 2020, 31(8):2465-2491.(in Chinese) [2] 王忠民, 刘戈, 宋辉.基于多核学习特征融合的语音情感识别方法[J].计算机工程, 2019, 45(8):248-254. WANG Z M, LIU G, SONG H.Speech emotion recognition method based on multiple kernel learning feature fusion[J].Computer Engineering, 2019, 45(8):248-254.(in Chinese) [3] MAO Q R, DONG M, HUANG Z W, et al.Learning salient features for speech emotion recognition using convolutional neural networks[J].IEEE Transactions on Multimedia, 2014, 16(8):2203-2213. [4] MAO Q R, DONG M, HUANG Z W, et al.Learning salient features for speech emotion recognition using convolutional neural networks[J].IEEE Transactions on Multimedia, 2014, 16(8):2203-2213. [5] XIE Y, LIANG R Y, LIANG Z L, et al.Speech emotion classification using attention-based LSTM[J].IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2019, 27(11):1675-1685. [6] ZHANG T, ZHENG W M, CUI Z, et al.Spatial-temporal recurrent neural network for emotion recognition[J].IEEE Transactions on Cybernetics, 2019, 49(3):839-847. [7] 张会云, 黄鹤鸣.基于异构并行神经网络的语音情感识别[J].计算机工程, 2022, 48(4):113-118. ZHANG H Y, HUANG H M.Speech emotion recognition based on heterogeneous parallel neural network[J].Computer Engineering, 2022, 48(4):113-118.(in Chinese) [8] JIANG P X, XU X Z, TAO H W, et al.Convolutional-recurrent neural networks with multiple attention mechanisms for speech emotion recognition[J].IEEE Transactions on Cognitive and Developmental Systems, 2022, 14(4):1564-1573. [9] LI H Q, TU M, HUANG J, et al.Speaker-invariant affective representation learning via adversarial training[C]//Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing.Washington D.C., USA:IEEE Press, 2020:7144-7148. [10] FAN W, XU X, XING X, et al.Adaptive domain-aware representation learning for speech emotion recognition[C]//Proceedings of INTERSPEECH.Shanghai, China:[s.n.], 2020:4089-4093. [11] CHEN Y P, DAI X Y, LIU M C, et al.Dynamic convolution:attention over convolution kernels[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2020:11027-11036. [12] LI J J, CHEN E P, DING Z M, et al.Maximum density divergence for domain adaptation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43(11):3918-3930. [13] HOU Q B, ZHOU D Q, FENG J S.Coordinate attention for efficient mobile network design[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2021:13708-13717. [14] WANG K X, AN N, LI B N, et al.Speech emotion recognition using Fourier parameters[J].IEEE Transactions on Affective Computing, 2015, 6(1):69-75. [15] YI L, MAK M W.Improving speech emotion recognition with adversarial data augmentation network[J].IEEE Transactions on Neural Networks and Learning Systems, 2022, 33(1):172-184. [16] BUSSO C, BULUT M, LEE C C, et al.IEMOCAP:interactive emotional dyadic motion capture database[J].Language Resources and Evaluation, 2008, 42(4):335-359. [17] SUN Y, WEN G, WANG J.Weighted spectral features based on local Hu moments for speech emotion recognition[J].Biomedical Signal Processing and Control, 2015, 18:80-90. [18] WEN G, LI H, HUANG J, et al.Random deep belief networks for recognizing emotions from speech signals[J].Computational Intelligence and Neuroscience, 2017, 2017:1945630. [19] JIANG P X, FU H L, TAO H W, et al.Parallelized convolutional recurrent neural network with spectral features for speech emotion recognition[J].IEEE Access, 2019, 7:90368-90377. [20] LI S Z, XING X F, FAN W Q, et al.Spatiotemporal and frequential cascaded attention networks for speech emotion recognition[J].Neurocomputing, 2021, 448:238-248. [21] STUHLSATZ A, MEYER C, EYBEN F, et al.Deep neural networks for acoustic emotion recognition:raising the benchmarks[C]//Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing.Washington D.C., USA:IEEE Press, 2011:5688-5691. [22] CHEN M Y, HE X J, YANG J, et al.3-D convolutional recurrent neural networks with attention model for speech emotion recognition[J].IEEE Signal Processing Letters, 2018, 25(10):1440-1444. [23] ZHANG S Q, ZHANG S L, HUANG T J, et al.Speech emotion recognition using deep convolutional neural network and discriminant temporal pyramid matching[J].IEEE Transactions on Multimedia, 2018, 20(6):1576-1590. [24] MIRSAMADI S, BARSOUM E, ZHANG C.Automatic speech emotion recognition using recurrent neural networks with local attention[C]//Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing.Washington D.C., USA:IEEE Press, 2017:2227-2231. [25] ZHAO Z P, ZHENG Y, ZHANG Z X, et al.Exploring spatio-temporal representations by integrating attention-based bidirectional-LSTM-RNNs and FCNs for speech emotion recognition[C]//Proceedings of INTERSPEECH.Hyderabad, India:[s.n.], 2018:272-276. [26] ANDO A, MASUMURA R, KAMIYAMA H, et al.Speech emotion recognition based on multi-label emotion existence model[C]//Proceedings of INTERSPEECH.Graz, Austria:[s.n.], 2019:2818-2822. |