|
融合LSTM和注意力机制的音乐分类推荐方法
|
Abstract:
[1] | 刘杨. 个性化音乐推荐系统的研究与实现[M]. 北京: 北京邮电大学, 2014. |
[2] | 陈雅茜. 音乐推荐系统及相关技术研究[J]. 计算机工程与应用, 2012, 48(18): 9-16. |
[3] | Ness, S.R., Theocharis, A., Tzanetakis, G., et al. (2009) Im-proving Automatic Music Tag Annotation Using Stacked Generalization of Probabilistic SVM Outputs. International Conference on Multimedia, Vancouver, October 2009, 705-708. https://doi.org/10.1145/1631272.1631393 |
[4] | Huang, Y.S., Chou, S.Y. and Yang, Y.H. (2018) Pop Music Highlighter: Marking the Emotion Keypoints. Audio and Speech Processing. |
[5] | Mirsamadi, S., Barsoum, E. and Zhang, C. (2017) Automatic Speech Emotion Recognition Using Recurrent Neural Networks with Local Attention. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, 19 June 2017, 2227-2231. https://doi.org/10.1109/ICASSP.2017.7952552 |
[6] | Piczak, K.J. (2015) Environmental Sound Classi-fication with Convolutional Neural Networks. IEEE 25th International Workshop on Machine Learning for Signal Pro-cessing (MLSP), Boston, MA, 1-6.
https://doi.org/10.1109/MLSP.2015.7324337 |
[7] | Zhang, Z., Xu, S., Cao, S., et al. (2018) Deep Convolutional Neural Network with Mixup for Environmental Sound Classification. Chinese Conference on Pattern Recognition and Computer Vision (PRCV), Springer, Cham, 356-367.
https://doi.org/10.1007/978-3-030-03335-4_31 |
[8] | Hinto, G., Deng, L., Yu, D., et al. (2012) Deep Neural Net-works for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups. IEEE Signal Pro-cessing Magazine, 29, 82-97.
https://doi.org/10.1109/MSP.2012.2205597 |
[9] | Van Den Oord, A., Dieleman, S., Zen, H., et al. (2016) Wavenet: A Generative Model for Raw Audio. SSW, 125. arXiv:1609.03499. |
[10] | Palo, H.K., Mohanty, M. and Chandra, M. (2015) Computational Vision and Robotics. Advances in Intelligent Systems and Computing, 332, 63-70. |
[11] | Roddy, C. (2001) Emotion Recognition in Human-Computer Interaction. Signal Processing Magazine, 18, 32-80.
https://doi.org/10.1109/79.911197 |
[12] | 张燕, 唐振民, 李燕萍. 面向推荐系统的音乐特征抽取[J]. 计算机工程与应用, 2011, 47(5): 130-133. |
[13] | Zhang, L., Wu, D., Han, X., et al. (2016) Feature Extraction of Under-water Target Signal Using Mel Frequency Cepstrum Coefficients Based on Acoustic Vector Sensor. Journal of Sensors, 4, 1-11.
https://doi.org/10.1155/2016/7864213 |
[14] | Gers, F.A., Schmidhube, J. and Cummins, F. (1999) Learning to Forget: Continual Prediction with LSTM. 9th International Conference on Artificial Neural Networks: ICANN’99, 850-855. https://doi.org/10.1049/cp:19991218 |