Author:
Jermsittiparsert Kittisak,Abdurrahman Abdurrahman,Siriattakul Parinya,Sundeeva Ludmila A.,Hashim Wahidah,Rahim Robbi,Maseleno Andino
Publisher
Springer Science and Business Media LLC
Subject
Computer Vision and Pattern Recognition,Linguistics and Language,Human-Computer Interaction,Language and Linguistics,Software
Reference16 articles.
1. Cakır, E., Heittola, T., & Virtanen, T. (2016). Domestic audio tagging with convolutional neural networks. In IEEE AASP challenge on detection and classification of acoustic scenes and events (DCASE 2016), (pp. 1–2).
2. Cakır, E., Parascandolo, G., Heittola, T., Huttunen, H., & Virtanen, T. (2017). Convolutional recurrent neural networks for polyphonic sound event detection. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(6), 1291–1303.
3. Chollet, F. (2017). Xception: Deep learning with depthwise separable convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition, (pp. 1251–1258).
4. El Ayadi, M., Kamel, M. S., & Karray, F. (2011). Survey on speech emotion recognition: Features, classification schemes, and databases. Pattern Recognition, 44(3), 572–587.
5. Hershey, S., Chaudhuri, S., Ellis, D.P., Gemmeke, J. F., Jansen, A., Moore, R. C., Plakal, M., Platt, D., Saurous, R. A., & Seybold, B. et al. (2017). CNN architectures for largescale audio classification. In 2017 IEEE international conference on acoustics, speech and signal processing (ICASSP), (pp. 131–135). IEEE.
Cited by
34 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献