Spectrogram based multi-task audio classification-Reference-Cited by-同舟云学术

Spectrogram based multi-task audio classification

Published:2017-12-26 Issue:3 Volume:78 Page:3705-3722
ISSN:1380-7501
Container-title:Multimedia Tools and Applications
language:en
Short-container-title:Multimed Tools Appl

Author:

Zeng Yuni,Mao Hua,Peng Dezhong,Yi Zhang

Funder

National Natural Science Foundation of China

Publisher

Springer Science and Business Media LLC

Subject

Computer Networks and Communications,Hardware and Architecture,Media Technology,Software

Link

http://link.springer.com/article/10.1007/s11042-017-5539-3/fulltext.html

Reference38 articles.

1. Amodei D, Anubhai R, Battenberg E, Case C, Casper J, Catanzaro B, Chen J, Chrzanowski M, Coates A, Diamos G, Elsen E, Engel J, Fan L, Fougner C, Hannun AY, Jun B, Han T, LeGresley P, Li X, Lin L, Narang S, Ng AY, Ozair S, Prenger R, Qian S, Raiman J, Satheesh S, Seetapun D, Sengupta S, Wang C, Yi W, Wang Z, Bo X, Xie Y, Yogatama D, Zhan J, Zhu Z (2016) Deep speech 2: End-to-end speech recognition in english and mandarin. In: Proceedings of the 33nd international conference on machine learning, pp 173–182

2. Boureau Y-L, Ponce J, LeCun Y (2010) A theoretical analysis of feature pooling in visual recognition. In: Proceedings of the 27th international conference on machine learning, pp 111–118

3. Bouvrie J (2006) Notes on convolutional neural networks. Neural Nets 2006:1–8

4. Caruana R (1997) Multitask learning. Mach Learn 28(1):41–75

5. Chen L, Mao X, Xue Y-L, Cheng LL (2012) Speech emotion recognition: features and classification models. Digital Signal Process 22(6):1154–1160

Cited by 123 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. MPSA-DenseNet: A novel deep learning model for English accent classification;Computer Speech & Language;2025-01

2. Speech emotion recognition based on bi-directional acoustic–articulatory conversion;Knowledge-Based Systems;2024-09

3. A Methodical Framework Utilizing Transforms and Biomimetic Intelligence-Based Optimization with Machine Learning for Speech Emotion Recognition;Biomimetics;2024-08-26

4. Multi-level LSTM framework with hybrid sonic features for human–animal conflict evasion;The Visual Computer;2024-08-05

5. Unveiling Social Anxiety: Analyzing Acoustic and Linguistic Traits in Impromptu Speech within a Controlled Study;ACM Journal on Computing and Sustainable Societies;2024-06-20