1. Speech/music segmentation using entropy and dynamism features in a hmm classification framework;Ajmera;Speech Communication,2003
2. A comparison of features for speech, music discrimination;Carey,1999
3. Choi, K., Fazekas, G., & Sandler, M. (2016). Explaining deep convolutional neural networks on music classification. arXiv:1607.02444.
4. Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition;Dahl;IEEE Transactions on Audio, Speech, and Language Processing,2012
5. Imagenet: A large-scale hierarchical image database;Deng,2009