1. Arandjelovic, R., Zisserman, A.: Look, listen and learn. In: IEEE International Conference on Computer Vision, pp. 609–617 (2017). https://doi.org/10.1109/ICCV.2017.73
2. de Benito-Gorron, D., Lozano-Diez, A., Toledano, D.T., Gonzalez-Rodriguez, J.: Exploring convolutional, recurrent, and hybrid deep neural networks for speech and music detection in a large audio dataset. EURASIP J. Audio Speech Music Process. 2019(1), 1–18 (2019). https://doi.org/10.1186/s13636-019-0152-1
3. Choi, K., Fazekas, G., Sandler, M.B., Cho, K.: Transfer learning for music classification and regression tasks. In: Cunningham, S.J., Duan, Z., Hu, X., Turnbull, D. (eds.) Proceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2017, Suzhou, China, 23–27 October 2017, pp. 141–149 (2017). https://ismir2017.smcnus.org/wp-content/uploads/2017/10/12_Paper.pdf
4. Cramer, J., Wu, H.H., Salamon, J., Bello, J.: Look, listen, and learn more: design choices for deep audio embeddings. In: ICASSP 2019–2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3852–3856 (05 2019). https://doi.org/10.1109/ICASSP.2019.8682475
5. Doukhan, D., Lechapt, E., Evrard, M., Carrive, J.: Ina’s mirex 2018 music and speech detection system. In: Music Information Retrieval Evaluation eXchange (MIREX 2018) (2018)