1. Birajdar G, Patil M (2019) Speech and music classification using spectrogram based statistical descriptors and extreme learning machine. Multimed Tools Appl 78(11):15141–15168
2. Bird S, Klein E, Loper E (2009) Natural language processing with Python, O’Reilly Media
3. Bordwell D, Thompson K, Smith J (2016) Film art: an introduction, McGraw-hill education; 11 edition, ISBN-13: 978–1259534959
4. Cerisara C, Král P, Lenc L (2018) On the effects of using word2vec representations in neural networks for dialogue act recognition. Comput Speech Lang 47:175–193
5. Cun Y, Bengio Y (1995) Convolutional networks for images, speech, and time series, The Handbook of Brain Theory and Neural Networks, M. A. Arbib, Ed. Cambridge, MA: MIT Press, 255–258