1. Alexander, A., Mars, M., Tingey, J., Yu, H.: Audio-enhanced segment retrieval within the Spotify podcasts dataset. Technical report, University College London (2021)
2. Clifton, A., et al.: 100,000 podcasts: a Spoken English document corpus. In: Proceedings of the 28th International Conference on Computational Linguistics (COLING) (2020)
3. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics (2019)
4. Eyben, F., et al.: The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for voice research and affective computing. IEEE Trans. Affect. Comput. 7(2), 190–202 (2016)
5. Eyben, F., Wöllmer, M., Schuller, B.: Opensmile: the Munich versatile and fast open-source audio feature extractor. In: Proceedings of the International Conference on Multimedia - MM 2010. ACM Press (2010)