1. MuSe 2023 Challenge: Multimodal Prediction of Mimicked Emotions, Cross-Cultural Humour, and Personalised Recognition of Affects
2. Shahin Amiriparian , Maurice Gerczuk , Sandra Ottl , Nicholas Cummins , Michael Freitag , Sergey Pugachevskiy , Alice Baird , and Bjö rn W. Schuller . 2017 . Snore Sound Classification Using Image-Based Deep Spectrum Features. In Interspeech 2017 , 18th Annual Conference of the International Speech Communication Association , Stockholm, Sweden, August 20--24 , 2017. 3512--3516. Shahin Amiriparian, Maurice Gerczuk, Sandra Ottl, Nicholas Cummins, Michael Freitag, Sergey Pugachevskiy, Alice Baird, and Bjö rn W. Schuller. 2017. Snore Sound Classification Using Image-Based Deep Spectrum Features. In Interspeech 2017, 18th Annual Conference of the International Speech Communication Association, Stockholm, Sweden, August 20--24, 2017. 3512--3516.
3. Alexei Baevski , Yuhao Zhou , Abdelrahman Mohamed , and Michael Auli . 2020 . wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations . In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020 , NeurIPS 2020, December 6--12, 2020, virtual. Alexei Baevski, Yuhao Zhou, Abdelrahman Mohamed, and Michael Auli. 2020. wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6--12, 2020, virtual.
4. Mathilde Caron , Hugo Touvron , Ishan Misra , Hervé Jé gou, Julien Mairal , Piotr Bojanowski , and Armand Joulin . 2021 . Emerging Properties in Self-Supervised Vision Transformers. In 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021 , Montreal, QC, Canada, October 10--17 , 2021. 9630--9640. Mathilde Caron, Hugo Touvron, Ishan Misra, Hervé Jé gou, Julien Mairal, Piotr Bojanowski, and Armand Joulin. 2021. Emerging Properties in Self-Supervised Vision Transformers. In 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Montreal, QC, Canada, October 10--17, 2021. 9630--9640.
5. ViTFER: Facial Emotion Recognition with Vision Transformers