1. Duration Robust Weakly Supervised Sound Event Detection
2. Audio captioning transformer;mei,2021
3. Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval;koizumi,2020
4. Bertscore: Evaluating text generation with bert;zhang,2020
5. Audio Captioning using Gated Recurrent Units;eren,2021