1. PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition
2. AST: audio spectrogram trans-former;gong;CoRR,2021
3. Efficient training of audio transformers with patchout;koutini;CoRR,2021
4. An image is worth 16×16 words: Transformers for image recognition at scale;dosovitskiy;9th International Conference on Learning Representations ICLR 2021,0
5. Training data-efficient image transformers & distillation through attention;touvron;Proc of the 38th Int Conf on Machine Learning ICML 2021 Virtual Event,0