1. Refining action segmentation with hierarchical video representations;Ahn,2021
2. Quo vadis, action recognition? A new model and the kinetics dataset;Carreira,2017
3. Action segmentation with joint self-supervised temporal domain adaptation;Chen,2020
4. On the relationship between self-attention and convolutional layers;Cordonnier,2020
5. CoAtNet: marrying convolution and attention for all data sizes;Dai;CoRR,2021