1. Hand-transformer: nonautoregressive structured modeling for 3d hand pose estimation;huang;Proceedings of the European Conference on Computer Vision (ECCV),2020
2. Generative pretraining from pixels;chen;Proceedings of the International Conference on Machine Learning (ICML),2020
3. Bert: Pre-training of deep bidirectional transformers for language understanding;devlin,2018
4. Scene Graph Generation With External Knowledge and Image Reconstruction
5. Factorizable Net: An Efficient Subgraph-Based Framework for Scene Graph Generation