1. Point 4D transformer networks for spatiotemporal modeling in point cloud videos;fan;CVPR,2021
2. PointContrast: Unsupervised pretraining for 3D point cloud understanding;xie;ECCV,2020
3. An image is worth 16x16 words: Transformers for image recognition at scale;dosovitskiy;ICLRE,2021
4. Greedy Hierarchical Variational Autoencoders for Large-Scale Video Prediction
5. PSTNet: Point spatiotemporal convolution on point cloud sequences;fan;ICLRE,2021