1. Gaussian error linear units (gelus);hendrycks,2016
2. Mlp-mixer: An all-mlp architecture for vision;tolstikhin,2021
3. Deep Residual Learning for Image Recognition
4. Efficientnet: Rethinking model scaling for convolutional neural networks;tan;International Conference on Machine Learning,2019
5. Searching for MobileNetV3