1. Xception: deep learning with depthwise separable convolutions;Chollet,2017
2. Distance-IoU loss: faster and better learning for bounding box regression;Zheng,2020
3. On layer normalization in the transformer architecture;Xiong,2020
4. Loshchilov L., Hutter F. Decoupled weight decay regularization [EB/OL]. https://arxiv.org/abs/1711.05101v3, 2021.
5. Deep residual learning for image recognition;He,2016