1. He Kaiming, Zhang Xiangyu, Ren Shaoqing, and Sun Jian. Spatial pyramid pooling in deep convolutional networks for visual recognition. In ECCV, 2014.
2. Girshick Ross, Donahue Jeff, Darrel Trevor, and Malik Jitendra. Region-based convolutional networks for accurate object detection and segmentation. IEEE transactions on pattern analysis and machine intelligence, 2015.
3. Ren Shaoqing, He Kaiming, Girshick Ross, and Sun Jian. Faster r-cnn: Towards realtime object detection with region proposal networks. In NIPS, 2015.
4. Lin Tsung-Yi, Goyal Priya, Girshick Ross, He Kaiming, and Dollar Piotr. Focal Loss for dense object detection. arXiv preprint arXiv:1708.02002, 2017.
5. Farhadi Ali, Endres Ian, Hoiem Derek, and Forsyth Devid. Describing objects by their attributes. In Computer Vision and Pattern Recognition, 2009.