1. Parallelized stochastic gradient descent;zinkevich;Advances in Neural Information Processing Systems (NIPS),2010
2. Automatic differentiation in pytorch;paszke,2017
3. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
4. Microsoft coco: Common objects in context;lin;Proceedings of the European Conference on Computer Vision (ECCV),2014
5. Very deep convolutional networks for large-scale image recognition;simonyan;CoRR,2015