1. Abu-El-Haija, S., Kothari, N., Lee, J., Natsev, P., Toderici, G., Varadarajan, B., & Vijayanarasimhan, S. (2016). Youtube-8m: A large-scale video classification benchmark. arXiv:1609.08675
2. An, X., Zhu, X., Xiao, Y., Wu, L., Zhang, M., Gao, Y., Qin, B., Zhang, D., & Fu, Y. (2020). Partial fc: Training 10 million identities on a single machine. arXiv:2010.05222
3. Anderson, C. (2006). The long tail: Why the future of business is selling less of more. Hachette Books.
4. Anderson, P., Fernando, B., Johnson, M., & Gould, S. (2016). Spice: Semantic propositional image caption evaluation. In Proceedings of the European conference on computer vision (pp. 382–398).
5. Andrej, K., George, T., Sanketh, S., Thomas, L., Rahul, S., & Li, F.F. (2014). Large-scale video classification with convolutional neural networks. In Proceedings of the IEEE international conference on computer vision (pp. 1725–1732).