1. C. Aggarwal. Neural networks and deep learning: A textbook. Springer, 2018.
2. R. Ahuja, T. Magnanti, and J. Orlin. Network flows: Theory, algorithms, and applications, Prentice Hall, 1993.
3. A. Bryson. A gradient method for optimizing multi-stage allocation processes. Harvard University Symposium on Digital Computers and their Applications, 1961.
4. M. Garey and D. Johnson. Computers and Intractability, Freeman, 2002.
5. I. Goodfellow, Y. Bengio, and A. Courville. Deep learning. MIT Press, 2016.