1. Optimal Approximation with Sparsely Connected Deep Neural Networks
2. Chaudhari, P., Choromanska, A., Soatto, S., LeCun, Y., Baldassi, C., Borgs, C., Chayes, J., Sagun, L., and Zecchina, R. (2016), “Entropy-SGD: Biasing Gradient Descent Into Wide Valleys,” arXiv no. 1611.01838.