1. Bengio, Y., LeCun, Y., et al.: Scaling learning algorithms towards AI. Large-scale Kernel Mach. 34, 1–41 (2007)
2. Utgoff Hinton, G.E., Osindero, S., Teh, Y.-W.: Many-layered learning. Neural Comput. 14, 2497–2529 (2002). MIT Press
3. Hinton, G.E., Osindero, S., Teh, Y.-W.: A fast learning algorithm for deep belief nets. Neural Comput. 18, 1527–1554 (2006). MIT Press
4. Freund, Y., Haussler, D.: Unsupervised learning of distributions of binary vectors using two layer networks, Computer Research Laboratory, University of California, Santa Cruz (1994)
5. Bengio, Y., Lamblin, P., Dan, P., et al.: Greedy layer-wise training of deep networks. In: Advances in Neural Information Processing Systems, vol. 19, p. 153. MIT Press (2007)