1. Abney, S. (1997). Stochastic attribute-value grammars. Computational Linguistics, 23, 597–618.
2. Anthony, M. and P. L. Bartlett. (1999). Neural Network Learning: Theoretical Foundations. Cambridge University Press.
3. Bartlett, P. L. (1998). The sample complexity of pattern classification with neural networks: the size of the weights is more important than the size of the network. IEEE Transactions on Information Theory, 44(2): 525–536, 1998.
4. Block, H. D. (1962). The perceptron: A model for brain functioning. Reviews of Modern Physics, 34, 123–135.
5. Bod, R. (1998). Beyond Grammar: An Experience-Based Theory of Language. CSLI Publications/Cambridge University Press.