1. Hinton, G.E., Mcclelland, J.L., Rumelhart, D.E.: Distributed representation. https://web.stanford.edu/jlmcc/papers/PDP/Chapter3.pdf
2. Harris, Z.S.: Distributional structure. Word (1954)
3. Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
4. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: arXiv preprint. arXiv:1301.3781 (2013)
5. Chalapathy, R., Borzeshi, E.Z., Piccardi, M.: Bidirectional LSTM-CRF for clinical concept extraction. arXiv. https://arxiv.org/abs/1611.08373v1 . (2016)