1. Seger, C. (2018). An Investigation of Categorical Variable Encoding Techniques in Machine Learning: Binary versus One-Hot And Feature Hashing. [Independent Thesis Basic Level (Degree of Bachelor), Royal Institute of Technology].
2. A vector space model for automatic indexing;Salton;Commun. ACM,1975
3. Cavnar, W.B., and Trenkle, J.M. (1994, January 26–28). N-gram-based text categorization. Proceedings of the SDAIR-94, 3rd Annual Symposium on Document Analysis and Information Retrieval, Las Vegas, NV, USA.
4. An information-theoretic perspective of tf–idf measures;Aizawa;Inf. Process. Manag.,2003
5. Gradient-based learning applied to document recognition;LeCun;Proc. IEEE,1998