1. Androutsopoulos, I., Paliouras, G., Karkaletsis, V., Sakkis, G., Spyropoulos, C., Stamatopoulos, P., 2000. Learning to filter spam e-mail: a comparison of a naive bayesian and a memory-based approach. In: Proceedings of the workshop on Machine Learning and Textual Information Access, fourth European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD-2000), pp. 1–13.
2. On the resemblance and containment of documents;Broder,1997
3. Brutlag, C., Meek, J., 2000. Challenges of the email domain for text classification. In: Proceedings of 17th International Conference on Machine Learning, pp. 103–110 (July).
4. Caropreso, M.F., Matwin, S., Sebastiani, F., 2001. A learner-independent evaluation of the usefulness of statistical phrases for automated text categorization. In: Chin, A.G. (Ed.), Text Databases and Document Management: Theory and Practice. Idea Group, pp. 78–102.
5. The CN2 induction algorithm;Clark;Machine Learning,1989