1. Almuallim, H., & Dietterich, T. G. (1991). Learning with many irrelevant features. In AAAI (pp. 547–552).
2. Apte, C., Damerau, F., & Weiss, S. (1998). Text mining with decision trees and decision rules. In Workshop on learning from text and the web – Conference on automated learning and discovery.
3. Distributional word clusters vs. words for text categorization;Bekkerman;Journal of Machine Learning Research,2003
4. Multilabel text categorization based on a new linear classifier learning method and a category-sensitive refinement method;Chang;Expert Systems with Applications,2008
5. Feature selection for text classification with Naive Bayes;Chen;Expert Systems with Applications,2009