1. The ClueWeb09 Dataset;Callan,2009
2. Efficient and effective spam filtering and re-ranking for large web datasets;Cormack;Inf. Retr.,2011
3. Indexing without spam;Zuccon,2011
4. Effects of spam removal on search engine efficiency and effectiveness;Crane,2012
5. The Lemur Project and Its ClueWeb12 Dataset;Callan,2012