1. An, A., Wang, Y., 2001. Comparisons of classification methods for screening potential compounds. In: IEEE International Conference on Data Mining, 2001.
2. The use of the area under the roc curve in the evaluation of machine learning algorithms;Bradley;Pattern Recognition,1997
3. Coppock, D.S., 2002. Data Modeling and Mining: Why Lift? Published in DM Review online. Available from: .
4. Kolcz, A., Chowdhury, A., Alspector, J., 2003. Data duplication: An imbalance problem. In: Workshop on Learning from Imbalanced Data Sets (ICML).
5. Noisy {R}eplication in {S}kewed {B}inary {C}lassification;Lee;Comput. Statist. Data Anal.,2000