1. MLI: an API for distributed machine learning;Sparks,2013
2. Hadoop: The Definitive Guide;White,2009
3. Spark: cluster computing with working sets;Zaharia,2010
4. A parallel distributed weka framework for big data mining using spark;Koliopoulos,2015
5. Do we need hundreds of classifiers to solve real world classification problems?;Fernández-Delgado;J. Mach. Learn. Res.,2014