1. Apache Hadoop Project. Apache Hadoop, 2018. (Online; accessed May 2018).
2. Apache Spark: Lightning-fast cluster computing. Apache spark, 2018. (Online; accessed May 2018).
3. K. Bache, M. Lichman, UCI machine learning repository, 2013.
4. M.A. Beyer, D. Laney, 3d data management: controlling data volume, velocity and variety, 2001.
5. J. Bins, B.A. Draper, Feature selection from huge feature sets, in: Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001, vol. 2, 2001, pp. 159–165.