1. Sayad, S. (2011). Real Time Data Mining, Self-Help Publishers.
2. Learning from imbalanced data: Open challenges and future directions;Prog. Artif. Intell.,2016
3. (2021, December 24). Spark 2.1.0 Documentation. Available online: https://spark.apache.org/docs/2.1.0/.
4. Zaharia, M., Chowdhury, M., Das, T., Dave, A., Ma, J., McCauly, M., Franklin, M.J., Shenker, S., and Stoica, I. (2012, January 25–27). Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing. Proceedings of the 9th USENIX Symposium on Networked Systems Design and Implementation (NSDI 12), San Jose, CA, USA.
5. (2021, December 04). Apache Hadoop- MapReduce. Available online: https://hadoop.apache.org/docs/r1.2.1/mapred_tutorial.html.