1. Apache Hadoop. http://hadoop.apache.org/. Accessed 12 Feb 2022
2. Apache spark$$^{\rm TM}$$ - unified engine for large-scale data analytics. http://spark.apache.com. Accessed 12 Feb 2022
3. Armbrust, M., et al.: Spark SQL: relational data processing in spark. In: Sellis, T.K., Davidson, S.B., Ives, Z.G. (eds.) Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31–June 4 2015, pp. 1383–1394. ACM (2015). https://doi.org/10.1145/2723372.2742797
4. Borthakur, D.: The Hadoop distributed file system: architecture and design. Hadoop Project Website 11(2007), 21 (2007)
5. Davidson, A., Or, A.: Optimizing shuffle performance in spark. University of California, Berkeley-Department of Electrical Engineering and Computer Sciences, Technical report (2013)