Affiliation:
1. Arkansas State University, USA
Abstract
This chapter discusses Open Source Software and associated technologies for the processing of Big Data. This includes discussions of Hadoop-related projects, the current top open source data tools and frameworks such as SMACK that is acronym for open source technologies Spark, Mesos, Akka, Cassandra, and Kafka that together compose the ingestion, aggregation, analysis, and storage layers for Big Data processing. Tabular summaries and categories for 38 Open Source Statistical Software (OSSS) are provided that include for each listing of features and URLs for free downloads. The current challenges of Big Data and Open Source Software are also discussed.
Reference44 articles.
1. Apache Avro. (2013). Retrieved May 17, 2018 from https://cwiki.apache.org/confluence/display/AVRO/Index
2. Free and Open Source Software Licenses Explained
3. Bui, A. (2018). The big data stack: Powering data lakes, data warehouses and beyond. Retrieved July 8, 2019 from https://blog.panoply.io/the-big-data-stack-powering-data-lakes-data-warehouses-and-beyond