1. Hadoop.Apache Hadoop documentation 2019.https://hadoop.apache.org/docs/r2.7.3/
2. Apache Flink: stream and batch processing in a single engine;Carbone P;IEEE Data Eng Bull,2015
3. YuY IsardM FetterlyD BudiuM ErlingssonU GundaPK CurreyJ.DryadLINQ: a system for general‐purpose distributed data‐parallel computing using a high‐level language. InProceedings of the 8th USENIX Conference on Operating Systems Design and Implementation OSDI'08.USENIX Association:Berkeley CA USA;2008 p.1–14.http://dl.acm.org/citation.cfm?id=1855741.1855742
4. BeamA.Apache Beam: an advanced unified programming model 2016.https://beam.apache.org/
5. ZahariaM ChowdhuryM FranklinMJ ShenkerS StoicaI.Spark: cluster computing with working sets. InProceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing HotCloud'10.USENIX Association:Berkeley CA USA;2010 p.10.http://dl.acm.org/citation.cfm?id=1863103.1863113