1. A model and survey of distributed data-intensive systems;Margara;ACM Comput. Surv.,2023
2. Apache spark: A unified engine for big data processing;Zaharia;Commun. ACM,2016
3. Discretized streams: Fault-tolerant streaming computation at scale;Zaharia,2013
4. The dataflow model: A practical approach to balancing correctness, latency, and cost in massive-scale, unbounded, out-of-order data processing;Akidau;Proc. VLDB Endow.,2015
5. Apache flink™: Stream and batch processing in a single engine;Carbone;IEEE Data Eng. Bull.,2015