1. [n.d.]. IMDb Non-Commercial Datasets. https://developer.imdb.com/non-commercial-datasets/. Accessed: 2023-02-12. [n.d.]. IMDb Non-Commercial Datasets. https://developer.imdb.com/non-commercial-datasets/. Accessed: 2023-02-12.
2. [n.d.]. Intel HiBench -- Big Data Benchmark. https://github.com/Intel-bigdata/HiBench. Accessed: 2023-02-25. [n.d.]. Intel HiBench -- Big Data Benchmark. https://github.com/Intel-bigdata/HiBench. Accessed: 2023-02-25.
3. Distributed join algorithms on thousands of cores
4. Matt Calder , Xun Fan , Zi Hu , Ethan Katz-Bassett , John Heidemann , and Ramesh Govindan . 2013. Mapping the Expansion of Google's Serving Infrastructure (IMC '13) . Association for Computing Machinery , New York, NY, USA , 313--326. Matt Calder, Xun Fan, Zi Hu, Ethan Katz-Bassett, John Heidemann, and Ramesh Govindan. 2013. Mapping the Expansion of Google's Serving Infrastructure (IMC '13). Association for Computing Machinery, New York, NY, USA, 313--326.
5. On random sampling over joins