1. E. H. M. Sha, N. L. Passos. Efficient polynomial-time nested loop fusion with full parallelism. (1999).
2. Verification of loop parallelisations;Blom,2015
3. Improving memory hierarchy performance through combined loop interchange and multi-level fusion;Yi;Int. J. High Perform. Comput. Appl.,2004
4. DeNovo: Rethinking the memory hierarchy for disciplined parallelism;Choi,2011
5. A nested loop fusion algorithm based on cost analysis;Jie,2012