1. Bondhugula U, Baskaran M, Krishnamoorthy S, Ramanujam J, Rountev A, Sadayappan P (2007) Affine transformations for communication minimal parallelization and locality optimization of arbitrarily nested loop sequences. Technical Report, The Ohio State University (OSU-CISRC-5/07-TR43)
2. Gordon S (2004) Numerical solution of partial differential equations: finite difference methods, 3rd edn. Clarendon Press, Oxford
3. Cong J, Huang M, Zou Y (2011) Accelerating fluid registration algorithm on multi-FPGA platforms. In: 21st International Conference on Field Programmable Logic and Applications, IEEE, Sep 2011, pp 50–57.
https://doi.org/10.1109/FPL.2011.20
4. Taflove A, Hagness S (1995) Computational electrodynamics: the finite-difference time-domain method, 2nd edn. Artech House, Boston
5. Han D, Xu S, Chen L, Huang L (2011) PADS: a pattern-driven stencil compiler-based tool for reuse of optimizations on GPGPUs. In: 17th International Conference on Parallel and Distributed Systems (ICPADS), IEEE, Dec 2011, pp 308–315.
https://doi.org/10.1109/ICPADS.2011.94