1. Cooley, J.M., Tukey, J.W.: An algorithm for the machine computation of the complex Fourier series. Math. Comput. 19, 297–301 (1965)
2. Strong, J.P.: The Fourier transform on mesh connected processing arrays such as massively parallel processors. CAPAIDM 19, 190–196 (1986)
3. Shousheng, H., Torkelson, M.: A systolic array implementation of common factor algorithm to compute DFT. In: International Symposium on Parallel Architectures, Algorithms and Networks (ISPAN), pp. 374–381 (1994)
4. Sinha, B.P., Mukherjee, A.: Parallel sorting algorithm using multiway merge and its implementation on a multi-mesh network. J. Parallel Distrib. Comput. 60, 891–960 (2000)
5. Jaja, J.: An Introduction to Parallel Algorithms. Addison-Wesley (1992)