1. Synchronous and asynchronous algorithms for matrix transposition on MCAP;Azari,1988
2. Complete exchange on a circuit switched mesh;Bokhari,1992
3. ScaLAPACK: A scalable linear algebra library for distributed memory concurrent computers;Choi,1992
4. The design of scalable software libraries for distributed memory concurrent computers;Choi,1992
5. PUMMA: Parallel universal matrix multiplication algorithms on distributed memory concurrent computers;Choi;Concurrency: Practice and Experience,1994