Cooperative rendezvous protocols for improved performance and overlap. Chakraborty, S., Bayatpour, M., Hashmi, J, Subramoni, H., & Panda, D. K. In SC18: International Conference for High Performance Computing, Networking, Storage and Analysis, pages 361–373, 2018. IEEE.
abstract   bibtex   
With the emergence of larger multi-/many-core clusters and new areas of HPC applications, performance of large message communication is becoming more important. MPI libraries use different rendezvous protocols to perform large message communication. However, existing rendezvous protocols do not take the overall communication pattern into account or make optimal use of the Sender and the Receiver CPUs. In this work, we propose a cooperative rendezvous protocol that can provide up to 2× improvement in intra-node bandwidth and latency for large messages. We also propose designs to dynamically choose the best rendezvous protocol for each message based on the overall communication pattern. Finally, we show how these improvements can increase the overlap of intra-node communication and computation with inter-node communication and lead to application level benefits at scale. We evaluate the proposed designs on three different architectures - Intel Xeon, Knights Landing, and OpenPOWER against state-of-the-art MPI libraries including MVAPICH2 and Open MPI. Compared to existing designs, the proposed designs show benefits of up to 19% with Graph500, 16% with CoMD, and 10% with MiniGhost.
@inproceedings{chakraborty2018cooperative,
  title={{Cooperative rendezvous protocols for improved performance and overlap}},
  author={Chakraborty, Sourav and Bayatpour, Mohammadreza and Hashmi, J and Subramoni, Hari and Panda, Dhabaleswar K.},
  booktitle={SC18: International Conference for High Performance Computing, Networking, Storage and Analysis},
  pages={361--373},
  year={2018},
  abstract={With the emergence of larger multi-/many-core clusters and new areas of HPC applications, performance of large message communication is becoming more important. MPI libraries use different rendezvous protocols to perform large message communication. However, existing rendezvous protocols do not take the overall communication pattern into account or make optimal use of the Sender and the Receiver CPUs. In this work, we propose a cooperative rendezvous protocol that can provide up to 2× improvement in intra-node bandwidth and latency for large messages. We also propose designs to dynamically choose the best rendezvous protocol for each message based on the overall communication pattern. Finally, we show how these improvements can increase the overlap of intra-node communication and computation with inter-node communication and lead to application level benefits at scale. We evaluate the proposed designs on three different architectures - Intel Xeon, Knights Landing, and OpenPOWER against state-of-the-art MPI libraries including MVAPICH2 and Open MPI. Compared to existing designs, the proposed designs show benefits of up to 19\% with Graph500, 16\% with CoMD, and 10\% with MiniGhost.},
  organization={IEEE}
}

Downloads: 0