Software Defined Multicasting for MPI Collective Operation Offloading with the NetFPGA BT - Euro-Par 2014 Parallel Processing: 20th International Conference, Porto, Portugal, August 25-29, 2014. Proceedings. Arap, O., Brown, G., Himebaugh, B., & Swany, M. Volume 8632 LNCS. Software Defined Multicasting for MPI Collective Operation Offloading with the NetFPGA BT - Euro-Par 2014 Parallel Processing: 20th International Conference, Porto, Portugal, August 25-29, 2014. Proceedings, pages 632-643. Springer International Publishing, 2014.
Software Defined Multicasting for MPI Collective Operation Offloading with the NetFPGA BT  - Euro-Par 2014 Parallel Processing: 20th International Conference, Porto, Portugal, August 25-29, 2014. Proceedings [link]Website  abstract   bibtex   
Collective operations play a key role in the performance of many high performance computing applications and are central to the widely used Message Passing Interface (MPI) programming model. In this paper we explore the use of programmable networking devices to accelerate the implementation of collective operations by offloading functionality to the underlying network. In our work we utilize a networked FPGA in conjunction with commercial OpenFlow switches supporting multicast. The union of hardware configurable network interfaces with Software Defined Networking (SDN) provides a significant opportunity to improve the performance of MPI applications that rely heavily on collective operations. The programmable interfaces implement collective operations in hardware using OpenFlow supported multicast. In our 8-node cluster, we observed up to 12% reduction in MPI_Allreduce latency in dynamic schemes employing SDN; and up to 22% reduction in static topologies. The results suggest more benefits if our approach is deployed in larger settings with low latency switches.
@inBook{
 title = {Software Defined Multicasting for MPI Collective Operation Offloading with the NetFPGA BT  - Euro-Par 2014 Parallel Processing: 20th International Conference, Porto, Portugal, August 25-29, 2014. Proceedings},
 type = {inBook},
 year = {2014},
 identifiers = {[object Object]},
 pages = {632-643},
 volume = {8632 LNCS},
 websites = {https://doi.org/10.1007/978-3-319-09873-9_53},
 publisher = {Springer International Publishing},
 city = {Cham},
 editors = {[object Object],[object Object],[object Object]},
 id = {6e20c6ab-9acb-3f39-87af-a1921d0eb79c},
 created = {2018-03-05T18:20:22.438Z},
 file_attached = {false},
 profile_id = {42d295c0-0737-38d6-8b43-508cab6ea85d},
 group_id = {9d761a94-2f2d-31ce-a8c3-50aa6d668643},
 last_modified = {2018-03-05T18:20:22.438Z},
 read = {false},
 starred = {false},
 authored = {false},
 confirmed = {true},
 hidden = {false},
 citation_key = {Arap2014},
 source_type = {CHAP},
 private_publication = {false},
 abstract = {Collective operations play a key role in the performance of many high performance computing applications and are central to the widely used Message Passing Interface (MPI) programming model. In this paper we explore the use of programmable networking devices to accelerate the implementation of collective operations by offloading functionality to the underlying network. In our work we utilize a networked FPGA in conjunction with commercial OpenFlow switches supporting multicast. The union of hardware configurable network interfaces with Software Defined Networking (SDN) provides a significant opportunity to improve the performance of MPI applications that rely heavily on collective operations. The programmable interfaces implement collective operations in hardware using OpenFlow supported multicast. In our 8-node cluster, we observed up to 12% reduction in MPI_Allreduce latency in dynamic schemes employing SDN; and up to 22% reduction in static topologies. The results suggest more benefits if our approach is deployed in larger settings with low latency switches.},
 bibtype = {inBook},
 author = {Arap, Omer and Brown, Geoffrey and Himebaugh, Bryce and Swany, Martin},
 book = {Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)}
}

Downloads: 0