TopoOpt: Co-optimizing Network Topology and Parallelization Strategy for Distributed Training Jobs. Wang, W., Khazraee, M., Zhong, Z., Ghobadi, M., Jia, Z., Mudigere, D., Zhang, Y., & Kewitsch, A. In Balakrishnan, M. & Ghobadi, M., editors, 20th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2023, Boston, MA, April 17-19, 2023, pages 739–767, 2023. USENIX Association.
TopoOpt: Co-optimizing Network Topology and Parallelization Strategy for Distributed Training Jobs [link]Paper  bibtex   
@inproceedings{DBLP:conf/nsdi/WangKZGJM0K23,
  author       = {Weiyang Wang and
                  Moein Khazraee and
                  Zhizhen Zhong and
                  Manya Ghobadi and
                  Zhihao Jia and
                  Dheevatsa Mudigere and
                  Ying Zhang and
                  Anthony Kewitsch},
  editor       = {Mahesh Balakrishnan and
                  Manya Ghobadi},
  title        = {TopoOpt: Co-optimizing Network Topology and Parallelization Strategy
                  for Distributed Training Jobs},
  booktitle    = {20th {USENIX} Symposium on Networked Systems Design and Implementation,
                  {NSDI} 2023, Boston, MA, April 17-19, 2023},
  pages        = {739--767},
  publisher    = {{USENIX} Association},
  year         = {2023},
  url          = {https://www.usenix.org/conference/nsdi23/presentation/wang-weiyang},
  timestamp    = {Thu, 11 May 2023 17:08:22 +0200},
  biburl       = {https://dblp.org/rec/conf/nsdi/WangKZGJM0K23.bib},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}

Downloads: 0