Understanding and Analyzing Interconnect Errors and Network Congestion on a Large Scale HPC System. Kumar, M., Gupta, S., Patel, T., Wilder, M., Shi, W., Fu, S., Engelmann, C., & Tiwari, D. In 48th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2018, Luxembourg City, Luxembourg, June 25-28, 2018, pages 107–114, 2018. IEEE Computer Society.
Understanding and Analyzing Interconnect Errors and Network Congestion on a Large Scale HPC System [link]Paper  doi  bibtex   
@inproceedings{DBLP:conf/dsn/KumarGPWSFET18,
  author       = {Mohit Kumar and
                  Saurabh Gupta and
                  Tirthak Patel and
                  Michael Wilder and
                  Weisong Shi and
                  Song Fu and
                  Christian Engelmann and
                  Devesh Tiwari},
  title        = {Understanding and Analyzing Interconnect Errors and Network Congestion
                  on a Large Scale {HPC} System},
  booktitle    = {48th Annual {IEEE/IFIP} International Conference on Dependable Systems
                  and Networks, {DSN} 2018, Luxembourg City, Luxembourg, June 25-28,
                  2018},
  pages        = {107--114},
  publisher    = {{IEEE} Computer Society},
  year         = {2018},
  url          = {https://doi.org/10.1109/DSN.2018.00023},
  doi          = {10.1109/DSN.2018.00023},
  timestamp    = {Mon, 05 Feb 2024 00:00:00 +0100},
  biburl       = {https://dblp.org/rec/conf/dsn/KumarGPWSFET18.bib},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}

Downloads: 0