Understanding PCIe performance for end host networking. Neugebauer, R., Antichi, G., Zazo, J. F., Audzevich, Y., López-Buedo, S., & Moore, A. W. In Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication, pages 327–341, Aug, 2018. doi abstract bibtex 2 downloads © 2018 Copyright held by the owner/author(s). In recent years, spurred on by the development and availability of programmable NICs, end hosts have increasingly become the enforcement point for core network functions such as load balancing, congestion control, and application specific network offloads. However, implementing custom designs on programmable NICs is not easy: many potential bottlenecks can impact performance. This paper focuses on the performance implication of PCIe, the de-facto I/O interconnect in contemporary servers, when interacting with the host architecture and device drivers. We present a theoretical model for PCIe and pcie-bench, an open-source suite, that allows developers to gain an accurate and deep understanding of the PCIe substrate. Using pcie-bench, we characterize the PCIe subsystem in modern servers. We highlight surprising differences in PCIe implementations, evaluate the undesirable impact of PCIe features such as IOMMUs, and show the practical limits for common network cards operating at 40Gb/s and beyond. Furthermore, through pcie-bench we gained insights which guided software and future hardware architectures for both commercial and research oriented network cards and DMA engines.
@InProceedings{Neugebauer2018Understanding,
author = {Neugebauer, Rolf and Antichi, Gianni and Zazo, Jos{\'e} Fernando and Audzevich, Yury and L{\'o}pez-Buedo, Sergio and Moore, Andrew W.},
booktitle = {Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication},
title = {{Understanding PCIe performance for end host networking}},
year = {2018},
month = {Aug},
pages = {327--341},
abstract = {© 2018 Copyright held by the owner/author(s). In recent years, spurred on by the development and availability of programmable NICs, end hosts have increasingly become the enforcement point for core network functions such as load balancing, congestion control, and application specific network offloads. However, implementing custom designs on programmable NICs is not easy: many potential bottlenecks can impact performance. This paper focuses on the performance implication of PCIe, the de-facto I/O interconnect in contemporary servers, when interacting with the host architecture and device drivers. We present a theoretical model for PCIe and pcie-bench, an open-source suite, that allows developers to gain an accurate and deep understanding of the PCIe substrate. Using pcie-bench, we characterize the PCIe subsystem in modern servers. We highlight surprising differences in PCIe implementations, evaluate the undesirable impact of PCIe features such as IOMMUs, and show the practical limits for common network cards operating at 40Gb/s and beyond. Furthermore, through pcie-bench we gained insights which guided software and future hardware architectures for both commercial and research oriented network cards and DMA engines.},
day = {7},
doi = {10.1145/3230543.3230560},
isbn = {9781450355674},
publicationstatus = {published},
}
Downloads: 2
{"_id":"zK7LDbxx8gL6Ek6a7","bibbaseid":"neugebauer-antichi-zazo-audzevich-lpezbuedo-moore-understandingpcieperformanceforendhostnetworking-2018","authorIDs":["23FvDKJM79risFbvk","2WRdLjsYbtmJQGXTj","3TdN7rFeMEefY9Hk7","3rTL65wF3DnHKjPrH","4D4M2oJhgoxJWmkcN","5e544e2f7a758fde010001b9","5e5456f588d190df01000080","5e546e797f0f44de01000043","5e54722a7f0f44de01000067","5e554f16e89e5fde0100002a","5e56ad6de177dede01000101","5e592f5fe60e02de010000a8","5e5b8cd399fee0df0100052f","5e5d95b05726ecdf0100009d","5e5e61c85f9e7bee010001b2","5e5ecf0f8c261adf010000e3","5e60f44831c7d3de0100009c","5e610d751cc34ede01000001","5e63cd1278da4fdf01000009","5e64eac85fc0b7de01000024","5e657177de41b9df01000190","5e666d8f46e828de010002c0","5e66782c152d6bde010000ad","5e66bc684b4a62de01000140","5e671b0e0355fddf0100001a","5e67421e4d24a6df0100009a","5e680874c1fce0de01000287","5e6902b38457bade0100024c","5e69eebe3aab3cdf0100027c","5e6b02dd69285ede010000d4","6dtSpmYM2v5HiJ9ko","7PYozaA4XxD8TMSWu","7v67D8gAyyrYcYL8n","8FAeKWJrSQzGoH9xE","92btiJpxxLrqAXgMn","9RzNnQ6CDTvwDBvLk","9xFPbGnxzbTckjz9c","AKtnLn9iXuKfsKnvb","BFbxJdJp2sdQE6jab","C42bg96aJPxALZDMj","DtNfHaeJrKpfNoJ6Y","EjCF3Z7mromd37b46","FAdZj59doyPG95Ft6","FS3CaSTZahuktxKRK","FSHFxTSeaaMHc8JFN","FjBPfgDjAs9qKbrk5","FsZDw63cXDZGCg8cM","FyuCzEsmg5eZwps34","G7Zffx7BGjho2AJZF","GWQKt32StcKocqn5B","GkmLPhgRgdfPyWMpr","GuCyBeEq4CLqBYY4e","HEsiueufKRKp7GbNg","HeDJuNtADqRvbfrNq","HjfA3eJtFSPtgMBym","Jqijff5rjodf97X53","KtL6LDN8YAwLxHa65","Mkn8H8agB7ogrFqWN","NzXzmqRTuu45HnK2v","PrcWsJASY7LaFZhBB","PwRjbkKnSWC6H3n5k","R7kWfTSiqEYweDNJJ","RnYAzvE3YT6XCvYbL","RzmZwHWjFC9KDceD6","S2grxY7kvZtXmkWaz","SFyHCv2aSe3LSrsig","T24gLu9jAvzXQKDdi","TetaaoYbFZFGavtxb","TrJkGmeXPf45TAc2W","WtXExMgBkc2WjYBJt","XA7WBWWEgKAXwNJCd","Xx5DKm38fz55YJDuS","YAHhHms8w9X3eEygF","YkX9xP5ow83tk9puf","YxYksTSKXdNyMqJsi","ZL6Dbz49T8BDz5vKk","aFMaT3CuDYnbfTuK2","aYh8WXnpTgutjdTMd","appbKSaMFf3DMDdyh","bPre4gjTqMby95fE9","bwBtZ44dECqBAqL6B","byXTwmgkSDjBepE6s","dQHw4Zt3JvgfJe4LH","fRHmMjA45atDn2XE2","fswYidNG7RDWyDvav","gGhwKTHH8tmmv276q","h2nvFJ6oT6Ts8LGw8","iWm4sAET4pyRiymhf","ij4rWKxtaweP2mmoQ","j399RSNqBHhuysXLG","jBsGo5nSsKF9Aswd7","jE5oX3AQSLtsnWFq5","jQhrZRo2jakTNZXE2","jXnTqkWvn2uys2LvQ","jYoTKtmcqmnjxyeYG","nCcpY7DvabFtejfsb","nmJWcKmR9YysKb2rn","pDgAmdf6bhwdtM7xi","pmYukAPu4EpuZAnZm","qGrsAHNkXL2yfxqju","qsecNeipc9bA8BJCt","r9c22nKBKGFJSB2La","srLPnyAzJWoRtHW8E","tN3iZEGhSw5dfH6Xz","tkFNTSf2DZXQJ2S68","uhtv8BNREAX7LsJYP","uokAikkyvYZZBbmbj","v2nSToyinuWzoQD4m","vhEMZTdeqfxeNkLCx","vve5WiBtmQqb2NB9Z","vwrt7tPuWdGZn9rfW","wbMcKHXZfdcWNjsvh","xBPtJ8k7YjPfQvS3C","xnzKrBDzkkPg97c7S","y8abYWAnJv92KiDwR","ycjRe3yAeg8atdvL6","ycyAerqYNXsBXxcPX","ykErHr8DwXQjFCSFG","zyNiT5NQSkPSNGs43"],"author_short":["Neugebauer, R.","Antichi, G.","Zazo, J. F.","Audzevich, Y.","López-Buedo, S.","Moore, A. W."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","author":[{"propositions":[],"lastnames":["Neugebauer"],"firstnames":["Rolf"],"suffixes":[]},{"propositions":[],"lastnames":["Antichi"],"firstnames":["Gianni"],"suffixes":[]},{"propositions":[],"lastnames":["Zazo"],"firstnames":["José","Fernando"],"suffixes":[]},{"propositions":[],"lastnames":["Audzevich"],"firstnames":["Yury"],"suffixes":[]},{"propositions":[],"lastnames":["López-Buedo"],"firstnames":["Sergio"],"suffixes":[]},{"propositions":[],"lastnames":["Moore"],"firstnames":["Andrew","W."],"suffixes":[]}],"booktitle":"Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication","title":"Understanding PCIe performance for end host networking","year":"2018","month":"Aug","pages":"327–341","abstract":"© 2018 Copyright held by the owner/author(s). In recent years, spurred on by the development and availability of programmable NICs, end hosts have increasingly become the enforcement point for core network functions such as load balancing, congestion control, and application specific network offloads. However, implementing custom designs on programmable NICs is not easy: many potential bottlenecks can impact performance. This paper focuses on the performance implication of PCIe, the de-facto I/O interconnect in contemporary servers, when interacting with the host architecture and device drivers. We present a theoretical model for PCIe and pcie-bench, an open-source suite, that allows developers to gain an accurate and deep understanding of the PCIe substrate. Using pcie-bench, we characterize the PCIe subsystem in modern servers. We highlight surprising differences in PCIe implementations, evaluate the undesirable impact of PCIe features such as IOMMUs, and show the practical limits for common network cards operating at 40Gb/s and beyond. Furthermore, through pcie-bench we gained insights which guided software and future hardware architectures for both commercial and research oriented network cards and DMA engines.","day":"7","doi":"10.1145/3230543.3230560","isbn":"9781450355674","publicationstatus":"published","bibtex":"@InProceedings{Neugebauer2018Understanding,\n author = {Neugebauer, Rolf and Antichi, Gianni and Zazo, Jos{\\'e} Fernando and Audzevich, Yury and L{\\'o}pez-Buedo, Sergio and Moore, Andrew W.},\n booktitle = {Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication},\n title = {{Understanding PCIe performance for end host networking}},\n year = {2018},\n month = {Aug},\n pages = {327--341},\n abstract = {© 2018 Copyright held by the owner/author(s). In recent years, spurred on by the development and availability of programmable NICs, end hosts have increasingly become the enforcement point for core network functions such as load balancing, congestion control, and application specific network offloads. However, implementing custom designs on programmable NICs is not easy: many potential bottlenecks can impact performance. This paper focuses on the performance implication of PCIe, the de-facto I/O interconnect in contemporary servers, when interacting with the host architecture and device drivers. We present a theoretical model for PCIe and pcie-bench, an open-source suite, that allows developers to gain an accurate and deep understanding of the PCIe substrate. Using pcie-bench, we characterize the PCIe subsystem in modern servers. We highlight surprising differences in PCIe implementations, evaluate the undesirable impact of PCIe features such as IOMMUs, and show the practical limits for common network cards operating at 40Gb/s and beyond. Furthermore, through pcie-bench we gained insights which guided software and future hardware architectures for both commercial and research oriented network cards and DMA engines.},\n day = {7},\n doi = {10.1145/3230543.3230560},\n isbn = {9781450355674},\n publicationstatus = {published},\n}\n\n","author_short":["Neugebauer, R.","Antichi, G.","Zazo, J. F.","Audzevich, Y.","López-Buedo, S.","Moore, A. W."],"key":"Neugebauer2018Understanding","id":"Neugebauer2018Understanding","bibbaseid":"neugebauer-antichi-zazo-audzevich-lpezbuedo-moore-understandingpcieperformanceforendhostnetworking-2018","role":"author","urls":{},"metadata":{"authorlinks":{"moore, a":"https://www.cl.cam.ac.uk/~awm22/publications/index.html"}},"downloads":2,"html":""},"bibtype":"inproceedings","biburl":"www.cl.cam.ac.uk/~awm22/tmp/jabref-master.bib","creationDate":"2020-02-24T22:29:03.614Z","downloads":2,"keywords":[],"search_terms":["understanding","pcie","performance","end","host","networking","neugebauer","antichi","zazo","audzevich","lópez-buedo","moore"],"title":"Understanding PCIe performance for end host networking","year":2018,"dataSources":["P4oCWtW2FaLzS5ueC","Z2XGYjE5q4GTmpP4E","LoreFsEEN9tXGMfht","d8e849HkrdPEWFP7g"]}