Server-side Prediction of Source IP Addresses using Density Estimation

Server-side Prediction of Source IP Addresses using Density Estimation. Goldstein, M., Reif, M., Stahl, A., & Breuel, T. In Fourth International Conference on Availability, Reliability and Security, pages 82-89, 2009.

Paper abstract bibtex

Source IP addresses are often used as a major feature for user modeling in computer networks. Particularly in the field of Distributed Denial of Service (DDoS) attack detection and mitigation traffic models make extensive use of source IP addresses for detecting anomalies. Typically the real IP address distribution is strongly undersampled due to a small amount of observations. Density estimation overcomes this shortage by taking advantage of IP neighborhood relations. In many cases simple models are implicitly used or chosen intuitively as a network based heuristic. In this paper we review and formalize existing models including a hierarchical clustering approach first. In addition, we present a modified k-means clustering algorithm for source IP density estimation as well as a statistical motivated smoothing approach using the Nadaraya-Watson kernel-weighted average. For performance evaluation we apply all methods on a 90 days real world dataset consisting of 1.3 million different source IP addresses and try to predict the users of the following next 10 days. ROC curves and an example DDoS mitigation scenario show that there is no uniformly better approach: k-means performs best when a high detection rate is needed whereas statistical smoothing works better for low false alarm rate requirements like the DDoS mitigation scenario.

@inproceedings{ mendeley_3976653282,
  author    = {Markus Goldstein and Matthias Reif and Armin Stahl and Thomas Breuel},
  title     = {Server-side Prediction of Source IP Addresses using Density Estimation},
  series   = {Fourth International Conference on Availability, Reliability and Security}, 
  abstract   = {Source IP addresses are often used as a major feature for user modeling in computer networks. Particularly in the field of Distributed Denial of Service (DDoS) attack detection and mitigation traffic models make extensive use of source IP addresses for detecting anomalies. Typically the real IP address distribution is strongly undersampled due to a small amount of observations. Density estimation overcomes this shortage by taking advantage of IP neighborhood relations. In many cases simple models are implicitly used or chosen intuitively as a network based heuristic. In this paper we review and formalize existing models including a hierarchical clustering approach first. In addition, we present a modified k-means clustering algorithm for source IP density estimation as well as a statistical motivated smoothing approach using the Nadaraya-Watson kernel-weighted average. For performance evaluation we apply all methods on a 90 days real world dataset consisting of 1.3 million different source IP addresses and try to predict the users of the following next 10 days. ROC curves and an example DDoS mitigation scenario show that there is no uniformly better approach: k-means performs best when a high detection rate is needed whereas statistical smoothing works better for low false alarm rate requirements like the DDoS mitigation scenario.},
  booktitle   = {Fourth International Conference on Availability, Reliability and Security},
  pages   = {82-89},
  url   = {http://madm.dfki.de/publication&amp;pubid=4111} ,
  year   = {2009}
}

Downloads: 0

{"_id":{"_str":"53429bec0e946d920a00224a"},"__v":22,"authorIDs":["5457f0e12abc8e9f370008ea","545f592f6aaec20d23000bcc","546ecea75ac8e5e30d00000c"],"author_short":["Goldstein, M.","Reif, M.","Stahl, A.","Breuel, T."],"bibbaseid":"goldstein-reif-stahl-breuel-serversidepredictionofsourceipaddressesusingdensityestimation-2009","bibdata":{"bibtype":"inproceedings","type":"inproceedings","author":[{"firstnames":["Markus"],"propositions":[],"lastnames":["Goldstein"],"suffixes":[]},{"firstnames":["Matthias"],"propositions":[],"lastnames":["Reif"],"suffixes":[]},{"firstnames":["Armin"],"propositions":[],"lastnames":["Stahl"],"suffixes":[]},{"firstnames":["Thomas"],"propositions":[],"lastnames":["Breuel"],"suffixes":[]}],"title":"Server-side Prediction of Source IP Addresses using Density Estimation","series":"Fourth International Conference on Availability, Reliability and Security","abstract":"Source IP addresses are often used as a major feature for user modeling in computer networks. Particularly in the field of Distributed Denial of Service (DDoS) attack detection and mitigation traffic models make extensive use of source IP addresses for detecting anomalies. Typically the real IP address distribution is strongly undersampled due to a small amount of observations. Density estimation overcomes this shortage by taking advantage of IP neighborhood relations. In many cases simple models are implicitly used or chosen intuitively as a network based heuristic. In this paper we review and formalize existing models including a hierarchical clustering approach first. In addition, we present a modified k-means clustering algorithm for source IP density estimation as well as a statistical motivated smoothing approach using the Nadaraya-Watson kernel-weighted average. For performance evaluation we apply all methods on a 90 days real world dataset consisting of 1.3 million different source IP addresses and try to predict the users of the following next 10 days. ROC curves and an example DDoS mitigation scenario show that there is no uniformly better approach: k-means performs best when a high detection rate is needed whereas statistical smoothing works better for low false alarm rate requirements like the DDoS mitigation scenario.","booktitle":"Fourth International Conference on Availability, Reliability and Security","pages":"82-89","url":"http://madm.dfki.de/publication&pubid=4111","year":"2009","bibtex":"@inproceedings{ mendeley_3976653282,\n author = {Markus Goldstein and Matthias Reif and Armin Stahl and Thomas Breuel},\n title = {Server-side Prediction of Source IP Addresses using Density Estimation},\n series = {Fourth International Conference on Availability, Reliability and Security}, \n abstract = {Source IP addresses are often used as a major feature for user modeling in computer networks. Particularly in the field of Distributed Denial of Service (DDoS) attack detection and mitigation traffic models make extensive use of source IP addresses for detecting anomalies. Typically the real IP address distribution is strongly undersampled due to a small amount of observations. Density estimation overcomes this shortage by taking advantage of IP neighborhood relations. In many cases simple models are implicitly used or chosen intuitively as a network based heuristic. In this paper we review and formalize existing models including a hierarchical clustering approach first. In addition, we present a modified k-means clustering algorithm for source IP density estimation as well as a statistical motivated smoothing approach using the Nadaraya-Watson kernel-weighted average. For performance evaluation we apply all methods on a 90 days real world dataset consisting of 1.3 million different source IP addresses and try to predict the users of the following next 10 days. ROC curves and an example DDoS mitigation scenario show that there is no uniformly better approach: k-means performs best when a high detection rate is needed whereas statistical smoothing works better for low false alarm rate requirements like the DDoS mitigation scenario.},\n booktitle = {Fourth International Conference on Availability, Reliability and Security},\n pages = {82-89},\n url = {http://madm.dfki.de/publication&pubid=4111} ,\n year = {2009}\n}\n\n\n","author_short":["Goldstein, M.","Reif, M.","Stahl, A.","Breuel, T."],"key":"mendeley_3976653282","id":"mendeley_3976653282","bibbaseid":"goldstein-reif-stahl-breuel-serversidepredictionofsourceipaddressesusingdensityestimation-2009","role":"author","urls":{"Paper":"http://madm.dfki.de/publication&pubid=4111"},"downloads":0},"bibtype":"inproceedings","biburl":"http://data.bibbase.org/author/armin-stahl/?format=bibtex","downloads":0,"keywords":[],"search_terms":["server","side","prediction","source","addresses","using","density","estimation","goldstein","reif","stahl","breuel"],"title":"Server-side Prediction of Source IP Addresses using Density Estimation","year":2009,"dataSources":["nET7ozMv4rRPd4sNC"]}