Online Anomaly Detection Leveraging Stream-Based Clustering and Real-Time Telemetry. Putina, A. & Rossi, D. IEEE Transactions on Network and Service Management, 18(1):839–854, March, 2021. Conference Name: IEEE Transactions on Network and Service Management
doi  abstract   bibtex   
Recent technology evolution allows network equipment to continuously stream a wealth of “telemetry” information, which pertains to multiple protocols and layers of the stack, at a very fine spatial-grain and high-frequency. This deluge of telemetry data clearly offers new opportunities for network control and troubleshooting, but also poses a serious challenge for what concerns its real-time processing. We tackle this challenge by applying streaming machine-learning techniques to the continuous flow of control and data-plane telemetry data, with the purpose of real-time detection of anomalies. In particular, we implement an anomaly detection engine that leverages DenStream, an unsupervised clustering technique, and apply it to features collected from a large-scale testbed comprising tens of routers traversed up to 3Terabit/s worth of real application traffic. We contrast DenStream with offline algorithms such as DBScan and Local Outlier Factor (LOF), as well as online algorithms such as the windowed version of DBScan, ExactSTORM, Continuous Outlier Detection (COD) and Robust Random Cut Forest (RRCF). Our experimental campaign compares these seven algorithms under both accuracy and computational complexity viewpoints: results testify that DenStream (i) achieves detection results on par with RRCF, the best performing algorithm and (ii) is significantly faster than other approaches, notably over two orders of magnitude faster than RRCF. In spirit with the recent trend toward reproducibility of results, we make our code available as open source to the scientific community.
@article{putina_online_2021,
	title = {Online {Anomaly} {Detection} {Leveraging} {Stream}-{Based} {Clustering} and {Real}-{Time} {Telemetry}},
	volume = {18},
	issn = {1932-4537},
	doi = {10.1109/TNSM.2020.3037019},
	abstract = {Recent technology evolution allows network equipment to continuously stream a wealth of “telemetry” information, which pertains to multiple protocols and layers of the stack, at a very fine spatial-grain and high-frequency. This deluge of telemetry data clearly offers new opportunities for network control and troubleshooting, but also poses a serious challenge for what concerns its real-time processing. We tackle this challenge by applying streaming machine-learning techniques to the continuous flow of control and data-plane telemetry data, with the purpose of real-time detection of anomalies. In particular, we implement an anomaly detection engine that leverages DenStream, an unsupervised clustering technique, and apply it to features collected from a large-scale testbed comprising tens of routers traversed up to 3Terabit/s worth of real application traffic. We contrast DenStream with offline algorithms such as DBScan and Local Outlier Factor (LOF), as well as online algorithms such as the windowed version of DBScan, ExactSTORM, Continuous Outlier Detection (COD) and Robust Random Cut Forest (RRCF). Our experimental campaign compares these seven algorithms under both accuracy and computational complexity viewpoints: results testify that DenStream (i) achieves detection results on par with RRCF, the best performing algorithm and (ii) is significantly faster than other approaches, notably over two orders of magnitude faster than RRCF. In spirit with the recent trend toward reproducibility of results, we make our code available as open source to the scientific community.},
	number = {1},
	journal = {IEEE Transactions on Network and Service Management},
	author = {Putina, Andrian and Rossi, Dario},
	month = mar,
	year = {2021},
	note = {Conference Name: IEEE Transactions on Network and Service Management},
	keywords = {Anomaly detection, Anomaly detection algorithms, Feature extraction, Principal component analysis, Protocols, Real-time systems, Support vector machines, Telemetry, machine learning, model driven telemetry, network monitoring and measurements, stream learning},
	pages = {839--854},
}

Downloads: 0