Improving the Optics of Active Outage Detection (extended). Baltra, G. & Heidemann, J. Technical Report ISI-TR-733, USC/Information Sciences Institute, May, 2019.
Improving the Optics of Active Outage Detection (extended) [link]Paper  abstract   bibtex   
There is a growing interest in carefully observing the reliability of the Internet's edge. Outage information can inform our understanding of Internet reliability and planning, and it can help guide operations. Outage detection algorithms using active probing from third parties have been shown to be accurate for most of the Internet, but inaccurate for blocks that are sparsely occupied. Our contributions include a definition of outages, which we use to determine how many independent observers are required to determine global outages. We propose a new \emphFull Block Scanning (FBS) algorithm that gathers more information for sparse blocks to reduce false outage reports. We also propose \emphISP Availability Sensing (IAS) to detect maintenance activity using only external information. We study a year of outage data and show that FBS has a True Positive Rate of 86%, and show that IAS detects maintenance events in a large U.S. ISP.
@TechReport{Baltra19a,
	author = 	"Guillermo Baltra and John Heidemann",
	title = 	"Improving the Optics of Active Outage Detection (extended)",
	institution = 	"USC/Information Sciences Institute",
	year = 		2019,
	sortdate = 		"2018-05-16", 
	project = "ant, lacanic, divoice, iiovadr",
	jsubject = "routing",
	number =	"ISI-TR-733",
	month =		may,
	jlocation =	"johnh: pafile",
	keywords =	"network outage detection",
	url =		"https://ant.isi.edu/%7ejohnh/PAPERS/Baltra19a.html",
	pdfurl =	"https://ant.isi.edu/%7ejohnh/PAPERS/Baltra19a.pdf",
	myorganization =	"USC/Information Sciences Institute",
	copyrightholder = "authors",
	abstract = "
There is a growing interest in carefully observing the reliability of
the Internet's edge.  Outage information can inform our understanding
of Internet reliability and planning, and it can help guide
operations.  Outage detection algorithms using active probing from
third parties have been shown to be accurate for most of the Internet,
but inaccurate for blocks that are sparsely occupied.  Our
contributions include a definition of outages, which we use to
determine how many independent observers are required to determine
global outages.  We propose a new \emph{Full Block Scanning} (FBS)
algorithm that gathers more information for sparse blocks to reduce
false outage reports.  We also propose \emph{ISP Availability Sensing}
(IAS) to detect maintenance activity using only external information.
We study a year of outage data and show that FBS has a True Positive
Rate of 86\%, and show that IAS detects maintenance events in a large
U.S.~ISP.
",
}

Downloads: 0