Census and Survey of the Visible Internet (extended). Heidemann, J., Pradkin, Y., Govindan, R., Papadopoulos, C., Bartlett, G., & Bannister, J. Technical Report ISI-TR-2008-649, USC/Information Sciences Institute, February, 2008.
Census and Survey of the Visible Internet (extended) [link]Paper  abstract   bibtex   
Prior measurement studies of the Internet have explored traffic and topology, but have largely ignored edge hosts. While the number of Internet hosts is very large, and many are hidden behind firewalls or in private address space, there is much to be learned from examining the population of \emphvisible hosts, those with public unicast addresses that respond to messages. In this paper we introduce two new approaches to explore the visible Internet. Applying statistical population sampling, we use \emphcensuses to walk the entire Internet address space, and \emphsurveys to probe frequently a fraction of that space. We then use these tools to evaluate address usage, where we find that only 3.6% of allocated addresses are actually occupied by visible hosts, and that occupancy is unevenly distributed, with a quarter of responsive /24 address blocks (subnets) less than 5% full, and only 9% of blocks more than half full. We also show that only about 34%7emillion (about 16% of responsive addresses) are very stable, while the rest are used intermittently, with a median occupancy of 81 minutes. Finally, we show that many firewalls are visible, measuring significant diversity in the distribution of firewalled block size. To our knowledge, we are the first to take a census of edge hosts in the visible Internet since 1982, to evaluate the accuracy of active probing for address census and survey, and to quantify these aspects of the Internet.
@TechReport{Heidemann08a_200802,
	author = 	"John Heidemann and Yuri Pradkin and Ramesh
                         Govindan and Christos Papadopoulos and 
                         Genevieve Bartlett and Joseph Bannister",
	title = 	"Census and Survey of the Visible Internet (extended)",
	institution = 	"USC/Information Sciences Institute",
	year = 		2008,
	sortdate = 		"2008-02-01",
	project = "ant, lander, madcat",
	jsubject = "omitted",
	number =	"ISI-TR-2008-649",
	month =		feb,
	location =	"johnh: pafile",
	keywords =	"census, survey, internet address space",
	url =		"http://www.isi.edu/%7ejohnh/PAPERS/Heidemann08a_200802.html",
	pdfurl =	"http://www.isi.edu/%7ejohnh/PAPERS/Heidemann08a_200802.pdf",
	myorganization =	"USC/Information Sciences Institute",
	copyrightholder = "authors",
	abstract = "
Prior measurement studies of the Internet have explored traffic and
topology, but have largely ignored edge hosts.  While the number of
Internet hosts is very large, and many are hidden behind firewalls or
in private address space, there is much to be learned from examining
the population of \emph{visible} hosts, those with public unicast
addresses that respond to messages.  In this paper we introduce two
new approaches to explore the visible Internet.  Applying statistical
population sampling, we use \emph{censuses} to walk the entire
Internet address space, and \emph{surveys} to probe frequently a
fraction of that space.  We then use these tools to evaluate address
usage, where we find that only 3.6\% of allocated addresses are
actually occupied by visible hosts, and that occupancy is unevenly
distributed, with a quarter of responsive /24 address blocks (subnets)
less than 5\% full, and only 9\% of blocks more than half full.  We
also show that only about 34%7emillion (about 16\% of responsive
addresses) are very stable, while the rest are used intermittently,
with a median occupancy of 81 minutes.  Finally, we show that many
firewalls are visible, measuring significant diversity in the
distribution of firewalled block size.  To our knowledge, we are the
first to take a census of edge hosts in the visible Internet since
1982, to evaluate the accuracy of active probing for address census
and survey, and to quantify these aspects of the Internet.
",
}

Downloads: 0