comparison of sampling techniques for web graph characterization. Becchetti, L., Castillo, C., & Donato, D. In LinkKDD, PA, USA, 2006.
comparison of sampling techniques for web graph characterization [link]Paper  abstract   bibtex   
We present a detailed statistical analysis of the characteris- tics of partial Web graphs obtained by sub-sampling a large collection of Web pages. We show that in general the macroscopic properties of the Web are better represented by a shallow exploration of a large number of sites than by a deep exploration of a limited set of sites. We also describe and quantify the bias induced by the different sampling strategies, and show that it can be significant even if the sample covers a large fraction of the collection.
@inproceedings{ Becchetti2006,
  abstract = {We present a detailed statistical analysis of the characteris- tics of partial Web graphs obtained by sub-sampling a large collection of Web pages. We show that in general the macroscopic properties of the Web are better represented by a shallow exploration of a large number of sites than by a deep exploration of a limited set of sites. We also describe and quantify the bias induced by the different sampling strategies, and show that it can be significant even if the sample covers a large fraction of the collection.},
  address = {PA, USA},
  author = {Becchetti, Luca and Castillo, Carlos and Donato, D},
  booktitle = {LinkKDD},
  file = {references/AComparisonOfSamplingTechniquesForWebGraphCharacterization.pdf},
  keywords = {BFS bias,sampling techniques,web,www},
  mendeley-tags = {BFS bias,sampling techniques,web,www},
  title = {{comparison of sampling techniques for web graph characterization}},
  url = {http://ailab.ijs.si/dunja/LinkKDD2006/Papers/becchetti.pdf http://www.chato.cl/papers/donato_2006_comparing_sampling_techniques.pdf http://ailab.ijs.si/dunja/linkkdd2006/Papers/becchetti.pdf http://academic.research.microsoft.com/Publication/4568540/a-comparison-of-sampling-techniques-for-web-graph-characterization},
  year = {2006}
}

Downloads: 0