logo Netzschleuder network catalogue, repository and centrifuge

Problems with this dataset? Open an issue.
You may also take a look at the source code.
The network in this dataset can be loaded directly from graph-tool with:
import graph_tool.all as gt
g = gt.collection.ns["google_web"]

google_web — Old Google web graph (2002)


A web graph representing a crawl of a portion of the general WWW, from a 2002 Google Programming contest.1

  1. Description obtained from the ICON project. 

Informational Web graph Unweighted
  • J. Leskovec, K. Lang, A. Dasgupta, M. Mahoney. "Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters." Internet Mathematics 6(1) 29--123 (2009), http://arxiv.org/abs/0810.1355
Upstream URL OK
Tip: hover your mouse over a table header to obtain a legend.
Name Nodes Edges $\left<k\right>$ $\sigma_k$ $\lambda_h$ $\tau$ $r$ $c$ $\oslash$ $S$ Kind Mode NPs EPs gt GraphML GML csv
google_web 916,428 5,105,039 5.57 39.82 101.24 3384.37 -0.05 0.09 24 0.93 Directed Unipartite 21.5 MiB 41.0 MiB 40.3 MiB 38.9 MiB
None drawing