Problems with this dataset? Open an issue.
You may also take a look at the source code.
The networks in this dataset can be loaded directly from graph-tool with:(and likewise for the other networks available.)import graph_tool.all as gt g = gt.collection.ns["soc_net_comms/friendster"]
Snapshots of LiveJournal, Friendster, Orkut, and YouTube online social networks, as well as DBLP and Amazon. Node metadata represents a post hoc definition of a 'community' that a node belongs to, derived from topical labels of the node or interest-based 'groups' that a node links to.1
Friendster is an on-line gaming network. Before re-launching as a game website, Friendster was a social networking site where users can form friendship edge each other. Friendster social network also allows users form a group which other members can then join. We consider such user-defined groups as communities. For the social network, we take the induced subgraph of the nodes that either belong to at least one community or are connected to other nodes that belong to at least one community. This data is provided by The Web Archive Project, where the full graph is available.
LiveJournal is a free on-line blogging community where users declare friendship each other. LiveJournal also allows users form a group which other members can then join.
Orkut is a free on-line social network where users form friendship each other. Orkut also allows users form a group which other members can then join. This data is provided by Alan Mislove et al.
Youtube is a video-sharing web site that includes a social network. In the Youtube social network, users form friendship each other and users can create groups which other users can join. This data is provided by Alan Mislove et al.
The DBLP computer science bibliography provides a comprehensive list of research papers in computer science. We construct a co-authorship network where two authors are connected if they publish at least one paper together. Publication venue, e.g, journal or conference, defines an individual community; authors who published to a certain journal or conference form a community.
The Amazon Network was collected by crawling the website. It is based on Customers Who Bought This Item Also Bought feature of the Amazon website. If a product i is frequently co-purchased with product j, the graph contains an undirected edge from i to j. Each product category provided by Amazon defines each community.
Name | Nodes | Edges | $\left<k\right>$ | $\sigma_k$ | $\lambda_h$ | $\tau$ | $r$ | $c$ | $\oslash$ | $S$ | Kind | Mode | NPs | EPs | gt | GraphML | GML | csv |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
friendster | 65,608,366 | 1,806,067,135 | 55.06 | 137.81 | 830.62 | 2153.71 | 0.19 | 0.02 | 37 | 1.00 | Undirected | Unipartite | name communities | 6.177 GiB | 9.254 GiB | 9.256 GiB | 9.250 GiB | |
livejournal | 3,997,962 | 34,681,189 | 17.35 | 42.96 | 447.52 | 8545.04 | 0.05 | 0.13 | 21 | 1.00 | Undirected | Unipartite | name communities | 109.6 MiB | 187.9 MiB | 181.8 MiB | 178.6 MiB | |
orkut | 3,072,441 | 117,185,083 | 76.28 | 154.78 | 776.75 | 190.23 | 0.02 | 0.04 | 9 | 1.00 | Undirected | Unipartite | name communities | 556.1 MiB | 781.3 MiB | 765.1 MiB | 884.4 MiB | |
youtube | 1,134,890 | 2,987,624 | 5.27 | 50.75 | 185.14 | 492.62 | -0.04 | 0.01 | 24 | 1.00 | Undirected | Unipartite | name communities | 9.3 MiB | 16.2 MiB | 15.1 MiB | 16.5 MiB | |
dblp | 317,080 | 1,049,866 | 6.62 | 10.01 | 114.72 | 372.79 | 0.27 | 0.31 | 23 | 1.00 | Undirected | Unipartite | name communities | 7.7 MiB | 11.0 MiB | 9.6 MiB | 9.5 MiB | |
amazon | 334,863 | 925,872 | 5.53 | 5.76 | 20.74 | 10782.00 | -0.06 | 0.21 | 47 | 1.00 | Undirected | Unipartite | name communities | 10.0 MiB | 14.0 MiB | 13.8 MiB | 14.8 MiB |