Networks with group metadata

Description

Snapshots of LiveJournal, Friendster, Orkut, and YouTube online social networks, as well as DBLP and Amazon. Node metadata represents a post hoc definition of a 'community' that a node belongs to, derived from topical labels of the node or interest-based 'groups' that a node links to.1

Friendster is an on-line gaming network. Before re-launching as a game website, Friendster was a social networking site where users can form friendship edge each other. Friendster social network also allows users form a group which other members can then join. We consider such user-defined groups as communities. For the social network, we take the induced subgraph of the nodes that either belong to at least one community or are connected to other nodes that belong to at least one community. This data is provided by The Web Archive Project, where the full graph is available.

LiveJournal is a free on-line blogging community where users declare friendship each other. LiveJournal also allows users form a group which other members can then join.

Orkut is a free on-line social network where users form friendship each other. Orkut also allows users form a group which other members can then join. This data is provided by Alan Mislove et al.

Youtube is a video-sharing web site that includes a social network. In the Youtube social network, users form friendship each other and users can create groups which other users can join. This data is provided by Alan Mislove et al.

The DBLP computer science bibliography provides a comprehensive list of research papers in computer science. We construct a co-authorship network where two authors are connected if they publish at least one paper together. Publication venue, e.g, journal or conference, defines an individual community; authors who published to a certain journal or conference form a community.

The Amazon Network was collected by crawling the website. It is based on Customers Who Bought This Item Also Bought feature of the Amazon website. If a product i is frequently co-purchased with product j, the graph contains an undirected edge from i to j. Each product category provided by Amazon defines each community.

1. Description obtained from the ICON project.

Networks
Name Nodes Edges Kind Mode NPs EPs
friendster 65,608,366 1,806,067,135 55.06 137.81 830.62 2153.71 0.19 0.02 37 1.00 Undirected Unipartite name communities 6.177 GiB 9.254 GiB 9.301 GiB 9.250 GiB
livejournal 3,997,962 34,681,189 17.35 42.96 447.52 8545.04 0.05 0.13 21 1.00 Undirected Unipartite name communities 109.6 MiB 187.9 MiB 183.0 MiB 178.6 MiB
orkut 3,072,441 117,185,083 76.28 154.78 776.75 190.23 0.02 0.04 9 1.00 Undirected Unipartite name communities 556.1 MiB 781.3 MiB 767.1 MiB 884.4 MiB
youtube 1,134,890 2,987,624 5.27 50.75 185.14 492.62 -0.04 0.01 24 1.00 Undirected Unipartite name communities 9.3 MiB 16.2 MiB 15.0 MiB 16.5 MiB
dblp 317,080 1,049,866 6.62 10.01 114.72 372.79 0.27 0.31 23 1.00 Undirected Unipartite name communities 7.7 MiB 11.0 MiB 9.6 MiB 9.5 MiB
amazon 334,863 925,872 5.53 5.76 20.74 10782.00 -0.06 0.21 47 1.00 Undirected Unipartite name communities 10.0 MiB 14.0 MiB 13.8 MiB 14.8 MiB