# Netzschleuder network catalogue, repository and centrifuge

Problems with this dataset? Open an issue.
You may also take a look at the source code.
The networks in this dataset can be loaded directly from graph-tool with:
import graph_tool.all as gt
g = gt.collection.ns["soc_net_comms/friendster"]

(and likewise for the other networks available.)

# soc_net_comms — Networks with group metadata

Description

Snapshots of LiveJournal, Friendster, Orkut, and YouTube online social networks, as well as DBLP and Amazon. Node metadata represents a post hoc definition of a 'community' that a node belongs to, derived from topical labels of the node or interest-based 'groups' that a node links to.1

Friendster is an on-line gaming network. Before re-launching as a game website, Friendster was a social networking site where users can form friendship edge each other. Friendster social network also allows users form a group which other members can then join. We consider such user-defined groups as communities. For the social network, we take the induced subgraph of the nodes that either belong to at least one community or are connected to other nodes that belong to at least one community. This data is provided by The Web Archive Project, where the full graph is available.

LiveJournal is a free on-line blogging community where users declare friendship each other. LiveJournal also allows users form a group which other members can then join.

Orkut is a free on-line social network where users form friendship each other. Orkut also allows users form a group which other members can then join. This data is provided by Alan Mislove et al.

Youtube is a video-sharing web site that includes a social network. In the Youtube social network, users form friendship each other and users can create groups which other users can join. This data is provided by Alan Mislove et al.

The DBLP computer science bibliography provides a comprehensive list of research papers in computer science. We construct a co-authorship network where two authors are connected if they publish at least one paper together. Publication venue, e.g, journal or conference, defines an individual community; authors who published to a certain journal or conference form a community.

The Amazon Network was collected by crawling the website. It is based on Customers Who Bought This Item Also Bought feature of the Amazon website. If a product i is frequently co-purchased with product j, the graph contains an undirected edge from i to j. Each product category provided by Amazon defines each community.

1. Description obtained from the ICON project.

Tags
Online Social Collaboration Informational Relatedness Unweighted Metadata
Citation
Upstream URL OK
https://snap.stanford.edu/data/com-Friendster.html
Networks
Tip: click on the table header to sort the list. Hover your mouse over it to obtain a legend.
Name Nodes Edges $\left<k\right>$ $\sigma_k$ $\lambda_h$ $\tau$ $r$ $c$ $\oslash$ $S$ Kind Mode NPs EPs gt GraphML GML csv
friendster 65,608,366 1,806,067,135 55.06 137.81 830.62 2153.71 0.19 0.02 37 1.00 Undirected Unipartite name communities 6.177 GiB 9.254 GiB 9.301 GiB 9.250 GiB
livejournal 3,997,962 34,681,189 17.35 42.96 447.52 8545.04 0.05 0.13 21 1.00 Undirected Unipartite name communities 109.6 MiB 187.9 MiB 183.0 MiB 178.6 MiB
orkut 3,072,441 117,185,083 76.28 154.78 776.75 190.23 0.02 0.04 9 1.00 Undirected Unipartite name communities 556.1 MiB 781.3 MiB 767.1 MiB 884.4 MiB
youtube 1,134,890 2,987,624 5.27 50.75 185.14 492.62 -0.04 0.01 24 1.00 Undirected Unipartite name communities 9.3 MiB 16.2 MiB 15.0 MiB 16.5 MiB
dblp 317,080 1,049,866 6.62 10.01 114.72 372.79 0.27 0.31 23 1.00 Undirected Unipartite name communities 7.7 MiB 11.0 MiB 9.6 MiB 9.5 MiB
amazon 334,863 925,872 5.53 5.76 20.74 10782.00 -0.06 0.21 47 1.00 Undirected Unipartite name communities 10.0 MiB 14.0 MiB 13.8 MiB 14.8 MiB