Problems with this dataset? Open an issue.
You may also take a look at the source code.
The network in this dataset can be loaded directly from graph-tool with:import graph_tool.all as gt g = gt.collection.ns["wiki_article_words"]
A bipartite network of English Wikipedia articles and the words they contain. The edge weight gives the number of times a word appeared in the connected article.1
Name | Nodes | Edges | $\left<k\right>$ | $\sigma_k$ | $\lambda_h$ | $\tau$ | $r$ | $c$ | $\oslash$ | $S$ | Kind | Mode | NPs | EPs | gt | GraphML | GML | csv |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
wiki_article_words | 276,739 | 2,941,902 | 21.26 | 138.32 | 818.58 | 2.23 | -0.35 | 0.00 | 4 | 1.00 | Undirected | Bipartite | weight | 8.7 MiB | 17.9 MiB | 14.7 MiB | 15.0 MiB |