logo Netzschleuder network catalogue, repository and centrifuge

Problems with this dataset? Open an issue.
You may also take a look at the source code.
The network in this dataset can be loaded directly from graph-tool with:
import graph_tool.all as gt
g = gt.collection.ns["wiki_article_words"]

wiki_article_words — Wikipedia article-words (en) (2010)


A bipartite network of English Wikipedia articles and the words they contain. The edge weight gives the number of times a word appeared in the connected article.1

  1. Description obtained from the ICON project. 

Informational Language Weighted
Upstream URL OK
Tip: hover your mouse over a table header to obtain a legend.
Name Nodes Edges $\left<k\right>$ $\sigma_k$ $\lambda_h$ $\tau$ $r$ $c$ $\oslash$ $S$ Kind Mode NPs EPs gt GraphML GML csv
wiki_article_words 276,739 2,941,902 21.26 138.32 818.58 2.23 -0.35 0.00 4 1.00 Undirected Bipartite weight 8.7 MiB 17.9 MiB 14.7 MiB 15.0 MiB
None drawing