Problems with this dataset? Open an issue.
You may also take a look at the source code.
The network in this dataset can be loaded directly from graph-tool with:import graph_tool.all as gt g = gt.collection.ns["microsoft_concept"]
A bipartite network of relations among "instances" and "concepts," like named entities, actions, objects, etc., mined from the World Wide Web and assembled using statistical techniques into a database called Probase. Edge weights represent the frequency of association between a concept and an instance. Access to data must be requested; this version is the "core IsA" graph.1
Name | Nodes | Edges | $\left<k\right>$ | $\sigma_k$ | $\lambda_h$ | $\tau$ | $r$ | $c$ | $\oslash$ | $S$ | Kind | Mode | NPs | EPs | gt | GraphML | GML | csv |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
microsoft_concept | 16,936,670 | 33,377,320 | 3.94 | 196.69 | 744.91 | 17939.31 | -0.06 | 0.00 | 22 | 0.89 | Undirected | Unipartite | name | count | 230.3 MiB | 357.2 MiB | 346.7 MiB | 341.5 MiB |