Skip to content

aaasen/kapok

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Kapok

A Knowledge Graph of Wikipedia.

A graph of how Ayn Rand relates to other historical figures. Orange nodes represent categories while purple nodes represent articles. This visualisation was created with the Neo4J graph browser using a small subset of the Wikipedia graph; it is not completely accurate.

Description

Kapok aims to create a knowledge graph from Wikipedia. In this graph, each node is an article, and links between articles are the edges between nodes.

Structure

Kapok is split into 3 modular sections:

  • Parsing: extracting relevant data from a 45GB archive of Wikipedia
  • Graph: morphing the parsed data into a graph for analysis
  • Visualisation: creating interesting visualisations with the data

The parsing section of Kapok could be easily extended to replace aging Wikimedia tools like MWDumper. I'll probably do this soon.

About

A knowledge graph of Wikipedia

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published