Skip to content
Olly Butters edited this page Jul 27, 2016 · 9 revisions

The file tree looks a little like:

- cache*
-- processed
--- merged    <- Intermediate files
-- raw        <- Raw copies of cached data
--- doi
--- pubmed
--- zotero
- config
- data*        <- Outputed csv files etc
- html*        <- Outputed html files
- inputs
- log*
- source        <- All the source code
-- add          <- Add extra metadata (geocode, citations etc)
-- analyse      <- Do some stats
-- bibliography <- Create bibliographic files
-- clean        <- Clean up the metadata
-- get          <- Get the metadata
-- html         <- Make the static html files

Folders with a * next to them get generated when the program runs, and can be safely deleted. Although deleting the cache will make the next run take much longer.