Skip to content
OllyButters edited this page Jun 4, 2020 · 2 revisions

An output of the pipeline is a series of datasets intended for analysis in other applications. These datasets are generated in the data folder and include:

  • Lists of authors
  • Lists of journals
  • Abstract, Title and keyword text in raw and lemmatized form.