Skip to content

data_collected_for_ContentMine

Richard Smith-Unna edited this page Jul 2, 2014 · 6 revisions

Every journal scraper in the collection targets the same data:

Core metadata

  • publisher
  • journal
  • title
  • authors
  • date
  • doi
  • volume
  • issue
  • firstpage
  • description

Creative content

  • abstract
  • fulltext HTML
  • fulltext PDF
  • supplementary materials
  • figures
  • figure captions

Connections

  • references

Permissions

  • license information
  • copyright assignment
Clone this wiki locally