Skip to content

An application that collects data from Google Scholar and Crossref

License

Notifications You must be signed in to change notification settings

ericgcc/crosscholar

Folders and files

NameName
Last commit message
Last commit date

Latest commit

18b6478 · Jan 20, 2019

History

3 Commits
Jan 20, 2019
Jan 20, 2019
Jan 20, 2019
Jan 20, 2019
Jan 20, 2019
Jan 20, 2019
Jan 20, 2019
Jan 20, 2019
Jan 20, 2019
Jan 20, 2019
Jan 20, 2019
Jan 20, 2019
Jan 20, 2019
Jan 20, 2019
Jan 20, 2019

Repository files navigation

crosscholar

Crosscholar is an application that collects scholarly data from Google Scholar and Crossref.

It's a command line scraper made with python that queries and parses Google Scholar's output crossing each record with crossref. This project is inspired by PyScholar(https://github.com/dnlcrl/PyScholar), and until now, is able to scrap Google Scholar more than 10 hours in a row, without being banned.

  • Free software: MIT License

Features

  • Gets a list of authors related to a query string.
  • For each author, gets data like name and citations distributed in the time.
  • Also for each author, extracts the data related to each work.
  • When getting work's data, the program connects to crossref to verify the record and get the DOI number.
  • Each work is retrieved with title, total citations, citations distribution, wos citations, publisher, authors, work type, volume, issue and page, when the information is available.

Todo

  • Writting a command line tool.
  • Improve code quality.

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

About

An application that collects data from Google Scholar and Crossref

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published