Skip to content

Automatic crawler for three major scientific libraries (IEEE xplore, ACM, ScienceDirect) for Literature Reviews.

Notifications You must be signed in to change notification settings

lukehali/PyLitReview

 
 

Repository files navigation

PyLitReview

19.05.2022: Updated for Python 3.10 usage (Fixed deprecated selenium code and added enums & match cases)

Quick Guide

  • Install Selenium
  • Download Chromdriver with same version as current Google Chrome installation
  • Put info into config_template.py and rename to config.py
  • Create keywords
  • usage: crawl(keywords, Library.library, SearchWhere.searchWhere)
    • keywords must be a list of lists of strings
    • library options:
      • IEEE
      • ACM
      • ScienceDirect (Not working headless)
    • searchWhere options:
      • Title
      • TitleAbstract (Title OR Abstract - not working for ACM)
      • Abstract
      • Text (All)

ToDos

  • Implement title+abstractsearch
  • Update Sciencedirect crawler

About

Automatic crawler for three major scientific libraries (IEEE xplore, ACM, ScienceDirect) for Literature Reviews.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 81.2%
  • Python 18.8%