Skip to content

rholang/AlgoliaSearch

Repository files navigation

DocSearch scraper

This repository holds the code of the DocSearch scraper used to power the hosted version of DocSearch. DocSearch scaper is used as search for the rholang/rholang.github.io implementation.

Pre-requisites

  • Install Python 3.6. Only this version have no error messages displayed.
  • Install pipenv
    • run command line with admin rights
    • pip install --user pipenv
    • print out where the path of python is with: py -m site --user-site
    • set path variable (change path to your path) : setx PATH "%PATH%;C:\Users\jetbrains\AppData\Roaming\Python\Python36\Scripts"

Setup project

  • Open this project with vscode -> cd into /DocSearch-Srapper
  • pipenv install --dev
  • rename .env.example to .env

Folder structure

  • .evn

    • keys from algolia and chromedriver path
    • change APPLICATION_ID, API_KEY to yours from algolia
    • change CHROMEDRIVER_PATH to your path
  • config/website.json

    • config, what css selector to parse on your website

Start scrapping

  • pipenv run python ./docsearch run ./configs/website.json

About

DocSearch Scrapper implementation

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages