Web Scrape

One master server.
Multiple "workers" which supply scraping network bandwidth + processing power.
How to update worker code?
- Server sends the before/after parts of the job processing pipeline to the workers.
- This code is maintained in normal python code.
Insert obtained data into database + create more jobs.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
postgres		postgres
tests		tests
webscrape		webscrape
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
temp.sqlite		temp.sqlite
x.py		x.py

Provide feedback