-
Notifications
You must be signed in to change notification settings - Fork 8
Description
@kaliiiiiiiiii this project is largely based on your Selenium-Driverless
would you be interested in collaboration?
(spoiler: i will use MIT license)
i have not-yet found an actual "headful-web-scraper"
where i can simply remote-control an actual chromium browser
to allow "semi-automatic web scraping" (solving captchas, debugging error states)
so i created my own : )
so far, my code is unreleased
im using it in my opensubtitles-scraper to bypass cloudflare
so far, my code (fetch-subs.py) is really messy
and it will need some serious refactoring
from 8000 lines in one file, to modules and classes
my goal is to make chromium usable just like any other http client in python
as a drop-in replacement for aiohttp
i have a working prototype for handling file downloads (and html error pages)
but i guess that will be too complex / out of scope for Selenium-Driverless
see also Selenium-Driverless#140