Skip to content

lets collaborate #1

@milahu

Description

@milahu

@kaliiiiiiiiii this project is largely based on your Selenium-Driverless
would you be interested in collaboration?
(spoiler: i will use MIT license)

i have not-yet found an actual "headful-web-scraper"
where i can simply remote-control an actual chromium browser
to allow "semi-automatic web scraping" (solving captchas, debugging error states)
so i created my own : )

so far, my code is unreleased
im using it in my opensubtitles-scraper to bypass cloudflare

so far, my code (fetch-subs.py) is really messy
and it will need some serious refactoring
from 8000 lines in one file, to modules and classes

my goal is to make chromium usable just like any other http client in python
as a drop-in replacement for aiohttp

i have a working prototype for handling file downloads (and html error pages)
but i guess that will be too complex / out of scope for Selenium-Driverless
see also Selenium-Driverless#140

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions