The following crawler goes over nulled.to/forum/15-other-leaks/ and download all the topics pages It uses AutoThrottling to fool the website as much as it can so it won't be blocked
Clone the repo
git clone [email protected]:noampolak/crawler-nulled.git
Install dependencies
poetry install
Go to crawler folder
cd crawler
Run the crawler.
scrapy crawl NulledVipLeaksSpider
Or you can run in debug mode with vscode
You can change the settings in the settings.py file or change some of them via env variables.