Skip to content
This repository has been archived by the owner on Feb 26, 2021. It is now read-only.

How to control concurrency? #3

Open
luzihang123 opened this issue Jul 10, 2020 · 1 comment
Open

How to control concurrency? #3

luzihang123 opened this issue Jul 10, 2020 · 1 comment

Comments

@luzihang123
Copy link

luzihang123 commented Jul 10, 2020

demand:10W web pages rendered into Base64 upload OSS one day

@elacuesta
Copy link
Owner

First of all, thanks for taking an interest in this project.

Concurrency should be handled by the Scrapy Scheduler (which acts before the handler provided by this package), taking into account settings like CONCURRENT_REQUESTS, CONCURRENT_REQUESTS_PER_DOMAIN and CONCURRENT_REQUESTS_PER_IP. Could you elaborate a bit more on you use case, maybe with some minimal code to help me understand the scenario? Thanks again.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants