-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cache expiration #99
Comments
Hm, I like the idea. We would need another caching layer (the third), for a specific download block (so not per url, as in https://github.com/pipes-digital/pipes/blob/master/downloader.rb). And then a way to invalidate that cache. Or indeed as a per-pipe option, changing the cache logic in Line 85 in ea379d5
|
Something else related to this: when a pipe is configured to download two or more URLs from the same domain in a row (as in a |
It's not supposed to be as fast as possible. The downloads are put into a ThrottleQueue, divided by domain. See https://github.com/pipes-digital/pipes/blob/master/downloader.rb#L21-L25:
If that leads to as fast as possible in a foreach, that would be a bug :/ |
This is one of the blogs that stopped my pipe with captchas when I was trying different combinations of Pipes blocks: https://pastebin.mozilla.org/kN5HgRk6 The problematic pipe was downloading the latest 3 or 4 posts. As I kept making changes and previewing them, the blog rightfully detected the pipe as a bot. Another note on caching everything more aggressively: this doesn't need to be done with pipes that only connect to This site, on the other hand, can barely handle its human visitors, so the less often Pipes downloads its front page with the pipe I have for it, the better: |
When pipes have
download
blocks or are too slow to process, consider caching the output feed for a (much) longer time. You can also add these tags for them to suggest feed readers not to update them too often.I'd be fine with it if my feeds that fit this description were automatically cached for three days or even longer to save everyone's bandwidth and server resources.
This could also be added as an option for us to manually mark pipes that don't need to be updated for a long time. I have some pipes with
download
blocks that only really need to be checked once a month.The text was updated successfully, but these errors were encountered: