Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Incomplete download problem (found 702 posts, download only 111 of them) #151

Closed
luboq opened this issue Dec 6, 2018 · 2 comments
Closed

Comments

@luboq
Copy link

luboq commented Dec 6, 2018

snip

@cebtenzzre
Copy link
Collaborator

@waldens Your "example post" gives a 404.

@cebtenzzre
Copy link
Collaborator

I was able to reproduce the issue with standard tumblr-utils, however with the changes proposed by #114 (tested using a git clone of this) I was able to successfully download 319 posts. I'm still not sure why it's less than the expected 700+.

I compared the results to an equivalent download done by TumblThree, and I found that the performance of tumblr-utils was far better -- not only did TumblThree find ~100 less likes, but it also just downloaded a dump of images and videos instead of a proper index. As far as I know, TumblThree scrapes the /liked/by page directly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants