Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Files with colons in the name do not save to disk after downloading #595

Open
dsdude123 opened this issue May 29, 2023 · 2 comments
Open

Comments

@dsdude123
Copy link

While trying to download this collection for The Computer Chronicles, I found that a number of files were not saving to the disk. I tried redownloading and observed in the console that it was indeed downloading normally but once complete the resulting file was 0 bytes length and had no extension.

I went to manually download these files in my browser and noticed a common trend among the affected files is that they had a colon in the name. This probably is a Windows specific issue as that character isn't allowed in file names and my browser when saving the file replaced it with an underscore.

Collection: https://archive.org/details/Computer_Chronicles

OS: Windows Server 2019 Datacenter
Python: 3.7.16

@maxz
Copy link
Contributor

maxz commented May 30, 2023

This probably is a Windows specific issue as that character isn't allowed in file names and my browser when saving the file replaced it with an underscore.

That is correct. It is related to the colon which is not allowed in file or directory names by Windows.
The colon is used by Windows as a character to create the so called "Alternate Data Streams".

@pinkderg
Copy link

Is there a solution/workaround for this? There are some pretty big collections I want to download but most of the filenames have colons in them and result in 0 byte files, but have the data in 'Size on disk' instead
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants