Handle characters with different encodings #4

leafo · 2017-07-29T19:01:57Z

filenames with characters that aren't utf-8 encoded will cause the extraction to fail. We should be able to automatically convert between the encoding to avoid the error.

https://twitter.com/Tryall_YT/status/891304284284997632

Go throws the error: 'ERROR: invalid byte sequence for encoding "UTF8"

The text was updated successfully, but these errors were encountered:

fasterthanlime · 2017-09-14T00:30:24Z

Sooooooooo

Paths encoding in .zip are a bit of a mess (in practice) (/cc @GranPC)

In theory, zip has a flag that you can set that says "all the paths are UTF-8" - if it's not set, then... they can be anything.

We've had a similar problem with butler's unzip command (for the app) and Jesus is working on a method that autodetects the encoding of paths if the utf-8 flag is not set. It hasn't landed yet, but when it does, we should switch zipserver over to https://github.com/itchio/arkive, which will solve this issue.

fasterthanlime self-assigned this Sep 14, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle characters with different encodings #4

Handle characters with different encodings #4

leafo commented Jul 29, 2017

fasterthanlime commented Sep 14, 2017

Handle characters with different encodings #4

Handle characters with different encodings #4

Comments

leafo commented Jul 29, 2017

fasterthanlime commented Sep 14, 2017