Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for wordlists in other languages #50

Open
simonft opened this issue Mar 9, 2018 · 5 comments
Open

Add support for wordlists in other languages #50

simonft opened this issue Mar 9, 2018 · 5 comments

Comments

@simonft
Copy link
Contributor

simonft commented Mar 9, 2018

I've been working on https://github.com/simonft/phrasebook and have been talking with @drebs about using diceware as the backend for it. One thing I want to add to phrasebook is the ability to choose from wordlists in multiple languages. Is adding wordlists in other languages something you'd consider adding (or allowing someone else to add) to diceware as well?

@ulif
Copy link
Owner

ulif commented Mar 10, 2018

By all means, yes! Non-english wordlists would be really nice!

Of course, if you have your own wordlist, you can store it in the diceware wordlist dir as wordlist_CC.txt with CC being a country code like fr, pt, or de. The wordlist will then be used when you call diceware -w CC.

If you want to share your wordlist with others, please do not hesitate to do a pull request.

I recommend, however, to check your wordlist with the diceware-list tools I use for exact that purpose. This package contains a few little scripts, one of them, wlflakes scanning existing lists for problems, while wldownload creates new lists out of Android-wordlists (even enabling you to choose whether to include offensive words or not).

Overall, I think ideal wordlists should be fairly long (say 2^15 terms?), be prefix code, contain no offensive words (which is not a hard criterion), and preferably contain short words (for easy typing). But this can all be discussed, of course.

@drebs
Copy link
Contributor

drebs commented Apr 22, 2019

I have proposed a "pt-br" wordlist: #60

Check the commit message for the method I used to come up with the list. I did not use the Android wordlist as a base because it comes with many non pt-br words.

@htgoebel
Copy link
Contributor

I have proposed a German, Italian and Latin wordlist: #61

@drebs
Copy link
Contributor

drebs commented Jul 2, 2024

FTR, some wordlists have been added since this issue was created (it, ca, es) but there's no release including them yet.

Tails, for example, has downported them, but it'd be great to get a new release so we ship them to the whole world! 😄

(In passing, if a next release could also include #86 i think it'd address the needs of several users.)

@drebs
Copy link
Contributor

drebs commented Jul 9, 2024

(In passing, if a next release could also include #86 i think it'd address the needs of several users.)

Actually, I wanted to refer to support for custom wordlists in ${XDG_DATA_HOME}/diceware/wordlists, which is related to the above but different.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants