synonames
generates aliases for human names across cultural environments. Names often
have different spellings in different languages and cultures - for example, Alexander
can also be Alexandr or Oleksandr. This repository reads a data dump from Wikidata to
filter out every human name from every language edition of Wikipedia and map them across
language editions.
The resulting file can be used as a set of synonyms, for example to expand search queries against a dataset about people.
Setup the build environment with make
, to generate the synonames files, run
make build
in the build environment.