Use pronounciation for transliteration #8

davidpomerenke · 2020-05-30T23:58:53Z

While transliterating letter-by-letter works nicely for German → *, most users appear to find it unintuitive for English → *.

There exists a tool for retrieving the international phonetic alphabet (IPA) version of an English word: https://github.com/shukriadams/node-text-to-ipa

The main work would be to rewrite the transliteration rules for English → * using the IPA characters as source characters. There's 107 characters + diacritics, so this will get really complex. I don't know whether Regexes work well with IPA characters.

davidpomerenke · 2020-05-31T00:04:22Z

An advantage of using the IPA as source characters would be that it would then no longer be necessary to distinguish between different source languages. And probably, the source part of rules would no longer need to contain multiple characters, and rules would no longer need to be prioritized. (This would bring no performance improvement, however, as the prioritization happens during preprocessing.)

However, this presupposes that there are suitable IPA dictionaries available for all relevant languages (only German, so far). The package mentioned above only includes an IPA dictionary for American English, and they mention in https://github.com/surrsurus/text-to-ipa that it was hard even to find this.

davidpomerenke added the enhancement New feature or request label May 30, 2020

davidpomerenke mentioned this issue May 31, 2020

Omit rules if they would be overridden by other rules (which are not yet triggered) #6

Open

davidpomerenke mentioned this issue Dec 7, 2024

Syllable boundaries #24

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use pronounciation for transliteration #8

Use pronounciation for transliteration #8

davidpomerenke commented May 30, 2020

davidpomerenke commented May 31, 2020 •

edited

Loading

Use pronounciation for transliteration #8

Use pronounciation for transliteration #8

Comments

davidpomerenke commented May 30, 2020

davidpomerenke commented May 31, 2020 • edited Loading

davidpomerenke commented May 31, 2020 •

edited

Loading