Skip to content

hkilang/TTS

Repository files navigation

香港圍頭話及客家話文字轉語音

Data Preprocessing

Inputs: dictionary.csv, public.csv, HakkaWords.csv, WaitauWords.csv Process: compile.py Outputs: chars.csv, hakka_words.csv, waitau_words.csv

In addition to words from HakkaWords.csv and WaitauWords.csv, extra words are automatically generated from collocations from the note column of dictionary.csv and entries with frequencies ≥ 10 from public.csv. Only entries which include at least one polyphone in the target language are included.