Chinese Text-to-Speech(TTS)
汉字 => ["han4", "zi4"] => .wav audio
Read this page in 简体中文
A list of all Chinese characters is obtained from 倉頡平台2012, a Chinese input software.
git clone https://github.com/junzew/HanTTS.git
cd HanTTS
pip install --user -r requires.txt
Download syllables.zip
from SourceForge, and decompress under the directory HanTTS
.
- Either run locally:
python main.py
- Or through web interface:
cd
into theserver
folderNavigate tonpm install node app.js
localhost:3000
in a browser
- Record five tones of each pinyin listed in
mapping.json
- Group them by the first letter (a,b,c,d, etc.), and save under folder
./recording
as{letter}.wav
- Then run
python process.py {letter}
for eachletter
to split recordings into individual pinyin - Move valid .wav files from
./pre
to the folder./syllables
Based on the text-to-speech-sample project by @alexram1313