Chinese text to Jyutping (粵拼; a standard of Cantonese romanization) conversion library in Python. Conversion is based on an officially published Cantonese Pronunciation List. This library performs the conversion using the Jyutping romanization table prepared by Dr. Samuel Cheng, which has been further converted to a dictionary file in JSON format.
pip install pyjyutping
>>> from pyjyutping import jyutping
>>> jyutping.convert("我係香港人")
'ngo5 hai6 hoeng1 gong2 jan4'
Tone number in output can be disabled:
>>> jyutping.convert("我係香港人", tone=False)
'ngo hai hoeng gong jan'
Alphabets and digits are preserved in the conversion:
>>> jyutping.convert("香港嘅英文係Hong Kong")
'hoeng1 gong2 ge3 jing1 man4 hai6 Hong Kong'
>>> jyutping.convert("1841年1月25號香港開埠")
'1841 nin4 1 jyut6 25 hou6 hoeng1 gong2 hoi1 bou6'
$ pyjyutping 我係香港人
ngo5 hai6 hoeng1 gong2 jan4
MIT