Sentence splitter https://pypi.org/project/icu-tokenizer/ https://pypi.org/project/mosestokenizer/ Alternative tokenizer https://pypi.org/project/fast-mosestokenizer/ https://pypi.org/project/fasttokenizer/ https://pypi.org/project/icu-tokenizer/ https://pypi.org/project/polyglot/ https://pypi.org/project/jieba/ (chinese) https://github.com/SamuraiT/mecab-python3 (japanese)