Simple LaTeX parser providing latex-to-unicode and unicode-to-latex conversion
The pylatexenc.latexencode
module provides a function unicode_to_latex()
which converts a unicode string into LaTeX text and escape sequences. It should
recognize accented characters and most math symbols. A couple of switches allow
you to alter how this function behaves.
You can also run latexencode
in command-line to convert plain unicode text
(from the standard input or from files given on the command line) into LaTeX
code, written on to the standard output.
A third party plug-in for Vim vim-latexencode by @Konfekt provides a corresponding command to operate on a given range.
The pylatexenc.latexwalker
module provides a series of routines that parse
the LaTeX structure of given LaTeX code and returns a logical structure of
objects, which can then be used to produce output in another format such as
plain text. This is not a replacement for a full (La)TeX engine, rather, this
module provides a way to parse a chunk of LaTeX code as mark-up code.
The pylatexenc.latex2text
module builds up on top of
pylatexenc.latexwalker
and provides functions to convert given LaTeX code to
plain text with unicode characters.
You can also run latex2text
in command-line to convert LaTeX input (either
from the standard input, or from files given on the command line) into plain
text written on the standard output.
Full documentation is available at https://pylatexenc.readthedocs.io/.
See LICENSE.txt (MIT License).
NOTE: See copyright notice and license information for file
tools/unicode.xml
provided in tools/unicode.xml.LICENSE
. (The file
tools/unicode.xml
was downloaded from
https://www.w3.org/2003/entities/2007xml/unicode.xml as linked from
https://www.w3.org/TR/xml-entity-names/#source.)