THIS LIBRARY MIGHT STILL WORK FOR CERTAIN USE CASES BUT ISN'T BEING ACTIVELY DEVELOPED OR MAINTAINED ANYMORE.
Purell is used to extract clean HTML from any kind of flat file format and prepare it for conversion into a raw Superbook. It is a pure
CLI utility to turn low quality markup available off of ordinary files into a high-quality Superbook-compatible markdown as per Bubblin-approved standards.
Purell is almost entirely designed to be used at the root of a Bookiza app. However, it can also be used as a transitioning tool between older file formats and the Superbook format,like so:
- MS Word ⭌ Extract ugly HTML ⭌ Markdown ⭌ Sanitized HTML ⭌ Superbook
- ePub ⭌ Markdown ⭌ Sanitized HTML ⭌ Superbook
- PDF ⭌ ⭌ Markdown ⭌ Sanitized HTML ⭌ Superbook
- Webpage (Scroll) ⭌ Markdown ⭌ Sanitized HTML ⭌ Superbook
$ pure --help
$ pure fetch <url> // Will fetch original.html from source URI
$ pure defile <path to file> // Will extract original.html from source file.
$ pure sanitize // Markdown Compatible HTML (Sanitized)
The responsibility to paginate the sanitized.html
into a Superbook is held by h2s.
Blueoak Model License 1.0.0