Skip to content
/ topy Public

Python script to fix typos in text, based on the RegExTypoFix project from Wikipedia and AutoWikiBrowser

License

Notifications You must be signed in to change notification settings

intgr/topy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

69 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Topy

Topy (anagram of "typo") is a Python script to fix typos in text, using rulesets developed by the RegExTypoFix project from Wikipedia. The English ruleset is included with Topy and is used by default. Other rulesets can be manually downloaded.

Topy works with Python 3.7-3.11.

The easiest way to install it is using pip:

pip install topy

Usage:

Usage: topy [options] FILES/DIRS...

Options:
  -h, --help            show this help message and exit
  -q, --quiet           silence information messages
  -a, --apply           overwrite files in place
  -r FILE, --rules=FILE
                        specify custom ruleset file to use
  -d RULE, --disable=RULE
                        disable rules by name
  --color=WHEN, --colour=WHEN
                        colorize the output; WHEN can be 'never', 'always', or
                        'auto'

For example, if you want to integrate topy in your CI pipeline, you can do something like:

sh -c "git ls-files | xargs topy --quiet --apply --; git --no-pager diff --exit-code"

The line above will check all files tracked by git, apply fixes to them and fail if any changes are applied. If no changes are to be applied, the command returns success.

Resources

Changelog

1.1.0 (2021-02-03)

  • Added colors to output. Thanks to Brian de Buiteach (@debuiteb) (#25)
  • Python 3.5 support is removed to allow newer syntax (#25)

1.0.1 (2021-01-30)

  • Updated bundled ruleset from Wikipedia (#29)
  • Python 3.9 is now officially supported and tested in CI (#27)
  • Migrated tests from travis-ci.org to travis-ci.com (#28)

1.0.0 (2020-09-08)

  • Updated bundled ruleset from Wikipedia (#23)
  • Removed Python 2.x and <3.5 compatibility code (#22)
  • Fixed Travis CI configuration (#21)

0.3.0 (2020-06-02)

Note: This was the last release to support Python 2.7.

  • Updated bundled ruleset (thanks to Oscar Caballero)
  • Added --disable option to disable individual rules (thanks to Oscar Caballero)
  • Fixed behavior when replacement string contains $ symbol (thanks to Oscar Caballero)
  • Prefer the faster lxml parser when it is installed. lxml is now an optional dependency (thanks to Oscar Caballero)
  • Added Python 3.7 support, deprecated Python 3.3 and 3.4.

0.2.2 (2016-12-16)

  • Update bundled ruleset
  • Officially add Python 3.6 support

0.2.1 (2016-07-15)

  • Update bundled ruleset
  • Update regex dependency version to avoid regex issue #216
  • Officially add Python 3.5 support

0.2.0 (2015-09-09)

  • Several fixes with Unicode on Python 2
  • Can safely deal with filenames that are invalid Unicode
  • Update bundled ruleset
  • Fix a few warnings from used libraries

0.1.0 (2014-08-24)

  • Initial public release

Contributing

Code style:

  • In general follow the Python PEP-8 coding style, except line length can go up to 120 chars.
  • Strings that have meaning for humans use double quotes ("), otherwise single quotes ('). When in doubt, don't worry about it.
  • Code should be compatible with both Python 2 and 3, preferably without version-specific conditionals.

Run the test suite using python setup.py test.

Submit your changes as pull requests on GitHub.

License

The Topy software is licensed under the MIT license (see LICENSE.txt)

The bundled retf.txt file, copied from Wikipedia AutoWikiBrowser/Typos by Wikipedia contributors is licensed under CC-BY-SA. See the page on Wikipedia for authorship information.

About

Python script to fix typos in text, based on the RegExTypoFix project from Wikipedia and AutoWikiBrowser

Resources

License

Stars

Watchers

Forks

Packages

No packages published