GitHub - TextpressoDevelopers/textpresso_classifiers: Textpressocentral classifiers

Introduction

Tpclassifier is a Python library that contains functions to train and apply classifiers for textual documents. It is based on Python scikit-learn library, and it provides an easy interface to train and use its classifiers. In addition, tpclassifier includes functions to transform documents from pdf and Textpresso CAS files (both generated from pdf or xml files) into text and simplify the way they are imported in the library and used by the classifiers for training, testing, and prediction.

Installing tpclassifier library

To install tpclassifier, run the following command from the root directory of the project:

pip3 install .

The installation requires Python3 and pip3 to be installed in the system.

Using the library from Python

The library can be imported as a regular Python package:

from tpclassifier import TextpressoDocumentClassifier

classifier = TextpressoDocumentClassifier()

The complete documentation of the classes and functions provided by the library can be found here.

Using the executable scripts provided by the library

tpclassifier comes with a set of executable programs that use the library as a backend to provide an easy interface to train, test, and apply classifiers for pdf or CAS documents. Go to the project wiki to see the complete documentation of these programs and for some example use cases.

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
bin		bin
docs		docs
tests		tests
textpresso_classifiers		textpresso_classifiers
wormbase_tools		wormbase_tools
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE.txt		LICENSE.txt
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Installing tpclassifier library

Using the library from Python

Using the executable scripts provided by the library

About

Releases

Packages

Contributors 2

Languages

License

TextpressoDevelopers/textpresso_classifiers

Folders and files

Latest commit

History

Repository files navigation

Introduction

Installing tpclassifier library

Using the library from Python

Using the executable scripts provided by the library

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages