Skip to content

text2doc/python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

python.text2doc.com

# Clean up any previous build artifacts
rm -rf build/ dist/ *.egg-info/

# Build the package
python -m pip install --upgrade pip build
pip install --upgrade setuptools

python -m build

# Publish to PyPI (if needed)
python -m pip install --upgrade twine
python -m twine upload dist/*

python -m venv venv
source venv/bin/activate

Clean up the build artifacts and rebuild

rm -rf build/ dist/ src/text2doc.egg-info/
python -m pip install -e .
python -m build

text2doc implements a virtual file system in memory. This module provides an interface compatible with the os module and provides operations on files and directories stored in RAM rather than on disk.

Installation

pip install text2doc

Usage

from text2doc import text2doc

# Example usage
texts = ["This is a sample text.", "Another text to tokenize."]
tokenizer = YourTokenizer()  # Replace with your tokenizer instance
docs = text2doc(texts, tokenizer)
print(docs)

Development

Clean up any previous build artifacts

rm -rf build/ dist/ *.egg-info/

Build the package

python -m pip install --upgrade pip build
python -m build

Publish to PyPI (if needed)

python -m pip install --upgrade twine
python -m twine upload dist/*