LLAMA OCR

Using LLAMA Vision Model for OCR, allowing configuring any OpenAI compliant endpoints and model names. This is the python version of llama-ocr.

Free software: MIT license

Installation

pip install llama-ocr

Usage

from llama_ocr import ocr

data = ocr(
  file_path="./test.png",
  api_key="xxxxx",
  base_url="https://openrouter.ai/api",
  model="meta-llama/llama-3.2-11b-vision-instruct:free"
)
# file_path: Path to the image file
# api_key: Your LLM API key
# base_url: The base URL of the LLM API
# model: The model to use

By default, this project will use the free model from OpenRouter. So you just need to provide your API key and image path.

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github		.github
docs		docs
src/llama_ocr		src/llama_ocr
tests		tests
.cursorrules		.cursorrules
.editorconfig		.editorconfig
.gitignore		.gitignore
.travis.yml		.travis.yml
AUTHORS.rst		AUTHORS.rst
CODE_OF_CONDUCT.rst		CODE_OF_CONDUCT.rst
CONTRIBUTING.rst		CONTRIBUTING.rst
HISTORY.rst		HISTORY.rst
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.rst		README.rst
pyproject.toml		pyproject.toml
requirements_dev.txt		requirements_dev.txt
ruff.toml		ruff.toml
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLAMA OCR

Installation

Usage

Credits

About

Releases 1

Packages

Languages

License

1WorldCapture/llama_ocr

Folders and files

Latest commit

History

Repository files navigation

LLAMA OCR

Installation

Usage

Credits

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages