LLAMA OCR

Using LLAMA Vision Model for OCR, allowing configuring any OpenAI compliant endpoints and model names. This is the python version of llama-ocr.

Free software: MIT license

Installation

pip install llama-ocr

Usage

from llama_ocr import ocr

data = ocr(
  file_path="./test.png",
  api_key="xxxxx",
  base_url="https://openrouter.ai/api",
  model="meta-llama/llama-3.2-11b-vision-instruct:free"
)
# file_path: Path to the image file
# api_key: Your LLM API key
# base_url: The base URL of the LLM API
# model: The model to use

By default, this project will use the free model from OpenRouter. So you just need to provide your API key and image path.

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.rst

README.rst

LLAMA OCR

Installation

Usage

Credits

Files

README.rst

Latest commit

History

README.rst

File metadata and controls

LLAMA OCR

Installation

Usage

Credits