Skip to content

Latest commit

 

History

History
26 lines (21 loc) · 572 Bytes

README.md

File metadata and controls

26 lines (21 loc) · 572 Bytes

This is a python script to extract table from a pdf file.

PREREQUISITES

python3

INSTALLATION

install virtual environment

sudo apt install python3.11-venv python3 -m venv venv source venv/bin/activate

install needed packages

pip install "camelot-py[base]" pip install --upgrade PyPDF2==2.12.1 pip install opencv-python

SETUP

  • copy pdf file in "extract_table_from_pdf" folder
  • update the variables "file_name" and "pages" below

RUN

python3 extract_table_from_pdf.py

POSTPROCESSING

  • to be done manually
  • deactivate the virtual environment: deactivate