Skip to content

benbenben2/extract_table_from_pdf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 

Repository files navigation

This is a python script to extract table from a pdf file.

PREREQUISITES

python3

INSTALLATION

install virtual environment

sudo apt install python3.11-venv python3 -m venv venv source venv/bin/activate

install needed packages

pip install "camelot-py[base]" pip install --upgrade PyPDF2==2.12.1 pip install opencv-python

SETUP

  • copy pdf file in "extract_table_from_pdf" folder
  • update the variables "file_name" and "pages" below

RUN

python3 extract_table_from_pdf.py

POSTPROCESSING

  • to be done manually
  • deactivate the virtual environment: deactivate

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages