scanned-documents

A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.

ocr image-processing scanned-documents image-segmentation optical-character-recognition signature-verification ocr-engine signature-recognition signature-detection handwritten-signatures signature-extractor signature-extraction-algorithm

Updated Apr 20, 2023
Python

ad-si / awesome-scanning

Sponsor

Star

A curated list of awesome projects to simplify and improve paper and document scanning.

scanner scanned-documents dms document-scanner scanning book-scanning book-scanner digitization book-digitization page-scanning

Updated Nov 19, 2024

susam / tucl

Star

The first-ever paper on the Unix shell written by Ken Thompson in 1976 scanned, transcribed, and redistributed with permission

shell pdf unix paper conservation scanned-documents scanned-pages unix-shell

Updated Dec 3, 2022
Makefile

papermerge / papermerge-core

Star

Papermerge DMS core backend, REST API server, and frontend UI

pdf ocr documents scanned-documents dms records-management digital-archives document-management-system

Updated Nov 19, 2024
Python

brakmic / OpenCV

Star

📷 Computer-Vision Demos

opencv ocr computer-vision vision scanned-documents scanning ocr-recognition scanimage

Updated Jan 25, 2016
C#

Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

html pdf ocr table-of-contents excel html-parser docx documents doc scanned-documents txt document-analysis odt pdf-parser table-recognition docx-parser document-content-extraction logical-structure-extraction

Updated Nov 19, 2024
Python

atgreen / paperless

Star

Emacs-assisted PDF document filing

pdf emacs melpa scanned-documents paperless

Updated Jan 30, 2024
Emacs Lisp

karolzak / boxdetect

Star

BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on scanned forms.

opencv computer-vision forms checkbox documents checkboxes scanned-documents boxes handwritten-documents cv2 opencv-python bounding-boxes box-detection scanned-images rectangle-detection handwritten-character-recognition handwritten-characters scanned-image-pdfs handwritten-forms

Updated Jan 18, 2023
Python

apurvmishra99 / pdf-to-scan

Star

Make your PDFs look like they were scanned

ghostscript imagemagick scan scanned-documents pdfs

Updated May 14, 2020
Python

beast / react-native-scan-doc

Star

A document scanner that automatically trims the edge with perspective transform

react-native scanned-documents

Updated Jun 14, 2018
Java

ApryseSDK / pdftron-android-ocr-scanner-sample

Star

Android Scanner with OCR support using PDFTron

android ocr scanner scanned-documents document-scanner pdf-scanner document-ocr

Updated Jul 7, 2021
Kotlin

maxim2266 / go-ocr

Star

A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.

go ocr scanned-documents extract-images

Updated Feb 20, 2020
Go

NjoyimPeguy / ICDAR-2019-RRC-SROIE

Star

ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction

ocr text-classification scanned-documents keyword-extraction receipts text-localization icdar2019 sroie sroie2019 scanned-receipts

Updated Jul 20, 2022
Python

baltpeter / scanprep

Star

Small utility to prepare scanned documents. Supports separating PDF files by separator pages and removing blank pages.

pdf image-processing scanned-documents scanning hacktoberfest

Updated Aug 13, 2024
Python

papermerge / documentation

Star

Documentation for Papermerge DMS - Installation, Help, User Manual, REST API

documentation ocr archives help scan installation scanned-documents dms document-management user-manual contrbuting

Updated Jul 29, 2024
HTML

goodday451999 / Character-Segmentation-of-Scanned-Text

Star

Segmentation of Scanned Text upto Character Level

scanned-documents handwritten-documents character-segmentation

Updated Dec 3, 2019
Python

AdroitAnandAI / Multilingual-Text-Inversion-Detection-of-Scanned-Images

Star

Efficient Text Localization Algorithm, Image Inversion Detection of Scanned Documents & Language Identification based on Shape Context and Traditional Computer Vision.

multilingual computer-vision shape text images detection inversion efficient scanned-documents language-identification shape-context scanned-images image-inversion text-localization traditional-algorithm inversion-detection

Updated Dec 18, 2021
Python

Improve this page

Add a description, image, and links to the scanned-documents topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the scanned-documents topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scanned-documents

Here are 48 public repositories matching this topic...

ciur / papermerge

4lex4 / scantailor-advanced

Udayraj123 / OMRChecker

ahmetozlu / signature_extractor

ad-si / awesome-scanning

susam / tucl

papermerge / papermerge-core

brakmic / OpenCV

ispras / dedoc

atgreen / paperless

karolzak / boxdetect

apurvmishra99 / pdf-to-scan

beast / react-native-scan-doc

ApryseSDK / pdftron-android-ocr-scanner-sample

maxim2266 / go-ocr

NjoyimPeguy / ICDAR-2019-RRC-SROIE

baltpeter / scanprep

papermerge / documentation

goodday451999 / Character-Segmentation-of-Scanned-Text

AdroitAnandAI / Multilingual-Text-Inversion-Detection-of-Scanned-Images

Improve this page

Add this topic to your repo