Multimodal Document Classification

Multimodal Visual & Text Based Approach to Multi-Page Document Classification.

MSc project investigating multi-modal fusion approaches to combining textual and visual features for multi-page document classification of documents within the North Sea Transition Authority (OGA) National Data Repository (NDR) using deep multimodal fusion convolutional long short term memory (C-LSTM) neural networks. This readme gives a brief overview of the project and the code in this repository, see the accompanying report for the full details. Note this is experimental code for my masters project, I would advise against running any of it in production.

Requirements

All Python dependencies for the project can be installed by building a conda environment from the environment.yml file.

Document Dataset

The data source used in this project was compiled from a corpus of raw documents uploaded by oil companies to the National Data Repository (NDR), a data repository for UK petroleum exploration and production data maintained by the North Sea Transition Authority (OGA). The document corpus used consists of a sample of 6,541 documents, these are mostly PDF, Microsoft Office, text and image type files.

The documents in this corpus are split into 6 classes:

geol_geow - Geological end of well reports.
geo_sed - Geological sedimentary reports.
gphys_gen - General geophysical reports.
log_sum - Well log summaries.
pre-site - Pre-site reports.
vsp_file - Vertical seismic profiles.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
configs		configs
experimental_models		experimental_models
extraction_pipeline		extraction_pipeline
media		media
report		report
scripts		scripts
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multimodal Document Classification

Multimodal Visual & Text Based Approach to Multi-Page Document Classification.

Requirements

Document Dataset

About

Releases

Packages

Languages

justinbt1/Multimodal-Document-Classification

Folders and files

Latest commit

History

Repository files navigation

Multimodal Document Classification

Multimodal Visual & Text Based Approach to Multi-Page Document Classification.

Requirements

Document Dataset

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages