LayoutLMv3

Jun 19, 2022

9be6548 · Jun 19, 2022

This branch is 193 commits behind NielsRogge/Transformers-Tutorials:master.

Name	Name	Last commit message	Last commit date
parent directory ..
Fine_tune_LayoutLMv3_on_FUNSD_(HuggingFace_Trainer).ipynb	Fine_tune_LayoutLMv3_on_FUNSD_(HuggingFace_Trainer).ipynb	Fix bug regarding data preprocessing	Jun 19, 2022
README.md	README.md	Update README.md	Jun 7, 2022

README.md

LayoutLMv3 notebooks

In this directory, you can find notebooks that illustrate how to use LayoutLMv3 both for fine-tuning on custom data as well as inference.

Important note

LayoutLMv3 models are capable of getting > 90% F1 on FUNSD. This is thanks to the use of segment position embeddings, as opposed to word-level position embeddings, inspired by StructuralLM. This means that words belonging to the same "segment" (let's say, an address) get the same bounding box coordinates, and thus the same 2D position embeddings. You can see here how the authors did this for the FUNSD dataset.

So it's always advised to use segment position embeddings over word-level position embeddings.

Training tips

Note that LayoutLMv3 is identical to LayoutLMv2 in terms of training/inference, except that:

images need to be resized and normalized, such that they are pixel_values of shape (batch_size, num_channels, heigth, width). The channels need to be in RGB format. This was not the case for LayoutLMv2, which expected the channels in BGR format (due to its Detectron2 visual backbone), and normalized the images internally.
tokenization of text is based on RoBERTa, hence byte-level Byte-Pair-Encoding. This in contrast to LayoutLMv2, which used BERT-like WordPiece tokenization.

Because of this, I've created a new LayoutLMv3Processor, which combines a LayoutLMv3FeatureExtractor (for the image modality) and a LayoutLMv3TokenizerFast (for the text modality) into one. Usage is identical to its predecessor LayoutLMv2Processor.

The full documentation can be found here.

The models on the hub can be found here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

LayoutLMv3

LayoutLMv3

README.md

LayoutLMv3 notebooks

Important note

Training tips

Files

LayoutLMv3

Directory actions

More options

Directory actions

More options

Latest commit

History

LayoutLMv3

Folders and files

parent directory

README.md

LayoutLMv3 notebooks

Important note

Training tips