Signex

Signex is open source signature & stamp recognition tool, that uses YOLOv7-based model for signature detection and EfficientNet v2 S for signature embeddings.

Here you can find our Swagger API description.

Introduction

Signature & stamp recognition is a valuable tool in various domains, including banking, legal, and security applications. This architecture provides a framework for building a signature recognition system using machine learning algorithms.

Requirements

To run the signature recognition architecture, the following requirements should be fulfilled:

Linux or Windows machine, this project was not tested on Mac
GPU and CUDA 11.8 for training

Installation

Step-by-step guide

Click to expand and follow these steps

If you are going to train custom models, install CUDA 11.8

Clone the repository:

git clone --depth 1 --recurse-submodules https://github.com/ATMI/Signex.git

Navigate to the project:
```
cd signature-recognition
```
Create Python virtual environment:
```
python -m venv venv
```

Activate venv:

Linux:

. venv/bin/activate

Windows:

venv\Scripts\activate

Install the requirements:
```
pip install -r requirements.txt
```

Linux:

git clone --depth 1 --recurse-submodules https://github.com/ATMI/Signex.git
cd Signex
python -m venv venv
. venv/bin/activate
pip install -r requirements.txt

Windows:

git clone --depth 1 --recurse-submodules https://github.com/ATMI/Signex.git
cd Signex
python -m venv venv
venv\Scripts\activate
pip install -r requirements.txt

Usage

Detection Model

To run trained Neural Network execute the following command:

cd detection
python ../yolov7/detect.py --weights weights/best.pt --conf 0.5 --img-size 640 --source images_dir

Comparison Model

As for now comparison model is only available in the comparison/main.ipynb. You can test existing model running the following cell:

MODEL = torch.load("weights/model.pt")
TEST_DATASET = TriDataset(TEST_PATH, transform=TRANSFORM, display=True)
test(MODEL, DEVICE, TEST_DATASET)

Do not forget to run previous cells except training one:

MODEL = train(DEVICE)
torch.save(MODEL, "model.pt")

Training

Detection Model

Structure

cfg - neural network configurations folder
data - dataset configurations folder
hyp - hyperparameters for training neural network, such as learning rate, augmentation strategies, etc.

Dataset preparation

To train your custom model:

Collect images for the dataset
Convert all images to .jpg format and check their validity
Label the data, we recommend to use YOLO Label. This project uses standard YOLO labels format:
```
<class_id> <cx> <cy> <w> <h>   
```
Where:
1. class_id - marked object class/type, ∈ [0, # of clases]
2. cx - x coordinate of the bounding box center, ∈ [0, 1]
3. cy - y coordinate of the bounding box center, ∈ [0, 1]
4. w - width of the bounding box, ∈ [0, 1]
5. h - height of the bounding box, ∈ [0, 1]
Note: cx, cy, w, and h are values relative to the corresponding image dimensions
Put or symlink all images and labels in the dataset/images and dataset/labels folders. Each label file name should correspond to the image file:
```
image_1.jpg <-> image_1.txt
ball.jpg <-> ball.txt
```
Change classes number in cfg/net.yaml:
```
nc: 2 # number of classes
```
Create dataset/train.lst and dataset/test.lst files, that will contain paths to the training and testing images. You can use shufflels tool to create them automatically:
1. Build shufflels:
```
g++ shufflels.cpp -o shufflels
```
2. Run shufflels:
```
cd dataset
./shufflels images jpg 80
```

You can specify custom train.lst and test.lst paths in the data/data.yaml file:

train: dataset/list.lst # path to images list used for training
val: dataset/list.lst # path to images list used for testing

Specify number and names of the classes in the data/data.yaml file:

nc: 2 # number of classes in the dataset
names: ['signature', 'stamp'] # names of the classes

Optionally you can modify hyperparameters in hyp/hyp.net.yaml

Start training

cd detection
python ../yolov7/train.py --workers 8 --device 0 --batch-size 64 --data data/data.yaml --img 640 640 --cfg cfg/net.yaml --weights weights/best.pt --name net --hyp hyp/hyp.net.yaml

Comparison Model

To train and test comparison model you can run the comparison/main.ipynb. The training dataset should be placed under comparison/dataset/train, each sub-folder should contain different variants of the same signature. The testing data should be placed under comparison/dataset/test:

dataset/
├── train/
│   ├── 1/
│   │   ├── variant_1.jpg
│   │   └── variant_2.jpg
│   └── 2/
│       ├── variant_1.jpg
│       └── variant_2.jpg
└── test/
    ├── 3/
    │   ├── variant_1.jpg
    │   └── variant_2.jpg
    └── 4/
        ├── variant_1.jpg
        └── variant_2.jpg

Testing

Detection Model

To test the training model run:

cd detection
python ../yolov7/test.py --weights weights/best.pt --img-size 640 --data data/data.yaml

With the latest model we have obtained the following results (all images were not included in the training dataset):

Class	Images	Labels	Precision	Recall	[email protected]	[email protected]:.95
all	696	1393	0.969	0.926	0.966	0.676
signature	696	935	0.953	0.92	0.965	0.56
stamp	696	458	0.985	0.932	0.966	0.791

Confusion matrix:

Comparison Model

Currently, see the Training section

API

To start an API, you need to run:

python api.py

By default, it listens to 8080 port and loads detection/weights/best.pt weights for the detector.

Contributing

We welcome contributions to enhance the signature recognition architecture. If you would like to contribute, please follow these steps:

Fork the repository on GitLab.
Create a new branch with the name feature/feature_name for your feature or bug fix.
Implement your changes or additions.
Commit and push your changes to your forked repository.
Submit a merge request, clearly describing the changes you have made.

Future work

Signex is still under development and the following tasks have to be done:

Development of in-stamp signatures extraction model. Our model is also trained to detect stamps for their potential further processing. We can try to find signatures inside stamps to improve signature detection accuracy, as current accuracy may seem relative low:

Further comparison model training and API method implementation.

License

Signex is licensed under the WTFPL.

Name		Name	Last commit message	Last commit date
Latest commit History 151 Commits
api		api
comparison		comparison
detection		detection
frontend		frontend
images		images
signex_utils		signex_utils
yolov7 @ c84bf4c		yolov7 @ c84bf4c
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
.gitmodules		.gitmodules
.pylintrc		.pylintrc
CMakeLists.txt		CMakeLists.txt
LICENSE.fuck		LICENSE.fuck
README.md		README.md
__init__.py		__init__.py
api.py		api.py
requirements.txt		requirements.txt
shufflels.cpp		shufflels.cpp
tests.py		tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Signex

Table of Contents

Introduction

Requirements

Installation

Step-by-step guide

Linux:

Windows:

Usage

Detection Model

Comparison Model

Training

Detection Model

Structure

Dataset preparation

Comparison Model

Testing

Detection Model

Comparison Model

API

Contributing

Future work

License

About

Releases

Packages

Contributors 3

Languages

License

ATMI/Signex

Folders and files

Latest commit

History

Repository files navigation

Signex

Table of Contents

Introduction

Requirements

Installation

Step-by-step guide

Linux:

Windows:

Usage

Detection Model

Comparison Model

Training

Detection Model

Structure

Dataset preparation

Comparison Model

Testing

Detection Model

Comparison Model

API

Contributing

Future work

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages