Multi-label Image Recognition with GCN and its Variants

Overview

This repo contains an implementation (and a few variants) of the paper Multi-label Image Recognition with Graph Convolutional Networks. This repo is created following Lightning Hydra template.

In general, the model is the combination of a CNN-based as the image representation extractor and a GCN-based as the label embedding. Figure 1 describes architecture of the model.

Figure 1. The overall architecture of the model.

Dataset

Currently we have ready-to-use VOCDectection pre-processor. More datasets will be added soon.

Installation

You should have Python 3.7 or higher. I highly recommend creating a virual environment like venv or Conda. For example:

# clone project
git clone https://github.com/thanhtvt/multi-label-gcns.git
cd multi-label-gcns

# [OPTIONAL] create conda environment
conda create -n mlgcn python=3.8
conda activate mlgcn

# install requirements
pip install -r requirements.txt

Results

These results are not fully optimized. Updates will be added in the (unknown :D) future.

Model	Params	Dataset	Accuracy	F1	Checkpoint
ResNet-50 + 2xGCN	25.9M	VOC2007	97.8%	85.2%	model

🚀 Quick start

Train

To train model with default configuration, run:

# train on CPU
python src/train.py trainer=cpu

# train on GPU
python src/train.py trainer=gpu

To train model with chosen experiment configuration from configs/experiment folder, run:

python src/train.py experiment=multi-label_base

To override any parameter from commandline, run:

python src/train.py logger=csv trainer.max_epochs=10

Evaluate

To evaluate model with default configuration, run:

python src/eval.py

To evaluate model with chosen checkpoint, run:

python src/eval.py ckpt_path=best.ckpt

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
configs		configs
data		data
logs		logs
notebooks		notebooks
src		src
static		static
.env		.env
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-label Image Recognition with GCN and its Variants

Overview

Dataset

Installation

Results

🚀 Quick start

Train

Evaluate

About

Releases

Packages

Languages

thanhtvt/multi-label-gcns

Folders and files

Latest commit

History

Repository files navigation

Multi-label Image Recognition with GCN and its Variants

Overview

Dataset

Installation

Results

🚀 Quick start

Train

Evaluate

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages