Performance Prediction for Semantic Segmentation by a Self-Supervised Image Reconstruction Decoder

Andreas Bär, Marvin Klingner, Jonas Löhdefink, Fabian Hüger, Peter Schlicht, and Tim Fingscheidt

Idea Behind the Paper

System Overview

Performance Prediction Module

In our work, we propose a novel per-image performance prediction for semantic segmentation. Our proposed method is sensor-efficient, data-efficient, and training-efficient. Specifically, we extend an already trained semantic segmentation network having fixed parameters with an image reconstruction decoder. After training and a subsequent regression, the image reconstruction quality is evaluated to predict the semantic segmentation performance.

Citation

If you find our code helpful or interesting for your research, please consider citing

@InProceedings{Baer2022,
  author    = {Andreas B\"{a}r and Marvin Klingner and Jonas L\"{o}hdefink and Fabian H\"{u}ger and Peter Schlicht and Tim Fingscheidt},
  booktitle = {Proc.\ of CVPR - Workshops},
  title     = {{Performance Prediction for Semantic Segmentation and by a Self-Supervised Image Reconstruction Decoder}},
  year      = {2022},
  address   = {New Orleans, LA, USA},
  month     = jun,
  pages     = {4399--4408},
}

Acknowledgments

I would like to thank Daniel Kusuma in helping me setting up this repository. I would also especially like to thank my co-author Jonas Löhdefink who passed away too early. We will never forget you, my friend.

Our Models

Results are reported for the Cityscapes Lindau validation set

SwiftNet semantic segmentation baseline:

mIoU	Download link
65.02	model

The following two tables include numbers for the Pearson correlation as reported in the paper.

SwiftNet-based reconstruction decoder models:

Reconstruction Decoder	Lat.	SPP	Clean	FGSM	PGD	Gaussian	S&P	all	Download link
SwiftNet			0.19	0.77	0.78	0.70	0.78	0.76	model
SwiftNet		x	0.20	0.77	0.79	0.70	0.77	0.76	model
SwiftNet	x		-0.01	0.88	0.85	0.83	0.86	0.85	model
SwiftNet	x	x	-0.01	0.88	0.84	0.83	0.85	0.84	model

ResNet-based reconstruction decoder models:

Reconstruction Decoder	Block config	Clean	FGSM	PGD	Gaussian	S&P	all	Download link
ResNet10	1-1-1-1	0.24	0.82	0.82	0.76	0.84	0.81	model
ResNet26	3-3-3-3	0.21	0.84	0.83	0.77	0.82	0.82	model
ResNet18L	2-2-2-2	0.08	0.88	0.85	0.83	0.86	0.85	model

Prerequisites and Requirements

First, clone this repository.

Environment

To install the environment from scratch, follow the following steps:

conda create --name swiftnet-pp python=3.7.12
source activate swiftnet-pp
conda install pytorch=1.10.2 torchvision=0.11.3 torchaudio=0.10.2 cudatoolkit=10.2 -c pytorch
conda install matplotlib=3.2.2 scipy=1.7.3 scikit-image=0.19.3 numba=0.55.1
conda install -c conda-forge wand=0.6.5
pip install opencv-python==4.5.5.62
pip install "git+https://github.com/ifnspaml/IFN_Dataloader.git"
pip install "git+https://github.com/ifnspaml/TUBSRobustCheck.git"

To install the environment via environment.txt, follow the following steps:

conda env create --file environment.yml
source activate swiftnet-pp
pip install "git+https://github.com/ifnspaml/IFN_Dataloader.git"
pip install "git+https://github.com/ifnspaml/TUBSRobustCheck.git"

For reference: The environment.yml was created by exporting the environment via conda env export > environment.yml on our Linux cluster.

Data Preparation

We use a similar dataloader as in AdvAttackDet which builds on IFN_Dataloader. Please refer to AdvAttackDet Prerequisites and Requirements section to prepare the datasets cityscapes and kitti_2015. Both are used in our experiments.

Further, if you don't want to pre-generate train ID segmentation labels (i.e., mapping the segmentation label IDs to a form suitable for training) or struggle with the process described here, you can set keys_to_load=('color', 'segmentation') and labels_mode='fromid' in the respective scripts.

Training

For training according to our method, please first use train_swiftnet.py (training SwiftNet for semantic segmentation) and then train_swiftnet_rec.py (loading a trained SwiftNet for semantic segmentation (frozen weights) and train an additional reconstruction decoder). Please refer to swiftnet.sh and swiftnet_rec.sh for example usages.

Evaluation On Clean And Corrupted Images

For evaluation according to our method, please use eval/eval_attacks_n_noise.py. Please refer to eval/attacks.sh for example usages. After running eval/eval_attacks_n_noise.py you can compute metrics with the generated output file. For metric computation, please use metrics/compute_metrics.py. Example output files can be found in the folder output.

Regression

To perform a regression analysis (predicting mIoU from PSNR) you need to first run eval/eval_attacks_n_noise.py to produce output files containing mIoU and PSNR statistics (e.g., mIoU and PSNR statistics of the Cityscapes validation subset). Next, you need to run metrics/regression.py to perform a regression analysis. Please choose a calibration output file (calibrating the regression) and a regression output file (perform an mIoU prediction). Example output files can be found in the folder output.

License

The original SwiftNet model in this project was developed by Marin Oršić et al. here. The Project was released under the GNU General Public License v3.0. This code modifies some parts of the original code and is therefore also licensed under the GNU General Public License v3.0. Please feel free to use it within the boundaries of this license.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Performance Prediction for Semantic Segmentation by a Self-Supervised Image Reconstruction Decoder

Idea Behind the Paper

Citation

Acknowledgments

Our Models

Prerequisites and Requirements

Environment

Data Preparation

Training

Evaluation On Clean And Corrupted Images

Regression

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
eval		eval
metrics		metrics
models		models
output		output
train		train
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
environment.yml		environment.yml
method_overview.png		method_overview.png
performance_prediction.png		performance_prediction.png
swiftnet.sh		swiftnet.sh
swiftnet_rec.sh		swiftnet_rec.sh
train_options.py		train_options.py
train_swiftnet.py		train_swiftnet.py
train_swiftnet_rec.py		train_swiftnet_rec.py

License

ifnspaml/PerfPredRec

Folders and files

Latest commit

History

Repository files navigation

Performance Prediction for Semantic Segmentation by a Self-Supervised Image Reconstruction Decoder

Idea Behind the Paper

Citation

Acknowledgments

Our Models

Prerequisites and Requirements

Environment

Data Preparation

Training

Evaluation On Clean And Corrupted Images

Regression

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages