Semantic Segmentation on CamVid Dataset using U-Net.

Description

This repository is made in attempt to implement the U-Net paper. We implemented the convolutional neural network architecture given in the paper. There are 32 classes in total.

TODO

Minimal implementation of the U-Net
Complete Readme
Visualizing output of the model
Adding different metrics to evaluate performance

Semantic Segmentation

Semantic segmentation refers to the process of mapping/classifying each pixel in an image to a class label. It can also be referred to as image classification for pixels in an image. Its primary applications include autonomous vehicles, human-computer interaction, robotics and photo-editing tools. It is very useful for delf-driving cars in which contextual information of the environment is required at each and every step while traversing the route.

Network Architechture

The network architecture mentioned in the U-Net paper is used. The first half of the U-Net is an encoding task and the second half is a reconstruction task. To get the confidence scores in the form of a probabilities, we use the softmax activation function at the output layer. And then we choose cross entropy loss as our error function.

Softmax Activation is given by

$\frac{exp(x_i)}{\sum_{j=1}^{n}exp(x_j)}$

Cross Entropy Loss is given by

$\ell(x, class) = -log(\frac{exp(x[class])}{\sum_{j=1}^{n}exp(x[j])})$

Dataset Description

The Cambridge-driving Labeled Video Database (CamVid) is the first collection of videos with object class semantic labels, complete with metadata. The database provides ground truth labels that associate each pixel with one of 32 semantic classes. The dataset can be found here.

Deep Learning Framework used

Pytorch

External References

Reference 1

Reference 2

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
UNET.ipynb		UNET.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic Segmentation on CamVid Dataset using U-Net.

Description

TODO

Semantic Segmentation

Network Architechture

Softmax Activation is given by

Cross Entropy Loss is given by

Dataset Description

Deep Learning Framework used

External References

About

Releases

Packages

Languages

sakethbachu/UNET-Semantic_Segmentation

Folders and files

Latest commit

History

Repository files navigation

Semantic Segmentation on CamVid Dataset using U-Net.

Description

TODO

Semantic Segmentation

Network Architechture

Softmax Activation is given by

Cross Entropy Loss is given by

Dataset Description

Deep Learning Framework used

External References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages