Skip to content

harshit17chaudhary/videolabeler

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spatio-Temporal Road Detection from Aerial Imagery using CNNs

This repository gathers a semi-automatic tool for labelling videos, as well as a labelled database of videos of roads from the point of view of a small commercial drone. The work done is detailed in the article Spatio-Temporal Road Detection from Aerial Imagery using CNNs, presented at the VISAPP 2017, the International Conference on Computer Vision Theory and Applications.

The author of this project is Belén Luque, under the supervision of Javier Ruiz Hidalgo and Josep Ramon Morros.

Abstract of the article

The main goal of this paper is to detect roads from aerial imagery recorded by drones. To achieve this, we propose a modification of SegNet, a deep fully convolutional neural network for image segmentation. In order to train this neural network, we have put together a database containing videos of roads from the point of view of a small commercial drone. Additionally, we have developed an image annotation tool based on the watershed technique, in order to perform a semi-automatic labeling of the videos in this database. The experimental results using our modified version of SegNet show a big improvement on the performance of the neural network when using aerial imagery, obtaining over 90% accuracy.

Publication

You can find our article published in ScitePress, as part of the Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - (Volume 4).

Labelling tool

This labelling tool allows to extend the labels from a frame to the rest of video, using the watershed and histogram backprojection techniques. The details on how the tool works are detailed in the article mentioned above. To use the labelling tool, follow the instructions in videoLabeler.py and run the script. Note that the numpy and OpenCV libraries are required.

Database

The labelled database consists on 3.313 images, gathered from snippets of different Youtube videos. These images have been labelled using the developed semi-automatic tool. The results have been supervised and corrected when needed by the author.

The videos used are the following:

As the videos are not from the author, only the labels are provided (drones_labels.zip). These labels are in .png format. Each pixel can have one of the following values:

  • (0,0,0) for the class sky
  • (4,4,4) for the class road
  • (6,6,6) for the class other

Below are some examples of labels in the database. For visualization purposes, the values of the images were modified, where gray corresponds to the class sky, red to the class road and violet to the class other.

A script is provided (videosToFrames.py) to automatically obtain the exact frames of each video used to create the database. When using this script, the needed frames of each video will be saved in a directory, with their names matching the names of the labels.



The frames used for each video are:

Video Frames from the video
Video 1 [1878-1908] / [2505-2555] / [2918-2948] / [4768-4798] / [7118-7158] / [8509-8528]
Video 2 [841-916] / [3040-3136] / [2846-2851] / [2856] / [2862-2863] / [2868] / [2875-2879] / [2884-2887] / [2892] / [2899] / [2901-2902] / [2906] / [2910-2911] / [2914-2915] / [2922] / [2926-2927] / [2932] / [2934] / [2940] / [357-436] / [541-638] / [640-749] / [943-1001] / [1057-1232] / [1916-2033] / [3644-3744] / [3814-3874] / [3904-4032] / [4536-4556] / [4616-4656] / [5083-5113] / [5183-5213] / [5304-5334]
Video 3 [227-257]
Video 4 [2110-2150] / [2404-2444]
Video 5 [2785-2839] / [3337-3450] / [3588-3722] / [4894-5030]
Video 6 [4299-4329] / [4673-4703] / [4956-4986] / [5026-5036] / [6711-6741]
Video 7 [1705-1735] / [1821-1836] / [2914-2944] / [3404-3444]
Video 8 [2681-2721] / [2751-2781]
Video 9 [2-402] / [809-989] / [1026-1280] / [1673-1769]

The order in which the frames are provided follows the order of the labels' names.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%