keywords: Deep learning, semantic segmentation, ALS point cloud, aerial imagery, multi-modal features
Master Thesis at Insititute of Photogrammetry (Ifp), University of Stuttgart, Germany
Author: Fangwen Shu
Betreuer: M.Sc. Dominik Laupheimer
Prüfer: Prof. Norbert Haala
framework: keras 2.0
pre-processing: python3
semantic segmentation: python2
End in 01.04.2018
https://www.ifp.uni-stuttgart.de/lehre/masterarbeiten/581-shu/
./conda_env/py2.yml
./conda_env/py3.yml
conda env export > py2.yml
conda env create -f py2.yml
pre-processing code for aerial imagery and LiDAR point cloud, including 3D-2D projection, frustum culling, Hidden-point-removal (HPR), gird interpolation and operator of Morphology.
functions of each of algorithms implemented in pre-processing.
some classes related to frustum culling, traslated from C++ code, detailed explanation in OpenGL.
functions used to visualize data.
functions used to calculate statistic information of the data.
code for generating depth image.
Old code backup, including point splatting achieved in C++ if you needed.
- testNet (multi-stream costume CNN based on SegNet, early or late fusion, multi-input stream)
- SegNet (main model used in thesis)
- PSPnet (runnable, one of baselines)
- FCN8,32 (runnable)
- U-net (runnable)
- TernausNet (runnable)
where you save tensorboard file.
where you save train/validation/test-set and VGG pre-treained weights.
where you save trained weights.
some dirty code of SegNet and pre-processing implemented in pytorch.
train and prediction your data.
data generator with pre-processing such as normalization, random rotation, random cropping
randomly brightness jitter, contrast normalization..
cropping images if you need.
evaluation semantic result in 2D and 3D space.