Fast Image Reconstruction with an Event Camera (forked from High Speed and High Dynamic Range Video with an Event Camera)
This code for the paper Fast Image Reconstruction with an Event Camera is based off code for the paper High Speed and High Dynamic Range Video with an Event Camera by Henri Rebecq, Rene Ranftl, Vladlen Koltun and Davide Scaramuzza:
You can find a pdf of the paper here. If you use any of this code, please cite the following publications:
@InProceedings{Scheerlinck20wacv,
author = {Cedric Scheerlinck and Henri Rebecq and Daniel Gehrig and Nick Barnes and Robert Mahony and Davide Scaramuzza},
title = {Fast Image Reconstruction with an Event Camera},
booktitle = {{IEEE} Winter Conf. Appl. Comput. Vis. {(WACV)}},
year = {2020},
pages = {156--163}
}
@Article{Rebecq19pami,
author = {Henri Rebecq and Ren{\'{e}} Ranftl and Vladlen Koltun and Davide Scaramuzza},
title = {High Speed and High Dynamic Range Video with an Event Camera},
journal = {{IEEE} Trans. Pattern Anal. Mach. Intell. (T-PAMI)},
url = {http://rpg.ifi.uzh.ch/docs/TPAMI19_Rebecq.pdf},
year = 2019
}
@Article{Rebecq19cvpr,
author = {Henri Rebecq and Ren{\'{e}} Ranftl and Vladlen Koltun and Davide Scaramuzza},
title = {Events-to-Video: Bringing Modern Computer Vision to Event Cameras},
journal = {{IEEE} Conf. Comput. Vis. Pattern Recog. (CVPR)},
year = 2019
}
Dependencies:
The installation requires Anaconda3. You can create a new Anaconda environment with the required dependencies as follows (make sure to adapt the CUDA toolkit version according to your setup):
conda create -n E2VID
conda activate E2VID
conda install pytorch torchvision cudatoolkit=10.0 -c pytorch
conda install pandas
conda install -c conda-forge opencv
- Download the pretrained FireNet model
or
- Download the pretrained E2VID model:
wget "http://rpg.ifi.uzh.ch/data/E2VID/models/E2VID_lightweight.pth.tar" -O pretrained/E2VID_lightweight.pth.tar
- Download an example file with event data:
wget "http://rpg.ifi.uzh.ch/data/E2VID/datasets/ECD_IJRR17/dynamic_6dof.zip" -O data/dynamic_6dof.zip
Before running the reconstruction, make sure the conda environment is sourced:
conda activate E2VID
- Run reconstruction:
python run_reconstruction.py \
-c firenet_1000.pth.tar \
-i data/dynamic_6dof.zip \
--auto_hdr \
--display \
--show_events
Below is a description of the most important parameters:
--window_size
/-N
(default: None) Number of events per window. This is the parameter that has the most influence of the image reconstruction quality. If set to None, this number will be automatically computed based on the sensor size, as N = width * height * num_events_per_pixel (see description of that parameter below). Ignored if--fixed_duration
is set.--fixed_duration
(default: False) If True, will use windows of events with a fixed duration (i.e. a fixed output frame rate).--window_duration
/-T
(default: 33 ms) Duration of each event window, in milliseconds. The value of this parameter has strong influence on the image reconstruction quality. Its value may need to be adapted to the dynamics of the scene. Ignored if--fixed_duration
is not set.--Imin
(default: 0.0),--Imax
(default: 1.0): linear tone mapping is performed by normalizing the output image as follows:I = (I - Imin) / (Imax - Imin)
. If--auto_hdr
is set to True,--Imin
and--Imax
will be automatically computed as the min (resp. max) intensity values.--auto_hdr
(default: False) Automatically compute--Imin
and--Imax
. Disabled when--color
is set.--color
(default: False): if True, will perform color reconstruction as described in the paper. Only use this with a color event camera such as the Color DAVIS346.
--output_folder
: path of the output folder. If not set, the image reconstructions will not be saved to disk.--dataset_name
: name of the output folder directory (default: 'reconstruction').
--display
(default: False): display the video reconstruction in real-time in an OpenCV window.--show_events
(default: False): show the input events side-by-side with the reconstruction. If--output_folder
is set, the previews will also be saved to disk in/path/to/output/folder/events
.
--num_events_per_pixel
(default: 0.35): Parameter used to automatically estimate the window size based on the sensor size. The value of 0.35 was chosen to correspond to ~ 15,000 events on a 240x180 sensor such as the DAVIS240C.--no-normalize
(default: False): Disable event tensor normalization: this will improve speed a bit, but might degrade the image quality a bit.--no-recurrent
(default: False): Disable the recurrent connection (i.e. do not maintain a state). For experimenting only, the results will be flickering a lot.--hot_pixels_file
(default: None): Path to a file specifying the locations of hot pixels (such a file can be obtained with this tool for example). These pixels will be ignored (i.e. zeroed out in the event tensors).
We provide a list of example (publicly available) event datasets to get started with E2VID.
- High Speed (gun shooting!) and HDR Dataset
- Event Camera Dataset
- Bardow et al., CVPR'16
- Scherlinck et al., ACCV'18
- Color event sequences from the CED dataset Scheerlinck et al., CVPR'18
Because PyTorch recommends Python 3 and ROS is only compatible with Python2, it is not straightforward to have the PyTorch reconstruction code and ROS code running in the same environment. To make things easy, the reconstruction code we provide has no dependency on ROS, and simply read events from a text file or ZIP file. We provide convenience functions to convert ROS bags (a popular format for event datasets) into event text files. In addition, we also provide scripts to convert a folder containing image reconstructions back to a rosbag (or to append image reconstructions to an existing rosbag).
Note: it is not necessary to have a sourced conda environment to run the following scripts. However, ROS needs to be installed and sourced.
To extract the events from a rosbag to a zip file containing the event data:
python scripts/extract_events_from_rosbag.py /path/to/rosbag.bag \
--output_folder=/path/to/output/folder \
--event_topic=/dvs/events
python scripts/image_folder_to_rosbag.py \
--datasets dynamic_6dof \
--image_folder /path/to/image/folder \
--output_folder /path/to/output_folder \
--image_topic /dvs/image_reconstructed
cd scripts
python embed_reconstructed_images_in_rosbag.py \
--rosbag_folder /path/to/rosbag/folder \
--datasets dynamic_6dof \
--image_folder /path/to/image/folder \
--output_folder /path/to/output_folder \
--image_topic /dvs/image_reconstructed
It can be convenient to convert an image folder to a video with a fixed framerate (for example for use in a video editing tool). You can proceed as follows:
export FRAMERATE=30
python resample_reconstructions.py -i /path/to/input_folder -o /tmp/resampled -r $FRAMERATE
ffmpeg -framerate $FRAMERATE -i /tmp/resampled/frame_%010d.png video_"$FRAMERATE"Hz.mp4
This code borrows from the following open source projects, whom we would like to thank: