Learning 3D Computer Vision

3D computer vision enables us to understand the spatial arrangement, orientation, shape, and volumetric characteristics of objects in the 3D world, leading to high-level semantic insights. This repository is dedicated to tutorials on 3D computer vision, focusing solely on learning-based methodologies, particularly with neural networks.

3D reconstruction from a single view is very similar to the process through which we recognize objects in the real world. When we look at a chair from one angle, we know it is a chair and can intuitively imagine what it would look like from other angles. It’s not like a chair viewed from one angle will look like an airplane from another angle. That being said, if you were determined to design an airplane that looks like a chair from a specific viewpoint, then everything in this post is inapplicable. 🤣

Left: Ground truth point cloud. Middle: Single view image. Right: Predicted point cloud.

Anatomy of NeRF, Neural Radiance Field

Neural Radiance Fields (NeRF) is a revolutionary approach to computer graphics and vision for synthesizing highly realistic images from sparse sets of images. At its core, NeRF models the continuous volumetric scene function using a multi-layer perceptron (MLP), mapping spatial coordinates and viewing directions to color and density. In this tutorial, I aim to demystify NeRF, explaining NeRF in detail and implementing it using PyTorch from scratch.

Left: A sequence of training images. Right: Synthesized views under continuous viewing directions.

NOTE: More work is required to make inverse sampling and fine sampling work.

Deep Dive into 3D Gaussian Splatting

The figure shows the process of 3D Gaussian changing its properties to align to viewing directions.

3D Gaussian Splatting (3DGS) is a powerful technique for generating novel views from a set of images and their poses. In this section, I will cover the basics of 3DGS.

NOTE: Advanced techniques in 3DGS, such as splitting and deleting 3D Gaussians, are not yet implemented. The code in Deep_Dive_into_3D_Gaussian_Splatting/ has been tested only on an NVIDIA L4 GPU with 24 GB memory.

Getting Started

All the results can be reproduced by following the instructions below.

System Dependencies

NVIDIA driver, CUDA Toolkit, and cuDNN libraries: The system must installed recent NVIDIA driver, CUDA Toolkit, and cuDNN libraries, which are prerequisites for PyTorch and PyTorch3D. Here are the software versions which have been tested:
```
NVIDIA driver: 530.30.02
cuda toolkit: 12.1
cudnn: 8.9.7
```
If you encounter any problems while installing or updating them, you could consult this guide.
Python3: Python 3.10 is used throughout all the developments for the problems.

Install Python and NVIDIA requirements

Create and activate a Python virtual environment named venv_3d_cv, and update pip, setuptools, and wheel:

python3.10 -m venv venv_3d_cv \
&& source venv_3d_cv/bin/activate \
&& python3 -m pip install --upgrade pip setuptools wheel

Install required general Python packages:

python3 -m pip install -r requirements.txt

Install required NVIDIA Python packages:

python3 -m pip install nvidia-pyindex && python3 -m pip install -r nvidia_requirements.txt

Install PyTorch and PyTorch3D

Install python3-dev by sudo apt install python3-dev.

The ~/.bashrc has the following lines:

# To Export or Not To Export LD_LIBRARY_PATH. Make Python find in venv*
export PATH=/usr/local/cuda-12.1/bin:$PATH
# export LD_LIBRARY_PATH=/usr/local/cuda-12.1/lib64:$LD_LIBRARY_PATH

Please visit the PyTorch official website to find the command to use for your system (CUDA 12.1):
```
python3 -m pip install torch torchvision torchaudio
```

The installation guide of PyTorch3D can be found here:

python3 -m pip install fvcore iopath && python3 -m pip install "git+https://github.com/facebookresearch/pytorch3d.git@stable"

Running Jupyter Notebook

Now you are ready to go to each folder and run the python script and the Jupyter Notebook. Please remember to select the kernel you just created in your virtual environment venv_3d_cv.

Acknowledgments

This repository is a compilation of materials gathered from various online sources, each cited to acknowledge their origin.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
3D_Object_Representations		3D_Object_Representations
Anatomy_of_NeRF		Anatomy_of_NeRF
Deep_Dive_into_3D_Gaussian_Splatting		Deep_Dive_into_3D_Gaussian_Splatting
Supervised_Single_View_to_3D_Objects		Supervised_Single_View_to_3D_Objects
.gitignore		.gitignore
README.md		README.md
nvidia_requirements.txt		nvidia_requirements.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning 3D Computer Vision

Table of Contents

3D Object Representations

Supervised Single-View to 3D Objects

Anatomy of NeRF, Neural Radiance Field

Deep Dive into 3D Gaussian Splatting

Getting Started

System Dependencies

Install Python and NVIDIA requirements

Install PyTorch and PyTorch3D

Running Jupyter Notebook

Acknowledgments

About

Releases

Packages

Languages

lionlai1989/Learning-3D-Computer-Vision

Folders and files

Latest commit

History

Repository files navigation

Learning 3D Computer Vision

Table of Contents

3D Object Representations

Supervised Single-View to 3D Objects

Anatomy of NeRF, Neural Radiance Field

Deep Dive into 3D Gaussian Splatting

Getting Started

System Dependencies

Install Python and NVIDIA requirements

Install PyTorch and PyTorch3D

Running Jupyter Notebook

Acknowledgments

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages