DisCo:Diffusion-based Cross-modal Shape Reconstruction

Installation

The following steps have been tested on Ubuntu20.04.

You must have an NVIDIA graphics card with at least 12GB VRAM and have CUDA installed.
Install Python >= 3.8.
Install PyTorch==2.3.0 and torchvision==0.18.0.

pip install torch==2.3.0 torchvision==0.18.0 --index-url https://download.pytorch.org/whl/cu118
pip install torch-scatter -f https://data.pyg.org/whl/torch-2.3.0+cu118.html

Install dependencies:

pip install -r requirements.txt

Install DisCo:

pip install -e .

Data preparation

Download and Organize Data
- Download the preprocessed data from BaiduYun (code: r7vs).
- After downloading, place all the data under the LASA directory.
- Unzip align_mat_all.zip manually.
Unzip All Data
- You can use the provided script to unzip all data in occ_data and other_data directories.
- Run the script to unzip the data:
```
python datasets_preprocess/unzip_all_data.py --unzip_occ --unzip_other
```
(Optional)Generate Augmented Partial Point Cloud
Extract Image Features
- Navigate to the process_scripts directory:
```
cd process_scripts
```
- Run the script to extract image features:
```
bash dist_extract_vit.sh
```

Generate Train/Validation Splits

Navigate to the process_scripts directory:
```
cd process_scripts
```

For the LASA dataset, run:

python generate_split_for_arkit.py --cat arkit_chair arkit_stool ...

For the synthetic dataset, run:

python generate_split_for_synthetic_data.py --cat 03001627 future_chair ABO_chair future_stool ...

Training

All experiments are conducted on 8 A100 GPUs with a batch size of 22. Ensure you have access to similar hardware for optimal performance.

Train the VAE Model
- Open the script train_VAE.sh with your preferred text editor and ensure the --category entry specifies the category you wish to train on. Possible options include:
  - Individual categories: chair, cabinet, table, sofa, bed, shelf
  - All categories: all
  - (Make sure you have downloaded and preprocessed all necessary sub-category data as outlined in ./datasets/taxonomy.py)
- Run the script to start training the VAE model:
```
 python train.py --gpus 0,1,2,3,4,5,6,7 --data_path ./data/ --train_type vae
```
- Note: Ensure you use early stopping by manually stopping the training at 150 epochs if needed.
Pre-Extract VAE Features Comming Soon
Train the Diffusion Model on Synthetic Dataset Comming Soon
Finetune the Diffusion Model on LASA Dataset Comming Soon

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
datasets_preprocess		datasets_preprocess
disco		disco
output		output
.gitignore		.gitignore
README.md		README.md
demo.py		demo.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DisCo:Diffusion-based Cross-modal Shape Reconstruction

Installation

Data preparation

Training

About

Releases

Packages

Languages

GAP-LAB-CUHK-SZ/DisCo

Folders and files

Latest commit

History

Repository files navigation

DisCo:Diffusion-based Cross-modal Shape Reconstruction

Installation

Data preparation

Training

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages