Skip to content

manyana72/inspiration_tree

 
 

Repository files navigation

Concept Decomposition for Visual Exploration and Inspiration


Concept Decomposition for Visual Exploration and Inspiration

Yael Vinker, Andrey Voynov, Daniel Cohen-Or, Ariel Shamir
Our method provides a tree-structured visual exploration space for a given unique concept. The nodes of the tree ("v_i") are newly learned textual vector embeddings, injected to the latent space of a pretrained text-to-image model. The nodes encode different aspects of the subject of interest. Through examining combinations within > and across trees, the different aspects can inspire the creation of new designs and concepts, as can be seen below.

Setup

git clone https://github.com/yael-vinker/inspiration_tree.git

Environment (with pip)

Our code relies on the enviornment in the official Stable Diffusion repository. To set up their environment, please run:

python -m venv .tree_venv
source .tree_venv/bin/activate
pip install -r requirements.txt

Technical details:

  • CUDA 11.6
  • torch 1.7.1+cu110

Hugging Face Diffusers Library
Our code relies on the diffusers library and the official Stable Diffusion v1.4 model.

Usage


This code will allow you to generate a tree per concept, and play with the different prompts and combinations for the generated tree (under "inspiration_tree_playground.ipynb").

Pretrained Models and Datasets

As part of our code release and to assist with comparisons, we have also provided some of the trained models and datasets used in the paper.

All of our models (learned tokens from the paper) can be found undeer the "learned_tokens" directory. The notebook inspiration_tree_playground.ipynb shows how to load them and reproduce the results from the paper (under "Play with learned aspects from the paper"). All datasets used from Textual Inversion can be found under "datasets".

Generate your tree

The logic for generating the tree is under "main_multiseed.py", which runs the framwork for a single node. You can generate the tree by passing your own parameters to main_multiseed.py. An example is given in "run_decompose.sh":

python main_multiseed.py --parent_data_dir "cat_sculpture/" --node v0 --test_name "v0" --GPU_ID "${GPU_ID}" --multiprocess 0

Notes:

  • The "test_name" should be identical to the chosen node Results:
  • The results will be saved to "outputs/<parent_data_dir>/"
  • In "final_samples.jpg" you can see a batch of random samples of the learned nodes
  • Under "consistency_test" we save the results of the seed selection procedure
  • Once training is finished, you can continue with splitting the generated nodes as well

Inference

We made a notebook ("inspiration_tree_playground.ipynb") to play with the results, and this should be the most convenient option. Notes:

  • The notebook shows the concepts learned in each node
  • You can also load checkpoints from early iterations
  • You can generate new concepts using natural language sentences

Acknowledgements

Our code builds on the diffusers implementation of textual inversion

Citation

If you find this useful for your research, please cite the following:

@article{vinker2023concept,
  title={Concept Decomposition for Visual Exploration and Inspiration},
  author={Yael Vinker and Andrey Voynov and Daniel Cohen-Or and Ariel Shamir},
  journal={arXiv preprint arXiv:2305.18203},
  year={2023}
}

Disclaimer

This is not an officially supported Google product.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.2%
  • Other 0.8%