Why we use diffusion for prior model #313

xiaotingxuan · 2022-12-05T13:22:41Z

Hi , I am a greenhorn for diffusion model

According to Dalle2 paper，Prior model is used to predict clip image embeddings from clip text embeddings. I think they design this model to minimize modality gap

I just don't konw why we need to use diffusion model for Prior. I know we can, but why we don't use a simpler network(I don't know ,maybe just a MLP) to implements the mapping from image embeddings to text embeddings. Diffusion model is quite expensive in terms of time and compute.

borisdayma · 2022-12-09T13:54:05Z

In the paper they claim it trains faster but you can probably get similar results without it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why we use diffusion for prior model #313

Why we use diffusion for prior model #313

xiaotingxuan commented Dec 5, 2022 •

edited

Loading

borisdayma commented Dec 9, 2022

Why we use diffusion for prior model #313

Why we use diffusion for prior model #313

Comments

xiaotingxuan commented Dec 5, 2022 • edited Loading

borisdayma commented Dec 9, 2022

xiaotingxuan commented Dec 5, 2022 •

edited

Loading