-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reconstruction Error #10
Comments
Thank you! The reconstruction error from Table 1 is pixel-level so the VAE encoding/decoding process introduces some level of error (which is why the |
Thanks for the reply, that makes sense! Is it possible to modify the VAE to fix the error? |
I also want to know how to reduce the error further by modifying the VAE. Can anyone provide some suggestions? |
The VAE is trained separately from the diffusion model. If you were to modify the VAE, it would make the outputs inconsistent with the diffusion model. So, to improve the reconstruction error from VAE you would have to train both the VAE and diffusion model from scratch. This is exactly what was done in latest Stable Diffusion versions. The latest versions of Stable Diffusion have improved their VAE, see the table below from SDXL paper. |
Very nice work! I have a question, while EDICT is invertible by design, why the reconstruction error is not zero as shown in Table 1?
The text was updated successfully, but these errors were encountered: